Siddharth Deshpande, Developer in Cambridge, United Kingdom
Siddharth is available for hire
Hire Siddharth

Siddharth Deshpande

Verified Expert  in Engineering

Data Scientist and Developer

Location
Cambridge, United Kingdom
Toptal Member Since
June 27, 2022

Siddharth是一名跨学科研究人员,他的独特观点来自于翻译项目和他在材料工程方面的综合教育背景, biochemistry, healthcare, natural language processing (NLP), and data science. 他在处理生物结构化和非结构化数据以及使用最先进的人工智能技术解决复杂的医疗保健问题方面拥有丰富的经验.

Portfolio

Immersely
Amazon Web Services (AWS), Machine Learning, Game AI, Emotion Recognition...
Post Urban Ventures
Python, CTO, Deep Learning, Entrepreneurship, Pitch Preparation...
Richmond Ayirebide
Natural Language Processing (NLP), Python, Chatbots, Machine Learning, GPT...

Experience

Availability

Part-time

Preferred Environment

GPT, Natural Language Processing (NLP), Generative Pre-trained Transformers (GPT), Biomedical Skills, Machine Learning, Language Models, Unstructured Data Analysis, Data Visualization, Artificial Intelligence (AI), Biochemistry, Amazon Web Services (AWS), Python

The most amazing...

...我开发的是一个NLP框架,它从文档中提取生物医学实体并将其可视化为网络图,以发现新的生物医学关系.

Work Experience

Chief Technological Officer (Interim)

2022 - PRESENT
Immersely
  • Worked for Immersely, 是什么让游戏开发者能够创造出能够实时适应玩家情感的超个性化游戏, boosting engagement to create better, more commercially successful games.
  • 负责开发机器学习模型,使用生理信号来检测一个人在玩游戏时的情绪,以开发互动游戏体验.
  • 负责为公司开发技术路线图和后端技术基础设施.
Technologies: Amazon Web Services (AWS), Machine Learning, Game AI, Emotion Recognition, Python 3, Data Science, LangChain

Deep Tech Venture Builder

2022 - PRESENT
Post Urban Ventures
  • Validated technological feasibility of new startup ideas before funding, built technical prototypes (MVP) for pre-seed and seed round investor pitches, and supported early-stage startups with essential technical infrastructure.
  • 曾担任四家初创公司的临时首席技术官,并在Post Urban Ventures中担任两家初创公司的技术顾问.
  • Contributed to securing a £5 million grant in funding for startups successfully.
  • Involved in preparing technical pitch decks, offered expert advice and guidance, and helped promote startup success. Designed technical roadmaps for scaling startups after pre-seed and seed rounds.
Technologies: Python, CTO, Deep Learning, Entrepreneurship, Pitch Preparation, Artificial Intelligence (AI), Web Scraping, Data Science, Excel Expert, JSON, Interactive Charts, CSV File Processing, Language Models, Unstructured Data Analysis, Natural Language Processing (NLP), GPT, Generative Pre-trained Transformers (GPT), Machine Learning, Healthcare, Chatbots, Chatbot Conversation Design, OpenAI, LangChain, Weviate, Pinecone

Senior AI/ML and NLP Chatbot Developer

2023 - 2023
Richmond Ayirebide
  • 根据客户需求,利用ChatGPT开发了一个会计聊天机器人, finetuned GPT-3, and Telegram.
  • 简化了预处理和后处理,将结果格式化为易于查看的Excel表格.
  • 帮助为聊天机器人在云基础设施中的未来部署制定计划.
Technologies: Natural Language Processing (NLP), Python, Chatbots, Machine Learning, GPT, Generative Pre-trained Transformers (GPT), Artificial Intelligence (AI), Deep Learning, Chatbot Conversation Design, OpenAI, LangChain, Weviate, Pinecone

Chief Technological Officer (Interim)

2022 - 2022
Bioleap
  • Brought on board to develop the technical framework for Bioleap, a startup focused on developing AI-based single-cell models.
  • Managed the building of cloud capabilities in AWS, hired a competent technical team, and improved the current mechanistic models.
  • 与领先的生物建模实验室建立了多个战略技术合作伙伴关系. Built a cloud-based automation strategy for Bioleap models.
  • Established a technology strategy (tech stack), technical roadmap, and business plan to support the growth strategy.
Technologies: Artificial Intelligence (AI), Bioinformatics, Single-cell Modeling, Time Series Analysis, Computational Biology, Excel Expert, JSON, Interactive Charts, CSV File Processing, Language Models, Unstructured Data Analysis, Generative Pre-trained Transformers (GPT), GPT, Natural Language Processing (NLP), Machine Learning, Python, Healthcare, Data Science, CTO, Medical Diagnostics

NLP Data Scientist

2021 - 2022
Evaluate Ltd
  • 开发了一个新闻稿分类器,将新闻文章分为40个技术类, saving the company around 30,000 pounds per year in third-party API licenses.
  • Identified digital health innovations from clinical trials, news articles, 并为一个定制分析项目处理文档,该项目减少了日本客户手工文档分类的工作时间.
  • Created a core NLP framework to extract biomedical entities from unstructured texts and visualize them as a graphical network; the framework became popular for discovering new biomedical relations and was subsequently used in many Evaluate products.
Technologies: Python, GPT, Natural Language Processing (NLP), Generative Pre-trained Transformers (GPT), Amazon Web Services (AWS), Pharmacology, R&D, Data Science, Data Visualization, Machine Learning, Biomedical Skills, Bioinformatics, Microsoft Excel, Healthcare, Excel Expert, JSON, Interactive Charts, CSV File Processing, Language Models, Unstructured Data Analysis, Artificial Intelligence (AI), Spark NLP, PySpark, Spark ML, Chatbots

Data Scientist

2019 - 2021
Patsnap
  • Developed PatSnap Bio, 一个核心产品,是最大的序列搜索平台之一,被大型制药公司积极使用.
  • Created PatSnap Materials, another core product under Beta testing in China.
  • 积极参与PatSnap Bio和PatSnap Materials的产品开发和客户反馈过程.
  • Filed five patent applications involving my technology.
Technologies: Python, Generative Pre-trained Transformers (GPT), GPT, Natural Language Processing (NLP), Patents, Analytics, Product Development, Biology, Pharmacology, Composite Materials, Biomaterial, Engineering, Bioinformatics, Web Scraping, Machine Learning, Microsoft Excel, Data Science, Healthcare, Excel Expert, JSON, Interactive Charts, CSV File Processing, Language Models, Unstructured Data Analysis, Artificial Intelligence (AI), Spark NLP, PySpark, Spark ML

COVID-19 Scientific Journals Analysis

http://github.com/siddharth0112358/coronavirus_19
Analyzed the COVID-19 dataset, a collection of scientific papers related to COVID-19, using different NLP techniques. 该项目的目的是使用不同的NLP算法获得不同的见解,这可能有助于更好地理解研究论文.

Research papers available on GitHub:

•AutoDetect_COVID_FakeNews—用于检测有关COVID的假新闻的分类模型
•BERT_semantic_search -语义搜索,在COVID语料库中查找类似的句子以响应查询问题
•biorelated_sentence_extracaction_covid -从COVID语料库中提取生物相关的句子
•covid_19_topic_modelelling_top2vec -使用Top2Vec对COVID_19语料库进行主题建模
• COVID_explore_drugs - Explore drugs in the COVID corpus
•Covid - 19_ques_and_ans -基于doc2vec的Covid论文问答系统
•covid - 19_ner_text_summarization_and_topic_modeling - BART摘要和LDA主题建模和NER
• Covid_19_genome_analysis - COVID_19 genome analysis
• Covid_paper_rank_display - NER and covid papers recovery based on topic
• Medical_NER_Corona - NER on coronavirus dataset
• Mining_COVID_keywords - mining keywords using bigrams and trigrams

Alibaba Cloud Global AI Innovation Challenge

Won an Innovation Award for the project.

我的项目目标是分析天气对能源生产和需求的影响,并找到一个可以使用天气参数预测可再生能源生产和能源需求的解决方案.

SOLUTION HIGHLIGHTS

•利用气候和时间参数预测太阳能、风能和水能发电.
•能源需求预测使用时间和能源参数(模型1)和时间完成, energy, and climate parameters (Model 2). Model 2 showed slightly higher accuracy than Model 1. 结果表明,气候参数对能源需求的影响不像能源参数那样显著.
•能源价格预测使用时间和能源参数(模型1)和时间, energy, and climate parameters (Model 2). Model 2 showed higher accuracy than Model 1. It shows that climate parameters affect energy prices significantly.

For all the above cases, 10 million regression algorithms were tested. ExtraTreeRegressor算法表现最好,并用于建立回归模型.

URL: http://www.alibabacloud.com/blog/project-showcase-%7C-effect-of-weather-on-energy-generation-and-demand_598252

Conversational Chatbots

I built three conversational chatbots across different conversation channels, including Slack, WhatsApp, Dashboard, Discord, Telegram, and Facebook messenger. 我使用GPT-3开发了聊天机器人,附带了额外的约束和快速的工程设计.
•对话助手-这个机器人帮助模拟艰难的对话,以便客户可以事先练习对话. The client is scored on 2-3 conversation skills, 最后会生成一份报告,显示他的分数以及如何提高他的会话能力.
•时尚助手-该机器人根据客户需求和企业库存推荐时尚单品. It uses a combination of GPT-3 and DALL-E.
•谷歌机器人-这个机器人有一个谷歌搜索引擎的能力,并作为一个顾问/朋友,你可以问任何问题, 它会在后台运行谷歌搜索,为你提供最新的答案.
Bot previews can be shown during interviews.
2016 - 2019

Doctorate in Medicine

National University of Singapore - Singapore

2014 - 2015

Master's Degree in Materials Science and Engineering

National University of Singapore - Singapore

2010 - 2014

Bachelor's Degree in Metallurgy and Material Science

College of Engineering Pune - Pune, India

JANUARY 2023 - PRESENT

Healthcare NLP for Data Scientists

John Snow Labs

JANUARY 2023 - PRESENT

Spark NLP for Data Scientists

John Snow Labs

MAY 2022 - PRESENT

TensorFlow: Advanced Techniques Specialization

DeepLearning.AI | via Coursera

APRIL 2022 - PRESENT

Deep Learning for Healthcare Specialization

University of Illinois at Urbana-Champaign | via Coursera

MARCH 2022 - PRESENT

Customizing Your Models with TensorFlow 2

Imperial College London | via Coursera

MARCH 2022 - PRESENT

Generative Adversarial Networks (GANs) Specialization

DeepLearning.AI | via Coursera

JULY 2021 - PRESENT

Deployment of Machine Learning Models

Udemy

FEBRUARY 2021 - PRESENT

Natural Language Processing in Python

DataCamp

DECEMBER 2020 - PRESENT

Natural Language Processing Specialization

DeepLearning.AI | via Coursera

OCTOBER 2020 - PRESENT

AI in Healthcare Specialization

Stanford University | via Coursera

OCTOBER 2018 - PRESENT

Deep Learning Specialization

DeepLearning.AI | via Coursera

Libraries/APIs

TensorFlow, PySpark, Spark ML

Tools

Microsoft Excel, SOLIDWORKS

Industry Expertise

Bioinformatics, Healthcare

Languages

Python, Python 3

Storage

JSON

Platforms

Amazon Web Services (AWS)

Paradigms

Data Science

Other

Natural Language Processing (NLP), Machine Learning, Data Visualization, Biochemistry, Analytics, Biology, Pharmacology, R&D, Engineering, CSV File Processing, Excel Expert, Interactive Charts, Spark NLP, Chatbots, Patents, GPT, Generative Pre-trained Transformers (GPT), Biomedical Skills, Language Models, Unstructured Data Analysis, Artificial Intelligence (AI), Biomaterial, Composite Materials, Deep Learning, Dash, Deep Neural Networks, Convolutional Neural Networks (CNN), Sequence Models, Entrepreneurship, Web Scraping, Time Series Analysis, Computational Biology, Game AI, Emotion Recognition, Chatbot Conversation Design, LangChain, Weviate, Pinecone, Cell Biology, Materials Science, 3D Printing, Product Development, Model Deployment, Generative Adversarial Networks (GANs), Single-cell Modeling, CTO, Pitch Preparation, Medical Diagnostics, OpenAI, Generative Pre-trained Transformer 3 (GPT-3), Google Custom Search

Collaboration That Works

How to Work with Toptal

在数小时内,而不是数周或数月,我们的网络将为您直接匹配全球行业专家.

1

Share your needs

在与Toptal领域专家的电话中讨论您的需求并细化您的范围.
2

Choose your talent

在24小时内获得专业匹配人才的简短列表,以进行审查,面试和选择.
3

Start your risk-free talent trial

Work with your chosen talent on a trial basis for up to two weeks. Pay only if you decide to hire them.

Top talent is in high demand.

Start hiring