Karim Foda, Developer in London, United Kingdom
Karim is available for hire
Hire Karim

Karim Foda

Verified Expert  in Engineering

NLP Researcher and Developer

Location
London, United Kingdom
Toptal Member Since
July 6, 2020

Karim是一名NLP研究员,在构建旨在复制特定人类功能的机器学习(ML)模型方面拥有深入的实践经验, thereby accelerating a business's processes. Most recently, Karim的重点是通过会话聊天机器人训练大型语言模型(llm),用于自然语言理解(NLU)和自然语言生成(NLG).

Portfolio

Kaizan
Artificial Intelligence (AI), OpenAI GPT-4 API, OpenAI GPT-3 API...
Shortform
自然语言处理(NLP), OpenAI GPT-4 API, Elasticsearch...
Grata
JSON, Roku, Machine Learning, Deep Neural Networks...

Experience

Availability

Full-time

Preferred Environment

Python

The most amazing...

...我相信我已经建立了一个LongT5模型,它对自动生成自助书籍摘要进行了微调.

Work Experience

Lead NLP Engineer

2021 - PRESENT
Kaizan
  • Built a GPT-4-driven chatbot that combined factored cognition, LangChain, 和Elasticsearch,以增强组织员工对所有团队电话和电子邮件的完美记忆.
  • 开发了一个内部注释平台,以增加使用弱标签的手动注释,并设计了一个数据增强策略,将用户数据大小增加了四倍.
  • 使用hugs Face Transformers和微软的DeepSpeed库对Pegasus大型视频通话摘要数据模型进行了微调,以自动生成会议动作和摘要.
Technologies: Artificial Intelligence (AI), OpenAI GPT-4 API, OpenAI GPT-3 API, Language Models, Django, Hugging Face, Generative Pre-trained Transformers (GPT), Elasticsearch, PostgreSQL, Redis, Google Cloud, Docker, Causal Inference, Fine-tuning, Generative Artificial Intelligence (GenAI), Research

NLP Consultant

2021 - 2023
Shortform
  • 在BookSum数据集上对LongT5 XXL模型进行了三倍以上的预训练,该模型的表现优于LongT5 XL,可以为带有个性化评论的小说书籍编写连贯的阅读指南.
  • 构建由语言模型和矢量数据库搜索提供支持的代理,以帮助用户创建对特定书籍主要论点的扩展和反驳点.
  • 使用GPT-4和摘要的摘要方法部署了一个用于总结书籍章节的管道.
Technologies: 自然语言处理(NLP), OpenAI GPT-4 API, Elasticsearch, Google Cloud, Artificial Intelligence (AI), Docker, Hugging Face, Causal Inference, Fine-tuning, Generative Artificial Intelligence (GenAI)

NLP Engineer

2021 - 2022
Grata
  • 调整了一个t5-3b模型,使用从公司网站上抓取的文本以预定义的格式生成公司描述, achieving an 89% average BERTScore precision.
  • 在Amazon SageMaker上部署了一个微调过的t5-3b模型,从公司网站自动生成公司描述.
  • 自定义构建了一个问答数据集,以微调基于roberta的模型,从其网站自动提取公司的特定信息,例如交易名称, location, and products.
Technologies: JSON, Roku, Machine Learning, Deep Neural Networks, Natural Language Processing (NLP), GPT, Generative Pre-trained Transformers (GPT), Python 3, Sequence Models, BERT, PyTorch, Hugging Face, OpenAI, Artificial Intelligence (AI), Docker, Causal Inference, Fine-tuning, Generative Artificial Intelligence (GenAI)

NLP Engineer

2018 - 2021
Lloyds Banking Group
  • 开发Python脚本,从内部社交媒体网站提取评论, analyzed their change in sentiment over time, and visualized the findings in the Python Dash app.
  • 建立了一个聊天机器人,专注于通过情绪记录功能改善同事的心理健康,并使用GPT-2转换器,使其能够与用户进行基本对话.
  • Classified 100,使用由LDA主题分析模型识别的类别自动使用描述每个案例的逐字文本注释运行的000个客户案例.
  • 使用正则表达式在RDS数据库中检测和编码个人客户数据.
Technologies: Natural Language Generation (NLG), Generative Pre-trained Transformers (GPT), GPT, Natural Language Processing (NLP), R, Tableau, Python, Sequence Models, Hugging Face, Causal Inference

NLP Engineer

2020 - 2020
FACETITLE
  • 训练基于bert的NER模型,以95%的准确率检测电视节目字幕中提到的角色,并在Roku应用程序上实时显示他们的头像.
  • 创建了一个基于roberta的多类分类模型,该模型使用hug Face Transformer库对剧集评论的情感进行分类,准确率达到92%.
  • 咨询创始团队,帮助他们获得NSF种子基金资助.
Technologies: Machine Learning, Natural Language Processing (NLP), Web Scraping, GPT, Generative Pre-trained Transformers (GPT), Python

Data Scientist

2016 - 2018
Lloyds Banking Group
  • 使用基于熵的随机森林模型和双向lstm的预测集合,建立了欧元/美元汇率运动方向的分类模型.
  • 与金融业务伙伴和业务经理协调,开发透明的交易管道收入预测模型,准确率为5%.
  • 使用VECM和VAR模型分析英国脱欧前欧洲资产之间的日内相关性,以促进以德国资产为重点的策略.
  • 使用线性回归模型分析年度收入数据的时间序列,自动计算21个行业的年度收入预算.
Technologies: Generative Pre-trained Transformers (GPT), GPT, Natural Language Processing (NLP), R, Machine Learning, Visual Basic for Applications (VBA), Python

Data Engineer

2014 - 2016
Lloyds Banking Group
  • 为数字、商业银行和IT支持团队构建数据捕获和可视化工具.
  • 领导了一项服务改进计划,解决了52%的金融市场系统问题记录,并建立了跟踪日常表现的仪表板.
  • 对两个新的手机银行测试产品的财务可行性进行研究,并估算和贴现未来预测现金流,以推动5000万英镑的投资决策.
Technologies: Python, Visual Basic for Applications (VBA), Tableau

Emotion Classification Using a WAME Optimizer

Implemented the recently developed WAME optimizer by Mosca et al. 提高一种情绪分类卷积神经网络的性能. 我获得了比基准优化器(如Adam和RMSProp)更高的精度.
2018 - 2020

Master of Research Degree in Machine Learning

Birkbeck University of London - London, United Kingdom

2016 - 2018

Master's Degree in Finance

London Business School - London, United Kingdom

2010 - 2014

Master of Science Degree in Aeronautical Engineering

Durham University - Durham, United Kingdom

Libraries/APIs

TensorFlow深度学习库(TFLearn), Keras, TensorFlow, Pandas, DeepSpeech, PyTorch

Tools

MATLAB, Named-entity Recognition (NER), Tableau

Languages

Python, R, SQL, Bash, c++, Visual Basic for Applications (VBA), Python 3

Platforms

Docker, Google Cloud Platform (GCP)

Paradigms

Data Science

Storage

PostgreSQL, JSON, Elasticsearch, Redis, Google Cloud

Frameworks

Django

Other

Dashboard Design, Transformers, Natural Language Processing (NLP), Dash, Topic Modeling, Emotion Recognition, Sentiment Analysis, Machine Learning, Statistics, Artificial Intelligence (AI), Natural Language Generation (NLG), Neural Networks, Custom BERT, OCR, Hugging Face, Generative Pre-trained Transformer 3 (GPT-3), Language Models, DeepSpeed, GPT, Generative Pre-trained Transformers (GPT), Causal Inference, Bittensor, Fine-tuning, Generative Artificial Intelligence (GenAI), Research, Chatbots, Image Recognition, Web Scraping, Econometrics, Time Series Analysis, Deep Neural Networks, Recurrent Neural Networks (RNNs), Convolutional Neural Networks (CNN), Decision Tree Classification, Finite Element Analysis (FEA), Deep Learning, Generative Adversarial Networks (GANs), Roku, Voice, Sequence Models, BERT, OpenAI, OpenAI GPT-4 API, OpenAI GPT-3 API

Collaboration That Works

How to Work with Toptal

在数小时内,而不是数周或数月,我们的网络将为您直接匹配全球行业专家.

1

Share your needs

在与Toptal领域专家的电话中讨论您的需求并细化您的范围.
2

Choose your talent

在24小时内获得专业匹配人才的简短列表,以进行审查,面试和选择.
3

Start your risk-free talent trial

Work with your chosen talent on a trial basis for up to two weeks. Pay only if you decide to hire them.

Top talent is in high demand.

Start hiring