Joao Diogo de Oliveira,巴西塞埃尔<e:1>州福塔莱萨市开发商
Joao is available for hire
Hire Joao

Joao Diogo de Oliveira

Verified Expert  in Engineering

机器学习工程师和开发人员

Location
福塔莱萨-塞埃尔州,巴西
Toptal Member Since
October 20, 2022

Joao is an AI/ML engineer with more than 14 years of experience at Fortune 100 companies like Procter & Gamble and Hearst and startups in the healthcare, energy, and finance industries. Joao holds a master's degree in computer engineering from the University of Porto and has multiple certifications in ML and deep learning.

Portfolio

Hearst - Technology
Python, Artificial Intelligence (AI), 生成预训练变压器(GPT)...
Peyton & Greyson Solutions Inc,
Artificial Intelligence (AI), AI Design, 生成对抗网络(GANs)...
Freelance Clients
Python 2, Python 3,深度学习,统计学,数据分析,Python...

Experience

Availability

Part-time

Preferred Environment

Python 3, PyTorch, TensorFlow, R, Machine Learning, Google Cloud Platform (GCP), Amazon Web Services (AWS)

The most amazing...

...project I've led is predicting over 300 wind and solar farms in a record time of 1.5 months.

Work Experience

MVP Developer

2023 - PRESENT
Hearst - Technology
  • Developed an MVP successfully, demonstrating the ease of replacing a legacy system within 3-4 weeks.
  • Used generative AI (GPT 3.5, GPT 4) and other frameworks and libraries (LangChain and LlamaIndex) to extract structured data from unstructured data. Achieved up to a 98% success rate.
  • Researched and drove the implementation of the newest trends in Generative AI to a broad audience. 这些包括但不限于, the newest models like GPT4, Turbo, Gemini, Claude, and multimodal models, and the newest frameworks, like LlamaIndex, LangChain, and AutoGPT.
  • Planned and elaborated working pipelines for training and inference so they could be used seamlessly.
Technologies: Python, Artificial Intelligence (AI), 生成预训练变压器(GPT), AgentGPT, 生成式人工智能(GenAI), Google Cloud Platform (GCP), Azure, Gemini, AI Agents, Information Extraction, Generative AI, Large Language Models (LLMs), Data Science, Natural Language Processing (NLP), GPT, Amazon Web Services (AWS), OpenAI

AI Developer

2022 - PRESENT
Peyton & Greyson Solutions Inc,
  • 开发了一个人工智能应用程序,用于编写自动提案, 至少节省了专业员工20%的时间.
  • Designed and architected the entire IT solution: a) database choice and detail; b) AWS Serverless Services; b) chose and set up the web app back-end implementation; c) API configuration; d) to complete AI model development and deployment with AgentGPT.
  • 跟踪团队成员的发展,并确保达到里程碑, 从演示到关键的开发交付物.
  • 成功地将GPT-3模型定制为特定的业务案例.
技术:人工智能(AI), AI Design, 生成对抗网络(GANs), Language Models, OpenAI, APIs, Backendless, Amazon Web Services (AWS), AWS Lambda, Amazon RDS, Python, DaVinci, Large Language Models (LLMs), Models, AI Programming, 自然语言理解(NLU), Matplotlib, GPT, 生成预训练变压器(GPT), Natural Language Processing (NLP), Information Extraction, GitHub, Cloud Platforms, Data Pipelines, Early-stage Startups, Data Processing, Data Transformation, Back-end, ChatGPT, OpenAI GPT-3 API, 生成预训练变压器3 (GPT-3), DevOps, Amazon SageMaker, Jupyter Notebook, OpenAI GPT-4 API, Kubernetes, Scraping, Analytics, Keras, Sentiment Analysis, Generative AI, Data Structures

IT工程师|人工智能工程师

2019 - PRESENT
Freelance Clients
  • Developed an artificial intelligence AI project for energy prediction of solar and wind farms, summing up 2.6 GW of installed power.
  • 建立了一个计算机视觉模型,可以进行面部识别.
  • 利用计算机视觉创建了一个模型,通过x射线轻松检测肺炎.
  • Provided consulting services to deliver wind certification for two offshore projects with a combined predicted installed power of 2GW.
  • 维护超过20台分布式Linux服务器, updating, securing, 创建关键绩效指标kpi.
Technologies: Python 2, Python 3,深度学习,统计学,数据分析,Python, Data Science, Deep Neural Networks, Big Data Architecture, Linux, Datasets, Pandas, 机器学习操作(MLOps), Image Processing, Hardware, Large Language Models (LLMs), Models, AI Programming, Natural Language Processing (NLP), GPT, 生成预训练变压器(GPT), Data Processing Automation, Artificial Intelligence (AI), Image Generation, ARIMA, ARIMA Models, LSTM, SARIMA, R, Matplotlib, Information Extraction, GitHub, Cloud Platforms, Data Pipelines, Energy, Neural Networks, Regression Modeling, Data Processing, Data Transformation, CSV, Data Analysis, Back-end, DevOps, Amazon SageMaker, Jupyter Notebook, Speech Recognition, Scraping, Analytics, FFmpeg, Keras, Sentiment Analysis, Image Recognition, TensorFlow, PyTorch, Computer Vision, Generative AI, OpenAI, Speech to Text

Product Owner | Country Manager

2017 - PRESENT
Prewind
  • Developed AI models, including deep learning, weather forecast, 以及多个市场的能源预测.
  • 为客户进行业务和数据分析.
  • 领导在巴西成功建立了一个欧洲研究所.
  • Managed a portfolio of clients with a combined production of over 3 GW of energy.
Technologies: Deep Learning, Artificial Intelligence (AI), Machine Learning, Data Analytics, Data Science, Data Visualization, Linux, Datasets, Pandas, Amazon Web Services (AWS), Python, Hardware, Models, Matplotlib, Information Extraction, GitHub, Early-stage Startups, Energy, Neural Networks, Data Transformation, CSV, Data Analysis, Back-end, DevOps, Workshop Facilitation, Analytics, Sentiment Analysis, Image Recognition

Managing Director

2013 - PRESENT
Niway Group
  • 管理集团投资项目的日常运营, including a shopping mall, business towers, 以及在官方政府机构面前的代表.
  • Reversed a seven-year loss into profit by applying substantial and stable changes.
  • 监督三个塔楼建设的财务控制, 12 floors each, with a total cost of R$ 43 million.
Technologies: Team Leadership, Finance, Data Science, Data Visualization, Python, Real Estate, CSV, Data Analysis, CTO, Workshop Facilitation, Analytics

Machine Learning Developer

2023 - 2023
EIS - Main
  • 是否对捕获进行可行性研究并实施POC, counting and geo-locating valves in a cloud point project of an oil and gas plant scan.
  • Developed an AI model to identify valves in batches of images from a plant scan.
  • 实现了一种自动处理和切片云点数据的方法, 提取图像并将其转换为二维图像.
Technologies: Machine Learning, Computer Vision, Deep Learning, 卷积神经网络(CNN), Artificial Intelligence (AI), Point Clouds, Point Cloud Data, Image Processing, Natural Language Processing (NLP), Python, TensorFlow, PyTorch

Team Leader

2023 - 2023
Stop the Traffik
  • Analyzed the most underlying tech issues in a volunteer organization and proposed a plan to tackle them through a team of 11 volunteers scattered over nine countries.
  • Led a team of ML/AI specialists to develop an AI model for sentiment analysis to automize the analysis of trafficking articles and classify them, 移除当前应用的体力劳动.
  • Guided and led a team of ML/AI specialists to improve the legacy model that classified articles into relevant and non-relevant ones for the organization.
  • Steered through meetings the project success and engagement to deliver the proposed outcomes to the organization. 参与开发的各个环节(AI), DevOps, Python),以确保承诺得到满足和交付.
Technologies: IBM Cloud, Amazon SageMaker, Kubernetes, Data Science, Python, Artificial Intelligence (AI)

NLP Engineer

2023 - 2023
乔治梅森大学的莫卡特斯中心
  • 为96个标签内的文档开发了一个长文本分类. The purpose was to use different NLP techniques to get probabilities of the three digits NAICS codes.
  • Explored literature on the most advanced techniques of text classification and long text and applied them; Combined the different techniques to achieve a better result, 在F1得分上提高了15%.
  • Used AWS SageMaker to provide an effective and insightful training and inference pipeline.
  • 在部分项目中获得F1分数,最高可达0分.95-0.使用不同技术的人从0增加到98.4 to 0.7.
技术:自然语言处理(NLP), Python, GPT, 生成预训练变压器(GPT), NLPP, Deep Neural Networks, Amazon SageMaker, Transformers, Data Science, Artificial Intelligence (AI), TensorFlow

Engineering Manager

2012 - 2013
Procter & Gamble
  • Implemented multiple line update projects across plants in France, Italy, and Spain.
  • 开发节省成本的解决方案,并在多个工厂部署.
  • Led technical discussions with suppliers to make sure they would meet the requirements.
技术:敏捷、项目设计 & Management, Process Management, APIs, Linux, Hardware, Supply Chain Management (SCM), Supply Chain Optimization, SARIMA, Data Processing, Data Analysis, Workshop Facilitation

Supply Chain Leader

2009 - 2012
Procter & Gamble
  • Led the design and implementation of a global pilot project to remodel the company's logistics sector.
  • 找到了解决库存成本复杂问题的方法, 实现从1200万美元减至700万美元.
  • 参与创建内部交叉对接供应链原型, 每年可节省200万美元.
  • 指导、指导和协调多名团队成员的工作.
Technologies: Project Design & Management, Logistics, Agile, Forecasting, Data Science, Datasets, Supply Chain Management (SCM), Supply Chain Optimization, Data Processing, Data Analysis, Workshop Facilitation

NLP在医疗保健中的应用|临床患者笔记评分

http://www.kaggle.com/c/nbme-score-clinical-patient-notes
A project to classify each patient's probable disease according to actual notes taken from clinical trials by doctors and my task was to build a natural language processing (NLP) model on top of the foundation framework RoBERTa.

CV: X-ray Pneumonia Detection

http://github.com/joao-d-oliveira/X-Ray_PneumoniaDetection
A computer vision model, 它接收x射线图像并检测外来组织的存在, 并预测图像是否属于肺炎患者. 该模型的表现与训练有素的医生相似, 准确率为86%(无肺炎)和19%(肺炎)。.

风能和太阳能发电场发电预测

A power generation forecast for over 300 wind and solar farms spread across Portugal. I performed the data analysis for the plant's geolocation and wind and solar profile, structuring all the data, 为每个农场构建大约五个模型的集合, 训练和部署模型.

Computer Vision - Face detection

A computer vision model, built with ML techniques, that does video-based facial recognition. I was instrumental in making the model and the necessary pipeline from the beginning. Additionally, 我已经达到了大约10^-5的误接受率(FAR), meeting clients' needs.

开发人工智能自动提案生成

该应用程序为提案写作提供自动化, as the idea was to develop a model and WebApplication to support the model to save the time of specialized employees by at least 20% and I've accomplished developing a working AI Model based on GPT-3. I've also designed and developed the structure and architecture of the web application, 制作大部分后端函数和所有数据库架构.

简历:图片说明-识别对象和书写说明

开发了一个机器学习模型, through deep learning networks, analyses images, identifies objects, and captions the images accordingly; The project got a BLUE-1 score of 0.679为图像标题,得分为0.6-0.7 is considered best in class.

Email NLP/NLU/NER Analysis

通过先进的NLP技术,从电子邮件中提取见解. Classify within a set of pre-defined (achieving an overall score of +83% accuracy overall), 从文本中提取重要信息, doing data analysis, summarisation, and other relevant tasks.

Surgery Assistance Software

一个可以进行语音识别的人工智能软件, interpret commands, 并识别特定手术时刻所需的工具. On top of that, the AI predicted (based on historical information) what should be the order of the tools within the surgery.

我设计并实现了软件的架构,实现了MVP.

Languages

Python 3, SQL, Python, R, Python 2, c++

Libraries/APIs

PyTorch, TensorFlow, Scikit-learn, Pandas, LSTM, Matplotlib, Keras, OpenCV, PyTorch Lightning, FFmpeg

Tools

GitHub,亚马逊SageMaker, ChatGPT,你只看一次(YOLO), NLPP

Paradigms

数据科学、敏捷、DevOps、异常检测

Platforms

Linux, Amazon Web Services (AWS), Jupyter Notebook, Google Cloud Platform (GCP), Kubernetes, Docker, Azure, Backendless, AWS Lambda

Storage

Data Pipelines, PostgreSQL, MySQL

Other

Machine Learning, Deep Learning, Data Structures, Artificial Intelligence (AI), Algorithms, Team Leadership, Project Design & Management, Computer Vision, BERT, Natural Language Processing (NLP), Deep Neural Networks, Datasets, Language Models, OpenAI, Image Processing, Hardware, Large Language Models (LLMs), Models, AI Programming, Data Processing Automation, Real Estate, ARIMA, ARIMA Models, Supply Chain Management (SCM), Supply Chain Optimization, Forecasting, Information Extraction, Energy, Neural Networks, Regression Modeling, Data Processing, Data Transformation, CSV, Data Analysis, GPT, 生成预训练变压器(GPT), Back-end, 生成预训练变压器3 (GPT-3), OpenAI GPT-4 API, Workshop Facilitation, Analytics, 卷积神经网络(CNN), Sentiment Analysis, Generative AI, Data Analytics, Process Management, Logistics, Statistics, Computer Vision Algorithms, Data Visualization, Big Data Architecture, 机器学习操作(MLOps), 生成对抗网络(GANs), DaVinci, SARIMA, 自然语言理解(NLU), Hugging Face, Cloud Platforms, Early-stage Startups, 生成式人工智能(GenAI), Web Development, Word Embedding, OpenAI GPT-3 API, API Integration, Speech Recognition, Scraping, Facial Recognition, Image Recognition, Speech to Text, Finance, Quantum Computing, Healthcare IT, Deep Reinforcement Learning, APIs, Object Detection, Generative Models, AI Design, Amazon RDS, Image Generation, CTO, Transformers, IBM Cloud, Prompt Engineering, Qiskit, AgentGPT, Point Clouds, Point Cloud Data, Gemini, AI Agents

2003 - 2009

计算机科学硕士学位

波尔图大学-波尔图,葡萄牙

2007 - 2008

计算机科学硕士学位交换项目课程

代尔夫特理工大学-荷兰代尔夫特

AUGUST 2022 - PRESENT

Quantum Excellence Certificate

IBM | Qiskit全球暑期学校2022

JULY 2022 - PRESENT

AI for Healthcare

Udacity

JULY 2021 - PRESENT

Machine Learning

Stanford University

JULY 2021 - PRESENT

Deep Reinforcement Learning

Udacity

JUNE 2021 - PRESENT

高级计算机视觉-机器学习

Udacity

Collaboration That Works

How to Work with Toptal

Toptal matches you directly with global industry experts from our network in hours—not weeks or months.

1

Share your needs

Discuss your requirements and refine your scope in a call with a Toptal domain expert.
2

Choose your talent

Get a short list of expertly matched talent within 24 hours to review, interview, and choose from.
3

Start your risk-free talent trial

与你选择的人才一起工作,试用最多两周. 只有当你决定雇佣他们时才付钱.

Top talent is in high demand.

Start hiring