AVAILABLE FOR WORKSAN FRANCISCO, CAEST. 2026

I find where your systems are not optimizing and rebuild them with AI that does.

I'm Ritin Wadekar, an AI Engineer at Onpoint Insights. I shipped an automated medical coding pipeline for a healthcare client. It's a system that turns clinical notes into billing codes approved by the government, built on official CMS data so every prediction is defensible. Before that, I built multi agent LLM pipelines, RAG systems, NLP, predictive and forecasting models, and recommendation engines across healthcare, pharma, and retail.

View Selected Work Explore Capabilities Contact Me

01 / WORK2023 TO 2026

Three companies. Five roles.
One thread: I always left the system smarter than I found it.

Onpoint Insights· FEATURED

AI Engineer

San Francisco, CA

Jan 2026 to Present

Researched, architected, and shipped a government compliant automated medical coding pipeline as an AI Engineer for a healthcare client.

7.66%

Coding error rate, well below the 10–20% industry standard

3×

More accurate than a standard baseline

$8.4B

Market we are targeting in medical coding

Delivered an automated medical coding pipeline in 2 months, owning the project end to end as an AI Engineer.

Led end to end R&D as an AI Engineer for a medical coding pipeline, delivering a production ready, government compliant product in two months. Built the solution for a market projected to reach $8.4B by 2033, helping position the client at the forefront of medical coding automation.

Identified a flawed LLM baseline sitting below 70% accuracy before it became the foundation of the entire system.

Researched automated ICD 10 prediction approaches, identifying that a vanilla LLM baseline failed to exceed 70% coding accuracy and was not viable for clinical use. Established a retrieval based architecture as the team's technical foundation, preventing the team from building on a flawed baseline.

Built a retrieval system grounded in government sources, achieving 7.66% error rate vs a 10–20% industry baseline.

Integrated three official government CMS coding sources into the pipeline's retrieval layer, grounding every prediction in clinically and legally accepted standards. Achieved an error rate under 7.66%, significantly outperforming the 10–20% error rate typical of manual medical coding.

Designed a two stage AI pipeline that tripled prediction accuracy over a standard LLM approach.

Designed a two stage AI pipeline that progressively narrows ICD 10 candidates before classification, outperforming standard LLM baselines by over 3× in prediction accuracy. The staged architecture mirrors expert coding workflows, making the system both accurate and interpretable.

Embedded Medicaid and Medicare compliance rules directly into the pipeline to handle government payer requirements.

Incorporated Medicaid and Medicare compliance rules directly into the automated coding pipeline, ensuring outputs meet government payer requirements at the architecture level. This targets the $1B+ annual loss the healthcare industry incurs from coding errors and non compliant claims.

PythonRAGVector DatabasesLLMsICD 10CPTRetrieval ArchitectureMedical CodingHealthcare AIGovernment Compliance

02 / FEATUREDAUTOMATED MEDICAL CODING PIPELINE

● Live · In Production

A government compliant medical coding pipeline, shipped in 2 months.

As the sole AI Engineer on a healthcare client engagement, I researched, architected, and shipped a government compliant medical coding pipeline. The system turns clinical notes into defensible billing codes, grounded in three official CMS sources (Centers for Medicare & Medicaid Services).

ACCURACY GAIN · VS AI BASELINE

3×

more accurate than a single pass LLM · proves the retrieve then rank architecture works for medical codes

ERROR RATE · VS INDUSTRY STANDARD

7.66%

well below the 10–20% manual coding industry standard · every prediction traceable to official CMS sources

End to end ownership

Researched, architected, and shipped as the sole AI Engineer on the engagement.

Caught the wrong approach early

Proved that an off the shelf large language model couldn't pass 70% accuracy on medical codes, before the team built the rest of the system on top of it.

Every answer traces back to official government sources

Built on top of three authoritative CMS coding manuals (Centers for Medicare & Medicaid Services), so every prediction has a paper trail back to a legally accepted source.

Two stage process that mirrors how expert coders work

Stage 1 narrows down the list of possible billing codes. Stage 2 picks the right one under tight rules. That's the same mental shortcut human medical coders use, and it tripled accuracy over a one step AI approach.

Compliance baked in at the right layers

Official CMS coding guidelines anchor every prediction from the start. Medicaid and Medicare payer rules then verify the code holds up under the patient's specific government insurance coverage. Both layers make the predictions defensible to regulators.

Built for a market projected to reach $8.4B by 2033

Positions the client at the frontier of medical coding automation as the total market grows, the kind of bet that compounds.

03 / IMPACTMEASURED · CLIENT IMPACT

Numbers from systems that actually shipped.

Click any number to jump to the project it came from.

Medical Coding Error Rate01

0.00%

error rate from the CMS-grounded medical coding pipeline I shipped

Accuracy vs Default AI02

0×

more accurate than a stock LLM, via the two-stage retrieval pipeline I designed

Projected Market03

$0.0B

medical coding market the automation pipeline I shipped is built for (by 2033)

Analyst Time Saved04

cut in analyst workflows from the multi-agent SQL system I built on Azure

AI Doc Search vs Keyword05

lift over keyword search from the RAG doc assistant I built (FAISS + intent routing)

Marketing ROI Lift06

0.0%

ROI lift for a top-3 pharma client from the causal attribution models I built

Invoice-Reading Accuracy07

extraction accuracy from the end-to-end OCR invoice pipeline I shipped

Transactions Processed08

warehouse transactions I mined to train a hybrid product recommender

Avg Order Size Lift09

0.0%

AOV lift from the hybrid recommender I built (Market Basket + Word2Vec)

Sales Forecast Error10

error on the monthly sales forecasting model I built for a manufacturer

Projected Sales Lift11

projected uplift from the regional pricing strategy I built for Conagra

Product Attributes Analyzed12

Conagra attributes I analyzed across 4 years of scanner data (Python + SAS)

App Downloads13

Series A EdTech app where I built churn models and ran S3 telemetry diagnostics

App Load Time Cut14

cut by ranking app latency bottlenecks from the AWS S3 user telemetry I analyzed

Lost-Payment Rate Cut15

drop in payment-page churn after the funnel diagnostics I ran on S3 data

Conagra Cost Savings16

$0K

saved by reallocating promo spend via the Clout & Vulnerability maps I built for Conagra

04 / PROJECTSCASE STUDIES

Ten systems. One pattern: find the bottleneck, rebuild it.

01Onpoint Insights● FEATURED

Automated Medical Coding Pipeline

Problem

Manual medical coding runs a 10–20% error rate and bleeds $1B+ annually from non-compliant claims, in a market projected to reach $8.4B by 2033.

Built

Two-stage retrieval pipeline grounded in three official CMS sources, with Medicaid and Medicare compliance rules built into the foundation of the system.

Approach

Step 1: find candidate codes from authoritative sources. Step 2: pick the right code under strict rules. Step 3: check compliance. Replaced a default large-language-model approach that capped below 70% accuracy.

PythonRAGVector DBLLMsICD-10CPTCMS Sources

→ Under 7.66% error rate · 3× off the shelf AI baseline · shipped in 2 months

02Onpoint Insights

AI Agents That Answer Analyst Questions on 20K+ Records

Problem

Analysts at a PPE (personal protective equipment) manufacturer spent a lot of time running custom database queries against 20K+ product records.

Built

Sequential team of AI agents: query decomposition → SQL construction → validation → response synthesis.

Approach

Azure-deployed agents using GPT-4o and o3-mini, with validation gates between each stage.

AzureGPT-4oo3-miniMulti-AgentSQL

→ 40% reduction in analyst effort

03Onpoint Insights

AI Document Assistant: Search Across 20+ Sources

Problem

Client teams couldn't answer questions across 20+ scattered documents with standard keyword search.

Built

AI assistant that retrieves answers across documents, with intent routing to the right source.

Approach

Transformer embeddings + FAISS vector index + LLM-driven intent routing.

FAISSTransformersRAGIntent Routing

→ 17% more accurate than standard keyword search

04Onpoint Insights

Marketing Attribution for a Top-3 Pharma Client

Problem

A top-3 pharma client needed to isolate the true ROI of individual marketing campaigns.

Built

Causal regression with counterfactual reasoning over clicks, impressions, conversions, cost-per-click, and spend.

Approach

Per-campaign impact isolation feeding data-driven reallocation recommendations.

Causal InferenceCounterfactualsStatistical Analysis

→ 4.2% ROI lift on campaign spend

05Onpoint Insights

Automated PDF Invoice Reader (OCR)

Problem

Manual invoice data entry was a bottleneck in client operations.

Built

End-to-end automation extracting structured data from PDF invoices into Excel.

Approach

Email-triggered workflows + data extraction models on Microsoft Power Automate.

Power AutomateOCRPDF Parsing

→ 92% extraction accuracy

06Onpoint Insights

Product Recommendations Across 10M+ Transactions

Problem

A B2B warehouse distributor needed contextual cross-sell across millions of historical transactions.

Built

Hybrid recommender combining Market Basket Analysis, Word2Vec, and learned hybrid models.

Approach

SQL-extracted 10M+ transactions → association rules + contextual recs deployed in production.

Word2VecMarket BasketSQLRecommendation

→ 6.7% lift in Average Order Size

07Onpoint Insights

Matching Messy Company Names Across Datasets

Problem

Partial company names across order datasets created reporting inconsistencies for a packaging client.

Built

NLP-driven fuzzy matching system to reconcile partial entities (think 'IBM' vs 'I.B.M. Corp').

Approach

Named Entity Recognition + cosine similarity, integrated with Power BI dashboards.

NERCosine SimilarityPower BINLP

→ Significant reduction in manual validation time

08Onpoint Insights

Sales Forecasting System

Problem

A blinds manufacturer needed reliable monthly forecasts for inventory and production planning.

Built

Time-series forecasting on three years of historical sales data.

Approach

Compared additive and multiplicative methods to capture seasonality before shipping the optimal model.

Time SeriesForecastingSeasonality

→ 12% error rate within acceptable range

09Conagra Brands

Conagra Meat Substitutes Growth Strategy

Problem

Conagra needed region-specific growth strategy for the Meat Substitutes category.

Built

Pricing, promotion, and demand-sensitivity framework with Clout & Vulnerability maps.

Approach

Python + SAS analysis across 100+ attributes and 4 years of regional sales data; controlled for statistical pitfalls (heteroscedasticity, endogeneity) in scanner data.

PythonSASPricingScanner DataCausal

→ Projected 7% sales uplift · $80K savings reallocation

10Creative Galileo

Predicting Drop-Off and Fixing Slow Spots in a 10M-Download App

Problem

An early-stage EdTech (Series A) with 10M+ downloads was losing users at the payment page.

Built

Churn prediction models + user-behavior diagnostics surfacing slow spots in the app.

Approach

scikit-learn + AWS SageMaker for risk scoring; S3 user-behavior data analysis for funnel drop-off ranking.

scikit-learnSageMakerAWS S3Telemetry

→ 10% faster load · 12% less lost-payment rate

05 / CAPABILITIESTHE STACK

A stack built to ship, not to pitch.

01Languages & Libraries

PythonPyTorchTensorFlowScikit-learnPandasNumPyPySparkKerasMatplotlibPlotlyRSAS

02Tools

TableauPowerBIAlteryxDatabricksMixpanelExcelPower AutomateGitGitHub ActionsCursorClaude Code

03Database Technologies

MySQLPostgreSQLMS SQL ServerMongoDBOracleApache HadoopSnowflake

04Cloud Platforms

AWSS3RedshiftQuickSightAthenaGlueAzureAzure KubernetesAzure Blob Storage

06 / ABOUTTHE THREAD

From numbers in Pune to AI in San Francisco.

It started with numbers. I was the kid who chased harder problems for fun and never quite stopped.

That pull took me through Pune University with honors in AI and machine learning, then to Creative Galileo for my first production machine-learning work, at an EdTech platform with 10M+ downloads.

From there I crossed the Pacific to UT Dallas for a master's in Business Analytics and AI. I joined Onpoint Insights as an intern and never left, working with clients across retail, pharma, and healthcare, and now building a government-compliant medical coding pipeline as their sole AI Engineer.

Every client taught me what AI looks like when it actually has to work.

Now I'm in San Francisco, doing what every step prepared me for: finding where companies are leaving growth on the table, and rebuilding their systems with AI that multiplies it.

07 / EDUCATIONPUNE TO TEXAS

The University of Texas at Dallas

M.S., Business Analytics & Artificial Intelligence

RICHARDSON, TXMAY 2025

Pune University

B.E. (Bachelor of Engineering), Electronics & Telecommunication · Honors in AI / ML

PUNE, INMAY 2023

08 / CONTACTREACH OUT

Let's build a system that finds the bottleneck and rebuilds it.

I'm open to AI engineering roles, advisory and consulting work, and serious technical conversations. Currently in San Francisco.

Email Ritin LinkedIn

EMAILritinwadekar6@gmail.com PHONE(972) 822-5748 LINKEDINwww.linkedin.com/in/ritin-wadekar/LOCATIONSan Francisco, CASTATUS● AVAILABLE FOR WORK

Three companies. Five roles.One thread: I always left the system smarter than I found it.

AI Engineer

A government compliant medical coding pipeline, shipped in 2 months.

Numbers from systems that actually shipped.

Ten systems. One pattern: find the bottleneck, rebuild it.

A stack built to ship, not to pitch.

01Languages & Libraries

02Tools

03Database Technologies

04Cloud Platforms

From numbers in Pune to AI in San Francisco.

Let's build a system that finds the bottleneck and rebuilds it.

Three companies. Five roles.
One thread: I always left the system smarter than I found it.