AVAILABLE FOR WORKSAN FRANCISCO, CAEST. 2026

I find where your systems are not optimizing and rebuild them with AI that does.

I'm Ritin Wadekar, an AI Engineer at Onpoint Insights. I shipped an automated medical coding pipeline for a healthcare client. It's a system that turns clinical notes into billing codes approved by the government, built on official CMS data so every prediction is defensible. Before that, I built multi agent LLM pipelines, RAG systems, NLP, predictive and forecasting models, and recommendation engines across healthcare, pharma, and retail.

01 / WORK2023 TO 2026

Three companies. Five roles.
One thread: I always left the system smarter than I found it.

Onpoint Insights· FEATURED

AI Engineer

San Francisco, CA
Jan 2026 to Present

Researched, architected, and shipped a government compliant automated medical coding pipeline as an AI Engineer for a healthcare client.

7.66%
Coding error rate, well below the 10–20% industry standard
More accurate than a standard baseline
$8.4B
Market we are targeting in medical coding
01
Delivered an automated medical coding pipeline in 2 months, owning the project end to end as an AI Engineer.
Led end to end R&D as an AI Engineer for a medical coding pipeline, delivering a production ready, government compliant product in two months. Built the solution for a market projected to reach $8.4B by 2033, helping position the client at the forefront of medical coding automation.
02
Identified a flawed LLM baseline sitting below 70% accuracy before it became the foundation of the entire system.
Researched automated ICD 10 prediction approaches, identifying that a vanilla LLM baseline failed to exceed 70% coding accuracy and was not viable for clinical use. Established a retrieval based architecture as the team's technical foundation, preventing the team from building on a flawed baseline.
03
Built a retrieval system grounded in government sources, achieving 7.66% error rate vs a 10–20% industry baseline.
Integrated three official government CMS coding sources into the pipeline's retrieval layer, grounding every prediction in clinically and legally accepted standards. Achieved an error rate under 7.66%, significantly outperforming the 10–20% error rate typical of manual medical coding.
04
Designed a two stage AI pipeline that tripled prediction accuracy over a standard LLM approach.
Designed a two stage AI pipeline that progressively narrows ICD 10 candidates before classification, outperforming standard LLM baselines by over 3× in prediction accuracy. The staged architecture mirrors expert coding workflows, making the system both accurate and interpretable.
05
Embedded Medicaid and Medicare compliance rules directly into the pipeline to handle government payer requirements.
Incorporated Medicaid and Medicare compliance rules directly into the automated coding pipeline, ensuring outputs meet government payer requirements at the architecture level. This targets the $1B+ annual loss the healthcare industry incurs from coding errors and non compliant claims.
PythonRAGVector DatabasesLLMsICD 10CPTRetrieval ArchitectureMedical CodingHealthcare AIGovernment Compliance
02 / FEATUREDAUTOMATED MEDICAL CODING PIPELINE
● Live · In Production

A government compliant medical coding pipeline, shipped in 2 months.

As the sole AI Engineer on a healthcare client engagement, I researched, architected, and shipped a government compliant medical coding pipeline. The system turns clinical notes into defensible billing codes, grounded in three official CMS sources (Centers for Medicare & Medicaid Services).

ACCURACY GAIN · VS AI BASELINE
more accurate than a single pass LLM · proves the retrieve then rank architecture works for medical codes
ERROR RATE · VS INDUSTRY STANDARD
7.66%
well below the 10–20% manual coding industry standard · every prediction traceable to official CMS sources
Diagram of the four step medical coding system. Step 1: receive the clinical note (a patient's medical record). Step 2: look up matching billing codes in three official government sources from CMS, the Centers for Medicare and Medicaid Services. Step 3: a two stage AI pipeline narrows candidates then picks the right code. Step 4: check Medicaid and Medicare compliance rules and output the final code, with an error rate under 7.66%.CLINICAL NOTEICD 10 / CPTCMS · ICD 10 SOURCEauthoritativeCMS · CPT SOURCEauthoritativeCMS · GUIDELINESauthoritativeSTAGE 1candidate narrowSTAGE 2constrained classifyMEDICAID RULESpayer complianceMEDICARE RULESpayer complianceGROUNDED CODE< 7.66% err
01
End to end ownership
Researched, architected, and shipped as the sole AI Engineer on the engagement.
02
Caught the wrong approach early
Proved that an off the shelf large language model couldn't pass 70% accuracy on medical codes, before the team built the rest of the system on top of it.
03
Every answer traces back to official government sources
Built on top of three authoritative CMS coding manuals (Centers for Medicare & Medicaid Services), so every prediction has a paper trail back to a legally accepted source.
04
Two stage process that mirrors how expert coders work
Stage 1 narrows down the list of possible billing codes. Stage 2 picks the right one under tight rules. That's the same mental shortcut human medical coders use, and it tripled accuracy over a one step AI approach.
05
Compliance baked in at the right layers
Official CMS coding guidelines anchor every prediction from the start. Medicaid and Medicare payer rules then verify the code holds up under the patient's specific government insurance coverage. Both layers make the predictions defensible to regulators.
06
Built for a market projected to reach $8.4B by 2033
Positions the client at the frontier of medical coding automation as the total market grows, the kind of bet that compounds.
03 / IMPACTMEASURED · CLIENT IMPACT

Numbers from systems that actually shipped.

Click any number to jump to the project it came from.

04 / PROJECTSCASE STUDIES

Ten systems. One pattern: find the bottleneck, rebuild it.

01Onpoint Insights● FEATURED
Automated Medical Coding Pipeline
Problem
Manual medical coding runs a 10–20% error rate and bleeds $1B+ annually from non-compliant claims, in a market projected to reach $8.4B by 2033.
Built
Two-stage retrieval pipeline grounded in three official CMS sources, with Medicaid and Medicare compliance rules built into the foundation of the system.
Approach
Step 1: find candidate codes from authoritative sources. Step 2: pick the right code under strict rules. Step 3: check compliance. Replaced a default large-language-model approach that capped below 70% accuracy.
PythonRAGVector DBLLMsICD-10CPTCMS Sources
Under 7.66% error rate · 3× off the shelf AI baseline · shipped in 2 months
02Onpoint Insights
AI Agents That Answer Analyst Questions on 20K+ Records
Problem
Analysts at a PPE (personal protective equipment) manufacturer spent a lot of time running custom database queries against 20K+ product records.
Built
Sequential team of AI agents: query decomposition → SQL construction → validation → response synthesis.
Approach
Azure-deployed agents using GPT-4o and o3-mini, with validation gates between each stage.
AzureGPT-4oo3-miniMulti-AgentSQL
40% reduction in analyst effort
03Onpoint Insights
AI Document Assistant: Search Across 20+ Sources
Problem
Client teams couldn't answer questions across 20+ scattered documents with standard keyword search.
Built
AI assistant that retrieves answers across documents, with intent routing to the right source.
Approach
Transformer embeddings + FAISS vector index + LLM-driven intent routing.
FAISSTransformersRAGIntent Routing
17% more accurate than standard keyword search
04Onpoint Insights
Marketing Attribution for a Top-3 Pharma Client
Problem
A top-3 pharma client needed to isolate the true ROI of individual marketing campaigns.
Built
Causal regression with counterfactual reasoning over clicks, impressions, conversions, cost-per-click, and spend.
Approach
Per-campaign impact isolation feeding data-driven reallocation recommendations.
Causal InferenceCounterfactualsStatistical Analysis
4.2% ROI lift on campaign spend
05Onpoint Insights
Automated PDF Invoice Reader (OCR)
Problem
Manual invoice data entry was a bottleneck in client operations.
Built
End-to-end automation extracting structured data from PDF invoices into Excel.
Approach
Email-triggered workflows + data extraction models on Microsoft Power Automate.
Power AutomateOCRPDF Parsing
92% extraction accuracy
06Onpoint Insights
Product Recommendations Across 10M+ Transactions
Problem
A B2B warehouse distributor needed contextual cross-sell across millions of historical transactions.
Built
Hybrid recommender combining Market Basket Analysis, Word2Vec, and learned hybrid models.
Approach
SQL-extracted 10M+ transactions → association rules + contextual recs deployed in production.
Word2VecMarket BasketSQLRecommendation
6.7% lift in Average Order Size
07Onpoint Insights
Matching Messy Company Names Across Datasets
Problem
Partial company names across order datasets created reporting inconsistencies for a packaging client.
Built
NLP-driven fuzzy matching system to reconcile partial entities (think 'IBM' vs 'I.B.M. Corp').
Approach
Named Entity Recognition + cosine similarity, integrated with Power BI dashboards.
NERCosine SimilarityPower BINLP
Significant reduction in manual validation time
08Onpoint Insights
Sales Forecasting System
Problem
A blinds manufacturer needed reliable monthly forecasts for inventory and production planning.
Built
Time-series forecasting on three years of historical sales data.
Approach
Compared additive and multiplicative methods to capture seasonality before shipping the optimal model.
Time SeriesForecastingSeasonality
12% error rate within acceptable range
09Conagra Brands
Conagra Meat Substitutes Growth Strategy
Problem
Conagra needed region-specific growth strategy for the Meat Substitutes category.
Built
Pricing, promotion, and demand-sensitivity framework with Clout & Vulnerability maps.
Approach
Python + SAS analysis across 100+ attributes and 4 years of regional sales data; controlled for statistical pitfalls (heteroscedasticity, endogeneity) in scanner data.
PythonSASPricingScanner DataCausal
Projected 7% sales uplift · $80K savings reallocation
10Creative Galileo
Predicting Drop-Off and Fixing Slow Spots in a 10M-Download App
Problem
An early-stage EdTech (Series A) with 10M+ downloads was losing users at the payment page.
Built
Churn prediction models + user-behavior diagnostics surfacing slow spots in the app.
Approach
scikit-learn + AWS SageMaker for risk scoring; S3 user-behavior data analysis for funnel drop-off ranking.
scikit-learnSageMakerAWS S3Telemetry
10% faster load · 12% less lost-payment rate
05 / CAPABILITIESTHE STACK

A stack built to ship, not to pitch.

01Languages & Libraries

PythonPyTorchTensorFlowScikit-learnPandasNumPyPySparkKerasMatplotlibPlotlyRSAS

02Tools

TableauPowerBIAlteryxDatabricksMixpanelExcelPower AutomateGitGitHub ActionsCursorClaude Code

03Database Technologies

MySQLPostgreSQLMS SQL ServerMongoDBOracleApache HadoopSnowflake

04Cloud Platforms

AWSS3RedshiftQuickSightAthenaGlueAzureAzure KubernetesAzure Blob Storage
06 / ABOUTTHE THREAD

From numbers in Pune to AI in San Francisco.

It started with numbers. I was the kid who chased harder problems for fun and never quite stopped.

That pull took me through Pune University with honors in AI and machine learning, then to Creative Galileo for my first production machine-learning work, at an EdTech platform with 10M+ downloads.

From there I crossed the Pacific to UT Dallas for a master's in Business Analytics and AI. I joined Onpoint Insights as an intern and never left, working with clients across retail, pharma, and healthcare, and now building a government-compliant medical coding pipeline as their sole AI Engineer.

Every client taught me what AI looks like when it actually has to work.

Now I'm in San Francisco, doing what every step prepared me for: finding where companies are leaving growth on the table, and rebuilding their systems with AI that multiplies it.

07 / EDUCATIONPUNE TO TEXAS
The University of Texas at Dallas
M.S., Business Analytics & Artificial Intelligence
RICHARDSON, TXMAY 2025
Pune University
B.E. (Bachelor of Engineering), Electronics & Telecommunication · Honors in AI / ML
PUNE, INMAY 2023
08 / CONTACTREACH OUT

Let's build a system that finds the bottleneck and rebuilds it.

I'm open to AI engineering roles, advisory and consulting work, and serious technical conversations. Currently in San Francisco.