Kumo Co-Founder Hema Raghavan Named to Inc.’s 2026 Female Founders 500

Learn more
7Classification · Provider Fraud

Provider Fraud Detection

Which healthcare providers are submitting suspicious claims?

Book a demo and get a free trial of the full platform: data science agent, fine-tune capabilities, and forward-deployed engineer support.

By submitting, you accept the Terms and Privacy Policy.

Loved by data scientists, ML engineers & CXOs at

Catalina Logo

A real-world example

Which healthcare providers are submitting suspicious claims?

Healthcare fraud accounts for 3-10% of total health spending, costing $68-230B annually in the US (National Health Care Anti-Fraud Association). Provider-driven fraud (upcoding, unbundling, phantom billing, unnecessary procedures) is the largest category, yet most schemes are detected only through retrospective audits 12-24 months after the billing occurs. By then, the insurer has already paid out millions. A single fraudulent medical practice can bill $5-15M before detection. SIU teams can only audit 1-2% of providers annually, so targeting accuracy is critical.

How KumoRFM solves this

Relational intelligence built for insurance data

Kumo connects providers, claims, patients, referral networks, procedure codes, and billing patterns into a relational graph. The model detects that Provider PRV-501 bills 3x the average number of high-complexity procedures, shares patients with a referring provider at an unusually high rate (92% of referrals come from one source), and has a billing-code distribution that deviates significantly from peer providers in the same specialty and region. These graph-based signals surface suspicious providers 6-12 months earlier than traditional audit triggers.

From data to predictions

See the full pipeline in action

Connect your tables, write a PQL query, and get predictions with built-in explainability — all in minutes, not months.

1

Your data

The relational tables Kumo learns from

PROVIDERS

provider_idnamespecialtyregionyears_in_network
PRV-501MedPro Spine ClinicOrthopedicsSoutheast4.2
PRV-502City General RadiologyRadiologyNortheast12.8
PRV-503Sunrise Physical TherapyPT/RehabWest7.5

BILLING_PATTERNS

provider_idavg_claim_amounthigh_complexity_rateclaims_per_patientvs_peer_avg
PRV-501$4,80078%8.43.2x peer avg
PRV-502$1,20032%3.11.1x peer avg
PRV-503$85015%12.21.8x peer avg

REFERRAL_NETWORK

provider_idtop_referrerreferral_concentrationpatient_overlap_pct
PRV-501Dr. R. Martinez92%88%
PRV-502Multiple (15+)12%8%
PRV-503Dr. K. Patel65%52%

PROCEDURE_ANALYSIS

provider_idtop_codefrequencypeer_frequencyupcoding_signal
PRV-50199214 (Moderate)12%45%Low usage (possible upcoding to 99215)
PRV-50199215 (High)68%22%3.1x above peer norm
PRV-50397110 (Therapeutic)45%38%1.2x above peer norm
2

Write your PQL query

Describe what to predict in 2–3 lines — Kumo handles the rest

PQL
PREDICT BOOL(PROVIDERS.FRAUD_CONFIRMED = 'True', 0, 12, months)
FOR EACH PROVIDERS.PROVIDER_ID
WHERE BILLING_PATTERNS.VS_PEER_AVG > 1.5
3

Prediction output

Every entity gets a score, updated continuously

PROVIDER_IDSPECIALTYFRAUD_SCOREEST_OVERPAYMENTSIU_PRIORITY
PRV-501Orthopedics0.89$2.4M/yrCritical
PRV-503PT/Rehab0.52$420K/yrHigh
PRV-502Radiology0.08$0Low
4

Understand why

Every prediction includes feature attributions — no black boxes

Provider PRV-501 (MedPro Spine Clinic)

Predicted: 89% fraud probability, est. $2.4M/yr overpayment

Top contributing features

High-complexity code rate (99215)

68% vs 22% peer

28% attribution

Referral concentration from single source

92%

25% attribution

Claims per patient far above peer

8.4 vs 2.6

21% attribution

Patient overlap with referring provider

88%

15% attribution

Average claim amount anomaly

$4,800 vs $1,500

11% attribution

Feature attributions are computed automatically for every prediction. No separate tooling required. Learn more about Kumo explainability

Bottom line: Detect fraudulent providers 6-12 months earlier and recover $100-300M in annual overpayments for a top-10 health insurer while focusing SIU resources on the highest-impact investigations.

Topics covered

provider fraud detection AIhealthcare fraud analyticsmedical billing fraud predictioninsurance provider audit AIgraph neural network provider fraudKumoRFMrelational deep learning insuranceupcoding detectionphantom billing detectionhealthcare claims analytics

One Platform. One Model. Predict Instantly.

KumoRFM

Relational Foundation Model

Turn structured relational data into predictions in seconds. KumoRFM delivers zero-shot predictions that rival months of traditional data science. No training, feature engineering, or infrastructure required. Just connect your data and start predicting.

For critical use cases, fine-tune KumoRFM on your data using the Kumo platform and Data Science Agent for 30%+ higher accuracy than traditional models.

Book a demo and get a free trial of the full platform: data science agent, fine-tune capabilities, and forward-deployed engineer support.