Kumo Co-Founder Hema Raghavan Named to Inc.’s 2026 Female Founders 500

Learn more

Entity Resolution & Identity

Entity Resolution, Identity Matching & Data Quality

Your data is fragmented across systems. Kumo uses graph-based resolution to match identities, deduplicate records, and link entities — going far beyond fuzzy string matching to understand true relational similarity.

Identity MatchingDuplicate DetectionRecord LinkingHousehold MappingAccount DedupCross-Device Matching

Book a demo and get a free trial of the full platform: data science agent, fine-tune capabilities, and forward-deployed engineer support.

By submitting, you accept the Terms and Privacy Policy.

6

Prediction types

Identity to cross-device

0

Rules to maintain

Learned automatically

<1 hr

To production

Per use case

0

Third-party cookies

Behavioral matching

Loved by data scientists, ML engineers & CXOs at

Catalina Logo

Traditional entity resolution vs. Kumo

See how graph-learning AI compares to rules-based matching and manual ETL.

Matching approach

Traditional

Fuzzy string matching + rules

With Kumo

Graph-based relational similarity

Cross-system linking

Traditional

Manual ETL mapping

With Kumo

Learned from relational patterns

Household mapping

Traditional

Address-based heuristics

With Kumo

Behavioral + transactional graph

Cross-device

Traditional

Third-party cookies

With Kumo

Behavioral relational patterns

Maintenance

Traditional

Rules need constant updating

With Kumo

Model learns automatically

Setup time

Traditional

Months of rules engineering

With Kumo

Under 1 hour

How It Works

Simply connect your data, start asking predictions, and get results.Want more control? Fine-tune the model for your specific use case.

Connect your data
STEP 1

Connect your data

Integrates directly with your warehouse, no additional pipeline setup.

Ask a predictive question
STEP 2

Ask a predictive question

Ask questions in plain English and let Kumo do the modeling for you.

Act on predictions
STEP 3

Act on predictions

Get clear predictions and push them instantly into your workflows.

churn_prediction.pql
PREDICT COUNT(transactions.*, 0, 90, days) = 0
FOR EACH customers.customer_id
WHERE COUNT(transactions.*, -60, 0, days) > 0

3 lines. No feature engineering. No pipeline code.

For developers

Predict in a few lines of SQL

Kumo's Predictive Query Language (PQL) replaces months of feature engineering, model training, and pipeline work with a few lines of SQL-like syntax. Describe what you want to predict — Kumo handles the rest.

Why Kumo

01Zero-Shot Foundation Models

Get accurate predictions on relational data instantly—no training or ML setup required.

Read the KumoRFM announcement
Snail
02Real-Time Predictions
03Native Data Warehouse Integration
04Fine-Tuning at Scale
05Enterprise-Grade Security
06Transparent Explainability

Built by pioneers in AI

Vanja Josifovski

Vanja Josifovski

CEO and Co-Founder

Former CTO at Airbnb and Pinterest

Jure Leskovec

Jure Leskovec

Co-Founder & Chief Scientist

Stanford Professor · Co-creator of RDL and GNN

Hema Raghavan

Hema Raghavan

Co-Founder & Head of Engineering

Former Sr. Director of Engineering at LinkedIn

Loved by data scientists, ML engineers & CXOs at

Catalina Logo

Peer-reviewed

Built on world-class research

Kumo is built on 40+ peer-reviewed papers at NeurIPS, ICML, and KDD. The methodology is public and reproducible.

RFMZero-shotFine-tunedTransfer
ICML 2024

KumoRFM: A Relational Foundation Model for Predictive Analytics

K. Huang, M. Fey, J. Leskovec et al.

A foundation model for relational data — pre-trained across schemas, it delivers accurate predictions out of the box and improves with fine-tuning.

Read paper
ABC
NeurIPS 2024

Relational Deep Learning: Graph Representation Learning on Relational Databases

M. Fey, W. Hu, K. Huang, J. Leskovec et al.

Learning predictive models directly on relational databases, eliminating the feature engineering pipeline.

Read paper
T1T2T3T4T5+20+20+23+22+35BaselineKumo30 tasks
NeurIPS 2024 · Datasets Track

RelBench: A Benchmark for Deep Learning on Relational Databases

J. Robinson, R. Miao, K. Huang et al.

An open benchmark for evaluating relational prediction methods across 11 databases and 30 tasks.

Read paper

Your fragmented data already contains the identity signal.

See what Kumo can resolve from your existing relational database.