CDI — Pre-Bill Reconciliation
Documentation QualityAsync jobs requiredPre-bill reconciliation of intended codes against chart evidence — integrity risks, capture, and queries.
About
A pre-bill reconciliation engine: the provider supplies the ICD-10 codes they intend to bill, and a map-reduce LLM pipeline compares each against the chart's documented evidence — surfacing over-coding (integrity risk), under-coding (capture opportunity), and specificity gaps.
Each finding carries its track, finding type, the supportable code, evidence quotes, an impact category (HCC/RAF, CC/MCC, DRG), and — where a query is warranted — a non-leading physician query that must clear a non-leading + evidence-supported audit.
How it works
- 1Document upload → OCR (or paste pre-OCR'd text)
- 2Normalize the provider's intended ICD-10 set
- 3Map-reduce reconciliation: confirm / capture / specificity-upgrade / integrity-risk per condition
- 4Audit pass validates each drafted query is non-leading and evidence-supported
Intended use
- •Pre-bill coder / CDI review before claim submission
- •Integrity programs flagging unsupported intended codes
- •Specificity and capture worklists with HCC/RAF and CC/MCC impact
Key outputs
- ▸findings[] — finding_type, intended_code, supportable_code, recommended_code, evidence, impact
- ▸suggested_query (+ audit: non_leading, evidence_supported) when a query is warranted
- ▸summary — counts by finding type, codeable_now, requires_query, estimated_impact
Model comparison
F1 on Gwen's healthcare benchmark for this task — the Gwen pipeline vs the prompt-optimized model alone, with the uplift the pipeline adds, per model.
| # | Model | Gwen pipeline | Model only | Uplift |
|---|---|---|---|---|
| 1 | GPT-5.5Best | 0.955 | 0.868 | +0.087 |
| 2 | Claude Opus 4.8 | 0.955 | 0.885 | +0.070 |
| 3 | Gemini 3.5 Flash | 0.953 | 0.923 | +0.029 |
Endpoints
Try each endpoint with your signed-in session — usage counts toward your monthly budget.
Use synthetic data only. Do not submit real patient records or PHI when testing endpoints.
Limitations & caveats
- –Integrity-risk (over-coding) detection requires the intended_codes input
- –Drafts queries — human CDI review before sending remains the expected workflow
- –Multi-pass LLM (1–3 minutes); the async /jobs flow is mandatory for production uploads