Skip to main content

Beyond parsing X12: Closing the gap for revenue cycle workflows in healthcare

by Aaron Zavora and Neel Shapur

  • The data problem is solved; the workflow problem isn't. Healthcare billers have fully parsed 835/834/837 EDI data in their lake, but still spend half their day in spreadsheets and SQL queries instead of actually working denials and appeals.
  • Genpact and Databricks built an operational workbench that sits directly on Unity Catalog gold views (no ETL shadow copies, PHI never leaves the secure perimeter), giving billers a purpose-built UI with a denials queue, remittance drawer, timely-filing age alerts, and one-click appeal drafting.
  • GenAI is the next layer, integrating Claude via Databricks Foundation Model APIs to read the CARC code, pull the original 837, review clinical documentation, and auto-draft the appeal letter so billers review and approve rather than write from scratch.

It’s Monday at 8 AM. A medical biller opens her queue.

Over the weekend, Friday’s 835 remittance files landed perfectly in your data lake. Every Claim Adjustment Reason Code (CARC) and Provider Level Adjustment (PLB) code was parsed, decoded, and normalized. Out of the 412 claims in the file, 38 are short-paid. The timely-filing window on the oldest denial is 27 days out.

She has all the data she needs. What she doesn’t have is a place to act on it.

Instead, she spends her morning manually walking the 2100 and 2110 loops, the nested claim and service-line details buried inside every EDI file, in a SQL query, pasting short pays into a spreadsheet, and reconciling them against a payment register. By the time she actually picks up the phone to fight a denial, half her day is gone.

According to KFF, insurers denied 1 in 5 in-network claims on HealthCare.gov in 2023, and fewer than 1% of those denials were ever appealed, meaning most short-payments simply stay short.

The reality of modern healthcare IT is this: The data problem is largely solved. The workflow problem is not.

This is the operational gap in healthcare X12-the missing layer right above the parsing engine. To fix it, Genpact and Databricks built a unified operational workbench that lives entirely inside your existing Databricks environment. PHI never leaves your secure perimeter, the UI queries the data in place, and row-level security is enforced automatically.

Here is how we get your billers out of spreadsheets and back to working claims.

Where the Pipeline Usually Stops

X12 remains the backbone of US healthcare payment (835, 834, 837). The open-source x12-edi-parser authored by the Databricks team is the perfect starting point. It takes raw files, understands the loops, and writes normalized records to Delta Lake.

But while that gets a data analyst to a SQL query, it doesn't get a biller to an appeal.

The medallion pipeline from raw X12 files down to the React UI, with the Unity Catalog PHI boundary clearly dashed around the Bronze/Silver/Gold data layers to show compliance.

What We Built: The Operational Workbench

To bridge this gap, we built a solution in two layers: extending the underlying parser for production-grade reality, and building a secure, intuitive operational surface for your team.

Layer 1: Extending the Engine

Real-world RCM needs lookups that standard open-source parsers don't always catch. We extended the engine to include:

  • Contextual Enrollment (834): End-to-end sponsor and dependency tracking.
  • Decoded Adjustments (835): Code-table-decoded CAS group/reason fields. This is the difference between a biller staring at "CO-45" versus reading "CO-45: charge exceeds fee schedule" on their screen.
  • Upstream Contributions: We didn't fork the code; these schema and test extensions ship right back to the open-source community.

Layer 2: The Biller’s Desktop

This is the operational surface—a secure web application sitting directly on top of your Databricks SQL connector. Every number on every screen is a live query against a Unity Catalog gold view. There is no ETL shadow copy and no synchronized cache.

The workbench features six core views designed for the way RCM teams actually work:

Working the Claims:

  • The Remittance Drawer (835): A biller sees exactly which CARC codes are driving the billed-versus-paid gap. They can open any claim down to the service-line detail and draft an appeal or correction without leaving the screen.
  • Denials Workbench: Timely-filing windows make age the most critical metric on the floor. Rows bleed red when they pass 30 days. You can filter by payer, CARC, or queue. The filter state encodes in the URL, meaning a manager can paste a prioritized worklist cleanly into Slack for their team.
  • Enrollment (834): Benefits coordinators can see new hires, plan changes, and terminations in one view, with one-click drill-ins to the exact offending segment when a record fails.

The Denials Workbench. Notice the two rows past 30 days bleeding red, age badges calling out criticality, and the at-risk dollar total sitting in the stat bar for immediate visibility.

Claim CLM-4209 is selected. The drawer displays three service lines, decodes the CO-45 adjustment into plain English, and exposes direct Draft Appeal / Correction action buttons.

Managing the Floor:

  • Leadership Dashboard: Claims MTD, payment match rates, active members, and open denials in a single view. If a payer feed flatlines, you see it the same day.
  • Quality Gates: Mismatched segment counts mean a payer will reject an entire file. Our quality layer catches those mismatches at ingest, before the file ships.
  • Audit & Security: Every PHI reveal and workflow state change is logged. If a user needs break-glass PHI access, the system requires a written reason, which is automatically audit-logged.

The break-glass modal showing the security lock, the pre-populated reason field, the HIPAA compliance warning, and the log-entry preview before the biller confirms access.

Looking Ahead: GenAI Appeal Drafting

Getting the data in front of the biller is step one. Step two is accelerating the work.

The next frontier for this workbench is integrating a Claude model via Databricks Foundation Model APIs. Soon, the system will read the CARC code, pull the original 837, review the clinical documentation, and dynamically draft the appeal letter. Instead of writing from a blank page, your biller simply acts as the reviewer and approver.

Try It On Your Own Data

Deploying this in your environment is a matter of configuration, not code. It pairs seamlessly with the Databricks X12 EDI accelerator.

Give us two weeks, your Unity Catalog schema, and twenty denials already sitting in your queue. Time your biller working those denials today—then time them working a matched cohort in the workbench.

You own the data, you own the stopwatch, and you own the conclusion.

To scope this for your organization, reach out to Neel Shapur ([email protected]) or Aaron Zavora at Databricks.

Get the latest posts in your inbox

Subscribe to our blog and get the latest posts delivered to your inbox.