Frequently Asked Questions
Healthcare price transparency data is publicly available under federal law — but it takes serious engineering to make it usable. Here's how DeductibleData turns terabytes of raw insurer data into the specific datasets you need.
Last updated March 31, 2026
About Healthcare Price Transparency
Understanding the data, the regulation, and why it matters.
What is healthcare price transparency data?
What is the Transparency-in-Coverage Rule?
What are Machine Readable Files (MRFs)?
Machine Readable Files are the standardized data files that health insurers publish to comply with the TiC Rule. They contain negotiated rates in JSON format. Individual files can exceed 100GB, with a single payer's complete dataset reaching terabytes. The nested JSON structure, thousands of files per payer, and sheer scale make them impractical to process without specialized cloud infrastructure and engineering. DeductibleData handles this complexity and delivers only the data you need.
What is Schema 2.0 and how does it affect the data?
Schema 2.0 is the updated data format for Machine Readable Files, enforced since February 2, 2026. Key changes include requiring a business_name alongside every EIN, consolidating billing code types, adding APR-DRG severity levels, and moving provider groups to inline-only format. These changes improve data quality and consistency, meaning DeductibleData's extractions from Schema 2.0 files contain richer, more reliable information than earlier formats.
Is healthcare price transparency data HIPAA protected?
No. MRF data contains zero patient information. The TiC Rule specifically requires disclosure of negotiated rates, which are contractual prices between insurers and providers. The data includes provider business identifiers (NPI, TIN) that are already public information. There is no Protected Health Information (PHI) in any of our deliverables.
What is the difference between in-network and out-of-network rates?
In-network rates are the negotiated prices that insurers have agreed to pay contracted providers — typically lower than list prices. Out-of-network allowed amounts reflect historical payments insurers have made for services from non-contracted providers during a rolling 90-day period, often based on Medicare rates, usual and customary charges, or plan-specific methodologies. DeductibleData primarily provides in-network negotiated rate data from MRF files.
How does price transparency data compare to claims data?
Price transparency data shows the negotiated rates between insurers and providers — what an insurer has agreed to pay for a service. Claims data shows what was actually billed and paid for specific patient encounters. TiC data is publicly available and covers every contracted rate, while claims data is proprietary and limited to actual utilization. They complement each other: TiC data is broader (every rate, every provider), while claims data captures real-world utilization patterns and patient volume.
Use Cases and Customers
Who uses this data and what they do with it.
Who uses healthcare price transparency data?
Healthcare price transparency data is used by a range of professionals and organizations:
- Healthcare consultants — benchmark market rates and advise on payer contract negotiations
- Hospital systems — compare negotiated rates against peers and identify reimbursement gaps
- Health tech companies — build cost estimation, plan comparison, and price shopping tools
- Employers and benefits advisors — evaluate health plan options and healthcare spending
- Litigators — use negotiated rate data as expert evidence in billing disputes and underpayment cases
- Researchers — study healthcare economics, pricing variation, and policy impact
What makes DeductibleData different from downloading MRFs myself?
Can DeductibleData serve as the data backend for my product or platform?
Data Coverage and Delivery
What data is available, how it's delivered, and how fresh it is.
What payers do you have data for?
Do you have Medicare or Medicaid rates?
DeductibleData focuses on commercial payer negotiated rates published under the Transparency-in-Coverage Rule. Medicare and Medicaid rates are set by CMS and published through separate channels (Medicare Physician Fee Schedule, OPPS, IPPS). Our data captures what commercial insurers like UnitedHealthcare, BCBS, and Aetna pay providers — which is often significantly different from government rates and is the data most consultants and hospital systems need for contract benchmarking.
Do you have historical pricing data?
How current is the data?
Insurers are required to publish MRF data monthly under federal regulation. We continuously monitor and index new publications, typically processing them within days of release. When you request a data pull, we extract from the most recent available files. The data reflects currently negotiated rates, though actual prices paid may vary based on specific plan terms.
What format is the data delivered in?
Data is delivered in one of two formats depending on size:
- CSV — for most requests. Compatible with Excel, Google Sheets, Python, R, SQL databases, and virtually any analysis tool. Each file includes standardized columns for provider information, billing codes, negotiated rates, and payer details.
- Parquet — for large-scale extractions that exceed Excel's ~1 million row limit. Delivered via cloud storage, optimized for Python, R, and cloud data warehouses.
We provide documentation explaining every field in your deliverable.
How large will my data deliverable be?
How long does delivery take?
Delivery takes hours to 7 business days depending on data volume, with targeted requests completing fastest. Small, targeted requests (under 10GB) typically complete within hours. Medium requests take 1-3 business days. Very large extractions covering all providers and all codes for a major payer can take 3-7 business days due to the scale of processing. You'll receive email notifications with progress updates throughout.
Can I filter by provider type or geographic area?
Do you offer claims volume data alongside negotiated rates?
What happens if my data processing job fails?
We monitor every job and are notified immediately if an issue occurs. In most cases, we can resume processing from where it left off. You won't be charged for incomplete deliveries, and we'll keep you informed via email until your data is ready for download.
Pricing and Billing
How pricing works, what's included, and subscription options.
How is DeductibleData pricing calculated?
What billing codes can I filter by?
Our data includes 16 billing code types. You can filter by any of the following:
- CPT (Current Procedural Terminology) — medical procedures
- HCPCS (Healthcare Common Procedure Coding System) — services, equipment, and drugs
- MS-DRG (Diagnosis Related Group) — Medicare inpatient hospital services
- NDC (National Drug Code) — prescription drugs and pharmacy-dispensed products
- Revenue codes — facility and department charges
- CDT — dental procedures
- APC (Ambulatory Payment Classification) — outpatient hospital services
- AP-DRG / APR-DRG — alternative DRG classification systems
- ICD — diagnosis codes
- And additional types including EAPG, HIPPS, and others
Search for codes on our custom data pull page.
What is an NPI number and how do I use it to filter data?
Can I get a subscription for recurring data updates?
Can DeductibleData host my data for team access?
How We Compare
Pricing, access, and how DeductibleData stacks up against other TiC data vendors.
How does DeductibleData pricing compare to other TiC data vendors?
| Vendor | Starting Price | Self-Serve Checkout | Contract Required |
|---|---|---|---|
| DeductibleData | $164/mo | Yes, credit card | No |
| Serif Health* | $1,000/mo ($12K/yr) | No, demo required | Yes |
| Turquoise Health** | $20K-$50K+/yr (est.) | No, demo required | Yes |
| Most others | Not disclosed | No | Yes |
*Serif Health pricing published on their blog. **Turquoise Health estimate based on enterprise sales model and $55M in venture funding (public record).
See your exact price on our real-time pricing calculator.
Why doesn't DeductibleData require a demo or sales call?
Our product is self-explanatory and our pricing is transparent. You configure exactly what you need — payers, providers, billing codes — on our custom data pull page, see the price in real time, and check out with a credit card.
No scoping call, no contract negotiation, no waiting for a quote. Most vendors in this space require demos because their pricing is opaque and varies by customer. We believe if you can't show your price, you're not confident in it.
What is the difference between pay-per-pull and an annual contract?
Pay-per-pull means you pay only when you need data, with no minimum commitment and no lock-in. Annual contracts, which most vendors in this space require, lock you into a fixed payment regardless of how much data you actually use. DeductibleData offers both one-time purchases and optional subscriptions (monthly, quarterly, or bi-annual), but subscriptions are opt-in — you're never required to commit upfront.
How does DeductibleData differ from platforms like Turquoise Health or Serif Health?
Three key differences:
- Price — DeductibleData starts at $164/month. That's roughly 6x cheaper than Serif Health ($12K/year) and an estimated 10-25x cheaper than Turquoise Health.
- Access — We're the only vendor with self-serve credit card checkout. No demo, no sales call, no contract.
- Focus — We deliver clean, filtered negotiated rate data scoped to your exact specifications, rather than bundling it into a broad compliance or analytics platform.
If you need specific MRF rate data without the overhead of an enterprise platform, we're built for that.
What is the difference between MRF pricing data and claims data?
- MRF data — contracted negotiated rates published under the Transparency-in-Coverage Rule. What insurers have agreed to pay providers. This is what DeductibleData provides.
- Claims data (835/837 transactions) — what was actually billed and paid per patient encounter, including procedure volume and utilization patterns. Proprietary, not publicly available.
The two are complementary: MRF data tells you the price, claims data tells you the volume. DeductibleData partners with claims data specialists for customers who need both pricing and volume in a single deliverable. Contact us to scope a blended request.
Is DeductibleData's lower price because the data quality is lower?
No. Every vendor in this space pulls from the same source: the Machine Readable Files that health insurers are federally required to publish under the TiC Rule. The provider IDs, billing codes, negotiated rates, and payer metadata are identical regardless of who processes them. DeductibleData's lower price reflects our business model — automated pipeline, no enterprise sales team, no venture capital overhead — not a difference in data quality.
Explore: AI-Powered Data Research
What the Explore feature is, how it works, and what you can do with it.
What is Explore?
Explore is an AI-powered research interface built into every DeductibleData job. Once your data pull is complete, Explore lets you ask natural language questions directly against your extracted dataset — without writing SQL or downloading the file. It translates your questions into database queries, runs them against your data, and returns plain-English answers with supporting numbers.
What can I query with Explore?
You can query up to 5 billion rows of negotiated rate data using plain English. Common use cases:
- Compare negotiated rates across payers for the same billing code
- Find the highest and lowest paying insurers for a specific procedure
- Benchmark rates at a target hospital against regional or national peers
- Identify billing codes where rates are statistical outliers
- Summarize rate distributions by provider type or geography
- Forensic analysis of rate patterns for litigation support
Who can access Explore?
Explore is available to all users — no additional subscription required. It is accessible directly from any completed job in your account dashboard. There is no paywall or demo requirement to use it.
Is there a limit on how many questions I can ask?
Yes. Each job includes up to 20 Explore queries. This limit is enforced server-side and resets per job — not per account. Most analyses are completable within 20 focused questions. If you need to run an extended research session, you can initiate a new data pull job to reset your query count.
Does Explore require me to know SQL or programming?
No. Explore accepts plain English questions. The underlying system generates and executes the necessary database queries on your behalf. You describe what you want to know; Explore figures out how to retrieve it.
NPI Inclusion and Exclusion Criteria
How DeductibleData builds its provider index and determines which NPIs are included or excluded.
What data source does DeductibleData use to build its NPI index?
DeductibleData builds its provider index from the NPPES (National Plan and Provider Enumeration System) registry, maintained by CMS. NPPES is the authoritative federal database of all healthcare providers assigned National Provider Identifiers. CMS publishes a full monthly replacement file and a separate monthly deactivation file. As of March 2026, CMS has migrated to Version 2 of the downloadable files; we use Version 2 as the current standard.
How does DeductibleData determine whether an NPI is active?
An NPI is treated as active if it exists in the current NPPES registry and does not appear in the CMS monthly deactivation file. The four official deactivation reason codes we screen against are:
- DT — Death
- DB — Disbandment (organization dissolved)
- FR — Fraud
- OT — Other (retirement, no longer practicing, etc.)
NPIs flagged with any of these codes are excluded from our provider index.
Does DeductibleData cross-reference NPIs against federal exclusion lists?
Yes. Our NPI index is cross-referenced against two federal exclusion databases:
- OIG LEIE (List of Excluded Individuals/Entities, HHS) — individuals and entities excluded from participating in Medicare, Medicaid, and federal healthcare programs due to fraud, patient abuse, or program-related crimes. Updated monthly.
- SAM.gov exclusions (GSA) — government-wide debarments including procurement fraud, tax delinquency, and national security violations. Broader than healthcare-specific exclusions. Note: SAM.gov does not use NPI records, so provider matching relies on name and EIN.
Providers appearing on either list are flagged in our dataset. A clean record on one list does not guarantee a clean record on the other — both are checked independently.
What types of providers are included in or excluded from the NPI index?
Our index includes all Type 1 (individual practitioners) and Type 2 (organizational) NPIs that meet all three criteria:
- Active in current NPPES registry (not deactivated)
- Not flagged in CMS monthly deactivation file
- Not listed on OIG LEIE or SAM.gov exclusion databases
We do not restrict by provider taxonomy — the index covers physicians, hospitals, ambulatory surgery centers, group practices, therapy providers, and all other NPI-holding entities.
How often is the NPI index updated?
The NPI index is refreshed monthly, aligned with CMS's publication schedule for NPPES updates and the OIG LEIE. This means provider status — active, deactivated, or excluded — reflects the most current available federal data.
Compliance Checks and Legal Use
What compliance validations we perform on the data and how it can be used in legal and regulatory contexts.
What compliance checks does DeductibleData perform on its data?
DeductibleData cross-references its NPI index against three public federal databases monthly:
- NPPES active NPI file (CMS) — provider deactivation status
- OIG LEIE (HHS) — exclusion from federal healthcare programs
- SAM.gov (GSA) — government-wide federal debarment
We do not independently verify the accuracy of the underlying negotiated rates. Those are published directly by payers under the Transparency-in-Coverage Rule and delivered as reported. Payer-reported rates are contractually negotiated prices and may not reflect actual payment amounts in every case.
Can TiC negotiated rate data be used in litigation?
TiC negotiated rate data has been used as evidence in healthcare billing disputes, underpayment cases, and payer contract litigation. The data is payer-reported under federal regulatory mandate — a primary source for establishing contracted rates.
Important caveats:
- Published negotiated rates are contracted amounts, not guaranteed payment amounts — actual payments may vary based on plan terms and billing circumstances
- Rates in MRF files are payer-reported and have not been independently audited
- The legal weight of TiC data in any specific case depends on jurisdiction, claim type, and how the data is introduced as evidence
Use of this data in litigation or compliance proceedings should involve qualified legal counsel.
Does using TiC rate data for benchmarking raise antitrust concerns?
No — analyzing publicly mandated TiC data for benchmarking and contract negotiation is not an antitrust violation. The TiC Rule specifically requires payers to publish these rates; using them is the intended purpose of the regulation.
Antitrust risk in healthcare pricing data applies to payers coordinating rates using shared algorithms or intermediaries — not to analysts, consultants, or providers consuming publicly available rate data. The DOJ has specifically addressed algorithmic price coordination among insurers (see ongoing MultiPlan litigation), but that concern is about payer behavior, not data consumers.
If you have specific antitrust questions about your use case, consult qualified legal counsel.
Does DeductibleData check providers against state Medicaid exclusion lists?
Not currently. Every U.S. state maintains its own Medicaid exclusion list, separate from both the federal OIG LEIE and SAM.gov. A provider can be excluded from a state Medicaid program without appearing on either federal list. Our current compliance checks cover the two federal exclusion databases (OIG LEIE and SAM.gov) and the NPPES deactivation file. If your use case requires state-level Medicaid exclusion screening, we recommend supplementing our data with a specialized exclusion monitoring service.
Still have questions?
Our team is ready to help you find the healthcare pricing data you need.