We'd Love Your Feedback!
Thank you for visiting our website! We value your feedback and use it to continually improve our services. Yale affiliates will find a link to our suggestion box included in their form.
Forms will open in a new tab
Updated: October 10th, 2025
Overview
Merative advances health and social care by innovatively providing healthcare data and technology solutions for patients, program managers, radiologists, and researchers. They collaborate with over 4,500 healthcare providers, top US health plans, government agencies, Fortune 100 employers, and all leading life sciences companies to drive shared health progress 1,2.
Merative’s MarketScan Research Databases provide longitudinal, patient-level data that cover the full continuum of healthcare costs and outcomes, including detailed prescription drug information. These databases, with data from over 273 million unique patients across diverse care points, support numerous research applications such as pharmacoeconomic outcome evaluations, economic burden studies, and therapeutic pathway analyses, documented in more than 2,600 peer-reviewed publications 3.
MarketScan offers several unique advantages: it includes comprehensive patient-level details, tracks the full continuum of care, and provides detailed prescription drug information for longitudinal research. Its large sample size supports studies on unique patient populations, and linked data enhance research across various disease areas, maintaining appropriate claims linkages and HIPAA compliance. Equipped with analytic tools for efficient data exploration, MarketScan facilitates understanding of disease progression, treatment patterns, and health outcomes for patients, employers, health plans, and government entities 3,5.
The YBIC license includes the following MarketScan datasets 5:
Commercial Database (CCAE): Contains data from active employees, early retirees, COBRA continuees, and dependents with employer-sponsored plans, including lab results. Its table structure includes Inpatient Admissions, Facility Header, Inpatient Services, Outpatient Services, Outpatient Pharmaceutical Claims, Annual Enrollment Summary, Enrollment Detail, and Lab Results. This dataset represents approximately 69 million patients and 8.7 billion records.
Medicare Database (MDCR): Originally designed for Medicare-eligible retirees with employer-sponsored Medicare Supplemental and Medicare Advantage plans, primarily containing fee-for-service plan data. Its table structure matches that of the Commercial Database and includes both Medicare-paid and employer-paid supplemental insurance amounts, limited to plans where both types of payments are available and evident on claims. This dataset represents approximately 4 million patients and 1.55 billion records.
Medicaid Database (MDCD): Captures healthcare service use for Medicaid enrollees across various states, covering both fee-for-service and managed care plans. It includes records of inpatient services, admissions, outpatient services, prescription drug claims, long-term care, and demographic variables such as age, gender, Federal Aid, Disability, TANF, and race. This dataset represents approximately 28 million patients and 8.26 billion records.
Dental Database: An independent product that can be linked to specific years and versions of the Merative MarketScan Commercial Database and the MarketScan Medicare Database. This dataset represents approximately 28 million patients and 1.73 billion records.
Commercial Insurance Weights Database: The Merative MarketScan Commercial (CCAE) and Medicare (MDCR) Databases contain data on individuals with employer-sponsored insurance (ESI), either as primary or supplemental coverage. The MarketScan Commercial Insurance Weights, created using the Public Use Microdata Sample (PUMS) from the American Community Survey (ACS), project this data to the national population with ESI.
Gaining Access
Do I Qualify?
The YBIC has licensed the MarketScan database for Yale community members to use in their research.
Typical Timeline
Time constraints may vary for applicants.
Step-by-Step Guide
Researchers need to fill out an application requesting access, listing the dataset and population they want access to, describing their research plan, disclosing funding sources, and signing the data use agreement.
After receiving approval, the Harvey Cushing/John Hay Whitney Medical Library, the Yale Center for Clinical Investigation (YCCI), and YBIC will collaborate to assist with data retrieval, analysis, and ensuring compliance with data use agreements. The request form is accessible on the MarketScan Database DataMed webpage.
Valuable Links
YBIC - MarketScan: Find links to the DataMed webpage for the dataset and training 4.
MarketScan use at Yale: Find additional details about the datasets covered by Yale’s license, access training, and apply for research access 5.
Publications
This section presents a selection of PubMed articles that utilize the dataset and are authored by individuals affiliated with the Yale School of Public Health. These articles are provided to inspire researchers and students to use the data in their own work.
-
Out-Of-Network Spending Mostly Declined In Privately Insured Populations With A Few Notable Exceptions From 2008 To 2016.
Zirui Song, William Johnson, Kevin Kennedy, Jean Fuglesten Biniek, Jacob Wallace
Health affairs (Project Hope) doi: 10.1377/hlthaff.2019.01776
PMID: 32479236 -
Out-of-Network Laboratory Test Spending, Utilization, and Prices in the US.
Zirui Song, Timothy Lillehaugen, Jacob Wallace
JAMA doi: 10.1001/jama.2021.0720
PMID: 33904879 -
Assessment of Underuse and Overuse of Screening Tests for Co-occurring Conditions Among Children With Obesity.
Mona Sharifi, Alyson B Goodman, Kao-Ping Chua
JAMA network open 2022 Jul 1 doi: 10.1001/jamanetworkopen.2022.22101
PMID: 35834247 -
Impact of reduced human papillomavirus vaccination coverage rates due to COVID-19 in the United States: A model based analysis.
Vincent Daniels, Kunal Saxena, Craig Roberts, Smita Kothari, Shelby Corman, Lixia Yao, Linda Niccolai
Vaccine 2021 Apr 6 doi: 10.1016/j.vaccine.2021.04.003
PMID: 33875269 -
Trends in Prescription Opioid Use in Motor Vehicle Crash Injuries in the United States: 2014-2018.
Lan Jin, Sten H Vermund, Yawei Zhang
International journal of environmental research and public health 2022 Nov 4 doi: 10.3390/ijerph192114445
PMID: 36361324 -
Treatment Access among Younger Medicaid Beneficiaries with Multiple Myeloma.
Mark A Fiala, Mengmeng Ji, Yi-Hsuan Shih, John Huber, Mei Wang, Kimberly J Johnson, Hamlet Gasoyan, Rong Wang, Graham A Colditz, Shi-Yi Wang, Su-Hsin Chang
Clinical lymphoma, myeloma & leukemia 2024 Aug 2 doi: 10.1016/j.clml.2024.07.017
PMID: 39209567 -
High-Deductible Health Plans and Out-of-Pocket Health Care Costs Among Younger Patients With Multiple Myeloma.
Mark Aaron Fiala, Mengmeng Ji, Michael Slade, John H Huber, Yi-Hsuan Shih, Mei Wang, Graham A Colditz, Shi-Yi Wang, Ravi Vij, Su-Hsin Chang
JCO oncology practice 2025 May 12 doi: 10.1200/OP-24-00978
PMID: 40354593