Data Sources

IBM® Marketscan® Research Databases

The IBM® Marketscan® Research Databases contain individual-level, de-identified healthcare claims information from employers, healthplans, and Medicaid programs. CoDES center has in-house data on IBM® Marketscan® Commercial Database, IBM®Marketscan® Medicare Supplemental Database, and IBM® Marketscan® Health Risk Assessment Database. 

The IBM® Marketscan® Commerical database includes 2005-2018 health insurance claims for inpatient, outpatient, and outpatient pharmacy encounters, as well as enrollment data from large employers and health plans across the United States who provide healthcare coverage for more than 180 million employees, their spouses, and dependents. 

The IBM® Marketscan® Medicare Supplemental Database includes 2005-2018 enrollment records along with inpatient, outpatient, ancillary, and drug claims for 12.5 million retirees in the United States with Medicare supplemental coverage through privately-insured fee-for-service, point-of-service, or capitated health plans.

The IBM® Marketscan® Health Risk Assessment (HRA) Database includes 2012-2018 self-reported biometric and health-related behavioral data obtained through surveys of employees of large US corporations and health plans. HRA data can be linked to medical, pharmacy, and enrollment data for these employees in the IBM® Marketscan® Commercial Database and used to examine the relationships between health behaviors/risk and health outcomes or medical expenditures.

Medicaid Analytic eXtract (MAX)

MAX data contains claims for medical care and drug benefits received by beneficiaries with Medicaid insurance coverage, the state-run programs for low-income and categorically eligible individuals and families. The CoDES center has in-house MAX data for over 100 million beneficiaries residing in the 29 most populous states from 1999-2010 (AL, AR, CA, FL, GA, IA, ID, IL, IN, KS, KY, LA, MA, MN, MO, MS, NC, NE, NJ, NM,  NY, OH, SC, TN, TX, VA, WA, WI, WV) and in all 50 states from 2011-2012. The 29 states included in the 1999-2010 MAX data represent 85% of all Medicaid beneficiaries.

Birth Certificate Records

Dr. Winterstein and her research team have been working for several years to build linkages between Medicaid data and Vital Statistic birth and death certificates. To that end, Vital Statistic birth certificates were obtained from the Florida Department of Health, Texas Department of State Health Services and New Jersey Department of Health. A subset of the birth certificates (for Medicaid beneficiaries with pregnancy-related claims) has been linked to mothers and their infants for livebirths occurring among Medicaid-enrolled women in Florida (1999-2012), New Jersey (1999-2010), and Texas (1999-2012).

5% Medicare Sample

Medicare is a federal health insurance program that provides coverage to people aged 65 years or older and those with disabilities or end-stage renal disease. Annual Medicare enrollment has exceeded 50 million since 2012. The 5% Medicare sample is a 5% random sample of all Medicare beneficiaries and cover claims for inpatient, skilled care nursing facility, and hospice care (Part A) as well as outpatient care (Part B) and prescription drugs (Part D). The CoDES center has in-house 5% Medicare data for the years 2011 through 2015.