Font Size:
|
||||||
MEPS HC-228 2021 Full Year Population CharacteristicsMarch 2023 Agency for Healthcare Research and Quality A. Data Use Agreement A. Data Use AgreementIndividual identifiers have been removed from the micro-data contained in these files. Nevertheless, under sections 308 (d) and 903 (c) of the Public Health Service Act (42 U.S.C. 242m and 42 U.S.C. 299 a-1), data collected by the Agency for Healthcare Research and Quality (AHRQ) and/or the National Center for Health Statistics (NCHS) may not be used for any purpose other than for the purpose for which they were supplied; any effort to determine the identity of any reported cases is prohibited by law. Therefore in accordance with the above referenced Federal Statute, it is understood that:
By using these data you signify your agreement to comply with the above stated statutorily based requirements with the knowledge that deliberately making a false statement in any matter within the jurisdiction of any department or agency of the Federal Government violates Title 18 part 1 Chapter 47 Section 1001 and is punishable by a fine of up to $10,000 or up to 5 years in prison. The Agency for Healthcare Research and Quality requests that users cite AHRQ and the Medical Expenditure Panel Survey as the data source in any publications or research based upon these data. B. Background1.0 Household ComponentThe Medical Expenditure Panel Survey (MEPS) provides nationally representative estimates of health care use, expenditures, sources of payment, and health insurance coverage for the U.S. civilian noninstitutionalized population. The MEPS Household Component (HC) also provides estimates of respondents’ health status, demographic and socio-economic characteristics, employment, access to care, and satisfaction with health care. Estimates can be produced for individuals, families, and selected population subgroups. The panel design of the survey includes five rounds of interviews covering two full calendar years. Additional rounds were added in 2020 and 2021, covering third and fourth years respectively, to compensate for the smaller number of completed interviews in later panels. These extra rounds provide data for examining person-level changes in selected variables such as expenditures, health insurance coverage, and health status. Using computer assisted personal interviewing (CAPI) technology, information about each household member is collected, and the survey builds on this information from interview to interview. All data for a sampled household are reported by a single household respondent. The MEPS HC was initiated in 1996. Each year a new panel of sample households is selected. Because the data collected are comparable to those from earlier medical expenditure surveys conducted in 1977 and 1987, it is possible to analyze long-term trends. Each annual MEPS HC sample size is about 15,000 households. Data can be analyzed at either the person or event level. Data must be weighted to produce national estimates. The set of households selected for each panel of the MEPS HC is a subsample of households participating in the previous year’s National Health Interview Survey (NHIS) conducted by the National Center for Health Statistics. The NHIS sampling frame provides a nationally representative sample of the U.S. civilian noninstitutionalized population. In 2006, the NHIS implemented a new sample design, which included Asian persons in addition to households with Black and Hispanic persons in the oversampling of minority populations. NHIS introduced a new sample design in 2016 that discontinued oversampling of these minority groups. 2.0 Medical Provider ComponentUpon completion of the household CAPI interview and obtaining permission from the household survey respondents, a sample of medical providers are contacted by telephone to obtain information that household respondents can not accurately provide. This part of the MEPS is called the Medical Provider Component (MPC) and information is collected on dates of visits, diagnosis and procedure codes, charges and payments. The Pharmacy Component (PC), a subcomponent of the MPC, does not collect charges or diagnosis and procedure codes but does collect drug detail information, including National Drug Code (NDC) and medicine name, as well as amounts of payment. The MPC is not designed to yield national estimates. It is primarily used as an imputation source to supplement/replace household reported expenditure information. 3.0 Survey Management and Data CollectionMEPS HC and MPC data are collected under the authority of the Public Health Service Act. Data are collected under contract with Westat, Inc. (MEPS HC) and Research Triangle Institute (MEPS MPC). Data sets and summary statistics are edited and published in accordance with the confidentiality provisions of the Public Health Service Act and the Privacy Act. The National Center for Health Statistics (NCHS) provides consultation and technical assistance. As soon as data collection and editing are completed, the MEPS survey data are released to the public in staged releases of micro data files, and tables via the MEPS website and datatools.ahrq.gov. Additional information on MEPS is available from the MEPS project manager or the MEPS public use data manager at the Center for Financing, Access, and Cost Trends, Agency for Healthcare Research and Quality, 5600 Fishers Lane, Rockville, MD 20857 (301-427-1406). C. Technical and Programming Information1.0 General InformationThis documentation describes the 2021 full-year population characteristics data file from the Medical Expenditure Panel Survey Household Component (MEPS HC). Released as an ASCII file (with related R, SAS, SPSS, and Stata programming statements and data user information) and a SAS dataset, SAS transport dataset, Stata dataset, and Excel file, this public use file provides information collected on a nationally representative sample of the civilian noninstitutionalized population of the United States for calendar year 2021. The file contains 990 variables and has a logical record length of 2,246 with an additional 2-byte carriage return/line feed at the end of each record. This file consists of MEPS survey data obtained in Rounds 7, 8, and 9 of Panel 23; Rounds 5, 6, and 7 of Panel 24; Rounds 3, 4, and 5 of Panel 25; and Rounds 1, 2, and 3 of Panel 26, the rounds for the MEPS panels covering calendar year 2021. 2021 is the first data year to include four panels of data; Panel 23 was extended to include Rounds 7, 8, and 9 and Panel 24 was extended to include Rounds 6, and 7. In addition, the Panel 24 Round 5 reference period was extended into 2021 instead of ending on 12/31/2020. An overview of the impact of this change on variables and variable names is described in Section 2.4. This file contains variables pertaining to survey administration, demographics, person-level conditions, health status, disability days, quality of care, employment, and health insurance. The 2021 full-year expenditure, medical care use counts and income data will be forthcoming. The following documentation offers a brief overview of the types and levels of data provided, content and structure of the files, and programming information. It contains the following sections:
Both weighted and unweighted frequencies of most variables included in the 2021 full-year population characteristics data file are provided in the accompanying codebook file. The exceptions to this are weight variables, variance estimation variables, and variables that have a separate weight (SAQ, DCS, SDOH). Only unweighted frequencies of these variables are included in the accompanying codebook file. See the Weights Variables list in Section D, Variable-Source Crosswalk. A database of all MEPS products released to date can be found on the MEPS website. 2.0 Data File InformationThis public use dataset contains variables and frequency distributions associated with 28,336 persons who participated in the MEPS Household Component of the Medical Expenditure Panel Survey in 2021. These persons received a positive person-level weight, a family-level weight, or both (some participating persons belonged to families characterized as family-level nonrespondents while some members of participating families were not eligible for a person-level weight). Note that persons who will have a positive family weight but not a positive person weight have been placed on this public use file to maintain consistency in terms of file structure with the upcoming public use file with expenditure and income data. Those will be the only records without a positive person weight appearing on this file. Note that unlike some previous MEPS Population Characteristic files, family weights are not included on this release. As indicated above, all persons included on this file that do not have positive person weights will have a positive family weight on the final 2021 Consolidated PUF: HC-233. These 28,336 persons were part of one of the four MEPS panels for whom data were collected in 2021: Rounds 7, 8, and 9 of Panel 23; Rounds 5, 6, and 7 of Panel 24; Rounds 3, 4, and 5 of Panel 25; or Rounds 1, 2, and 3 of Panel 26. Of these persons, 27,332 were assigned a positive person-level weight. In conjunction with the person-level weight variable (PERWT21P) provided on this file, data for persons with a positive person-level weight can be used to make estimates for the civilian noninstitutionalized U.S. population for 2021. The MEPS CAPI design has changed significantly beginning with the specifications for Panel 21 Round 5/Panel 22 Round 3/Panel 23 Round 1. In addition, three rounds of data collection were added for Panel 23 (Rounds 7, 8, and 9) and three rounds of data collection were added for Panel 24 (Rounds 5, 6, and 7) in 2021.
2.1 Codebook StructureThe codebook and data file sequence lists variables in the following order:
2.2 Reserved Codes
The value -15 (CANNOT BE COMPUTED) is assigned to MEPS constructed variables in cases where there is not enough information from the MEPS instrument to calculate the constructed variables. “Not enough information” is often the result of skip patterns in the data or from missing information resulting from MEPS responses of -7 (REFUSED) or -8 (DK). Note that reserved code -8 includes cases where the information from the question was “not ascertained” or where the respondent chose “don’t know”. 2.3 Codebook Format
2.4 Variable NamingIn general, variable names reflect the content of the variable. Edited variables end in an X and are so noted in the variable label. The last two characters in round-specific variables have denoted the rounds of data collection, for example, Round 3, 4, or 5 of Panel 25, and Round 1, 2, or 3 of Panel 26. Historically round dates have been indicated by two numbers following the variable name; the first number representing the round for second panel persons (Panel 25), the second number representing the round for first panel persons (Panel 26). For example, EMPST31 refers to employment status on the Round 3 interview date for Panel 25 persons and employment status on the Round 1 interview date for Panel 26 persons. The variable names in the 2021 files have not been renamed from prior years, despite the addition of Rounds 5, 6, and 7 of Panel 24 and Rounds 7, 8, and 9 of Panel 23, and those round numbers (5, 6, and 7 for Panel 24 and 7, 8, and 9 for Panel 23) will not be included in the variable names. The variable names containing “53” (for example, AGE53X, INS53X, INSAT53X, EMPST53, and HELD53X) have data from Round 9 of Panel 23, Round 7 of Panel 24, Round 5 of Panel 25, and Round 3 of Panel 26. The variable names that contain “42” (for example, AGE42X, INS42X, INSAT42X, EMPST42, and HELD42X) have data from Round 8 of Panel 23, Round 6 of Panel 24, Round 4 of Panel 25, and Round 2 of Panel 26. Variable names containing “31” have data from Round 7 of Panel 23, Round 5 of Panel 24, Round 3 of Panel 25, and Round 1 of Panel 26. As variable collection, universe, or categories are altered, the variable name will be appended with “_Myy” to indicate in which year the alterations took place. Details about these alterations can be found throughout this document. Variables contained in this delivery were derived either from the questionnaire itself or from the CAPI. The source of each variable is identified in the section of the documentation entitled “Section D. Variable-Source Crosswalk.” Sources for each variable are indicated in one of four ways: (1) variables derived from CAPI or assigned in sampling are so indicated; (2) variables derived from complex algorithms associated with re-enumeration are labeled “RE Section”; (3) variables that are collected by one or more specific questions in the instrument have those question numbers listed in the Source column; and (4) variables constructed from multiple questions using complex algorithms are labeled “Constructed.” 2.5 File ContentsUsers of MEPS data should be aware that the survey collects data for all sample persons who were in the survey target population at any time during the survey period. In other words, a small proportion of individuals in MEPS analytic files are not members of the survey target population (i.e., civilian noninstitutionalized) for the entire survey period. These persons include those who had periods during which they lived in an institution (e.g., nursing home or prison), were in the military, or lived out of the country, as well as those who were born (or adopted) into MEPS sample households or died during the year. They are considered sample persons for the survey and are included in MEPS data files with positive person weights, but no data were collected for the periods they were not inscope and their annual data for variables like health care utilization, expenditures, and insurance coverage reflect only the part of the year they were inscope for the survey. Persons who are inscope for only part of the year should not be confused with non-respondents. Sample persons who are classified as non-respondents to one or more rounds of data collection (i.e., initial non-respondents and drop outs over time) are not included in MEPS annual files, and survey weights for full-year respondents are inflated through statistical adjustment procedures to compensate for both full and part-year nonresponse (see Section 3.0 “Survey Sample Information” for more information). The AHRQ website provides more details about the identification and analytic considerations regarding sample persons who are inscope only part of the year. 2.5.1 Survey Administration Variables (DUID–RURSLT53)The survey administration variables contain information related to conducting the interview, household and family composition, and person-level and RU-level status codes. Data for the survey administration variables were derived from the sampling process, the CAPI programs, or were computed based on information provided by the respondent in the Reenumeration section of the questionnaire. Most survey administration variables on this file are asked during every round of the MEPS interview. They describe data for Rounds 7/5/3/1, 8/6/4/2, 9/7/5/3 status and status as of December 31, 2021. This year, the data collected includes a fourth panel, Panel 23, which was extended past the previous seven rounds of data collection to include eighth, and ninth round interviews. In order to incorporate the fourth year Panel 23 Round 7, Round 8, and Round 9 data into the delivery without adding new variables, a decision was made to use the ‘31’/‘42’/‘53’ variables to also hold the fourth year panel data. Retaining the standard ‘31’/‘42’/‘53’ variable names, even with the addition of the fourth year panel, will allow easier comparison to previous FY data. Data collection for Panel 24 was planned to go beyond the usual five rounds, so for 2021, Round 5 was collected as a (2020-2021) cross-year round. Alternatively, Panel 25 data collection was planned to end at five rounds, and thus, Panel 25 Round 5 was collected as a (2021) terminal round (not as a 2021-2022 cross-year round). For example, where variables ending in ‘53’ would normally hold data for Panel 25 Round 5 and Panel 26 Round 3, they also now hold data for Panel 24 Round 7 and Panel 23 Round 9. Similarly, Panel 24 Round 6 and Panel 23 Round 8 data have been added to the ‘42’ variables. Panel 24 Round 5 and Panel 23 Round 7 data have been added to the ‘31’ variables. This means that the ‘31’ variables contain data for Rounds 1, 3, 5 and 7, the ‘42’ variables contain data for Rounds 2, 4, 6 and 8, and the ‘53’ variables contain data for Rounds 3, 5, 7, and 9. The December 31, 2021 variables were developed in two ways. Those used in the construction of eligibility, inscope, and the end reference date were based on an exact date. The remaining variables were constructed using data from specific rounds, if available. If data were missing from the target round but were available in another round, data from that other round were used in the variable construction. If no valid data were available during any round of data collection, an appropriate reserved code was assigned. Dwelling Units, Reporting Units, and Families The definitions of Dwelling Units (DUs) in the MEPS Household Survey are generally consistent with the definitions employed for the National Health Interview Survey (NHIS). The Dwelling Unit ID (DUID) is a seven-digit ID number consisting of a 2-digit panel number followed by a five-digit random number assigned after the case was sampled for MEPS. A three-digit person number (PID) uniquely identifies each person within the DU. The variable DUPERSID is the combination of the variables DUID and PID. As part of the new CAPI design, the lengths of the ID variables have changed in the file. An additional 2 bytes in the IDs resulted from adding a 2-digit panel number to the beginning of all the IDs. Users should be mindful of the different ID structures/lengths when combining MEPS files from 1996-2017 with MEPS files from 2018-2021. PANEL is a constructed variable used to specify the panel number for the person. PANEL will indicate Panel 23, Panel 24, Panel 25, or Panel 26 for each person on the file. Panel 23 is the panel that started in 2018, Panel 24 is the panel that started in 2019, Panel 25 is the panel that started in 2020, and Panel 26 is the panel that started in 2021. The panel number is included as the first two digits of the DUID and DUPERSID. A Reporting Unit (RU) is a person or group of persons in the sampled DU who are related by blood, marriage, adoption, or other family association. Each RU was interviewed as a single entity for MEPS. Thus, the RU serves chiefly as a family-based “survey” operations unit rather than an analytic unit. Members of each RU within the DU are identified in the pertinent three rounds by the round-specific variables RULETR31, RULETR42, and RULETR53. End-of-year status (as of December 31, 2021 or the last round they were in the survey) is indicated by the RULETR21 variable. Regardless of the legal status of their association, two persons living together as a “family” unit were treated as a single RU if they chose to be so identified. Examples of different types of RUs are:
Unmarried college students (less than 24 years of age) who usually live in the sampled household but were living away from home and going to school at the time of the Round 3/1 MEPS interview were treated as an RU separate from that of their parents for the purpose of data collection. The round-specific variables RUSIZE31, RUSIZE42, RUSIZE53, and the end-of-year status variable RUSIZE21 indicate the number of persons in each RU, treating students as single RUs separate from their parents. Thus, students are not included in the RUSIZE count of their parents’ RU. However, for many analytic objectives, the student RUs would be combined with their parents’ RU, treating the combined entity as a single family. Family identifier and size variables are described below and include students with their parents’ RU. The round-specific variables FAMID31, FAMID42, FAMID53, and the end-of-year status variable FAMID21 identify a family (i.e., persons related to one another by blood, marriage, adoption, or self-identified as a single unit) for each round and as of December 31, 2020. The FAMID variables differ from the RULETR variables only in that student RUs are combined with their parents’ RU. One other family identifier, FAMIDYR, is provided on this file. The annualized family ID letter, FAMIDYR, identifies eligible members of the eligible annualized families within a DU. In order to identify a person’s family affiliation, users must create a unique set of FAMID variables by concatenating the DU identifier and the FAMID variable. Foster care relationships and fostered members of households are no longer included in the MEPS data. This change was implemented with the 2017 Consolidated Public Use Files, so users combining many years of data may encounter foster relationships/members in earlier MEPS files. The round-specific variables FAMSZE31, FAMSZE42, FAMSZE53, and the end-of-year status variable FAMSZE21 indicate the number of persons associated with a single family unit after students are linked to their associated parent RUs for analytical purposes. Family-level analyses should use the FAMSZE variables. Note that the variables RUSIZE31, RUSIZE42, RUSIZE53, RUSIZE21, FAMSZE31, FAMSZE42, FAMSZE53, and FAMSZE20 exclude persons who are ineligible for data collection (i.e., those where ELGRND31 NE 1, ELGRND42 NE 1, ELGRND53 NE 1 or ELGRND21 NE 1); analysts should exclude ineligible persons in a given round from all family-level analyses for that round. The round-specific variables RURSLT31, RURSLT42, and RURSLT53 indicate the RU response status for each round. Users should note that the values for RURSLT31 differ from those for RURSLT42 and RURSLT53.
Standard or primary RUs are the original RUs from NHIS. A new RU is one created when members of the household leave the primary RU and are followed according to the rules of the survey. A student RU is an unmarried college student (under 24 years of age) who is considered a usual member of the household, but was living away from home while going to school, and was treated as a Reporting Unit (RU) separate from his or her parents’ RU for the purpose of data collection. RUCLAS21 was set based on the RUCLAS values from Rounds 5/3/1, 6/4/2, and 7/5/3. If the person was present in the responding RU in Round 7/5/3, then RUCLAS21 was set to RUCLAS53. If the person was not present in a responding RU in Round 7/5/3 but was present in Round 6/4/2, then RUCLAS21 was set to RUCLAS42. If the person was not present in either Rounds 6/4/2 or 7/5/3 but was present in Round 5/3/1, then RUCLAS21 was set to RUCLAS31. If the person was not linked to a responding RU during any round, then RUCLAS21 was set to -15. Geographic Variables The round-specific variables REGION31, REGION42, REGION53, and the end-of-year status variable REGION21 indicate the Census region for the RU. REGION21 indicates the region for the 2021 portion of Round 9/7/5/3. For most analyses, REGION21 should be used.
Reference Period Dates The reference period is the period of time for which data were collected in each round for each person. The reference period dates were determined during the interview for each person by the CAPI program. The round-specific beginning reference period dates are included for each person. These variables include BEGRFM31, BEGRFY31, BEGRFM42, BEGRFY42, BEGRFM53, and BEGRFY53. The reference period for Round 1 for most persons began on January 1, 2021 and ended on the date of the Round 1 interview. For RU members who joined later in Round 1, the beginning Round 1 reference date was the date the person entered the RU. For all subsequent rounds, the reference period for most persons began on the date of the previous round’s interview and ended on the date of the current round’s interview. Persons who joined after the previous round’s interview had their beginning reference date for the round set to the day they joined the RU. The round-specific ending reference period dates for Rounds 7/5/3/1, 8/6/4/2, and 9/7/5/3 as well as the end-of-year reference period end date variables are also included for each person. These variables include ENDRFM31, ENDRFY31, ENDRFM42, ENDRFY42, ENDRFM53, ENDRFY53, ENDRFM21, and ENDRFY21. For most persons in the sample, the date of the round’s interview is the reference period end date. Note that the end date of the reference period for a person is prior to the date of the interview if the person was deceased during the round, left the RU, was institutionalized prior to that round’s interview, or left the RU to join the military. For a small number of cases, the reference period dates may be recoded for confidentiality. Reference Person Identifiers The round-specific variables REFPRS31, REFPRS42, and REFPRS53 and the end-of-year status variable REFPRS21 identify the reference person for Rounds 7/5/3/1, 8/6/4/2, and 9/7/5/3, and as of December 31, 2021 (or the last round they were in the survey). In general, the reference person is defined as the household member 16 years of age or older who owns or rents the home. If more than one person meets this description, the household respondent identifies one from among them. If the respondent is unable to identify a person fitting this definition, the questionnaire asks for the head of household and this person is then considered the reference person for that RU. This information is collected in the Reenumeration section of the CAPI questionnaire. Respondent Identifiers The respondent is the person who answered the interview questions for the Reporting Unit (RU). The round-specific variables RESP31, RESP42, and RESP53 and the end-of-year status variable RESP21 identify the respondent for Rounds 7/5/3/1, 8/6/4/2, and 9/7/5/3 and as of December 31, 2021 (or the last round they were in the survey). Only one respondent is identified for each RU. In instances where the interview was completed in more than one session, only the first respondent is indicated. There are two types of respondents. The respondent can be either an RU member or a non-RU member proxy. The round-specific variables PROXY31, PROXY42, and PROXY53 and the end-of-year status variable PROXY21 identify the type of respondent for Rounds 7/5/3/1, 8/6/4/2, 9/7/5/3 and as of December 31, 2021 (or the last round they were in the survey). Language of Interview The language of interview variable (INTVLANG) is a summary value of the round-specific RU-level information section question (RU30), which asks the interviewer to record the language in which the interview was completed: English, Spanish, Both English and Spanish, Other Language. Given the first round that the person was part of the study and the person’s associated RU for that round, INTVLANG is assigned the interview language value reported for the person’s RU for the round. Person Status A number of variables describe the various components reflecting each person’s status for each round of data collection. These variables provide information about a person’s in-scope status, Keyness status, eligibility status, and disposition status. These variables include: KEYNESS, INSCOP31, INSCOP42, INSCOP53, INSCOP21, INSC1231, INSCOPE, ELGRND31, ELGRND42, ELGRND53, ELGRND21, PSTATS31, PSTATS42, and PSTATS53. These variables are set based on sampling information and responses provided in the Reenumeration section of the CAPI questionnaire. Through the Reenumeration section of the CAPI questionnaire, each member of an RU was classified as “Key” or “Non-Key”, “in-scope” or “out-of-scope”, and “eligible” or “ineligible” for MEPS data collection. To be included in the set of persons used in the derivation of MEPS person-level estimates, a person had to be a member of the civilian noninstitutionalized population for at least one day during 2021. Because a person’s eligibility for the survey might have changed since the NHIS interview, a sampling reenumeration of household membership was conducted at the start of each round’s interview. Only persons who were “inscope” sometime during the year, were “Key”, and responded for the full period in which they were inscope were assigned positive person-level weights, and thus are to be used in the derivation of person-level national estimates from the MEPS. Note: If analysts want to subset to infants born during 2021, then newborns should be identified using AGE21X = 0 rather than PSTATSxy = 51. Inscope The round-specific variables INSCOP31, INSCOP42, and INSCOP53 indicate a person’s in-scope status for Rounds 7/5/3/1, 8/6/4/2, and 9/7/5/3. INSCOP21, INSC1231, and INSCOPE indicate a person’s in-scope status for the portion of Round 9/7/5/3 that covers 2021, the person’s in-scope status as of 12/31/21, and whether a person was ever in-scope during the calendar year 2021. A person was considered as in-scope during a round or a referenced time period if he or she was a member of the U.S. civilian, noninstitutionalized population at some time during that round or that time period. The values of these variables taken in conjunction allow one to determine in-scope status over time (for example, becoming inscope in the middle of a round, as would be the case for newborns).
Keyness The term “Keyness” is related to an individual’s chance of being included in MEPS. A person is Key if that person is linked for sampling purposes to the set of NHIS sampled households designated for inclusion in MEPS. Specifically, a Key person was either a member of a responding NHIS household at the time of interview, or joined a family associated with such a household after being out-of-scope at the time of the NHIS (examples of the latter situation include newborns and those returning from military service, an institution, or residence in a foreign country). A non-Key person is one whose chance of selection for the NHIS (and MEPS) was associated with a household eligible but not sampled for the NHIS and who later became a member of a MEPS Reporting Unit. MEPS data (e.g., utilization and expenditures) were collected for the period of time a non-Key person was part of the sampled unit to provide information for family-level analyses. However, non-Key persons who leave a sample household unaccompanied by a Key, in-scope member were not followed for subsequent interviews. Non-Key individuals do not receive sample person-level weights and thus do not contribute to person-level national estimates. The variable KEYNESS indicates a person’s Keyness status. This variable is not round-specific. Instead, it is set at the time the person enters MEPS, and the person’s Keyness status never changes. Once a person is determined to be Key, that person will always be Key. It should be pointed out that a person might be Key even though not part of the civilian, noninstitutionalized portion of the U.S. population. For example, a person in the military may have been living with his or her civilian spouse and children in a household sampled for NHIS. The person in the military would be considered a Key person for MEPS; however, such a person would not be eligible to receive a person-level sample weight if he or she was never inscope during 2021. Eligibility The eligibility of a person for MEPS pertains to whether or not data were to be collected for that person. All of the Key in-scope persons of a sampled RU were eligible for data collection. The only non-Key persons eligible for data collection were those who happened to be living in an RU with at least one Key, in-scope person. Their eligibility continued only for the time that they were living with at least one such person. The only out-of-scope persons eligible for data collection were those who were living with Key in-scope persons, again only for the time they were living with such a person. Only military persons can meet this description (for example, a person on full-time active duty military, living with a spouse who is Key). A person may be classified as eligible for an entire round or for some part of a round. For persons who are eligible for only part of a round (for example, persons may have been institutionalized during a round), data were collected for the period of time for which that person was classified as eligible. The round-specific variables ELGRND31, ELGRND42, ELGRND53 and the end-of-year status variable ELGRND21 indicate a person’s eligibility status for Rounds 7/5/3/1, 8/6/4/2, and 9/7/5/3 and as of December 31, 2021. Person Disposition Status The round-specific variables PSTATS31, PSTATS42, and PSTATS53 indicate a person’s response and eligibility status for each round of interviewing. The PSTATSxy variables indicate the reasons for either continuing or terminating data collection for each person in the MEPS. Using this variable, one could identify persons who moved during the reference period, died, were born, institutionalized, or who were in the military. Analysts should note that PSTATS53 provides a summary for all of Round 9/7/5/3, including transitions that occurred after 2021. Note that some categories may be collapsed for confidentiality purposes.
2.5.2 Navigating the MEPS Data with Information on Person Disposition StatusSince the variables PSTATS31, PSTATS42, and PSTATS53 indicate the reasons for either continuing or terminating data collection for each person in MEPS, these variables can be used to explain the beginning and ending dates for each individual’s reference period of data collection, as well as which sections in the instrument each individual did not receive. By using the information included in the following table, analysts will be able to determine for each individual which sections of the MEPS questionnaire collected data elements for that person. Some individuals have a reference period that spans an entire round, while other individuals may have data collected only for a portion of the round. When an individual’s reference period does not coincide with the RU reference period, the individual’s start date may be a later date, or the end date may be an earlier date, or both. In addition, some individuals have reference period information coded as “Inapplicable” (e.g., for individuals who were not actually in the household). The information in this table indicates the beginning and ending dates of reference periods for persons with various values of PSTATS31, PSTATS42, and PSTATS53. The actual dates for each individual can be found in the following variables included on this file: BEGRFM31, BEGRFM42, BEGRFM53, BEGRFY31, BEGRFY42, BEGRFY53, ENDRFM31, ENDRFM42, ENDRFM53, ENDRFY31, ENDRFY42, ENDRFY53, ENDRFM21, and ENDRFY21. The table below also describes the section or sections of the questionnaire that were NOT asked for each value of PSTATS31, PSTATS42, and PSTATS53. For example, the Priority Condition Enumeration (PE) section has questions that are not asked for deceased persons. The Closing (CL) section also contains some questions or question rosters that exclude certain persons depending on whether the person died, became institutionalized, or otherwise left the RU; however, no one is considered to have skipped the entire section. Some questions or sections (e.g., Health Status (HE), Employment (RJ, EM, EW)) are skipped if individuals are not within a certain age range. Since the PSTATS variables do not address skip patterns based on age, analysts will need to use the appropriate age variables. The paper-and-pencil Self-Administered Questionnaire (SAQ) was designed to collect information during Panel 26 Round 2, Panel 25 Round 4, Panel 24 Round 6 and Panel 23 Round 8. A person was considered eligible to receive an SAQ if that person did not have a status of deceased or institutionalized, did not move out of the U.S. or to a military facility, was not a non-response at the time of the Round 2, Round 4 Round 6, or Round 8 interview date, and was 18 years of age or older. No RU members added in Round 3, Round 5, or Round 7 were asked to complete an SAQ questionnaire. Because PSTATS variables do not address skip patterns based on age, this questionnaire was not included in the table below. Once again, analysts will need to use the appropriate age variable, which in this case would be AGE42X. The documentation for this questionnaire appears in the SAQ section of this document under “Health Status Variables.”
2.5.3 Demographic Variables (AGE31X–YRSINUS)General Information Demographic variables provide information about the demographic characteristics of each person from the MEPS HC. The characteristics include age, sex, race, ethnicity, marital status, educational attainment, and military service. As noted below, some variables have edited and imputed values. Most demographic variables on this file were asked during every round of the MEPS interview. These variables describe data for Rounds 7, 8, and 9 for Panel 23 (the panel that started in 2018), Rounds 5, 6, and 7 of Panel 24 (the panel that started in 2019); Rounds 3, 4 and 5 of Panel 25 (the panel that started in 2020); Rounds 1, 2 and 3 of Panel 26 (the panel that started in 2021); and status as of December 31, 2021. Demographic variables that have variable names that contain ‘31’, ‘42’, or ‘53’ are round-specific variables. As mentioned in Section 2.4 “Variable Naming”, fourth year panel data for Rounds 7, 8, and 9 of Panel 23 and third year panel data for Rounds 5, 6, and 7 for Panel 24 are included in the ‘31’/’42’/’53’ sets of variables. For example, AGE31X represents the age data relevant to Round 7 of Panel 23, Round 5 of Panel 24, Round 3 of Panel 25 or Round 1 of Panel 26, The variable PANEL indicates the panel from which the data were derived. A value of 23 indicates Panel 23 data, a value of 24 indicates Panel 24 data, a value of 25 indicates Panel 25 data, and a value of 26 indicates Panel 26 data. The remaining demographic variables on this file are not round-specific. The variables describing demographic status of the person as of December 31, 2021 were developed in two ways. First, the age variable (AGE21X) represents the exact age, calculated from date of birth and indicates age status as of 12/31/21. For the remaining December 31st variables [i.e., related to marital status (MARRY21X, SPOUID21, SPOUIN21), student status (FTSTU21X), and the relationship to reference persons (REFRL21X)], the following algorithm was used: data were taken from the Round 9/7/5/3 counterpart if non-missing; else, if missing, data were taken from the Round 8/6/4/2 counterpart; else from the Round 7/5/3/1 counterpart. If no valid data were available during any of these rounds of data collection, the algorithm assigned the missing value (other than -1 “Inapplicable”) from the first round that the person was part of the study. When all three rounds were set to -1, a value of -15 “Cannot be Computed” was assigned. Age Date of birth and age for each RU member were asked or verified during each MEPS interview (DOBMM, DOBYY, AGE31X, AGE42X, AGE53X). If date of birth was available, age was calculated based on the difference between date of birth and date of interview. Inconsistencies between the calculated age and the age reported during the CAPI interview were reviewed and resolved. For purposes of confidentiality, the variables AGE31X, AGE42X, AGE53X, AGE21X, and AGELAST were top-coded at 85 years. When date of birth was not provided but age was provided (either from the MEPS interviews or the 2017-2020 NHIS data), the month and year of birth were assigned randomly from among the possible valid options. For any cases still not accounted for, age was imputed using:
For example, a mother’s age is imputed as her child’s age plus 26, where 26 is the mean age difference between MEPS mothers and their children. A wife’s age is imputed as the husband’s age minus 3, where 3 is the mean age difference between MEPS wives and husbands. Age was imputed in this way for 19 persons on this file. AGELAST indicates a person’s age from the last time the person was eligible for data collection during a specific calendar year. The age range for this variable is between 0 and 85. Sex Data on the gender of each RU member (SEX) were initially determined from the 2017 NHIS for Panel 23, from the 2018 NHIS for Panel 24, from the 2019 NHIS for Panel 25, and from the 2020 NHIS for Panel 26. The SEX variable was verified and, if necessary, corrected during each MEPS interview. The data for new RU members (persons who were not members of the RU at the time of the NHIS interviews) were also obtained during each MEPS round. When gender of the RU member was not available from the NHIS interviews and was not determined during one of the subsequent MEPS interviews, it was assigned in the following way. The person’s first name was used to assign gender if obvious (no cases were resolved in this way). If the person’s first name provided no indication of gender, then family relationships were reviewed (no cases were resolved this way). If neither of these approaches made it possible to determine the individual’s gender, gender was randomly assigned (no cases were resolved this way). Race and Ethnicity Group The race and the ethnic background questions were asked for each RU member during the MEPS interview. If the information was not obtained in Round 1, the questions were asked in subsequent rounds. It should be noted that race/ethnicity questions in the MEPS were revised starting with data collection in 2013 for Panel 16 Round 5, Panel 17 Round 3, and Panel 18 Round 1; this affected data starting with the FY 2012 file. Previously, there were two race questions, but starting with data collection in 2013, there is only one race question. All Asian categories listed in the second question were moved to the new single question. In addition, the new race question had additional detail for the Native Hawaiian and Other Pacific Islander categories. The main change for ethnicity is that the new questions allowed respondents to report more than one Hispanic ethnicity. Race/ethnicity data from earlier years may not be directly comparable. The following table shows the variables used for FY 2002-2011 and FY 2012-2021, with these exceptions: 1) in FY 2012, RACEV1X categories 4 and 5 were not combined but are combined starting with 2013, and 2) RACEV2X and HISPNCAT were first introduced in 2013.
Race and ethnicity variables and their response categories for years prior to 2002 are available in the documentation for the FY Consolidated PUF for each data year. Values for these variables were obtained based on the following priority order. If available, data collected were used to determine race and ethnicity. If race and/or ethnicity were not reported in the interview, then data obtained from the originally collected NHIS data were used (27 cases were resolved this way for race, and 10 cases were resolved this way for ethnicity). If still not determined, the race, and/or ethnicity were assigned based on relationship to other members of the DU using a priority ordering that gave precedence to blood relatives in the immediate family (this approach was used on 12 persons to set race and 7 persons to set ethnicity). For the FY12 and FY13 PUFs, three new race variables were constructed for both the old and the new questions: RACEVER, RACEV1X, and RACETHX. The variable RACEVER was constructed to indicate which version of the race question(s) was asked and was included in only the 2012 and 2013 FY PUFs. RACEVER has been dropped starting with the 2014 PUF. The variables RACEV1X and RACETHX replace the variables RACEX and RACETHNX from 2002-2011. A new race variable, RACEV2X, was constructed only for the new race question and was added for the first time to the 2013 files. RACEV2X was set to -1 “Inapplicable” for persons that were not asked the new race question in FY13 only. This variable includes the expanded detail Asian categories and continues to be constructed for all PUFs. The “multiple races reported” categories for RACEV1X and RACEV2X differ in the 2013-2015 PUFs but are the same starting with the 2016 PUF. In the 2013-2015 PUFs, persons with multiple Asian races or multiple Hawaiian/Pacific Islander races were considered multiple races for RACEV2X and were not considered multiple races for RACEV1X. Starting with the 2016 PUFs, persons with multiple Asian races or multiple Hawaiian/Pacific Islander races were no longer considered multiple races in RACEV2X. For the FY12 and FY13 PUFs, the two Hispanic ethnicity variables from previous years were included: HISPANX and HISPCAT. The HISPANX variable continues to be constructed. The HISPCAT variable was constructed for specific Hispanic categories based only on the old question in FY12 and FY13 and HISPCAT has been dropped starting with the 2014 PUF. A new ethnicity variable, HISPNCAT, based on the new question, was introduced starting with 2013. HISPNCAT includes similar categories as HISPCAT but in a different order, and contains an additional category, 8 “Multiple Hispanic Groups Reported”, to represent any multiple responses reported. HISPNCAT was set to -1 “Inapplicable” for persons that were not asked the new ethnicity question in FY13. This variable continues to be constructed for all PUFs. Categories have been collapsed in the variables RACEV1X, RACEV2X and HISPNCAT. For RACEV1X, new with the 2012 PUF, categories 4 and 5 were collapsed in category 4 as “ASIAN/NATV HAWAIIAN/PACFC ISL-NO OTH” starting with the 2013 PUF. For RACEV2X, new with and starting with the 2013 PUF, categories 7, 8, 9, 10, and 11 were collapsed in category 10 as “OTH ASIAN/NATV HAWAIIAN/PACFC ISL-NO OTH,” and for HISPNCAT, new with and starting with the 2013 PUF, categories 6 and 7 were collapsed in category 6 as “OTH LAT AM/HISP/LATINO/SPNSH ORGN-NO OTH”. Language Variables: OTHLGSPK, WHTLGSPK, and HWELLSPK Language variables (OTHLGSPK, WHTLGSPK, and HWELLSPK) were collected at the person level in the round in which the person entered the MEPS survey. Beginning with Panel 23 Round 1, the household respondent was asked for each person, age 5 or older, a person-level question to determine whether that person speaks a language other than English at home (RE1170, OTHLGSPK). If the response to OTHLGSPK was ‘yes’, then two other questions were asked. WHTLGSPK (RE1170) is a person-level question that asks whether the non-English language spoken at home is Spanish or some other language, and HWELLSPK (RE1170) is a person-level question that asks how well that person can speak English. If the response to OTHLGSPK was ‘No’, then WHTLGSPK and HWELLSPK are set to ‘-1’ (Inapplicable). Family members who are deceased or institutionalized in Round 1 are coded with a value of ‘-1’ (Inapplicable). Minors under age 5 in households have all three variables coded to “5” (Under 5 years old - Inapplicable). For users doing multiyear analyses, please carefully review prior years’ documentation to assure correct interpretation of language variables over time. Foreign Born Status Three questions regarding foreign born status were asked in the Demographic section to ascertain whether a person was born in the U.S. (RE1170), what year they came to the U.S. (RE1170) if not born in the U.S., and years lived in the U.S. (RE1170) if the response to RE1170 was ‘Don’t Know’. They replaced similar questions that had been asked in the Access to Care section prior to 2013. The three questions were only asked once for each eligible person, that is, the first round the person was included in the interview. These new questions were asked of everyone, except deceased and institutionalized persons. The data from RE1170 are reported as the constructed variable BORNUSA. The data from RE1170 (YRCAMEUS) and RE1170 (YRSINUSA) were used to calculate the number of years a person has lived in the U.S. for the constructed variable, YRSINUS. Please note that YRSINUS is a discrete variable and has collapsed categories: 1 “less than 1 year”; 2 “1 yr., less than 5 years”; 3 “5 yrs., less than 10 years”; 4 “10 yrs., less than 15 years”; 5 “15 years or more”. Marital Status and Spouse ID Current marital status was collected and/or updated during every round of the MEPS interview. This information was obtained in RE100 and RE1170 and is reported as MARRY31X, MARRY42X, MARRY53X, and MARRY21X. Persons under the age of 16 were coded as 6 “Under 16 - Inapplicable”. If marital status of a specified round differed from that of the previous round, then the marital status of the specified round was edited to reflect a change during the round (e.g., married in round, divorced in round, separated in round, or widowed in round). In instances where there were discrepancies between the marital statuses of two individuals within a family, other person-level variables were reviewed to determine the edited marital status for each individual. Thus, when one spouse was reported as married and the other spouse reported as widowed, the data were reviewed to determine if one partner should be coded as 8 “Widowed in Round”. Edits were performed to ensure some consistency across rounds. First, a person could not be coded as “Never Married” after previously being coded as any other marital status (e.g., “Widowed”). Second, a person could not be coded as “Under 16 - Inapplicable” after being previously coded as any other marital status. Third, a person could not be coded as “Married in Round” after being coded as “Married” in the round immediately preceding. Fourth, a person could not be coded as an “in Round” code (e.g., “Widowed in Round”) in two subsequent rounds. Since marital status can change across rounds and it was not feasible to edit every combination of values across rounds, unlikely sequences for marital status across the round-specific variables do exist. The person identifier for each individual’s spouse is reported in SPOUID31, SPOUID42, SPOUID53, and SPOUID21. These are the PIDs (within each family) of the person identified as the spouse during Round 7/5/3/1, Round 8/6/4/2, and Round 9/7/5/3 and as of December 31, 2021, respectively. If no spouse was identified in the household, the variable was coded as 995 “No Spouse in House”. Those with unknown marital status are coded as 996 “Marital Status Unknown”. Persons under the age of 16 are coded as 997 “Less than 16 Years Old”. The SPOUIN31, SPOUIN42, SPOUIN53, and SPOUIN21 variables indicate whether a person’s spouse was present in the RU during Round 7/5/3/1, Round 8/6/4/2, Round 9/7/5/3 and as of December 31, 2021 respectively. If the person had no spouse in the household, the value was coded as 2 “Not Married/No Spouse”. For persons under the age of 16 the value was coded as 3 “Under 16 - Inapplicable”. The SPOUID and SPOUIN variables were obtained from RE900, where the respondent was asked to identify how each pair of persons in the household was related. Analysts should note that this information was collected in a set of questions separate from the questions that asked about marital status. While editing was performed to ensure that SPOUID and SPOUIN are consistent within each round, there was no consistency check between these variables and marital status in a given round. Apparent discrepancies between marital status and spouse information may be due to any of the following causes:
Student Status and Educational Attainment The variables FTSTU31X, FTSTU42X, FTSTU53X and FTSTU21X indicate whether the person was a full-time student at the interview date (or 12/31/21 for FTSTU21X). These variables have valid values for all persons between the ages of 17 - 23 inclusive. When this question was asked during Round 1 of Panel 26, it was based on age as of the 2021 NHIS interview date. Education questions were only asked when persons first entered MEPS, typically Round 1 for most people. It should be noted that education questions were changed with data collection in 2012 and then changed back to the original questions with data collection in 2015. The variables associated with the original education questions (data collection in 2011 and prior years and 2015 and subsequent years) are EDUCYR and HIDEG. The variable associated with the interim education question (data collection in 2012-2014) is EDUYRDEG (or EDUYRDG with collapsed categories). The variable EDRECODE relates variables for the original and interim education questions. As a result, different education variables are in the 2011-2015 PUFs based on the panel and round when a person first entered MEPS. The PUF documentation for each of the 2011-2015 years contains details about which education variables are in the respective files. Starting with the 2016 PUFs, EDUCYR and HIDEG are the only education variables on the PUFs. EDUCYR contains the number of years of education completed when entering MEPS for individuals 5 years or older. Children under the age of 5 years were coded as -1 “Inapplicable” regardless of whether they attended school. Individuals who were 5 years of age or older and had never attended school were coded as 0. The user should note that EDUCYR is an unedited variable and minimal data cleaning was performed on this variable. HIDEG contains information on the highest degree of education attained at the time the individual entered MEPS. Information was obtained from three questions: highest grade completed, high school diploma, and highest degree. Persons under 16 years of age when they first entered MEPS were coded as 8 “Under 16 - Inapplicable”. In cases where the response to the highest degree question was “No Degree” and the response to the highest grade question was 13 through 17, the variable HIDEG was coded as 3 “High School Diploma”. If the response to the highest grade completed was “Refused” or “Don’t Know” and the response to the highest degree question was “No Degree”, the variable HIDEG was coded as 1 “No Degree”. The user should note that HIDEG is an unedited variable and minimal data cleaning was performed on this variable. Military Service Information on active duty military status was collected during each round of the MEPS interview. Persons currently on full-time active duty status are identified in the variables ACTDTY31, ACTDTY42, and ACTDTY53. Those under 16 years of age were coded as 3 “Under 16 - Inapplicable”, and those over the age of 59 were coded as 4 “Over 59 - Inapplicable”. Relationship to the Reference Person within Reporting Units For each Reporting Unit (RU), the person who owns or rents the DU is usually defined as the reference person. For student RUs, the student is defined as the reference person. (For additional information on reference persons, see the documentation on survey administration variables.) The relationship variables indicate the relationship of each individual to the reference person of the Reporting Unit (RU) in a given round. For confidentiality, starting in 2013, detailed relationships were combined into more general categories in the variables REFRL31X, REFRL42X, REFRL53X, and REFRL21X. These variables replaced RFREL31X, RFREL42X, RFREL53X, and RFRELyyX used before 2013. The new and old variables are defined differently, so researchers using multiple years of MEPS should refer to prior years’ documentation to assure consistency in their data. Note that categories for Child (4), Parent (7), and Sibling (8) for REFRL31X, REFRL42X, REFRL53X, and REFRL21X changed in 2017. In 2013-2016, these categories included biological, adoptive, step relationships, as well as in-law and foster relationships. Starting in 2017, in-law relationships are included in 91 OTHER RELATED, SPECIFY.
For the reference person, these variables have the value “Household reference person”; for all other persons in the RU, relationship to the reference person is indicated by codes representing “Spouse”, “Unmarried Partner”, “Child”, etc. A code of 91, meaning “Other Related, Specify”, was used to indicate rarely observed relationship descriptions such as “Mother of Partner”, “Partner of Sister”, etc. If the relationship of an individual to the reference person was not determined during the round-specific interview, relationships between other RU members were used, where possible, to assign a relationship to the reference person. If MEPS data from calendar year 2021 were not sufficient to identify the relationship of an individual to the reference person, relationship variables from the 2019 MEPS or NHIS data were used to assign a relationship. In the event that a meaningful value could not be determined or data were missing, the relationship variable was assigned a missing value code. If the relationship of two individuals indicated they were spouses, but both had marital status indicating they were not married, their relationship was changed to non-marital partners. In addition, the relationship variables were edited to insure that they did not change across rounds for RUs in which the reference person did not change, with the exception of relationships identified as partner, or spouse relationships. 2.5.4 Person-Level Condition Variables (RTHLTH31–ADHDAGED)Perceived Health Status Perceived health status (RTHLTH31, RTHLTH42, and RTHLTH53) and perceived mental health status (MNHLTH31, MNHLTH42, and MNHLTH53) were collected in the Priority Conditions Enumeration (PE) section. The target persons of the questions are all current or institutionalized persons regardless of age. These questions (PE10 and PE20) asked the respondent to rate each person in the family according to the following categories: excellent, very good, good, fair, and poor. Priority Condition Variables (HIBPDX–ADHDAGED) The PE section was asked in its entirety in Round 1 for all current or institutionalized persons, and in Panel 26 Round 2, Panel 25 Round 4, Panel 24 Round 6, and Panel 23 Round 8 for only new RU members. In Panel 26 Round 3 and Panel 24 Round 7, the specific condition questions (except joint pain and chronic bronchitis) were asked only if the person had not reported the condition in a previous round. Also, PE questions were asked in Panel 24 Round 5 to collect calendar year data in a manner parallel to the other panels. Priority condition variables whose names end in “DX” indicate whether the person was ever diagnosed with the condition. Chronic bronchitis, joint pain, and asthma follow-up questions (ASSTIL31/53, ASATAK31/53, and ASTHEP31/53 described below) reflect data obtained in Round 7 of Panel 23, Rounds 5 and 7 of Panel 24, Round 3 of Panel 25, and Rounds 1 and 3 of Panel 26. Diagnoses data (except attention deficit hyperactivity disorder/attention deficit disorder, diabetes, and asthma) were collected for persons over 17 years of age. If edited age is within range for the variable to be set, but the source data are missing because person’s age in CAPI is not within range, the constructed variable is set to “Cannot be Computed” (-15). Following the same pattern, attention deficit hyperactivity disorder/attention deficit disorder is asked of persons age 5 to 17, and diabetes and asthma are asked of persons of all ages. Exceptions to this pattern are the variables JTPAIN31_M18 and CHBRON31, which are described in greater detail below. Questions were asked regarding the following conditions:
These conditions were selected because of their relatively high prevalence, and because generally accepted standards for appropriate clinical care have been developed. This information thus supplements other information on medical conditions that is gathered in other parts of the interview. Condition data were collected at the person-by-round level (indicating if the person was ever diagnosed with the condition) and at the condition level. If the person reported having been diagnosed with a condition, the person-by-round variable was set to ‘1’ (Yes) and a condition record for that medical condition was created. Editing of these variables focused on checking that skip patterns were consistent. High Blood Pressure Questions about high blood pressure (hypertension) were asked only of persons aged 18 or older. Consequently, persons aged 17 or younger were coded as “Inapplicable” (-1) on these variables. HIBPDX ascertained whether the person had ever been diagnosed as having high blood pressure (other than during pregnancy). Those who had received this diagnosis were also asked if they had been told on two or more different visits that they had high blood pressure (BPMLDX). The age of diagnosis for high blood pressure (HIBPAGED) is included in this file. This variable is top-coded to 85 years of age. Heart Disease Heart disease questions were asked only of persons aged 18 or older. Consequently, persons aged 17 or younger were coded as “Inapplicable” (-1) on all the variables in this set. CHDDX – asked if the person had ever been diagnosed as having coronary heart disease ANGIDX – asked if the person had ever been diagnosed as having angina, or angina pectoris MIDX – asked if the person had ever been diagnosed as having a heart attack, or myocardial infarction OHRTDX – asked if the person had ever been diagnosed with any other kind of heart disease or condition The age of diagnosis for coronary heart disease (CHDAGED), angina (ANGIAGED), heart attack or myocardial infarction (MIAGED), and other kind of heart disease (OHRTAGED) are included in this file. These variables are top-coded to 85 years of age. Respondents who answered “Yes” to a person being diagnosed with any other kind of heart disease or condition (OHRTDX), were asked a follow up question (OHRTTYPE) to specify other heart diseases or conditions. Stroke STRKDX asked if the person (aged 18 or older) had ever been diagnosed as having had a stroke or transient ischemic attack (TIA or ministroke). Persons aged 17 or younger were coded as “Inapplicable” (-1). The age of diagnosis for stroke or TIA (STRKAGED) is included in this file. This variable is top-coded to 85 years of age. Emphysema EMPHDX asked if the person (aged 18 or older) had ever been diagnosed with emphysema. Persons aged 17 or younger were coded as “Inapplicable” (-1). The age of diagnosis for emphysema (EMPHAGED) is included in this file. This variable is top-coded to 85 years of age. Chronic Bronchitis CHBRON31 asked if the person (aged 18 or older) has had chronic bronchitis in the last 12 months. Persons aged 17 or younger were coded as “Inapplicable” (-1). High Cholesterol Questions about high cholesterol were asked of persons aged 18 or older. Consequently, persons aged 17 or younger were coded as “Inapplicable” (-1) on these variables. CHOLDX ascertained whether the person had ever been diagnosed as having high cholesterol. The age of diagnosis for high cholesterol (CHOLAGED) is included in this file. This variable is top-coded to 85 years of age. Cancer Questions about cancer were asked only of persons aged 18 or older. Consequently, persons aged 17 or younger were coded as “Inapplicable” (-1) on these variables. CANCERDX ascertained whether the person had ever been diagnosed as having cancer or a malignancy of any kind. If the respondent answered “Yes” they were asked at PE140 what type of cancer was diagnosed. CABLADDR, CABLOOD, CABREAST, CACERVIX, CACOLON, CALUNG, CALYMPH, CAMELANO, CAMUSCLE, CAOTHER, CAPROSTA, CASKINNM, CASKINDK, and CAUTERUS indicate selection of cancer of the bladder, blood, breast, cervix, colon, or lung; lymphoma or melanoma; cancer of the soft tissue, muscle, or fat; other type of cancer, cancer of the prostate, skin, or uterus. Cancer of the cervix or uterus could not be reported for males, and cancer of the prostate could not be reported for females. Recoding of Cancer Variables Specific cancer diagnosis variables with a frequency count fewer than 20 and those considered clinically rare (i.e., appear on the National Institutes of Health’s list of rare diseases), were removed from the file for confidentiality reasons, and the corresponding variable CAOTHER, indicating diagnosis of a cancer that is not counted individually, was recoded to “Yes” (1) as necessary. In data year 2021, the clinically rare cancers are:
The variable CABREAST, which indicates diagnosis of breast cancer, was recoded to “Inapplicable” (-1) for males for confidentiality reasons. The corresponding value of the general cancer diagnosis variable, CANCERDX, was recoded to “Cannot be Computed” (-15), and the corresponding values of remaining specific cancer variables were recoded to “Inapplicable” (-1). Diabetes Prior to 2018, diabetes diagnosis was asked for each person aged 18 or older. Beginning in 2018, DIABDX_M18 replaces DIABDX where diabetes is now asked for all ages. DIABDX_M18 indicates whether each person had ever been diagnosed with diabetes (excluding gestational diabetes). The age of diagnosis for diabetes (DIABAGED) is included in this file. This variable is top-coded to 85 years of age. Each person 18 years or older said to have received a diagnosis of diabetes was asked to complete a special self-administered questionnaire. The documentation for this questionnaire appears in the Diabetes Care Survey (DCS) section of the full year Consolidated file documentation. Joint Pain JTPAIN31_M18 asked if the person (aged 18 or older) had experienced pain, swelling, or stiffness around a joint in the last 12 months. This question is not intended to be used as an indicator of a diagnosis of arthritis. Persons aged 17 or younger were coded as “Inapplicable” (-1). Joint pain questions are skipped if the person already has an arthritis condition that is specified on the conditions roster in the PE section. Arthritis ARTHDX asked if the person (aged 18 or older) had ever been diagnosed with arthritis. Persons aged 17 or younger were coded as “Inapplicable” (-1). Respondents who answered “Yes” were asked a follow up question to determine the type of arthritis. ARTHTYPE indicates if the diagnosis was for Rheumatoid Arthritis (1), Osteoarthritis (2), or non-specific arthritis (3). The age of diagnosis for arthritis (ARTHAGED) is included in this file and may be recoded in some cases to “Cannot be Computed” (-15) for confidentiality reasons. This variable is top-coded to 85 years of age. Asthma ASTHDX indicates whether a person had ever been diagnosed with asthma. The age of diagnosis for asthma (ASTHAGED) is included in this file. This variable is top-coded to 85 years of age. Respondents who answered “Yes” to asthma diagnosis were asked additional questions. ASSTIL31 asked if the person still had asthma. ASATAK31 asked whether the person had experienced an episode of asthma or an asthma attack in the past 12 months. If the person did not experience an asthma attack in the past 12 months, a follow-up question (ASTHEP31) asked when the last asthma episode or asthma attack occurred. Additional follow-up questions regarding asthma medication used for quick relief (ASACUT31), preventive medicine (ASPREV31), and peak flow meters (ASPKFL31) were asked. These questions were asked if the person reported having been diagnosed with asthma (ASTHDX = 1). ASACUT31 asked whether, during the last three months, the person had used the kind of prescription inhaler “that you breathe in through your mouth” that gives quick relief from asthma symptoms. ASPREV31 asked whether the person had ever taken the preventive kind of asthma medicine used every day to protect the lungs and prevent attacks, including both oral medicine and inhalers. ASPKFL31 indicates whether the person with asthma had a peak flow meter at home. Respondents who answered “Yes” to ASACUT31 were asked whether the person had used more than three canisters of this type of inhaler in the past three months (ASMRCN31). Respondents who answered “Yes” to ASPREV31 were asked whether the person now took this kind of medication daily or almost daily (ASDALY31). Respondents who answered “Yes” to ASPKFL31 were asked if the person ever used the peak flow meter (ASEVFL31). Those respondents who answered “Yes” to ASEVFL31 were asked when the person last used the peak flow meter (ASWNFL31). Beginning in 2018, questions regarding asthma medication used for quick relief, preventive medicine, and peak flow meters are now implemented starting with Panel 22 Round 3 and Panel 23 Round 1. With the extension of rounds and an additional panel beginning in 2020, Round 5/3 asthma variables have been added. The asthma variables included in this file are: ASSTIL31 (Does Person Still Have Asthma - RD 3/1) ASSTIL53 (Does Person Still Have Asthma - RD 5/3) ASATAK31 (Asthma Attack Last 12 Mos - RD 3/1) ASATAK53 (Asthma Attack Last 12 Mos - RD 5/3) ASTHEP31 (When Was Last Episode of Asthma - RD 3/1) ASTHEP53 (When Was Last Episode of Asthma - RD 5/3) ASACUT31 (Used Acute Pres Inhaler Last 3 Mos- RD 3/1) ASACUT53 (Used Acute Pres Inhaler Last 3 Mos- RD 5/3) ASPREV31 (Ever Used Prev Daily Asthma Meds - RD 3/1) ASPREV53 (Ever Used Prev Daily Asthma Meds - RD 5/3) ASPKFL31 (Have Peak Flow Meter at Home - RD 3/1) ASPKFL53 (Have Peak Flow Meter at Home - RD 5/3) ASMRCN31 (Used >3 Acute Cn Pres Inh Last 3 Mos - RD 3/1) ASMRCN53 (Used >3 Acute Cn Pres Inh Last 3 Mos - RD 5/3) ASDALY31 (Now Take Prev Daily Asthma Meds - RD 3/1) ASDALY53 (Now Take Prev Daily Asthma Meds - RD 5/3) ASEVFL31 (Ever Used Peak Flow Meter - RD 3/1) ASEVFL53 (Ever Used Peak Flow Meter - RD 5/3) ASWNFL31 (When Last Used Peak Flow Meter - RD 3/1) ASWNFL53 (When Last Used Peak Flow Meter - RD 5/3) It may appear that there are discrepancies between the diagnosis variable and the follow-up variables. If a person reported asthma in the PE section in Panel 26 Round 3, ASATAK31 and ASSTIL31 will be set to “Inapplicable” (-1) as the person had not reported asthma in Round 1. Attention Deficit Hyperactivity Disorder/Attention Deficit Disorder ADHDADDX asked if persons aged 5 through 17 had ever been diagnosed as having Attention Deficit Hyperactivity Disorder or Attention Deficit Disorder. Persons younger than 5 or older than 17 were coded as “Inapplicable” (-1). The age of diagnosis for attention deficit hyperactivity disorder/attention deficit disorder (ADHDAGED) is included in this file. 2.5.5 Health Status Variables (IADLHP31– ADOVER42)Due to the overlapping panel design of the MEPS (Round 7 for Panel 23, Round 5 for Panel 24, Round 3 for Panel 25 and Round 1 for Panel 26 overlapped; Round 8 for Panel 23, Round 6 for Panel 24, Round 4 for Panel 25, and Round 2 for Panel 26 overlapped; and Round 7 for Panel 24 and Round 3 for Panel 26 overlapped), data from overlapping rounds have been combined across panels. In 2020, data collection was expanded beyond five rounds. In 2021, variables ending in “31” reflect data obtained in Round 7 of Panel 23, Round 5 of Panel 24, Round 3 of Panel 25 and Round 1 of Panel 26. Variables ending in “42” reflect data obtained in Round 8 of Panel 23, Round 6 of Panel 24, Round 4 of Panel 25 and Round 2 of Panel 26. Variables ending in “53” reflect data obtained in Round 7 of Panel 24 and Round 3 of Panel 26. Health Status variables whose names end in “21” indicate a full-year measurement. For persons in Panel 25, Round 3 extended from 2020 into 2021. Therefore, for these people, some information from late 2020 is included for variables that have names ending in “31”. Health Status variables in this data release can be classified into several conceptually distinct sets:
In general, Health Status variables involved the construction of person-level variables based on information collected in the Health Status section of the questionnaire. Many Health Status questions were initially asked at the family level to ascertain if anyone in the household had a particular problem or limitation. These were followed up with questions to determine which household member had each problem or limitation. All information ascertained at the family level has been brought to the person level for this file. Logical edits were performed in constructing the person-level variables to assure that family-level and person-level values were consistent. Particular attention was given to cases where missing values were reported at the family level to ensure that appropriate information was carried to the person level. Inapplicable cases occurred when a question was never asked because of a skip pattern in the survey (e.g., some follow-up verification questions were not asked about individuals who were 13 years of age or older; questions pertaining to children’s health status were not asked about individuals older than 17). Inapplicable cases are coded as -1. In addition, deceased persons were coded as “Inapplicable” (-1). Each of the sets of variables listed above will be described in turn. IADL and ADL Limitations IADL Help The Instrumental Activities of Daily Living (IADL) Help or Supervision variable IADLHP31 was constructed from a series of three questions administered in the Health Status section of the interview in Panel 23 Round 7, Panel 24 Round 5, Panel 25 Round 3 and Panel 26 Round 1. In 2021, the IADL questions were also administered in Panel 24 Round 7 and Panel 26 Round 3 and the new variable IADLHP53 is included in this file. The initial question (HE10) determined if anyone in the family received help or supervision with IADLs such as using the telephone, paying bills, taking medications, preparing light meals, doing laundry, or going shopping. If the response was “Yes”, a follow-up question (HE20) was asked to determine which household member(s) received this help or supervision. For persons under age 13, a final verification question (HE30) was asked to confirm that the IADL help or supervision was the result of an impairment or physical or mental health problem. If the response to the final verification question was “No”, IADLHP31 or IADLHP53 was coded “No” for persons under the age of 13. If no one in the family was identified as receiving help or supervision with IADLs, all members of the family were coded as receiving no IADL help or supervision. In cases where the response to the family-level question was “Refused” (-7), “Don’t Know” (-8), or “Cannot be Computed” (-15), all persons were coded according to the family-level response. In cases where the response to the family-level question (HE10) was “Yes” but no specific individuals were identified in the follow-up question as having IADL difficulties, all persons were coded as “Don’t Know” (-8). ADL Help The Activities of Daily Living (ADL) Help or Supervision variable ADLHLP31 was constructed in the same manner, and for the same persons, as the IADL help variable, but using questions HE40-HE60 in Panel 23 Round 7, Panel 24 Round 5, Panel 25 Round 3 and Panel 26 Round 1. In 2021, the ADL questions were also administered in Panel 24 Round 7 and Panel 26 Round 3 and the variable ADLHLP53 is included in this file. Coding conventions for missing data were the same as for the IADL variable. Functional and Activity Limitations A series of health status questions was asked related to functional limitations; use of assistive technology and social/recreational limitations; work, housework, and school limitations; and cognitive limitations. The ‘31’ versions of these variables incorporate data collected in Panel 23 Round 7, Panel 24 Round 5, Panel 25 Round 3 and Panel 26 Round 1. The ‘53’ versions of these variables incorporate data collected in Panel 24 Round 7 and Panel 26 Round 3. Functional Limitations A series of questions was asked that pertained to functional limitations, which are defined as difficulty in performing certain specific physical actions. WLKLIM31/53 was the filter question. These variables were derived from a question (HE90) that was asked at the family level: “Does anyone in the family have difficulties walking, climbing stairs, grasping objects, reaching overhead, lifting, bending or stooping, or standing for long periods of time?” If the answer was “No”, then all family members were coded as “No” (2) on WLKLIM31/53. If the answer was “Yes”, then the specific persons who had any of these difficulties were identified and coded as “Yes” (1), and remaining family members were coded as “No” (2). If the response to the family-level question was “Don’t Know” (-8), “Refused” (-7), “Cannot be Computed” (-15), or “Inapplicable” (-1), then the corresponding missing value code was applied to each family member’s value for WLKLIM31/53. If the answer to HE90 was “Yes” (1) but no specific individual was named as experiencing such difficulties, then each family member was assigned “Don’t Know” (-8). Deceased persons were assigned a code of “Inapplicable” (-1) for WLKLIM31/53. If WLKLIM31/53 was coded “Yes” (1) for any family member, a subsequent series of questions was administered. The series of questions for which WLKLIM31/53 served as a filter is as follows: LFTDIF31/53 - difficulty lifting 10 pounds STPDIF31/53 - difficulty walking up 10 steps WLKDIF31/53 - difficulty walking 3 blocks MILDIF31/53 - difficulty walking a mile STNDIF31/53 - difficulty standing 20 minutes BENDIF31/53 - difficulty bending or stooping RCHDIF31/53 - difficulty reaching over head FNGRDF31/53 - difficulty using fingers to grasp This series of questions was asked separately for each person whose response to WLKLIM31/53 was coded “Yes” (1). The series of questions was not asked for other individual family members whose response to WLKLIM31/53 was “No” (2). In addition, this series was not asked about family members who were less than 13 years of age, regardless of their status on WLKLIM31/53. These questions were not asked about deceased family members. In such cases (i.e., WLKLIM31/53 = 2, or age < 13, or PSTATS31/53 = 23, 24, or 31), each question in the series was coded as “Inapplicable” (-1). Finally, if responses to WLKLIM31/53 were “Refused” (-7), “Don’t Know” (-8), “Cannot be Computed” (-15), or otherwise “Inapplicable” (-1), then each question in this series was coded as “Inapplicable” (-1). Analysts should note that WLKLIM31/53 was asked of all household members, regardless of age. For the subsequent series of questions, however, persons less than 13 years old were skipped and coded as “Inapplicable” (-1). Therefore, it is possible for someone age 12 or younger to have a code of “Yes” (1) on WLKLIM31/53, and also to have codes of “Inapplicable” on the subsequent series of questions. Use of Assistive Technology and Social/Recreational Limitations The variables indicating use of assistive technology (AIDHLP31/53 from question HE70) and social/recreational limitations (SOCLIM31/53, from question HE230) were collected initially at the family level. If there was a “Yes” (1) response to the family-level question, a second question identified the specific individual(s) to whom the “Yes” response pertained. Each individual identified as having the difficulty was coded “Yes” (1) for the appropriate variable; all remaining family members were coded “No” (2). If the family-level response was “Refused” (-7), “Don’t Know” (-8), or “Cannot be Computed” (-15), all persons were coded with the family-level response. In cases where the family-level response was “Yes” but no specific individual was identified as having difficulty, all family members were coded as “Don’t Know” (-8). Work, Housework, and School Limitations The variable indicating any limitation in work, housework, or school (ACTLIM31/53) was constructed using questions HE190-HE200. Specifically, information was collected initially at the family level. If there was a “Yes” (1) response to the family-level question (HE190), a second question (HE200) identified the specific individual(s) to whom the “Yes” (1) response pertained. Each individual identified as having a limitation was coded “Yes” (1) for the appropriate variable; all remaining family members were coded “No” (2). If the family-level response was “Refused” (-7), “Don’t Know” (-8), or “Cannot be Computed” (-15), all persons were coded with the family-level response. In cases where the family-level response was “Yes” (1) but no specific individual was identified as having limitation, all family members were coded as “Don’t Know” (-8). Persons less than five years old were coded as “Inapplicable” (-1) on ACTLIM31/53. If ACTLIM31/53 was “Yes” (1) and the person was 5 years of age or older, a follow-up question (HE210) was asked to identify the specific limitation or limitations for each person. These included working at a job (WRKLIM31/53), doing housework (HSELIM31/53), or going to school (SCHLIM31/53). Respondents could answer “Yes” (1) or “No” (2) to each activity; thus a person could report limitations in multiple activities. WRKLIM31/53, HSELIM31/53, and SCHLIM31/53 have values of “Yes” (1) or “No” (2) only if ACTLIM31/53 was “Yes” (1); each variable was coded as “Inapplicable” (-1) if ACTLIM31/53 was “No” (2). When ACTLIM31/53 was “Refused” (-7), these variables were all coded as “Refused” (-7); when ACTLIM31/53 was “Don’t Know” (-8), these variables were all coded as “Don’t Know” (-8); and when ACTLIM31/53 was “Cannot be Computed” (-15), these variables were all coded as “Cannot be Computed” (-15). If a person was under 5 years old or was deceased, WRKLIM31/53, HSELIM31/53, and SCHLIM31/53 were each coded as “Inapplicable” (-1). An additional question (UNABLE31/53) asked if the person was completely unable to work at a job, do housework, or go to school. Those persons who were coded “No” (2), “Refused” (-7), “Don’t Know” (-8), or “Cannot be Computed” (-15) on ACTLIM31/53, were under 5 years of age, or were deceased were coded as “Inapplicable” (-1) on UNABLE31/53. UNABLE31/53 was asked once for whichever set of WRKLIM31/53, HSELIM31/53, and SCHLIM31/53 the person had limitations; if a person was limited in more than one of these three activities, UNABLE31/53 did not specify if the person was completely unable to perform all of them, or only some of them. Cognitive Limitations The variable indicating any cognitive limitation (COGLIM31/53) was collected at the family level as a three-part question (HE250A to HE250C), asking if any of the adults in the family (1) experience confusion or memory loss, (2) have problems making decisions, or (3) require supervision for their own safety. If a “Yes” response was obtained to any item, the persons affected were identified in HE260, and COGLIM31/53 was coded as “Yes” (1). Remaining family members not identified were coded as “No” (2) for COGLIM31/53. If responses to HE250A through HE250C were all “No”, or if two of three were “No” (2) and the remaining was “Refused” (-7), “Don’t Know” (-8), or “Cannot be Computed” (-15), all family members were coded as “No” (2). If responses to the three questions were combinations of “Don’t Know” (-8), “Refused” (-7), and missing, all persons were coded as “Don’t Know” (-8). If the response to any of the three questions was “Yes” (1) but no individual was identified in HE260, all persons were coded as “Don’t Know” (-8). COGLIM31/53 reflects whether any of the three component questions is “Yes” (1). Family members with one, two, or three specific cognitive limitations cannot be distinguished. In addition, because the question asked specifically about adult family members, all persons less than 18 years of age are coded as “Inapplicable” (-1) on this question. Hearing, Vision Problems A series of questions (HE270 to HE310), asked in Panel 23 Round 8, Panel 24 Round 6, Panel 25 Round 4 and Panel 26 Round 2, provides information on hearing and visual impairment. Household members less than one year old and deceased RU members were coded as “Inapplicable” (-1). The hearing impairment variable, DFHEAR42, indicates whether a person has serious difficulty hearing. This variable was based on two questions, HE270 and HE280. The initial question (HE270) determined if anyone in the family had difficulty hearing. If the response was “Yes” (1), a follow-up question (HE280) was asked to determine which household member(s) had a hearing impairment. If the family-level response was “Don’t Know” (-8), “Refused” (-7), or “Cannot be Computed” (-15), all persons were coded with the family-level response. In cases where the family-level response was “Yes” (1) but no specific individual was identified as having serious difficulty hearing, all family members were coded as “Don’t Know” (-8). The visual impairment variable, DFSEE42, indicates whether a person has serious difficulty seeing. This variable was based on two questions, HE290C and HE300. The initial question (HE290C) determined if anyone in the family had difficulty seeing. If the response was “Yes” (1), a follow-up question (HE300) was asked to determine which household member(s) had a seeing impairment. If the family-level response was “Don’t Know” (-8), “Refused” (-7), or “Cannot be Computed” (-15), all persons were coded with the family-level response. In cases where the family-level response was “Yes” (1) but no specific individual was identified as having serious difficulty seeing, all family members were coded as “Don’t Know” (-8). Disability Status A series of questions (HE310 to HE380) in Panel 23 Round 8, Panel 24 Round 6, Panel 25 Round 4 and Panel 26 Round 2 provides information on cognitive difficulty, difficulty walking or climbing stairs, difficulty dressing or bathing, and difficulty doing errands. Questions regarding cognitive difficulty, difficulty walking or climbing stairs, and difficulty dressing or bathing were asked of household members 5 years of age and older. The question regarding difficulty doing errands was asked of household members 15 years of age and older. Deceased RU members were coded “Inapplicable” (-1). DFCOG42 indicates whether a person had serious cognitive difficulty. This variable was based on two questions, HE310 and HE320. The initial question (HE310) determined if anyone in the family had difficulty concentrating, remembering or making decisions. If the response was “Yes” (1), a follow-up question (HE320) was asked to determine which household member(s) had difficulty concentrating, remembering or making decisions. If the family-level response was “Don’t Know” (-8), “Refused” (-7), or “Cannot be Computed” (-15), all persons were coded with the family-level response. In cases where the family-level response was “Yes” (1) but no specific individual was identified as having serious cognitive difficulty, all family members were coded as “Don’t Know” (-8). DFWLKC42 indicates whether a person has serious difficulty walking or climbing stairs. This variable was based on two questions, HE330 and HE340. The initial question (HE330) determined if anyone in the family had serious difficulty walking or climbing stairs. If the response was “Yes” (1), a follow-up question (HE340) was asked to determine which household member(s) had difficulty walking or climbing stairs. If the family-level response was “Don’t Know” (-8), “Refused” (-7), or “Cannot be Computed” (-15), all persons were coded with the family-level response. In cases where the family-level response was “Yes” (1) but no specific individual was identified as having serious difficulty walking or climbing stairs, all family members were coded as “Don’t Know” (-8). DFDRSB42 indicates whether a person has difficulty dressing or bathing. This variable was based on two questions, HE350 and HE360. The initial question (HE350) determined if anyone in the family had difficulty dressing or bathing. If the response was “Yes” (1), a follow-up question (HE360) was asked to determine which household member(s) had difficulty dressing or bathing. If the family-level response was “Don’t Know” (-8), “Refused” (-7), or “Cannot be Computed” (-15), all persons were coded with the family-level response. In cases where the family-level response was “Yes” (1) but no specific individual was identified as having difficulty dressing or bathing, all family members were coded as “Don’t Know” (-8). DFERND42 indicates whether a person has difficulty doing errands alone. This variable was based on two questions, HE370 and HE380. The initial question (HE370) determined if anyone in the family had difficulty doing errands alone. If the response was “Yes” (1), a follow-up question (HE380) was asked to determine which household member(s) had difficulty doing errands alone. If the family-level response was “Don’t Know” (-8), “Refused” (-7), or “Cannot be Computed” (-15), all persons were coded with the family-level response. In cases where the family-level response was “Yes” (1) but no specific individual was identified as having difficulty doing errands alone, all family members were coded as “Don’t Know” (-8). Any Limitation Rounds 7 and 8 (Panel 23)/ Rounds 5, 6 and 7 (Panel 24) / Rounds 3 and 4 (Panel 25)/ Rounds 1, 2 and 3 (Panel 26) ANYLMI21 summarizes whether a person had any IADL, ADL, functional, or activity limitations in any of the pertinent rounds. ANYLMI21 was built using the component variables IADLHP31/53, ADLHLP31/53, WLKLIM31/53, ACTLIM31/53, DFSEE42, and DFHEAR42. If any of these components was coded “Yes”, then ANYLMI21 was coded “Yes” (1). If all components were coded “No”, then ANYLMI21 was coded “No” (2). If all the components were “Inapplicable” (-1), then ANYLMI21 was coded as “Inapplicable” (-1). If all the components had missing value codes (i.e., -7, -8, or -1), ANYLMI21 was coded as “Cannot be Computed” (-15). If some components were “No” and others had missing value codes, ANYLMI21 was coded as “Cannot be Computed” (-15). The exception to this latter rule was for children younger than five years old, who were not asked questions that are the basis for ACTLIM31/53; for these RU members, if all other components were “No”, then ANYLMI21 was coded as “No” (2). The variable label for ANYLMI21 departs slightly from conventions. Typically, variables that end in “21” refer only to 2021. However, some of the variables used to construct ANYLMI21 were assessed in 2022, so some information from early 2022 is incorporated into this variable. Child Health and Preventive Care Questions were asked about each child (under the age of 18 excluding deceased children) in the applicable age subgroups to which they pertained. For the Child Supplement variables, a code of “Inapplicable” (-1) was assigned if a person was deceased, was not in the appropriate Round 2, 4, 6, or 8 or was not in the applicable age subgroup as of the interview date. This public use dataset contains variables and frequency distributions from the Child Health and Preventive Care (CS) Section associated with 7,519 children who were eligible for the (CS) section. Children were eligible for this section when PSTATS42 was not equal to 23, 24, 31 (Deceased) and 0 <= AGE42X <= 17. Of these children, 5,379 were assigned a positive person-level weight for 2021 (PERWT21P > 0). Cases not eligible for the (CS) section should be excluded from estimates made with the (CS) section. Starting in 2018, the Consumer Assessment of Healthcare Providers and Systems (CAHPS) and Columbia Impairment Scale (CIS) series of questions will be administered every other year. CAPI will administer the CAHPS and CIS series as follows:
Therefore, since Panel 26 Round 1 collection started in 2021, Panel 25 Round 1 collection started in 2020, Panel 24 Round 1 collection started in 2019 and Panel 23 Round 1 started in 2018, the CAHPS and CIS questions were asked, and these variables are included in the 2021 dataset. In addition, the child preventive care series will be administered every other year beginning in 2018. CAPI will administer the child preventive care series as follows:
Therefore, the child preventive care questions were not asked in 2021 and are not included in the 2021 dataset.
Children with Special Health Care Needs Screener (ages 0 - 17) The Children with Special Health Care Needs (CSHCN) Screener instrument was developed through a national collaborative process as part of the Child and Adolescent Health Measurement Initiative (CAHMI) coordinated by the Foundation for Accountability. A key reference for this screener instrument is Bethel et al (2002).These questions are asked about children ages 0 -17. In general, the CSHCN screener identifies children with activity limitation or need or use of more health care or other services than is usual for most children of the same age. When a response to a gate question was set to “No” (2), “Refused” (-7), “Don’t Know” (-8), or “Cannot be Computed” (-15), follow-up variables based on the gate question were coded as “Inapplicable” (-1). The variable CSHCN42 identifies children with special health care needs, and was created using the CSHCN screener questions according to the specifications in the reference above. The CSHCN screener questions consist of a series of question sequences about the following five health consequences: the need or use of medicines prescribed by a doctor; the need or use of more medical care, mental health, or education services than is usual for most children; being limited or prevented in doing things most children can do; the need or use of special therapy such as physical, occupational, or speech therapy; and the need or use of treatment or counseling for emotional, developmental, or behavioral problems. Parents who responded “yes” to any of the “initial” questions in the five question sequences were then asked to respond to up to two follow-up questions about whether the health consequence was attributable to a medical, behavioral, or other health condition lasting or expected to last at least 12 months. Children with positive responses to at least one of the five health consequences along with all of the follow-up questions were identified as having a Special Health Care Need. Children with a “no” response for all five question sequences were considered NOT to have a Special Health Care Need. Those children whose “special health care need” status could not be determined (due to missing data for any of the questions) were coded as “Unknown” for CSHCN42. More information about the CSHCN screener questions can be obtained from the website for the Child and Adolescent Health Measurement Initiative. The CSHCN screener questions were: CHPMED42 - child needs or uses prescribed medicines CHPMHB42 - prescribed medicines were because of a medical, behavioral, or other health condition CHPMCN42 - health condition that causes a person to need prescribed medicines has lasted or is expected to last for at least 12 months CHSERV42 - child needs or uses more medical care, mental health, or education services than is usual for most children of the same age CHSRHB42 - child needs or uses more medical and other service because of a medical, behavioral, or other health condition CHSRCN42 - health condition that causes a person to need or use more medical and other services has lasted or is expected to last for at least 12 months CHLIMI42 - child is limited or prevented in any way in ability to do the things most children of the same age can do CHLIHB42 - child is limited in the ability to do the things most children can do because of a medical, behavioral, or other health condition CHLICO42 - health condition that causes a person to be limited in the ability to do the things most children can do has lasted or is expected to last for at least 12 months CHTHER42 - child needs or gets special therapy such as physical, occupational, or speech therapy CHTHHB42 - child needs or gets special therapy because of a medical, behavioral, or other health condition CHTHCO42 - health condition that causes a person to need or get special therapy has lasted or is expected to last for at least 12 months CHCOUN42 - child has an emotional, developmental, or behavioral problem for which he or she needs or gets treatment or counseling CHEMPB42 - problem for which a person needs or gets treatment or counseling is a condition that has lasted or is expected to last for at least 12 months CSHCN42 - identifies children with special health care needs Columbia Impairment Scale (ages 5 - 17) (included in alternating years only) These questions inquired about possible child behavioral problems and were asked in previous years. Respondents were asked to rate on a scale from 0 to 4, where “0” indicates “No Problem” and “4” indicates “A Very Big Problem”, how much of a problem the child has with thirteen specified activities. A key reference for the Columbia Impairment Scale is Bird et al (1996). Certain questions in this series were coded to “Asked, but Inapplicable” (99) when the question was not applicable for a specific child. For example, if a child’s mother was deceased, a question about how much of a problem a child has getting along with his/her mother would be set to “Asked, but Inapplicable” (99). Similarly, the question about problems getting along with siblings would be set to “Asked, but Inapplicable” (99) for children with no siblings. Variables in this set include: GETTRB42 - problem with getting into trouble MOMPRO42 - getting along with mother DADPRO42 - getting along with father UNHAP42 - feeling unhappy or sad SCHLBH42 - (his/her) behavior at school HAVFUN42 - having fun ADUPRO42 - getting along with adults NERVAF42 - feeling nervous or afraid SIBPRO42 - along with brothers and sisters KIDPRO42 - getting along with other kids SPRPRO42 - getting involved in activities like sports or hobbies SCHPRO42 - (his/her) schoolwork HOMEBH42 - (his/her) behavior at home CAHPS® (Consumer Assessment of Healthcare Providers and Systems) ages 0 - 17 (included in alternating years only) The health care quality measures were taken from the health plan version of CAHPS®, an AHRQ-sponsored family of survey instruments designed to measure quality of care from the consumer’s perspective. All of the CAHPS® variables refer to events experienced in the last 12 months. The variables included from the CAHPS® are: CHILCR42 - whether a person had an illness, injury, or condition that needed care right away from a clinic, emergency room, or doctor’s office CHILWW42 - how often a person got care as soon as was needed (coded as “-1 Inapplicable” when CHILCR42 = 2, -7, -8, or -15) CHRTCR42 - whether any appointments were made CHRTWW42 - how often a person got an appointment for health care as soon as was needed (coded as “-1 Inapplicable” when CHRTCR42 = 2, -7, -8, or -15) CHAPPT42 - how many times a person went to a doctor’s office or clinic for health care CHLIST42 - how often a person’s doctors or other health providers listened carefully to the parent (coded as “-1 Inapplicable” when CHAPPT42 = 0, -7, -8, or CHEXPL42 - how often a person’s doctors or other health providers explained things in a way the parent could understand (coded as “-1 Inapplicable” when CHAPPT42 = 0, -7, -8, or -15) CHRESP42 - how often a person’s doctors or other health providers showed respect for what the parent had to say (coded as “-1 Inapplicable” when CHAPPT42 = 0, -7, -8, or -15) CHPRTM42 - how often doctors or other health providers spent enough time with a person (coded as “-1 Inapplicable” when CHAPPT42 = 0, -7, -8, or -15) CHHECR42 - rating of health care from 0 to 10 where 0 = Worst health care possible and 10 = Best health care possible (coded as “-1 Inapplicable” when CHAPPT42 = 0, -7, -8, or -15) CHSPEC42_M18 - whether a person made an appointment to see a specialist CHEYRE42_M18 - how often did a person get appointments to see a specialist (coded as “-1 Inapplicable” when CHSPEC42 = 2, -7, -8, or -15) Additional Health Variables LSTETH53 (has person lost all natural (permanent) teeth), PHYEXE53 (currently spends half hour or more in moderate to vigorous physical activity at least five times a week), and OFTSMK53 (how often smoke cigarettes) are asked in the Additional Healthcare Questions (AH) section. These questions are asked every year of each person 18 years or older. A code of “Inapplicable” (-1) was assigned if the person was deceased or less than 18 years old. In 2021, these variables include data collected in Panel 23 Round 9, Panel 24 Round 7, Panel 25 Round 5, and Panel 26 Round 3. Self-Administered Questionnaire (SAQ) The Self-Administered Questionnaire (SAQ) variables will be released on the 2021 Consolidated file. Diabetes Care Survey (DCS) The Diabetes Care Survey (DCS) is a self-administered paper-and-pencil questionnaire fielded during Panel 23 Round 9, Panel 24 Round 7, Panel 25 Round 5 and Panel 26 Round 3. These data and documentation of the data will be included only in the full year Consolidated file (HC-233). Social Determinants of Health Survey (SDOH) The Social Determinants of Health Survey (SDOH) includes questions about the social determinants of health, such as housing, financial well-being, food security, social support, discrimination, and physical and sexual violence. The SDOH constitutes the first MEPS instrument to be administered both as a paper-and-pencil and web questionnaire. The SDOH was fielded during Panel 23 Round 7, Panel 24 Round 5, Panel 25 Round 3, and Panel 26 Round 1 of the MEPS data collection. All adults age 18 and older as of the Round 1, 3, 5, or 7 interview date (AGE31X >= 18) in MEPS households were asked to complete the SDOH. No gate questions were included, all questions were asked of all SDOH respondents, regardless of age, sex, or health status. The variable SDOHELIG indicates the person’s eligibility status for the SDOH. SDOHELIG was used to construct the variables based on the SDOH data. SDOHELIG was coded “0” (Not Eligible for SDOH) if there was no record for the person in the round, if the person was not key, if the person was deceased or institutionalized, moved out of the U.S., moved to a military facility, if the person’s disposition status was inapplicable, or if the person was less than 18 years old. SDOHELIG was coded “1” (Eligible for SDOH and Has SDOH Data) if an SDOH record existed for the person in Round 1 (for Panel 26), Round 3 (for Panel 25), Round 5 (for Panel 24), or Round 7 (for Panel 23). SDOHELIG was coded “2” (Eligible for SDOH, but No SDOH Data) if no SDOH record existed for the person in the round. This variable was used as a building block for all other constructed SDOH variables. A question that was only included on the paper form asked if the respondent was the person represented in the form. If a person was unable to respond to the SDOH, the paper questionnaire was completed by a proxy. The relationship of the proxy to the adult represented in the questionnaire is indicated by the variable SDPROX42. SDPROX42 was coded “1” (Self-administered) if the respondent was the person represented in the questionnaire. A code of “-1” (Inapplicable) was assigned if a person was not eligible or was eligible but no data existed (SDOHELIG = 0 or 2). Month and year of survey completion are included - SDCMPM and SDCMPY. A special weight variable (SDOHWT21P) has been designed to be used with the SDOH for persons who were age 18 and older at the interview date. This weight adjusts for non-response and weights to the U.S. civilian noninstitutionalized population (see Section 3.0 “Survey Sample Information” for details). The variables created from the SDOH begin with “SD.” If a person was not assigned a positive SDOH weight, all SDOH variables except SDOHELIG were coded “Inapplicable” (-1). SDLIFE - How satisfied with life SDHOME - How satisfied with house/apartment where person lives SDMEDCARE - Rate neighborhood availability of places to get medical care SDPARKS - Rate neighborhood availability of parks and playgrounds SDHLTHFOOD - Rate neighborhood availability of places to buy healthy food SDSFCRIME - Rate neighborhood safety from crime and violence SDPUBTRANS - Rate neighborhood access to public transportation SDAFRDHOME - Rate neighborhood availability of affordable housing SDNOTRANS - During last 12 months, has lack of transportation kept person from medical appointments, meetings, work, or getting things needed for daily living SDLATERENT - During last 12 months, household’s rent/mortgage was late or unpaid because household could not afford to pay SDLATEUTIL - During last 12 months, household unable to pay electric, gas, oil, or water bills on time SDSHUTUTIL - During last 12 months, home utilities were threatened to be shut off SDPROBPEST - The place where person lives has problems with pests SDPROBMOLD - The place where person lives has problems with mold SDPROBLEAD - The place where person lives has problems with lead paint or pipes SDPROBHEAT - The place where person lives has problems with lack of heat SDPROBCOOK - The place where person lives has problems with oven or stove not working SDPROBSMKDET - The place where person lives has problems with smoke detectors missing or not working SDPROBLEAKS - The place where person lives has problems with water leaks SDPROBNONE - The place where person lives has problems with none of the above SDWRRYFD - During last 12 months, worried about food running out before person got money to buy more SDNOFOOD - During last 12 months, food bought didn’t last and person did not have money to get more SDPAYBASICS - How hard is it to pay for basics like food, housing, medical care, and heating SDUNEXPEXP - How confident able to come up with $400 if an unexpected expense arose within the next month SDMISSCCLN - During last 12 months, missed a payment on a credit card or loan (not including mortgage) SDDEBT - During last 12 months, contacted by debt collection agency SDDAYEXER - During last 30 days, how many days per week engaged in moderate exercise SDMINSEXER - Average amount of minutes spent doing moderate exercise SDSTRESS - Rate current stress level SDFAMILY - How much help expected to receive from family if problem arose (for example, sickness or moving) SDFRIENDS - How much help expected to receive from friends if problem arose (for example, sickness or moving) SDCOMM - How much help expected to receive from community if problem arose (for example, sickness or moving) SDTLKPHN - How often talk on the phone (or by video) to family, friends, or neighbors in a typical week SDGETTGT - How often get together with friends or relatives in a typical week SDCHURCH - How often attend church or religious services (in-person or online) SDCLUBORG - How often attend meetings of clubs or organizations (in-person or online) SDCOMPAN - How often feel the lack of companionship SDLEFTOUT - How often feel left out SDISOL - How often feel isolated from others SDENICPROD - Ever used an electric nicotine product SDDSCRMDR - Ever experienced discrimination at doctor’s office, clinic, or hospital SDDSCRMWRK - Ever experienced discrimination at work SDDSCRMJOB - Ever experienced discrimination when applying for jobs SDDSCRMHS - Ever experienced discrimination when trying to rent or buy housing SDDSCRMPOL - Ever experienced discrimination when interacting with law enforcement SDDSCRMPUB - Ever experienced discrimination when applying for social services or public assistance SDDSCRMSTR - Ever experienced discrimination at a restaurant or store SDPHYSHURT - How often does anyone physically hurt person SDINSULT - How often does anyone insult or talk down to person SDTHRHARM - How often does anyone threaten person with harm SDSCREAM - How often does anyone scream or curse at person SDHMDEPR - When under 18 years of age, lived with someone who was depressed, mentally ill, or suicidal SDHMALC - When under 18 years of age, lived with a problem drinker or alcoholic SDHMDRG - When under 18 years of age, lived with someone who used illegal street drugs or abused prescription medication SDHMJAIL - When under 18 years of age, lived with someone who served time in prison, jail, or correctional facility SDHMDIV - Parents separated or divorced SDHMBEAT - How often did parents or adults in home slap, hit, kick, punch, or beat each other up SDHURTCHLD - How often did parents or adults in home hit, beat, kick, or physically hurt person SDINSCHLD - How often did parents or adults in home swear, insult, or put person down SDTCHCHLD - How often did an adult or someone at least 5 years older touch person sexually SDTCHADLT - How often did an adult or someone at least 5 years older try to make person touch them sexually SDFRCSXCH - How often did an adult or someone at least 5 years older force the person to have sex 2.5.6 Disability Days Indicator Variables (DDNWRK21–OTHNDD21)The Disability Days (DD) section of the core interview contains questions about time lost from work because of a physical illness or injury, or a mental or emotional problem. Data were collected on each individual in the household. These questions were repeated in each round of interviews; this file contains data from Rounds 7, 8, and 9 of the MEPS Panel 23, initiated in 2018, Rounds 5, 6, and 7 of the MEPS Panel 24, initiated in 2019, Rounds 3, 4, and 5 of the MEPS Panel 25, initiated in 2020, and Rounds 1, 2, and 3 of the MEPS Panel 26, initiated in 2021. Beginning in FY 2015, annualized versions of these variables are constructed for release rather than the previously released versions, which were round- and panel-specific. The number at the end of the variable name (21) identifies the variable as representing data from 2021. Due to confidentiality concerns, the annual Disability Days variables, which represent the number of days a person missed work (DDNWRK21 and OTHNDD21), are top-coded to mask values that exceed the top one-half of one percent of the population. These annual variables use building block variables for construction, which represent an individual panel within the data year. The reference period for the Disability Days questions is the time period between the beginning of the panel or the previous interview date and the current interview date. Analysts should be aware that Round 7 of Panel 23, Round 5 of Panel 24, and Round 3 of Panel 25 are conducted across years. The Disability Days variables reflect only the data pertinent to the calendar year (i.e., the current delivery year of 2021). Analysts who are interested in examining Disability Days data across years can link to other person-level PUFs using the DUPERSID. The flow of the Disability Days section relies on the person’s age as of the interview date. Therefore, the round-specific constructed age variables (AGE31X, AGE42X, and AGE53X) are used to construct the comparable round-specific Disability Days building block variables. Due to the age-specific nature of the Disability Days section, age data from other rounds are not used should the person’s age for the round be missing. The variable DDNWRK21 represents the number of times the person lost a half-day or more from work because of illness, injury, or mental or emotional problems during the calendar year. A response of “no work days lost” was coded zero; if the person did not work, this variable was coded -1 (Inapplicable). The analyst should note that there are cases where EMPST## = 1 or 2 (has current job or job to return to) where DDNWRK21 contains a positive value, indicating the number of times the person lost a half-day or more from work. This is because the responses to the Disability Days questions are independent of the responses to the employment questions. Persons who were less than 16 years old or whose age is missing (AGE##X is set to -1) were not asked about work days lost, thus this variable is coded -1 (Inapplicable) for these persons. A final set of variables indicates if an individual took a half-day or more off from work to care for the health problems of another individual in the family and the number of days missed. OTHDYS21 indicates if a person missed work because of someone else’s illness, injury, or health care needs, for example to take care of a sick child or relative. This variable has three possible answers: yes - missed work to care for another (coded 1); no - did not miss work to care for another (coded 2); or the person does not work (coded -1), based on the setting of DDNWRK21. Persons younger than 16 and persons whose age is missing were not asked this question and are also coded as -1 (in a small number of cases this was not done for the 1996 data, the analyst will need to make this edit when doing longitudinal analyses). OTHNDD21 indicates the number of days in which work was lost because of another’s health problem. Persons younger than 16, those whose age is missing, those who do not work, and those who answer “no” to OTHDYS21 are skipped out of OTHNDD21 and receive a code of -1. Note that, because Disability Days variables use only those Round 7 of Panel 23, Round 5 of Panel 24, and Round 3 of Panel 25 data pertinent to the data year, it is possible to have a person report missing work to care for the health problems of another individual (OTHDYS21 = 1) but report no days missed (OTHNDD21 = 0). This combination indicates that the person did not miss those workdays during the data year. Editing was done on these variables to preserve the skip patterns. No imputation was done for those with missing data. 2.5.7 Access to Care Variables (ACCELI42-BOOSTERSHOT53)The variables ACCELI42 through AFRDPM42 describe data from the Access to Care (AC) section of the MEPS HC questionnaire, which was administered in Panel 23 Round 8, Panel 24 Round 6, Panel 25 Round 4, and Panel 26 Round 2. This supplement gathers information on family members’ usual source of health care; characteristics of usual source of health care providers; access to and satisfaction with the usual source of health care provider; and affordability of medical treatment, dental treatment, and prescription medicines. The variable ACCELI42 indicates whether persons were eligible to receive the Access to Care questions. Persons with ACCELI42 set to ‘-1’ (Inapplicable) should be excluded from estimates made with the Access to Care data. The COVID (CV) section of the MEPS HC questionnaire was administered in Panel 23 Rounds 7 – 9, Panel 24 Rounds 5 – 7, Panel 25 Rounds 3 – 5, and Panel 26 Rounds 1 – 3. This supplement gathers information on whether a person was delayed in getting medical care (CVDLAYCA31, CVDLAYCA42, CVDLAYCA53), dental treatment (CVDLAYDN31, CVDLAYDN42, CVDLAYDN53), and prescription medicine (CVDLAYPM31, CVDLAYPM42, CVDLAYPM53) due to the coronavirus pandemic, as well as whether the person ever acquired a COVID-19 vaccination (CVVACCINE42, CVVACCINE53) and ever had a COVID-19 booster vaccination (BOOSTERSHOT53). Family Members’ Usual Source of Health Care For each individual family member, the AC section ascertains whether there is a particular doctor’s office, clinic, health center, or other place that the individual usually goes to if he/she is sick or needs advice about his/her health (HAVEUS42). PRACTP42 indicates whether a usual source of care (USC) provider has his or her own practice that is not part of a group practice, health center, clinic, or other facility. For those family members who have a USC provider, AC30 ascertains the type of practice: 1 Own Practice, Not Part of Group/Facility 2 Practice Associated with Group/Facility YNOUSC42_M18 indicates the main reason why a person does not have a USC provider. For those family members who do not have a USC provider, question AC40 ascertains the main reason why: 1 Seldom or Never Sick 2 Recently Moved to Area 3 Just Changed Insurance Plans 4 No Health Insurance, Oth Insurance-Related Issue 5 Don’t Know Where to Go for Care 6 USC in This Area No Longer Available 7 Likes to Go to Different Places for Different Health Needs 8 Don’t Use Doctors/Treat Self 9 Cost of Medical Care 10 No Health Insurance 91 Other Reason In 2018, YNOUSC42 was renamed to YNOUSC42_M18 because the list of answer categories changed. Characteristics of Usual Source of Health Care Providers The AC section collects information about the different characteristics of each unique USC provider for a given family. If a person does not have a USC provider (HAVEUS42 is set to ‘2’ (No), ‘-7’ (Refused), ‘-8’ (Don’t Know) or ‘-15’ (Cannot be Computed)), then these variables are set to ‘-1’ (Inapplicable). The basis for the AC provider questions is PROVTY42_M18. This variable indicates whether the person’s provider is a facility (‘1’), a person (‘2’), or a person-in-facility (‘3’). PROVTY42_M18 is a copy of PROVTYPE_M18 (Provider Type) for persons who have a USC provider. Depending on how PROVTYPE_M18 is set, persons are asked about the provider’s location, the provider’s personal characteristics (e.g., race), the provider’s accessibility, and the person’s satisfaction with the provider. In 2018, PROVTY42 was renamed PROVTY42_M18 because of changes to CAPI. Provider Location Two variables indicate the location of the provider. For facility or person-in-facility type providers, PLCTYP42 indicates whether the person’s facility is a Hospital Clinic or Outpatient Department (‘1’), Hospital Emergency Room (‘2’), or Other Kind of Place (‘3’). According to CAPI flow, persons do not report the type of facility for person-type providers; therefore, if PROVTY42_M18 is set to ‘2’ (Person), PLCTYP42 is set to ‘-1’ (Inapplicable). For all provider types, including person-type, LOCATN42 indicates whether the person’s provider is located in an Office (‘1’), a Hospital but Not the Emergency Room (‘2’), or a Hospital Emergency Room (‘3’). Personal Characteristics of Providers For person and person-in-facility type providers, TYPEPE42 indicates what type of doctor or other medical provider the person’s provider is. The possible values include: 1 MD - General/Family Practice 2 MD - Internal Medicine 3 MD - Pediatrics 4 MD - OB/Gyn 5 MD - Surgery 6 MD - Other 7 Chiropractor 8 Nurse 9 Nurse Practitioner 10 Physician’s Assistant 11 Other Non-MD Provider 12 Unknown 13 MD - Cardiologist 14 Doctor of Osteopathy 15 MD – Endocrinologist 16 MD – Gastroenterologist 17 MD – Geriatrician 18 MD – Nephrologist 19 MD – Oncologist 20 MD – Pulmonologist 21 MD – Rheumatologist 22 Psychiatrist / Psychologist 23 MD – Neurologist 24 Alternative Care Provider TYPEPE42 is constructed using variables collected at several questions: AC70 “Is provider a medical doctor?” (PROV.MEDTYPE_M18); AC80 “Is provider a nurse, nurse practitioner, physician’s assistant, midwife, or some other kind of person?” (PROV.OTHTYPE_M18); and AC90 “What is provider’s specialty?” (PROV.MDSPECLT_M18). If respondents choose ‘91’ (Other) at AC80 or AC90, they are asked at AC80OS or AC90OS, respectively, to provide a verbal explanation of the type of provider or medical doctor. These text strings can be recoded to one of the existing categorical values listed above or, if the frequency of the response warrants it, additional categorical values. Recoding is described in greater detail below. The AC section also collects demographic information about person and person-in-facility type providers (PROVTY42 = 2 or 3). Six variables indicate the provider’s race: WHITPR42 (White), BLCKPR42 (Black/African American), ASIANP42 (Asian), NATAMP42 (Indian/Native American/Alaska Native), PACISP42 (Other Pacific Islander) and OTHRCP42 (Other Race). The respondent may choose more than one race for a single provider. These variables reflect the answer categories given at AC110. In addition to the race variables, two other demographic variables are created: HSPLAP42 indicates whether or not the provider is Hispanic or Latino, and GENDRP42 indicates whether the provider is Male (‘1’) or Female (‘2’). Using Constructed Variables to Describe the Usual Source of Care Provider These variables describing a person’s USC provider can be used in combination to present a broader picture of the provider. For example, a person-in-facility provider with a particular person named who is a white, Hispanic, female pediatrician, with no other race specified, and whose location is in a hospital is coded as: PROVTY42_M18 = 3 PLCTYP42 = 1 TYPEPE42 = 3 HSPLAP42 = 1 WHITPR42 = 1 BLCKPR42 = 2 ASIANP42 = 2 NATAMP42 = 2 PACISP42 = 2 OTHRCP42 = 2 GENDRP42 = 2 LOCATN42 = 2 Access to and Satisfaction with the Provider The AC section collects information regarding the person’s ability to access the USC provider as well as the person’s satisfaction with the USC provider. Access to the Provider TMTKUS42 indicates how long it takes the person to travel to the USC provider: Less Than 15 Minutes (‘1’), 15 to 30 Minutes (‘2’), 31 to 60 Minutes (‘3’), 61 to 90 Minutes (‘4’), 91 Minutes to 120 Minutes (‘5’), or More than 120 Minutes (‘6’). OFFHOU42, PHNREG42, and AFTHOU42 assess aspects of the provider that may make it difficult for the person to get in contact with the USC provider. OFFHOU42 indicates whether the provider has office hours at night or on the weekend. The remaining two variables reflect the person’s rating of the difficulty of accessing the USC provider by phone (PHNREG42), and after hours (AFTHOU42). The person has the following choices: Very Difficult (‘1’), Somewhat Difficult (‘2’), Not Too Difficult (‘3’), or Not at All Difficult (‘4’). Satisfaction with the Provider These variables reflect the person’s satisfaction with the USC provider. The person’s level of satisfaction with the USC provider is examined in four ways: Does the USC provider: usually ask about prescription medications and treatments other doctors may give them (TREATM42), ask the person to help make decisions between a choice of treatments (DECIDE42), present and explain all options to the person (EXPLOP42), and speak the person’s language or provide translator services (PRVSPK42). PRVSPK42 is set to a value other than ‘-1’ (Inapplicable) for persons eligible for the Access to Care supplement who had a usual source of care, were identified as speaking a language other than English at home (OTHLGSPK = ‘1’) and speaking English either “Not Well” or “Not at All” (HWELLSPK = ‘3’ or ‘4’). PRVSPK42 is set to ‘-1’ (Inapplicable) for all persons not meeting these criteria or who were deceased, institutionalized, or younger than 5 years of age. If the person was under 5 years old in Round 1 and age 5 in Round 2 of the first year panel or Round 4 of the second year panel, and the source data are missing, PRVSPK42 was set to ‘-1’ (Inapplicable); if the source data are available, PRVSPK42 was set per specifications. Affordability of Medical Care, Dental Care, and Prescription Medicines The Access to Care supplement gathers information on whether care was not received or was delayed because of cost in the past 12 months. These questions are split into three sections inquiring about medical care, dental care, and prescription medicines. Each section inquires whether the person did not receive care because they could not afford it (AFRDCA42, AFRDDN42, AFRDPM42). The affordability variables indicate with a value of ‘1’ (Yes) that the person needed care but was unable to afford it, a value of ‘2’ (No) that the person did not have any unmet needs for that type of care because of the cost. Respondents were also asked if anyone in the household delayed receiving care because of worry about cost (DLAYCA42, DLAYDN42, DLAYPM42). The delay variables indicate with a value of ‘1’ (Yes) that the person was delayed in receiving that type of care because of worry about the cost, and a value of ‘2’ (No) for these variables indicates that the person was not delayed in seeking that type of care because of the worry about the cost. Editing the Access to Care Variables Editing consisted primarily of logical editing for consistency with skip patterns. Other editing included the construction of new response values and new variables describing the recoding of “other specify” text items into existing or new categorical values, which are described below. Not all variables or categories that appear in the Access to Care section of the HC questionnaire are included on the file, as some small cell sizes have been suppressed to maintain confidentiality. Recoding of Additional Other Specify Text Items For Access to Care items AC80 and AC90, the “other specify” text responses were reviewed and coded as an existing or new value for the related categorical variable (AC80 and AC90). OTHTYPE_M18 and MDSPECLT_M18 are used to construct the variable TYPEPE42. These variables’ text strings can be recoded to each other’s categories. For example, for persons who indicate that their USC provider is not a medical doctor (PROV.MEDTYPE = ‘2’), the other type of USC provider is other (PROV.OTHTYPE = 91), and the text string collected is “GYNECOLOGIST,” TYPEPE42 would be set to ‘4’ (MD - OB/GYN) instead of ‘11’ (OTHER NON-MD PROVIDER.) Delayed Medical Care, Dental Care, and Prescription Medicines due to the Coronavirus Pandemic The CV section was administered in Panel 23 Rounds 7 - 9, Panel 24 Rounds 5 - 7, Panel 25 Rounds 3 - 5, and Panel 26 Rounds 1 - 3. The CV section gathered information on family members’ abilities to receive treatment without delay from March 2020 (for Panel 23 Round 7, Panel 24 Round 5, Panel 25 Round 3, and Panel 26 Round 1) and the last interview date for all other panels.
These questions asked whether anyone in the household delayed receiving care because of the coronavirus pandemic. If the respondent answered ‘1’ (Yes), they were asked to identify who in the household delayed care. Within a household that had care delayed, the variables CVDLAYCA31, CVDLAYCA42, CVDLAYCA53 (Delay Med Care For COVID R3/1, R4/2, R5/3), CVDLAYDN31, CVDLAYDN42, CVDLAYDN53 (Delay Getting Dental For COVID R3/1, R4/2, R5/3), and CVDLAYPM31, CVDLAYPM42, CVDLAYPM53 (Delay Getting PMED For COVID R3/1, R4/2, R5/3) indicate with a value of ‘1’ (Yes) that the person was delayed in receiving that care during the pandemic; a value of ‘2’ (No) for these variables indicates that the person was not delayed in receiving that type of care during the pandemic. COVID-19 Vaccination Status The CV section also gathers information regarding vaccination and booster shots ever received for COVID-19 for all members of the RU. CVVACCINE42 and CVVACCINE53 represent round-specific measures of ever having received the COVID-19 vaccination. Sample members who were reportedly ever vaccinated as of round 42 (CVVACINE42=1) were not asked in round 53 whether they were ever vaccinated and therefore had CVVACINE53 coded -1 (inapplicable). BOOSTERSHOT53 was collected only for Panel 23 Round 9, Panel 24 Round 7, Panel 25 Round 5, and Panel 26 Round 3, and it indicates whether the person had ever received a COVID-19 vaccine booster shot before the end of the reference period. 2.5.8 Employment Variables (EMPST31–OFREMP53)Employment questions were asked of all persons 16 years and older at the time of the interview. Employment variables consist of person-level indicators such as employment status and job-related variables such as hourly wage. All job-specific variables refer to a person’s current main job. The current main job, defined by the respondent, indicates the main source of employment. Historically, most employment variables pertain to the interview date for Rounds 1- 4, and to December 31 of the delivery year for Round 5 of a second year panel. In 2021, Employment variables have been constructed to reflect responses from additional panels. Panel 24 was fielded for a third year and includes responses from Round 5, Round 6, and Round 7 interviews. Panel 23 was fielded for a fourth year and includes responses from Round 7, Round 8, and Round 9 interviews. In 2021, Panel 25 Round 3, Panel 24 Round 5, and Panel 23 Round 7 were fielded as cross-year rounds where respondents were asked to provide information about the reference period between the prior interview date in 2020 (Rounds 2, 4, and 6 respectively) and the current round interview date in 2021. Panel 26 Round 3 and Panel 24 Round 7 were also fielded as cross-year rounds where respondents were asked to provide information about the reference period between the prior interview date in 2021 (Rounds 2 and 6 respectively) and current round interview date (occurring in 2022). In contrast, Panel 23 Round 9 and Panel 25 Round 5 were fielded as 2021 terminal rounds where respondents were asked to provide relevant information between the prior interview date in 2021 (Rounds 8 and 4, respectively) and 12/31/2021.
No additional adjustments were necessary for 2021 variables, however, users combining 2021 data with 2020 MEPS data should refer to 2020 documentation to fully understand adjustments that were made for Panel 23 variables in Round 5 and Round 6 in the 2020 file. Variable naming protocol is consistent with all prior years. Historically, round dates have been indicated by two numbers following the variable name; the first number representing the round for second panel persons (Panel 25), the second number representing the round for first panel persons (Panel 26). For example, EMPST31 refers to employment status on the Round 3 interview date for Panel 25 persons and employment status on the Round 1 interview date for Panel 26 persons. In 2021, a third year panel and a fourth year panel are included in each of the ‘31’, ‘42’, and ‘53’ variables, but round numbers of these panels (5/6/7 and 7/8/9 respectively) will not be included in the variable name. For example, the 2021 version of EMPST31 (noted above) will also include employment status on the Round 7 interview date for Panel 23 persons and the employment status on the Round 5 interview date of Panel 24 persons. All employment variables reflect the inclusion of the third year of Panel 24 and the fourth year of Panel 23. Panel 23 Round 7 and Panel 24 Round 5 information is contained in the ‘31’ variables, Panel 23 Round 8 and Panel 24 Round 6 information is contained in the ‘42’ variables, and Panel 23 Round 9 and Panel 24 Round 7 information is contained in the ‘53’ variables. (Some users might find it helpful to think of these variables as (75)31, (86)42, and (97)53, even though the naming convention remains as 31, 42, and 53). With the exception of some health insurance and wage variables, no attempt has been made to logically edit any employment variables. When missing, values were imputed for certain persons’ hourly wages. Due to confidentiality concerns, hourly wages greater than or equal to $105.77 were top-coded to -10 and the number of employees variable was top-coded at 500. With the exception of a variable indicating whether the employer has more than one location (MORE31, MORE42, MORE53), all employer-specific variables on the Population Characteristics Public Use release file refer to the specific establishment that is the location of a person’s current main job. For users interested in additional jobs (i.e. current miscellaneous, former main job, and others) or additional details about the current main job, please refer to the Jobs Public Use release file for the current delivery year. The MEPS employment section used dependent interviewing in Rounds 2 through 9. If employment status and certain job characteristics did not change from the previous round, as identified in the Review of Employment (RJ) section, the respondent was skipped through the main employment section. A code of “Determined in Previous Round” (-2) is used to indicate that the information in question was obtained in a previous round. For example, if HRWG42X (Round 8 interview date hourly wage for Panel 23 persons or Round 6 interview date hourly wage for Panel 24 persons or Round 4 interview date hourly wage for Panel 25 persons or Round 2 interview date hourly wage for Panel 26 persons) is coded as “Determined in Previous Round” (-2), it means that hourly wage was collected in a previous round. In this case, users would need to refer to HRWG31X (Round 7 interview date hourly wage for Panel 23 persons or Round 5 interview date hourly wage for Panel 24 persons or Round 3 interview date hourly wage for Panel 25 persons or Round 1 interview date hourly wage for Panel 26 persons) to obtain the value for HRWG42X. The “-2” value for HRWG42X indicates that the person was skipped past the hourly wage question at the time of the Round 8/6/4/2 interview. The same coding applies to HRWG53X when a person was skipped past the Round 9/7/5/3 interview. Note that users may find a positive value in the HRWG31X (Round 7/5/3/1 hourly wage) or they may find an “Inapplicable” code of -1. Unlike HRWG42X and HRWG53X, HRWG31X does not contain -2 values. For persons skipped in Round 7/5/3/1, the prior year hourly wage value is used to populate HRWG31X. The prior year round from which the wage was collected for such cases can be found in the RNDGLG31 variable. These cases will be discussed in more detail below. To determine who should be skipped through various employment questions, certain information, such as employment status, had to be asked in every round. Note that “-2” codes do not apply to questions asked in every round, like employment status. Additionally, information on whether the person currently worked at more than one job (MORJOB) or whether the person held health insurance from a current main employer (HELDX) was asked in every round, and, therefore, those variables also have no “-2” codes. For (a) Panel 23 persons who have a current main job in Round 7 that continued from a job first reported in Round 1 or 2 of 2018, (b) Panel 23 persons who have a current main job in Round 7 that continued from a job first reported in Round 3 or 4 of 2019, (c) Panel 23 persons who have a current main job in Round 7 that continued from a job first reported in Round 5 or Round 6 of 2020, (d) Panel 24 persons who have a current main job in Round 5 that continued from a job first reported in Round 1 or 2 of 2019, (e) Panel 24 persons who have a current main job in Round 6 that continued from a job first reported in Round 3 or Round 4 of 2020, or (f) Panel 25 persons who have a current main job in Round 3 that continued from a job first reported in Round 1 or Round 2 of 2020, the “-2” code is not used. This is because prior year employment variables do not reside on the current 2021 Population Characteristics file and are therefore not easily accessible for users (and in some cases, the data could be impossible to obtain). Instead, the values for the variables resulting from skipped questions are copied from the appropriate prior year file (2018, 2019, or 2020) to the 2021 Population Characteristics Public Use release ‘31’ variable, depending on the round in which the job first became the current main job:
The accompanying 2021 variable RNDFLG31 indicates the round from which these data were collected. For example, if the Panel 25 person has a Round 3 current main job that continues from Round 2 and was first reported as the current main job in Round 2, HRWG31X in the 2021 Population Characteristics Public Use release will be a copy of the HRWG42X variable from the 2020 Population Characteristics Public Use release, and RNDFLG31 in the 2021 Population Characteristics Public Use release will be “2”, indicating the round in which the job was first reported as the current main job. Employment Status (EMPST31, EMPST42, and EMPST53) Employment status was asked for all persons age 16 or older. Allowable responses to the employment status questions were as follows:
These responses were mutually exclusive. A current main job was defined for persons who either reported that they were currently employed and identified a current main job or who reported and identified a job to return to. Therefore, job-specific information such as hourly wage exists for persons not presently working at the interview date but who have a job to return to as of the interview date. The analyst should note that there are cases where EMPSTrr = 1 or 2 (has current job or job to return to) where DDNWRK21 contains a positive value, indicating the number of times the person lost a half-day or more from work. This is because the responses to the Disability Days questions in the survey are independent of the responses to the employment questions. Data Collection Round for Round 7, 5, 3, or 1 CMJ (RNDFLG31) As mentioned above, for (a) a Panel 25 person with a Round 3 current main job (CMJ) that is a continuation CMJ from Round 1 or Round 2, (b) a Panel 24 person with a Round 5 CMJ that is a continuation CMJ from Round 1, 2, 3, or 4, or (c) a Panel 23 person with a Round 7 CMJ that is a continuation CMJ from Round 1, 2, 3, 4, 5, or 6, the value for most ’31’ variables will be copied forward from the 2018, 2019, or 2020 Population Characteristics Public Use release from the variable representing the round in which the job was first reported as the CMJ. Therefore, for persons in Panel 23, Panel 24, or Panel 25, RNDFLG31 indicates the 2018, 2019, or 2020 round in which the Round 7, Round 5 or Round 3 CMJ was first reported as the CMJ and provides a timeframe for the reported wage information and other job details. RNDFLG31 is used with many ‘31’ variables to indicate the round on which the reported information is based. RNDFLG31 is set to “Inapplicable” (-1) for persons in any panel who are under age 16 or who do not have a CMJ in Panel 23 Round 7, Panel 24 Round 5, Panel 25 Round 3, or Panel 26 Round 1. For persons who are part of Panel 23, RNDFLG31 is also set to “Inapplicable” (-1) if the person is out-of-scope in the 2021 portion of Round 7. For persons who are part of Panel 24, RNDFLG31 is also set to “Inapplicable” (-1) if the person is out-of-scope in the 2021 portion of Round 5. For persons who are part of Panel 25, RNDFLG31 is also set to “Inapplicable” (-1) if the person is out-of-scope in the 2021 portion of Round 3. For persons who are part of Panel 26, RNDFLG31 is also set to “Inapplicable” (-1) if the person is out-of-scope in Round 1. Values for RNDFLG31 are set as follows: 1 continuing Panel 23 Round 7/Panel 24 Round 5/Panel 25 Round 3 CMJ reported first in Round 1, or Panel 26 Round 1 CMJ newly reported as current main in Round 1 2 continuing Panel 23 Round 7/Panel 24 Round 5/Panel 25 Round 3 CMJ reported first in Round 2 3 continuing Panel 23 Round 7/Panel 24 Round 5 CMJ reported first in Round 3 or Panel 25 Round 3 CMJ newly reported as current main in Round 3 4 continuing Panel 23 Round 7/Panel 24 Round 5 CMJ reported first in Round 4 5 continuing Panel 23 Round 7 CMJ reported first in Round 5 or Panel 24 Round 5 CMJ newly reported as current main in Round 5 6 continuing Panel 23 Round 7 CMJ reported first in Round 6 7 Panel 23 Round 7 CMJ newly reported as current main in Round 7 -15 Panel 23 Round 7/Panel 24 Round 5 CMJ/Panel 25 Round 3 CMJ is a continuation CMJ (wage information and other details were not collected in Round 7/Round 5/Round 3) but the Round 6/Round 4/Round 2 CMJ record either does not exist or is not the same job. This setting applies even in cases where there is a corresponding Round 1, 2, 3, 4, or 5 CMJ for Panel 23, Round 1, 2, or 3 CMJ for Panel 24 or Round 1 CMJ for Panel 25. This can occur in rare instances because corrections made to a person’s record in a current file cannot be made to that record in an earlier file due to database processing constraints. Corrections are made based on respondent comments in subsequent rounds that affect employment information previously reported. Users may refer to previously released Jobs Public Use release files to review rosters as follows:
Self-Employed (SELFCM31, SELFCM42, and SELFCM53) Information on whether an individual was self-employed at the current main job was obtained for all persons who reported a current main job. If an individual reports that they are self-employed at their current main job, they are also asked to identify whether the self-employed business was incorporated, a proprietorship, or a partnership (BSNTY31, BSNTY42, BSNTY53). These questions are not asked of individuals who were not self-employed and, as a result, individuals who are not self-employed are coded with “Inapplicable” (-1). Alternatively, there are several variables that are only constructed for wage earners (not self-employed). These include benefits, employment characteristics, and hourly wage variables (covered in the following two sections). As noted below, self-employed individuals are coded with “Inapplicable” (-1) for benefits, employment characteristics, and hourly wage variables. Benefits and Employment Characteristics (PAYDR31/42/53, SICPAY31/42/53, PAYVAC31/42/53, RETPLN31/42/53, MORE31/42/53, JOBORG31/42/53) Several variables are constructed only for individuals who report not being self-employed at their current main job. These individuals are asked questions to indicate whether the establishment reported as the main source of employment offered any of the following benefits:
They are also asked information on whether the firm had more than one business location (MORE31, MORE42, MORE53) and whether the establishment was a private for-profit, nonprofit, or a government entity (JOBORG31, JOBORG42, JOBORG53). For persons who were self-employed at their current main job, all of the variables detailed in this section were coded as “Inapplicable” (-1). Hourly Wage (HRWG31X, HRWG42X, HRWG53X), Wage Update Variable (DIFFWG31, DIFFWG42, DIFFWG53), and Updated Hourly Wage (NHRWG31, NHRWG42, NHRWG53) Hourly wage was constructed for all persons who reported a current main job that was not self-employment (SELFCM). HRWG31X, HRWG42X, and HRWG53X provide the wage amount reported initially for a person’s current main job. If a current main job continues into subsequent rounds DIFFWG31, DIFFWG42, and DIFFWG53 indicate if the wage has changed since the previous round. If the job continues and there is a different wage at that job, NHRWG31, NHRWG42, and NHRWG53 indicate the new wage amount. The initial hourly wage variables (HRWG31X, HRWG42X, HRWG53X) on this file should be considered along with their accompanying variables - HRHOW31, HRHOW42, and HRHOW53 - which indicate how the respective round hourly wage was constructed. Hourly wage could be derived, as applicable, from a large number of source variables. In the simplest case, hourly wage was reported directly by the respondent. For other persons, construction of the hourly wage was based upon salary, the time period on which the salary was based, and the number of hours worked per time period. If the number of hours worked per time period was not available, a value of 40 hours per week was assumed, as identified in the HRHOW variable. To assist interviewers during collection of wage amounts, CAPI prompts the respondent to confirm wages reported in the Employment Wage section if a wage amount falls outside a specified wage range.
Where there was insufficient information available for calculating the initial hourly wage, the initial hourly wage variables HRWG31X, HRWG42X, and HRWG53X were imputed using a weighted sequential hot-deck procedure for individuals who reported a current main job (and were not self-employed) but did not know their wage or refused to report a wage. Hourly wage for persons for whom employment status was not known was coded as “Cannot be Computed” (-15). Additionally, wages were imputed for wage earners who reported a wage range instead of a specific wage value. For each of these persons, a value was imputed from other persons on the file who did report a specific value that fell within the reported range. Wages from 2018, 2019, 2020, and 2021 were eligible donors in this process, expanding the donor pool to cover four years instead of the typical two years. The expansion of the donor pool to use four years of donors instead of two allowed AHRQ to maintain a similar sized donor pool to prior releases - but it does mean that some recipients are assigned a donor wage from four years prior. The variables HRWGIM31, HRWGIM42, and HRWGIM53 identify persons whose wages were imputed. Note that wages were imputed only for persons with a positive person and/or positive family weight. The variables DIFFWG31, DIFFWG42, and DIFFWG53 indicate whether a person’s wage amount was different in the current round (from the previous round) at a continuing, current main job. NHRWG31, NHRWG42, and NHRWG53 contain the updated wage amount in cases where a person indicates a change in wages (DIFFWG = 1). While the question regarding wage changes pertains to the primary wage at the main job, occasionally respondents update a person’s supplemental wage at the main job. In these cases, users should note that the HRWG31X, HRWG42X, HRWG53X variable may contain the same value as the NHRWG31, NHRWG42, NHRWG53 variable. For all Panel 26 Round 1 persons, DIFFWG31 and NHRWG31 are set to “Inapplicable” (-1) because this was the first round that wages could be reported for those persons. In Rounds 2 through 9, no imputation was performed on NHRWG31, NHRWG42, and NHRWG53. Instead, where an updated wage amount is “Don’t Know” (-8) or is “Refused” (-7), NHRWG31, NHRWG42, and/or NHRWG53 is set to “Cannot be Computed” (-15). For persons whose hourly wage variable HRWG31X, HRWG42X, and /or HRWG53X was imputed and the respondent provides an updated wage amount in a subsequent round, the new wage, NHRWG31, NHRWG42, and/or NHRWG53, is not presented. Instead, NHRWG31, NHRWG42, and/or NHRWG53 is set to “Initial Wage Imputed” (-13) to indicate that the initial HRWG31X, HRWG42X, and/or HRWG53X was imputed. Users are able to access the value reported for updated wage for these jobs by referring to the 2021 Jobs Public Use release file. In 2021, wage information is logically edited for consistency using established rules and guidance from AHRQ. Outliers are checked for persons who report a wage change and the new reported wage (a) is substantially different from prior wage (change >=100%), (b) is no different than prior wage, (c) is low in value ($0 < wage < $1) or, (d) has a value higher than prior year’s top code value. There are numerous sources for these types of errors, including keystroke or respondent error. In 2021, approximately 100 wages are reviewed per panel, resulting in approximately 90 HRWGrrX/NHRWGrr wage variable edits (overall). Users should note that outlier edits were not performed in 2020 and therefore should be mindful when using the wage variables, especially when comparing 2020 wages to wages in other data years. To help users identify cases that would have been reviewed (but not necessarily edited) in this process, the 2020 data includes wage outlier flag variables, OUTFLAGrr. These round-specific wage outlier flag variables OUTFLAG31, OUTFLAG42, and OUTFLAG53 indicated that a person’s updated wage at the current main job would have been programmatically selected for review during the 2020 wage outlier editing process (but not necessarily edited). Although the OUTFLAGrr variables only appear on the 2020 file, they could be relevant to continuing wages on the 2021 file that were first reported in 2020. More information on these variables may be found in MEPS HC-224: 2020 Full Year Consolidated Data file documentation. OUTFLAGrr variables were not constructed for the 2021 Population Characteristics file since outlier reviews were performed. For reasons of confidentiality, the hourly wage variables were top-coded. A value of -10 indicates that the hourly wage was greater than or equal to $105.77. The wage top-code process uses the highest calculated wage for an individual regardless of whether it was reported in HRWG31X, HRWG42X, and HRWG53X or NHRWG31, NHRWG42, and NHRWG53 variable. All wages for a person were top-coded if any wage variable was at or above the top-code amount. In order to protect the confidentiality of persons across deliveries, the same top-code amount used in this 2021 Population Characteristics file was also applied to the 2021 Jobs file. Because a person can have other jobs besides a current main job which are included in the corresponding 2021 Jobs file, wages at these other jobs were reviewed in the top-coding process. In some cases for these persons, wages reported at the current main job were below the top-code amount while the wage at another job had to be top-coded. To further protect the confidentiality of such persons across deliveries, wages reported at all jobs in the 2021 Jobs file were top-coded and the wages at their current main job (HRWG31X, HRWG42X, HRWG53X, NHRWG31, NHRWG42, and NHRWG53) included in this 2021 Population Characteristics Public Use file were also top-coded. Health Insurance (HELD31X, HELD42X, HELD53X, OFFER31X, OFFER42X, OFFER53X, CHOIC31, CHOIC42, CHOIC53, DISVW31X, DISVW42X, DISVW53X, OFREMP31, OFREMP42, OFREMP53) There are several employment-related health insurance measures included in this release: health insurance held at a current main job (HELD31X, HELD42X, HELD53X), health insurance offered through a current main job (OFFER31X, OFFER42X, OFFER53X), health insurance offered to anyone through the current main job employer (OFREMP31, OFREMP42, OFREMP53), and choice of health plans available through the current main job (CHOIC31, CHOIC42, CHOIC53). This collection of variables reflect the insurance status in the current round. They are logically edited for consistency for each round. MEPS asks if the person holds health insurance through the current main job (HELDX) in the first round in which a person is reported as having that job. If the person does not hold health insurance at the job, then a follow-up question is asked as to whether the person was offered insurance but declined coverage (OFFERX). If the person neither holds nor was offered health insurance at the job, then an additional question is asked to determine whether any other employees at the current main job were offered health insurance (OFREMP). If the person either holds insurance from the job or was offered insurance at the job, then an additional question is asked to determine whether a choice of health plans is available at the job (CHOIC). Prior to Panel 23 Round 9, Panel 24 Round 7, Panel 25 Round 5, and Panel 26 Round 3, in cases where HELDX or OFFERX were “Refused” (-7) or “Don’t Know” (-8), CHOIC was also coded -7/-8, even though the question that populates CHOIC was not asked. As of Panel 23 Round 9, Panel 24 Round 7, Panel 25 Round 5, and Panel 26 Round 3, CHOIC is now coded “Cannot be Computed” (-15) for cases where HELDX or OFFERX were “Refused” (-7) or “Don’t Know” (-8) to reflect that a value cannot be calculated for CHOIC as a result of skip patterns. In the rounds after the job is first reported, the Review of Jobs (RJ) section has the same series of questions with one exception, it does not ask whether there is a choice of health insurance plans at an employer. Choice of health insurance plan is only asked in the round the job was first reported (in the Employment section). In rounds after the job is first reported, a “held” question (whether a person now holds health insurance through the employer) is asked in the Review of Jobs section. This is to determine if there has been any change in coverage. RJ70 (HELDX) is asked if insurance was offered, but not taken by the employee, when the job was first reported and no coverage has been reported since the initial round. RJ80 (HELDX) is asked where:
MEPS then includes several clarifying questions regarding health insurance status and availability to the jobholder through an employer. Where the person does not report, does not know, or refuses to indicate the insurance status at RJ70, or reports no coverage at RJ80, the respondent is asked if the person was offered insurance (OFFERX). Lastly, when a respondent indicates that the jobholder of a reviewed job neither holds nor was offered health insurance at the job, the respondent is asked if any other employees at the job were offered health insurance (OFREMP). If a person does hold insurance through their job, then that person is not asked the offer question and the OFFERX variable is automatically set to “Yes” (1). Data users should note that OFREMP is automatically set to 1 in cases where the jobholder has health insurance coverage through the job (HELDX=1) or in cases where health insurance is offered to the employee at their job (OFFERX=1). Responses in the Employment and Review of Jobs sections for health insurance held were recoded to be consistent with the variables in the Health Insurance section of the survey. For persons who responded in the Employment section or Review of Jobs section that they held health insurance coverage through the employer but then disavowed the coverage in the Health Insurance section, MEPS includes follow-up questions regarding whether health insurance is offered (either to the employee or any other employee depending on responses to questions) and whether more than one plan is available. This information is used in an edit process whereby responses to these variables in the Health Insurance section are transferred into the Employment section or Review of Jobs section. Consequently, more information is now available on MEPS file in the OFFERX, OFREMP, and CHOIC variables. Consistent with prior years, the round-specific flag variable DISVWX continues to be constructed and reflects the disavowal at the current main job in the round. Beginning Panel 23 Round 9, Panel 24 Round 7, Panel 25 Round 5, and Panel 26 Round 3, two CAPI changes impact how insurance information is collected in the Employment and Review of Jobs sections.
1 EMPLOYER 2 UNION 3 BOTH EMPLOYER AND UNION (EMPLOYER IS PRIMARY) 4 BOTH EMPLOYER AND UNION (UNION IS PRIMARY) The result is that persons who report insurance via both union and employer sources in the Employment section will no longer have the secondary source insurance coverage recorded in the Health Insurance section. However, respondents continue to have the opportunity to report any additional private coverage in the Health Insurance section at HX190/HX200. Hours (HOUR31, HOUR42, HOUR53) The hours variables refer to usual hours worked per week at the current main job. Note that, in cases where the respondent estimated hours worked per week at 35 hours or more, HOUR31, HOUR42, and HOUR53 were set to ‘40.’ Temporary (TEMPJB31, TEMPJB42, TEMPJB53) and Seasonal (SSNLJB31, SSNLJB42, SSNLJB53) Jobs The temporary job variables (TEMPJB31, TEMPJB42, TEMPJB53) indicate whether a newly reported current main job lasts for only a limited amount of time or until the completion of a project. The seasonal job variables (SSNLJB31, SSNLJB42, SSNLJB53) indicate whether the newly reported current main job is only available during certain times of the year. SSNLJB is “Yes” (1) if the job is only available during certain times of the year; SSNLJB is “No” (2) if the job is year round. Teachers and other school personnel who work only during the school year are considered to work year round. Both variables are set on current main jobs whether a person is self-employed or not. These questions are asked only in the round the job is newly reported. Consequently, in rounds following the initial report, a code of “Determined In Previous Round” (-2), is used to indicate that the information in the question was obtained in a previous round. This differs from some previous files where both questions were asked in each round and -2 was not an allowed value. Analysts using either of these variables over multiple years of MEPS should refer to documentation for each year to assure consistency for the variable. Number of Employees (NUMEMP31, NUMEMP42, NUMEMP53) NUMEMP indicates the number of employees at the location of the person’s current main job. Due to confidentiality concerns, this variable has been top-coded at 500 or more employees. For respondents who do not know the specific number of employees at the establishment, a categorical question is offered as an alternative. In these cases, a numerical value for NUMEMP is constructed using a median estimated size calculated from donors within the reported categorical range. As always, median values may vary across panels/rounds since calculations are panel/round specific. One noticeable difference in 2021, however, was on medians calculated for NUMEMP31 in the highest estimated range (101-500 employees). Where Panel 26 had median establishment size of 220, Panel 25 had median establishment size of 200, Panel 24 had a median establishment size of 300, and Panel 23 had a median establishment size of 150. Otherwise, differences were generally less pronounced. CAPI does not accept an establishment size value of 0 to indicate the total number of employees working at a self-employed business. Where a person is not self-employed at a job, an establishment size of 0 is allowed. NUMEMP is set to “Cannot be Computed” (-15) for these cases. Other Employment Variables Information about industry and occupation types for a person’s current main job at the interview date is also contained in this release. Based on verbatim text fields collected during the interview, numeric industry and occupation codes are assigned by trained coders at the Census Bureau. The coders used 2007 Census Industry and 2010 Census Occupation Coding schemes, which were developed for the Bureau’s Current Population Survey and American Community Survey. Users should note that coding schemes are comparable for the FY 2010 through FY 2021 data files. Earlier versions of Census coding schemes were used on files prior to FY 2010. Current main jobs were initially coded at the 4-digit level for both industry and occupation. Then, for confidentiality reasons, these codes were condensed into broader groups for release on the file. INDCAT31, INDCAT42, and INDCAT53 represent the condensed industry codes for a person’s current main job at the interview date. OCCCAT31, OCCCAT42, and OCCCAT53 represent the condensed occupation codes for a person’s current main job at the interview date. This release incorporates crosswalks showing how the detailed 2007 Census industry codes (Appendix 1) and 2010 Census occupation codes (Appendix 2) were collapsed into the condensed codes on the file. The schemes used in this file can be linked directly to the 2007 North American Industry Code System (NAICS) and the 2010 Standard Occupation Code scheme (SOC) by going to the U.S. Census Bureau website where a variety of additional crosswalks is also available. Information indicating whether a person belonged to a labor union (UNION31, UNION42, and UNION53) is also contained in this release. The month and year that the current main job started for Rounds 7, 8, and 9 of Panel 23; Rounds 5, 6, and 7 of Panel 24; Rounds 3, 4, and 5 of Panel 25; and Rounds 1, 2 and 3 of Panel 26; are provided in this release (STJBMM31, STJBYY31, STJBMM42, STJBYY42, STJBMM53, and STJBYY53). In Full Year 2021, STJBYY31, STJBYY42, and STJBYY53 are bottom coded to a value of ‘1951’ to preserve age confidentiality. This value is calculated by taking the delivery year of 2021 and subtracting the age top code value of 85, then adding back 15, the age of a person in the year before entering the work force as defined in MEPS. Thus, the bottom code value will be different in each delivery year. For Panel 23 Round 5 jobs that were first reported in Round 4 or Round 5 and Panel 24 Round 3 jobs that were first reported in Round 1 or Round 2, the bottom code continues to be set to the value calculated in the Full Year 2019 delivery year, 1949. For Panel 23 Round 5 jobs that were first reported in Round 1 or Round 2, the bottom code continues to be set to the value calculated in the Full Year 2018 delivery year, 1948. There are two measures included in this release that relate to a person’s work history over a lifetime. One indicates whether a person ever retired from a job as of the Round 9 interview date for Panel 23 persons, or as of the Round 7 interview date for Panel 24 persons, or as of the Round 5 interview date for Panel 25 persons, or as of the Round 3 interview date for Panel 26 persons (EVRETIRE). The other indicates whether a person ever worked for pay as of the Round 9 interview date for Panel 23 persons, as of the Round 7 interview date for Panel 24 persons, as of the Round 5 interview date for Panel 25 persons, or as of the Round 3 interview date for Panel 26 persons (EVRWRK). The latter was asked of everyone who indicated that they were not working as of the round interview date. Therefore, anyone who indicated current employment or who had a job during any of the previous or current rounds was skipped past the question identifying whether the person ever worked for pay. These individuals were coded as “Inapplicable” (-1). All persons who ever reported a job and were 55 years or older as of the round interview date were asked if they “ever retired”. Since both of these variables are not round specific, there are no “Determined in Previous Round” (-2) codes. This release contains variables indicating the main reason a person did not work since the start of the reference period (NWK31, NWK42, and NWK53). If a person was not employed at all during the reference period (at the interview date or at any time during the reference period) but was employed some time prior to the reference period, the person was asked to choose from a list the main reason he or she did not work during the reference period. The “Inapplicable” (-1) category for the NWK variables includes:
A measure of whether an individual had more than one job on the round interview date (MORJOB31, MORJOB42, and MORJOB53) is provided on this release. In addition to those under 16 and those individuals who were out-of-scope, the “Inapplicable” (-1) category includes those who did not report having a current main job. Because this is not a job-specific variable, there are no “Determined in Previous Rounds” (-2) codes. This release contains a variable indicating if a current main job changed between the seventh and eighth rounds for Panel 23 persons, between the fifth and sixth rounds for Panel 24 persons, between the third and fourth rounds for Panel 25 persons, or between the first and second rounds for Panel 26 persons (CHGJ3142). It also contains a variable indicating if a current main job changed between the eighth and ninth rounds for Panel 23 persons, between the sixth and seventh rounds for Panel 24 persons, between the fourth and fifth rounds for Panel 25 persons, or between the second and third rounds for Panel 26 persons (CHGJ4253). In addition to the “Inapplicable” (-1), “Refused” (-7), “Don’t Know” (-8), and “Cannot be Computed” (-15) categories, the change job variables were coded to represent the following: 1 person left previous round current main job and now has a new current main job; 2 person still working at the previous round’s current main job but, as of the new round, no longer considers this job to be the current main job and defines a new current main job (previous round’s current main job is now a current miscellaneous job); 3 person left previous round’s current main job and does not have a new job; 4 person did not change current main job. Finally, this release contains the reason given by the respondent for the job change (YCHJ3142 and YCHJ4253). The reasons for a job change were listed in the CAPI questionnaire and a respondent was asked to choose the main reason from this list. Beginning Panel 23 Round 9, Panel 24 Round 7, Panel 25 Round 5, and Panel 26 Round 3, in addition to those out-of-scope, those under 16, those not having a current main job, and workers who did not change jobs the “Inapplicable” (-1) category for YCHJ3142 and YCHJ4253 now also includes workers who continue to work at the main job but no longer consider it their main job (CHGJrrrr = Changed CMJ/Previous CMJ is Now Current Miscellaneous job” (2)). These persons did not leave the job and therefore were not asked why they left a job. Prior to this change, these persons were set to “Cannot be Computed” (-15). 2.5.9 Health Insurance Variables (TRIJAyyX–PMEDPY53)Throughout Section 2.5.9 references to yy represent the year, 21, references to mm indicate the month (JA through DE), and references to rr indicate a combination of rounds (31/42/53, where the first r denotes the interview round for Panel 25 and the second r denotes the round for Panel 26) or the end of the calendar year (21). For the two extended panels - Panel 23 and Panel 24 - the “31” variables contain data from Round 7 (Panel 23) or Round 5 (Panel 24), the “42” variables contain data from Round 8 (Panel 23) or Round 6 (Panel 24), and the “53” variables contain data from Round 9 (Panel 23) or Round 7 (Panel 24). Beginning Panel 22 Round 3/Panel 23 Round 1, design changes to the health insurance section may impact trend analyses. Analysts should note that a series of questions were added to the HX section of the questionnaire to confirm whether a person who did not initially report any comprehensive coverage during a round has insurance. Starting at HX210, questions were presented to respondents who at that point in the instrument had not yet reported any sources of health insurance coverage, or only reported a source of health insurance without hospital and physician benefits, to determine whether they had coverage that included hospital and physician benefits. If the respondent answered affirmatively at HX210, subsequent questions identified the specific type of coverage (e.g. Medicaid, Private, etc.). This may cause analysts to see changes to the insurance variables-particularly, changes to the monthly health insurance coverage indicators PUBmmyyX, PRImmyyX, INSmmyyX; and the summary health insurance coverage indicators UNINSyy, INSCOVyy, INSURCyy, PUBrrX, PUBATrrX, PRIVrr, PRIVATrr, INSrrX, and INSATrrX. Other changes were made in FY 2018 to the health insurance questions that may affect the continuity of estimates. These changes include modifications to the Medicaid/SCHIP, and TRICARE/CHAMPVA questions to ask if each person in the household is covered using the person’s name in the question text (e.g. “Was Person 1 covered?” “What about Person 2?” etc.). Additionally, in Rounds 2 and 3, respondents are now required to answer “Yes” or “No” for each person individually when reviewing coverage from a previous round for these insurance sources. Changes to the Medicare Round 1 series were also made to probe separately for persons in the RU who were 65 years of age or older versus RU members who were under 65 years of age. Similar to the Medicaid and TRICARE series, Medicare coverage questions were asked for each RU member who was at least 65 years old. The aforementioned changes to the administration of the insurance section may also be evident in the Managed Care Variables (TRISTyyX-PRVHMOyy) because more respondents are now more likely to be asked about managed care. Respondents were allowed to simultaneously report Medicaid and other public hospital/physician coverage. Analysts should be aware that they might see changes in coverage trends in the constructed variables relating to Medicaid, edited Medicaid, or Other Public coverage as well as respondents reporting both after FY 2018. The variables VERFLG31, VERFLG42, and VERFLGyy indicate the round in which comprehensive health insurance coverage was first reported through the verification series of questions collected in the loop that starts at HX210 (HXLoop_40). These values will be carried through to subsequent rounds (e.g., from VERFLG31 to VERFLG42) if the coverage initially added through the verification loop continues, and no other comprehensive source of coverage is reported for that person outside of the verification loop. If previously reported coverage through the verification series ends and, in a future round, new comprehensive coverage is reported through the verification loop, then the VERFLG31/42/yy variable will reflect the corresponding round of the newly reported coverage. The VERFLG variables were set to ‘95’ to indicate that: 1) coverage was reported outside verification; 2) the person did not have coverage; or 3) the person would have been assigned edited coverage even though they may have reported coverage in the verification loop. As an example of the latter, a person who is age 65 or older and reports Medicare coverage through verification but also reports receipt of social security would have MCAREX set to ‘1’ because of the report of social security so the report of coverage in the verification module would not have changed their coverage status in the MEPS. In FY 2019, the construction of the VERFLG variables was modified such that all persons ages 65 and older who gained edited Medicare through the Medicare coverage of their spouse also have a value of 95 in the verification variables, provided that the coverage of the spouse was added outside of the verification series. Persons who report coverage under Indian Health Service (IHS) are identified in the constructed variables IHSrr, IHSATrr, and IHSmmyy. Persons reporting only IHS coverage are not considered covered for the summary insurance measures PUBmmyyX, PUByyX, INSmmyyX, INSCOVyy, and INSURCyy. Persons who report coverage under Veteran’s Administration (VA) can be identified in this file in the constructed variables VAPROGrr, VAPRATrr, VAEVyy, as well as the monthly variables VAPRmmyy. Monthly Health Insurance Indicators (TRIJAyyX-INSDEyyX) Constructed and edited variables are provided that indicate any coverage in each month of 2021 for the sources of health insurance coverage collected during the MEPS interviews (Panel 23 Rounds 7 through 9, Panel 24 Rounds 5 through 7, Panel 25 Rounds 3 through 5, and Panel 26 Rounds 1 through 3). One edit to the private insurance variables corrects for a problem concerning covered benefits that occurred when respondents reported a change in any of their private health insurance plan names. Additional edits address issues of missing data on the time period of coverage for both public and private coverage that was either reviewed or initially reported in a given round. Other edits, described below, were performed on the Medicare and Medicaid or State Children’s Health Insurance Program (SCHIP) variables to assign persons to coverage from these sources. Observations that contain edits assigning persons to Medicare or Medicaid/SCHIP coverage can be identified by comparing the edited and unedited versions of the Medicare and Medicaid/SCHIP variables. Starting October 1, 2001, persons 65 years and older have been able to retain TRICARE coverage in addition to Medicare. Therefore, unlike in earlier MEPS public use files, persons 65 years and older do not have their reported TRICARE coverage (TRIJAyyX - TRIDEyyX) overturned. TRICARE acts as a supplemental insurance for Medicare, similar to Medigap insurance. Public sources include Medicare, TRICARE/CHAMPVA, Medicaid, SCHIP, and other public hospital/physician coverage. IHS is not included as a public source of coverage. Medicare Medicare (MCRJAyy - MCRDEyy) coverage was edited (MCRJAyyX - MCRDEyyX) for persons age 65 or over. Within this age group, individuals were assigned Medicare coverage if:
Note that age (AGErrX) is checked for edited Medicare, however date of birth is not considered. Edited Medicare is somewhat imprecise with regard to a person’s 65th birthday. Medicaid/SCHIP and Other Public Hospital/Physician Coverage Questions about other public hospital/physician coverage were asked in an attempt to identify Medicaid or SCHIP recipients who may not have recognized their coverage as such. Beginning Panel 22 Round 3/Panel 23 Round 1, these questions were asked even if a respondent reported Medicaid or SCHIP directly. (Previously, other public hospital/physician coverage was only asked for respondents who did not report Medicaid or SCHIP.) Respondents reporting other public hospital/physician coverage were asked follow-up questions to determine if the coverage was through a specific Medicaid HMO or if it included some other managed care characteristics. Respondents who identified managed care from either source were asked if the recipient paid anything for the coverage and/or if a government source paid for the coverage. The Medicaid/SCHIP variables (MCDJAyy - MCDDEyy) have been edited (MCDJAyyX - MCDDEyyX) to include persons who paid nothing for their other public hospital/physician insurance when such coverage was through a Medicaid HMO or reported to include some other managed care characteristics. To assist users in further editing sources of insurance, this file contains variables constructed from the other public hospital/physician series that indicate:
The variables GVAJAyy - GVADEyy, GVBJAyy - GVBDEyy, and GVCJAyy - GVCDEyy are provided only to assist in editing and should not be used to make separate insurance estimates for these types of insurance categories. Any Public Insurance in Month The file also includes summary measures that indicate whether or not a sample person has any public insurance in a month (PUBJAyyX - PUBDEyyX). Persons identified as covered by public insurance are those reporting coverage under TRICARE, Medicare, Medicaid or SCHIP, other public hospital/physician programs, or Veteran’s Administration (VA). IHS is not included as a public source of coverage. Note that further edits may be made to the public insurance variables in later MEPS data releases to address cases where private coverage through a federally-facilitated, state-based or state partnership exchange/marketplace may have been originally reported as public insurance. These potential edits could affect the variables MCAIDyyX, GOVTAyy, GOVTByy, GOVTCyy, and PUByyX. Private Insurance Variables identifying private insurance in general (PRIJAyy - PRIDEyy) and specific private insurance sources [such as employer/union group insurance (PEGJAyy - PEGDEyy); non-group (PNGJAyy - PNGDEyy); other group (POGJAyy - POGDEyy)]; and private insurance through a federally-facilitated, state-based or state partnership exchange/marketplace (PRXJAyy - PRXDEyy) were constructed. Private insurance sources identify coverage in effect at any time during each month of 2021. Separate variables beginning with the letter “H” identify policyholders (e.g., HPEJAyy - HPEDEyy). Both types of variables indicate coverage or policyholder status for a particular source and do not identify persons who may have more than one policy of a given source where they are just covered or are also a policyholder (for example, someone who is a policyholder for one employer/union group plan and also a dependent on another employer/union group plan held by his/her spouse). In some cases, the policyholder was unable to characterize the source of insurance (PDKJAyy - PDKDEyy). Prior to FY 2018, persons covered under policyholders living outside the RU were identified in POUJAyy - POUDEyy and PROUTrr. Beginning FY 2018, the constructed variables PRIEUOrr and PRINEOrr are included. PRIEUOrr indicates coverage from a policyholder living outside the RU where the source is through an employer, and PRINEOrr indicates coverage from a policyholder living outside the RU where the source is not through an employer. These variables are based on responses to a follow-up question for respondents who indicate coverage from a policyholder outside the household. The question HP130 asks “Is the {INSURANCE SOURCE NAME} health coverage {POLICYHOLDER} has through an employer or previous employer?” If the respondent’s answer to HP130 was unknown, the person’s coverage is now included in PRIDKrr. An individual was considered to have private health insurance coverage if, at a minimum, that coverage provided benefits for hospital and physician services (including Medicare supplemental coverage). Note, however, that persons covered by private insurance through an exchange/marketplace (PRSTXrr and PRXJAyy - PRXDEyy) were considered to have private health coverage if that coverage provided hospital/physician services, but excluded coverage that was explicitly identified as Medicare supplemental coverage (HX620/OE130=5). If a person reported Medicare supplemental coverage through the exchange/marketplace, then the source of the insurance purchased was edited to reflect coverage “from a professional association” (HP40=1) or coverage “from a group or association” (HX200/HX300=1). Further descriptions of the exchange variables are detailed below. Sources of insurance with missing information regarding the type of coverage were assumed to contain hospital/physician coverage. Persons who reported private insurance that did not provide hospital/physician insurance were not counted as privately insured. Coverage indicated by these variables may be from any type of job whereas the employment section insurance variables delivered on this file reflect only coverage through a current main job. Health insurance through a job or union (PEGJAyy - PEGDEyy) was initially asked about in the Employment Section of the interview and later confirmed in the Health Insurance Section. Insurance that was reported in the employment section through a job classified as self-employed with firm size of 1 is included in the other private insurance variables: PEGJAyy-PEGDEyy; PNGJAyy-PNGDEyy; POGJAyy-POGDEyy; PDKJAyy-PDKDEyy; HPEJAyy-HPEDEyy; HPNJAyy-HPNDEyy; HPOJAyy-HPODEyy; HPDJAyy-HPDDEyy; and PRIEUrr, PRINGrr, PRIOGrr, and PRIDKrr based on responses at HP40. Private insurance that was not employment-related (POGJAyy - POGDEyy, PNGJAyy - PNGDEyy, PDKJAyy - PDKDEyy, PNEJAyy - PNEDEyy, and PRXJAyy - PRXDEyy) was reported in the Health Insurance Section only. Beginning in Panel 14 Round 5/Panel 15 Round 3, “High Risk Pool” was added to the list of categories (HX03 =10 and HX23 =13). Beginning FY 2010, High Risk Pool was included in all Other Group insurance categories. Beginning in Panel 22 Round 3/Panel 23 Round 1, the response category “High Risk Pool” was removed from HP40, HX200, and HX300. “Federal/State Exchange” is included in the list of private insurance categories (HP40=8 and HX200/HX300 =11). Information on federal/state exchanges is also collected at question HP50 (“Is this coverage through {state exchange name}?”) for respondents reporting insurance from a group, directly from an insurance company or HMO, from an insurance agent or from an “other” unspecified source and at OE40 in Rounds 3, 5, and 7 only (“Is this coverage through {state exchange name}?”) for respondents who previously reported private insurance coverage from an insurance company or HMO, or from an insurance agent that was not through an exchange/marketplace. Note that the state-specific name for the exchange/marketplace was used when asking these questions and was also used on the list of private insurance categories at HP40, HX200, and HX300. The variables PRSTXrr have been constructed to include persons less than 65 years old who report private insurance through a federally-facilitated, state-based or state partnership exchange/marketplace at HP40, HX200, or HX300, or persons 65 years old or older who report private insurance through a federally-facilitated, state-based or state partnership exchange/marketplace at HP40, HX200, or HX300 and who were not covered by Medicare. In addition, persons who reported a source of insurance at HX200 or HX300 that was not through an exchange/marketplace (e.g. through a group or directly from an insurance company) but who answered yes to HP50 or OE40 were also classified as having exchange/marketplace coverage instead of being assigned to the category they originally reported. In addition to reporting coverage through an exchange/marketplace, coverage needed to have been identified as hospital/physician coverage at HX620/OE130 (=1 or missing ( -7, -8)), but not identified as having Medicare supplemental coverage (HX620/OE130=5). The variables PRSTXrr contain information on private coverage that was reported as obtained through a federally-facilitated, state-based or state partnership marketplace. Consistent with the approach used in the Current Population Survey and the National Health Interview Survey, MEPS respondents reporting public coverage were asked whether the public coverage was obtained through a federal or state marketplace in case respondents were confused about whether the source of coverage was public or private. Responses to these questions were not used to edit the PRSTXrr variables. Any Insurance in Month The file also includes summary measures that indicate whether or not a person has any insurance in a month (INSJAyyX - INSDEyyX). Persons identified as insured are those reporting coverage under TRICARE, Medicare, Medicaid, SCHIP, other public hospital/physician or private hospital/physician insurance (including Medigap plans), or Veteran’s Administration (VA). A person is considered uninsured if not covered by one of these insurance sources. IHS is not included as a source of coverage. Summary Insurance Coverage Indicators (PRVEVyy-INSURCyy) The variables PRVEVyy - UNINSyy summarize health insurance coverage for the person in 2021 for the following types of insurance: private (PRVEVyy); TRICARE/CHAMPVA (TRIEVyy); Medicaid or SCHIP (MCDEVyy); Medicare (MCREVyy); other public coverage (GVAEVyy); other public coverage that is an HMO (GVBEVyy); other public coverage where a premium is paid (GVCEVyy). Each variable was constructed based on the values of the corresponding 12 month-by-month health insurance variables described above. For persons not in scope for the full year, these summary variables are based on the period of eligibility. If the person was not in scope for all 12 months throughout the year, the values are based on the months the person was eligible. A value of 1 indicates that the person was covered for at least one day of at least one month during 2021. A value of 2 indicates that the person was not covered for a given type of insurance for all of 2021. The variable UNINSyy summarizes PRVEVyy - GVAEVyy. Where PRVEVyy - GVAEVyy are all equal to 2, then UNINSyy equals 1, person was uninsured for all of 2021. Otherwise, UNINSyy is set to 2, insured for all or part of 2021. For user convenience, this file contains a constructed variable INSCOVyy that summarizes health insurance coverage for the person in 2021, with the following three values: 1 = ANY PRIVATE (Person had any private insurance coverage [including TRICARE/CHAMPVA] any time during 2021) 2 = PUBLIC ONLY (Person had only public insurance coverage [excluding TRICARE/CHAMPVA] during 2021) 3 = UNINSURED (Person was uninsured during all of 2021) INSURCyy summarizes health insurance coverage for the person in 2021 using eight categories of insurance separated by age using the person’s age on December 31st, 2021: 1 = ANY PRIVATE (0-64) (Person is between 0 and 64 years old and is covered by private insurance or TRICARE/CHAMPVA in 2021) 2 = PUBLIC ONLY (0-64) (Person is between 0 and 64 years old and is covered by public insurance only (excluding TRICARE/CHAMPVA) in 2021) 3 = UNINSURED (0-64) (Person is between 0 and 64 years old and is uninsured for all of 2021) 4 = EDITED MEDICARE ONLY (65+) (Person is 65 years old or more and is covered by edited Medicare only in 2021) 5 = EDITED MEDICARE & PRIV (65+) (Person is 65 years old or more and is covered by edited Medicare and private insurance or TRICARE/CHAMPVA in 2021) 6 = EDITED MEDICARE & OTH PUB ONLY (65+) (Person is 65 years old or more and is covered by edited Medicare and public insurance including edited Medicaid/SCHIP or other public coverage but excluding TRICARE/CHAMPVA in 2021) 7 = UNINSURED (65+) (Person is 65 years old or more and is uninsured for all of 2021) 8 = NO MEDICARE BUT ANY PUBLIC/PRIVATE (65+) (Person is 65 years old or more and is not covered by Medicare but is covered by private insurance, Medicaid, TRICARE/CHAMPVA, Veteran’s Administration, or other public coverage in 2021) Please note, beginning in 2012, Category 7 was revised to categorize persons who are 65 years or older and uninsured, and Category 8 was added to include persons 65 years or older who do not have Medicare, but are covered by public or private insurance. Please note that IHS is not included as a source of coverage for either INSCOVyy or INSURCyy. Please note that both INSCOVyy and INSURCyy categorize TRICARE as private coverage. All other health insurance indicators included in this data release categorize TRICARE as public coverage. If an analyst wishes to consider TRICARE public coverage, the variable can easily be reconstructed using the PRVEVyy and TRIEVyy variables. Also note that these categories are mutually exclusive, with preference given to private insurance and TRICARE. Persons with both private insurance/TRICARE and public insurance will be coded as “1” for INSCOVyy and INSURCyy. Users wishing to compare INSCOVyy and INSURCyy across years should note at least two changes beginning in 2018 that may affect the continuity of estimates: 1) increased reports of coverage due to the inclusion of the coverage verification series; and 2) the inclusion of Veteran’s Administration coverage as a public coverage source. Flexible Spending Accounts (FSAGT31-PFSAMT31) Respondents in Rounds 1, 3, 5 and 7 were asked if any RU members set aside pre-tax dollars of their own money to pay for out-of-pocket health care expenses. If an RU has a Flexible Spending Account (FSA), then FSAGT31 was set to 1 (Yes), and two follow-up questions were asked - HASFSA31 and PFSAMT31. HASFSA31 was set for each RU member to indicate which RU member has an FSA. The constructed variable PFSAMT31 indicates the total amount the individual RU member contributed to his or her FSA. If no RU member has an FSA, then both HASFSA31 and PFSAMT31 are set to -1 (Inapplicable). Unedited Health Insurance Variables (PREVCOVR-MORECOVR) Duration of Uninsurance If a person was identified as being without insurance as of January 1st in the MEPS Round 1 interview, a series of follow-up questions was asked to determine the duration of uninsurance prior to the start of the MEPS survey. Persons who were insured as of January 1st, and persons with a date of birth on or after December 31, 2021 or whose age category was less than 1 year old were skipped past this loop of questions. These questions are asked in Round 1 only. PREVCOVR indicates if the person was covered by insurance in the two years prior to the MEPS Round 1 interview. For persons who reported only non-comprehensive coverage as of January 1st, a question was asked to determine if they had been covered by more comprehensive coverage that paid for medical and doctors’ bills in the previous two years (MORECOVR). Beginning Panel 23 Round 1, the follow-up questions to PREVCOVR and MORECOVR that collected information on the most recent month and year of coverage (COVRMM, COVRYY, INSENDMM, INSENDYY) and type of coverage (Employer-sponsored (WASESTB), Medicare (WASMCARE), Medicaid/SCHIP (WASMCAID), TRICARE/CHAMPVA (WASCHAMP), VA/Military Care (WASVA), Other public (WASOTGOV, WASAFDC, WASSSI, WASSTAT1-4, WASOTHER), private coverage purchased through a group, association or insurance company (WASPRIV)) are no longer asked. Therefore, these variables will no longer be constructed. Note that these variables are unedited and have been taken directly as they were recorded from the raw data. There may be inconsistencies with the health insurance variables released on public use files that indicate that an individual is uninsured in January. Out-of-scope persons in both panels have been set to “Inapplicable” (-1) for PREVCOVR and MORECOVR. All other persons have PREVCOVR and MORECOVR copied directly from the value of the unedited source variable. Persons whose January 1st insurance coverage status could not be determined due to their reference period beginning after January 1st were also asked the follow-up questions described above. In these cases, persons who reported comprehensive coverage were asked if they were ever without insurance. Those who were uninsured were asked to determine the duration of uninsurance prior to the start of their reference period. Those who reported only non-comprehensive coverage were asked if they had been covered by comprehensive coverage that paid for medical and doctors’ bills in the previous two years. Coverage is determined by health insurance status during the whole reference period or the month of January and ignores that these persons were not in the household on January 1st. Health Insurance Coverage Variables - At Any Time/At Interview Date/At 12-31 Variables (TRICR31X-INSATyyX) Constructed and edited variables are provided that indicate health insurance coverage at any time in a given round, as well as at the MEPS interview dates and on December 31, 2021. Note that for persons who left the RU before the MEPS interview date or before December 31st, the variables measuring coverage at the interview date or on December 31st represent coverage at the date the person left the RU. Variables indicating coverage for Panel 25 members for any time in the round that end in “31” indicate coverage for the portion of Round 3 that occurred in calendar year 2021, unless noted otherwise (see “Dental and Prescription Drug Private Insurance” section). This is also true for data from Panel 24 Round 5 and Panel 23 Round 7 - the 2021 portion of those rounds is contained in the “31” variables. Variables indicating coverage for Panel 26 members ending in “53” indicate coverage at any time in Round 3, including the portion of the round that occurred in calendar year 2022. Similarly, all of the Panel 24 Round 7 data, including data collected in 2022, are contained in the “53” variables. For Round 3 coverage for Panel 26 members or Round 7 data for Panel 24 members that occurred in calendar year 2021, users should use variables ending in “yy”. The Panel 23 Round 8 data, Panel 24 Round 6 data, Panel 25 Round 4 data, and Panel 26 Round 2 data are stored in the 42 variables. The health insurance variables are constructed for the sources of health insurance coverage collected during the MEPS interviews (Panel 23 Rounds 7 through 9, Panel 24 Rounds 5 through 7, Panel 25 Rounds 3 through 5, and Panel 26 Rounds 1 through 3). Note that the Medicare variables on this file as well as the private insurance variables that indicate the particular source of private coverage (rather than any private coverage) only measure coverage at the interview date and on December 31st. Users should also note that the same general editing rules were followed for the month-by-month health insurance variables released on this public use file (see the Section “Monthly Health Insurance Indicators” for details). Editing programs checking for consistencies between these sets of variables were developed in order to provide as much consistency as possible between the round-specific indicators and the month-by-month indicators of insurance. Public sources include Medicare, TRICARE, Veteran’s Administration (VA), Medicaid/SCHIP, and other public hospital/physician coverage. IHS was not considered a public coverage source. Medicare Medicare coverage variables (MCARErr) and the edited versions of these variables (MCARErrX) were constructed similarly to the month-by-month Medicare variables. Since Medicare coverage is logically edited to continue for a person once it has been reported in MEPS, the Medicare coverage variables can be considered as either “coverage at any time in the round” or “coverage at the interview date” variables, with the same caveats as noted above regarding persons who left the RU prior to the interview date or regarding coverage on December 31st variables and restrictions on Round 3, Round 5, and Round 7 coverage to reflect coverage in 2021. Medicaid/SCHIP and Other Public Hospital/Physician Coverage Medicaid/SCHIP variables (MCAIDrr) and the edited versions of these variables (MCAIDrrX, and MCDATrrX) were constructed similarly to the month-by-month Medicaid/SCHIP variables. Other public coverage variables indicating coverage through other public hospital/physician insurance (GOVTArr and GOVAATrr); other public coverage that is an HMO (GOVTBrr and GOVBATrr); and other public coverage that pays a premium (GOVTCrr and GOVCATrr) were constructed similarly to the month-by-month Other Public variables. Any Public Insurance Any public insurance variables (PUBrrX and PUBATrrX) were constructed similarly to the month-by-month any public insurance variables. The variables indicating coverage through Veteran’s Administration (VAPROGrr and VAPRATrr) are included in this file and were constructed similarly to the Veteran’s Administration month-by-month variables. Private Insurance Variables identifying private insurance in general (PRIVrr and PRIVATrr) and specific private insurance sources (such as employer/union group insurance [PRIEUrr]; other group coverage [PRIOGrr]; coverage through an unknown private category [PRIDKrr]; coverage from a policyholder living outside the RU that is employer based coverage [PRIEUOrr]; coverage from a policyholder living outside the RU that is not employer-based coverage [PRINEOrr]; and coverage through an exchange [PRSTXrr]) were constructed similarly to the month-by-month variables in the Section “Monthly Health Insurance Indicators.” Variables indicating any private insurance coverage are available for the following time periods: at any time in a given round, at the interview date, and on December 31st. The variables for the specific sources of private coverage are only available for coverage on the interview dates and on December 31st. Any Insurance in Period Any insurance variables (INSrrX and INSATrrX) were constructed similarly to the month-by-month any insurance program variables. FY 2021 PUF Managed Care Variables (TRIST31X-PRVHMOyy) In addition to the month-by-month indicators of coverage, there are round-specific health insurance variables indicating coverage by an HMO or managed care plan. Managed care variables have been constructed from information on health insurance coverage at any time in a reference period and the characteristics of the plan. A separate set of managed care variables has been constructed for private insurance, Medicaid/SCHIP, and Medicare coverage. The purpose of these variables is to provide information on managed care participation during the portion of the three rounds (i.e., reference periods) that fall within the same calendar year. Managed care variables for calendar year 2021 are based on responses to health insurance questions asked during the Round 7, 8, and 9 interviews of Panel 23, the Round 5, 6, and 7 interviews of Panel 24, the Round 3, 4, and 5 interviews of Panel 25, and the Round 1, 2, and 3 interviews of Panel 26. Each managed care variable ends in “rr” where the first r denotes the interview round for Panel 25 and the second r denotes the round for Panel 26, respectively. For the two extended panels - Panel 23 and Panel 24 - the “31” variables contain data from Round 7 (Panel 23) or Round 5 (Panel 24), the “42” variables contain data from Round 8 (Panel 23) or Round 6 (Panel 24), and the “53” variables contain data from Round 9 (Panel 23) or Round 7 (Panel 24). The variables ending in “31” and “42” correspond to the first two interviews of each panel in the calendar year. Because Round 3 interviews typically overlap the final months of one year and the beginning months of the next year, the “31” managed care variables for Panels 24 and 25 indicate whether or not a person has coverage from a managed care plan in the 2021 calendar year. This is also the case for the Panel 23 Round 7 and Panel 24 Round 5 data - the “31” managed care variables are limited to the 2021 calendar year. Similarly, the Panel 24 Round 7 and Panel 26 Round 3 managed care variables indicate whether or not a person has coverage from a managed care plan in the 2021 calendar year, and the variables have been given the suffix “yy” (as opposed to “53”) to emphasize the restricted time frame. Further descriptions of the implications to managed care plan coverage due to the overlapping calendar year in Rounds 3 and 5 are detailed below. Construction of the managed care variables is straightforward, but three caveats are appropriate. First, MEPS estimates of the number of persons in HMOs are higher than figures reported by other sources, particularly those based on HMO industry data. The differences stem from the use of household-reported information, which may include respondent error, to determine HMO coverage in MEPS. Second, the managed care questions are asked about the last plan held by a person through his or her establishment (employer or insurer) even though the person could have had a different plan through the establishment at an earlier point during the interview period. As a result, in instances where a person changed his or her establishment-related insurance, the managed care variables describe the characteristics of the last plan held through the establishment. Third, the “yy” versions of the managed care variables are developed from Rounds 3, 5, and 7 variables that cover different time frames. Health insurance status variables for Round 3 are restricted to the same calendar year as the Round 1 and 2 data. The Rounds 3, 5, and 7 variables describing plan type, on the other hand, overlap the next calendar year, 2022. As a consequence, the “yy” managed care variables may not describe the characteristics of the last plan held in the calendar year if the person changed plans after the first of the year. The variables PRVHMOrr indicate coverage by a private HMO in Panel 26 Rounds 1 - 3, Panel 25 Rounds 3 - 5, Panel 24 Rounds 5 - 7, and Panel 23 Rounds 7 - 9. The variables MCRPHOrr indicate coverage by a Medicare managed care plan (or “Medicare Advantage” plan) in Panel 26 Rounds 1 - 3, Panel 25 Rounds 3 - 5, Panel 24 Rounds 5 - 7, and Panel 23 Rounds 7 - 9. The variables MCRPDrr indicate coverage by Medicare prescription drug benefit, also known as Part D, in Panel 26 Rounds 1 - 3, Panel 25 Rounds 3 - 5, and Panel 24 Rounds 5 - 7, and Panel 23 Rounds 7 - 9. The edited version of the Medicare prescription drug coverage variables (MCRPDrrX) include persons who are covered by both edited Medicare and edited Medicaid. The variables MCDHMOrr and MCDMCrr indicate coverage by a Medicaid or SCHIP HMO or managed care plan in Panel 26 Rounds 1 - 3, Panel 25 Rounds 3 - 5, Panel 24 Rounds 5 - 7, and Panel 23 Rounds 7 - 9. The Tricare plan variables are similarly defined. For Panel 26, the “31” version indicates coverage at any time in Round 1, the “42” version indicates coverage at any time in Round 2, and the “yy” version represents coverage at any time during the 2021 portion of Round 3. For Panel 25, the “31” version indicates coverage at any time during the 2021 portion of Round 3, the “42” version indicates coverage at any time in Round 4, and the “yy” version represents coverage at any time during Round 5 since Round 5 ends on 12/31/2021 for Panel 25. For Panel 24, the “31” version indicates coverage at any time during the 2021 portion of Round 5, the “42” version indicates coverage at any time in Round 6, and the “yy” version represents coverage at any time during the 2021 portion of Round 7. For Panel 23, the “31” version indicates coverage at any time during the 2021 portion of Round 7, the “42” version indicates coverage at any time during Round 8, and the “yy” version represents coverage at any time during Round 9, since Round 9 ends on 12/31/2021 for Panel 23. In the health insurance section of the questionnaire, respondents reporting private health insurance were asked to identify what types of coverage a person had via a checklist. If the respondent selected prescription drug or dental coverage from this checklist, variables were constructed to indicate prescription drug or dental coverage respectively. It should be noted, however, that in some cases respondents may have failed to identify prescription drug or dental coverage that was included as part of a hospital and physician plan. TRICARE Plan Variables In Fall 2022 the response options for the CAPI TRICARE questions HX125_01, HX260 and PR280_01 were changed. Options “Tricare Standard”, “Tricare Prime”, “Tricare Extra”, and “Tricare For Life” were replaced by the single response option “Tricare”. As a result the previous plan-specific variables TRICARE Standard (TRISTrrX), TRICARE Prime (TRIPRrrX), TRICARE Extra (TRIEXrrX), and TRICARE for Life (TRILIrrX) were dropped from the 2021 data, and new variable TRIrrX (Person Covered by Tricare at Any Time during the Reference Period) was added. Beginning in Panel 9 Rounds 4 and 5/Panel 10 Rounds 1 through 3, CHAMPVA was added to the list of TRICARE/CHAMPVA Plans collected in the instrument. Therefore, the variables TRICH42/yyX were created. The “31” version of this variable was constructed starting in 2006. It should be noted that the TRICARE Plan information was elicited from a pick-list, code-all-that-apply question that asked which type of TRICARE plan the person obtained. Beginning Panel 22 Round 3/Panel 23 Round 1, questions related to military health coverage were asked at the person-level. If it was reported that someone in the RU had coverage through military health care, a follow-up question was asked to determine who in the RU was covered; then, the pick-list, code-all-that-apply question described above was asked to determine which type of military coverage the person obtained. VA (Veteran’s Administration) was added to this list beginning Panel 22 Round 3/Panel 23 Round 1. In each round, the TRICARE variable has four possible values: 1 The person was covered by the TRICARE plan 2 The person was covered by CHAMPVA but not TRICARE 3 The person was not covered by TRICARE/CHAMPVA. -1 The person was out-of-scope. Medicare Managed Care Plans, Part B, and Prescription Drug Benefit Persons were assigned Medicare coverage based on their responses to the health insurance questions or through logical editing of the survey data. A small number of persons were edited to have Medicare. For this group, coverage through a managed care plan, Part B, and coverage by prescription drug plan questions were not asked. Since no Medicare establishment-person pair exists for this group, the persons’ Medicare managed care, Part B, and prescription drug benefit statuses are set to -15 (Cannot be Computed). For those persons who reported Medicare coverage based on their responses to the health insurance questions, the Medicare managed care plan, Part B, and prescription drug benefit questions were asked. Medicare managed care plan and prescription drug benefit questions were asked for each round a person indicates Medicare coverage. Medicare Part B questions were asked during the first report of Medicare only. The Medicare Part B indicator for those persons who indicated not having a Medicare card available was introduced for Panel 14 Round 2 and Panel 13 Round 4. For those persons who reported having Medicare coverage in Round 1, but did not have a Medicare card available, Medicare Part B coverage was set to -15 (Cannot be Computed). The Medicare prescription drug benefit variables (MCRPDrr) have been edited (MCRPDrrX) to turn on coverage for all persons who are covered by both edited Medicare and edited Medicaid regardless of the status on their unedited Medicare prescription drug benefit variable. In each round, the variables MCRPHOrr have five possible values: 1 The person was covered by Medicare and covered through a Medicare Managed Care or Medicare Advantage Plan. 2 The person was covered by Medicare but not covered through a Medicare Managed Care or Medicare Advantage Plan. 3 The person was not covered by Medicare. -15 The person was covered by Medicare but whether the coverage is through a Medicare Managed Care or Medicare Advantage Plan cannot be computed. -1 The person was out-of-scope. In each round, the variables MCRPDrr/MCRPDrrX have five possible values: 1 The person was covered by Medicare and covered by prescription drug benefit. 2 The person was covered by Medicare but not covered by prescription drug benefit. 3 The person was not covered by Medicare. -15 The person was covered by Medicare but prescription drug benefit coverage cannot be computed. -1 The person was out-of-scope. In each round, the variables MCRPBrr have five possible values: 1 The person was covered by Medicare and covered by Part B. 2 The person was covered by Medicare but not covered by Part B. 3 The person was not covered by Medicare. -15 The person was covered by Medicare but Part B cannot be computed. -1 The person was out-of-scope. Medicaid/SCHIP Managed Care Plans Persons were assigned Medicaid or SCHIP coverage based on their responses to the health insurance questions or through logical editing of the survey data. The number of persons who were edited to have Medicaid or SCHIP coverage is small. These persons indicated coverage through an Other Government program that was identified as being in a Medicaid HMO or gatekeeper plan that did not require premium payment from the insured party. By definition, respondents were asked about the managed care characteristics of this insurance coverage. Medicaid/SCHIP HMOs If Medicaid/SCHIP or Other Government programs were identified as the source of hospital/physician insurance coverage, the respondent was asked about the characteristics of the plan. The variables MCDHMOrr were set to “Yes” if an affirmative response was provided to the following question: Under {Medicaid{, also known as {STATE NAME FOR MEDICAID},} or {STATE CHIP NAME}/{PROGRAM NAME FROM HX160/HX270}, the program sponsored by a state or local government agency which provides hospital and physician benefits,}} {{are/is}/{were/was}} {PERSON 1}, {PERSON 2},{PERSON 3}, {PERSON 4},{PERSON N} enrolled in an HMO, that is a Health Maintenance Organization {between {START DATE} and {END DATE}}? [With an HMO, you must generally receive care from HMO physicians. If another doctor is seen, the expense is not covered unless you were referred by the HMO, or there was a medical emergency.] In subsequent rounds, for persons who had been previously identified as covered by Medicaid, the respondent was asked whether the name of the person’s insurance plan had changed since the previous interview. An affirmative response triggered the previous set of questions about managed care (name on list of Medicaid HMOs or signed up with an HMO). In each round, the variables MCDHMOrr have five possible values: 1 The person was covered by a Medicaid/SCHIP HMO. 2 The person was covered by Medicaid/SCHIP but the plan was not an HMO. 3 The person was not covered by Medicaid/SCHIP. -15 The person was covered by Medicaid/SCHIP but the plan type cannot be computed. -1 The person was out-of-scope. Medicaid/SCHIP Gatekeeper Plans If a person did not belong to a Medicaid/SCHIP HMO, a third question was used to determine whether the person was in a gatekeeper plan. The variables MCDMCrr were set to “Yes” if the respondent provided an affirmative response to the following question: {Does/Between {START DATE} and {END DATE}, did} {Medicaid{, {STATE NAME FOR MEDICAID},}or {STATE CHIP NAME}/{PROGRAM NAME FROM HX160/HX270}, the program sponsored by a state or local government agency which provides hospital and physician benefits,} require {PERSON 1}, {PERSON 2},{PERSON 3}, {PERSON 4}, {PERSON N} to sign up with a certain primary care doctor, group of doctors, or with a certain clinic which they must go to for all of their routine care? PROBE: Do not include emergency care or care from a specialist they were referred to. In each round, the variables MCDMCrr have five possible values: 1 The person was covered by a Medicaid/SCHIP gatekeeper plan. 2 The person was covered by Medicaid/SCHIP, but it was not a gatekeeper plan. 3 The person was not covered by Medicaid/SCHIP. -15 The person was covered by Medicaid/SCHIP but the plan type cannot be computed. -1 The person was out-of-scope. Private Managed Care Plans Persons with private insurance were identified from their responses to questions in the health insurance section of the MEPS questionnaire. In some cases, persons were assigned private insurance as a result of comments collected during the interview, but data editing was minimal. As a consequence, most persons with private insurance were asked about the characteristics of their plan, and their responses were used to identify HMO and gatekeeper plans. Private HMOs Persons with private insurance were classified as being covered by an HMO if they met any of the three following conditions:
{Is/Was} {your/{POLICYHOLDER}'s} {NAME OF INSURER} an HMO {as of {END DATE}}? {When answering this question, do not consider {your/his/her} insurance through Medicare.} [With an HMO, you must generally receive care from HMO physicians. For other doctors, the expense is not covered unless you were referred by the HMO or there was a medical emergency.] In subsequent rounds, policyholders were asked whether the name of their insurance plan had changed since the previous interview. An affirmative response triggered the detailed question about managed care (i.e., was the insurer an HMO). Some insured persons have more than one private plan. In these cases, if the policyholder identified any plan as an HMO, the variables PRVHMOrr were set to “Yes.” If a person had multiple plans and one or more were identified as not being an HMO and the other(s) had missing plan type information, the person-level variable was set to missing. Additionally, if a person had multiple plans and none were identified as an HMO, the person-level variable was set to “No.” In each round, the variables PRVHMOrr have five possible values: 1 The person was covered by a private HMO. 2 The person was covered by private insurance, but it was not an HMO. 3 The person was not covered by private insurance. -15 The person was covered by private insurance, but the plan type cannot be computed. -1 The person was out-of-scope. Dental and Prescription Drug Private Insurance Variables (DENTIN31-PMDINSyy) Dental Private Insurance Variables Round-specific variables (DENTINrr) are provided that indicate the person was covered by a private health insurance plan that included at least some dental coverage for each round of 2021. It should be noted that the information was elicited from a pick-list, code-all-that-apply, question that asked what type of health insurance a person obtained through an establishment. The list included: hospital and physician benefits including coverage through an HMO, Medigap coverage, vision coverage, dental, and prescription drugs. It is possible that some dental coverage provided by hospital and physician plans was not independently enumerated in this question. Users should also note that persons with missing information on dental benefits for all reported private plans and those who reported that they did not have dental coverage for one or more plans but had missing information on other plans are coded as not having private dental coverage. Persons with reported dental coverage from at least one reported private plan were coded as having private dental coverage. DENTIN53 reflects coverage for all of Panel 26 Round 3, all of Panel 25 Round 5, all of Panel 24 Round 7, and all of Panel 23 Round 9 where the end reference year for Panels 24 and 26 could extend into 2022. DENTIN31 for Panel 25 Round 3, Panel 24 Round 5, and Panel 23 Round 7 reflects coverage in 2020 and 2021 since the reference period for all three rounds spans both years. A second version of these dental coverage indicators was built to reflect only current year coverage (DNTINSrr). Prescription Drug Private Insurance Variables Round-specific variables (PMEDINrr) are provided that indicate the person was covered by a private health insurance plan that included at least some prescription drug insurance coverage for each round of 2021. It should be noted that the information was elicited from a pick-list, code-all-that-apply, question that asked what type of health insurance a person obtained through an establishment. The list included: hospital and physician benefits including coverage through an HMO, Medigap coverage, vision coverage, dental, and prescription drugs. It is possible some prescription drug coverage provided by hospital and physician plans was not independently enumerated in this question. Persons with reported prescription drug coverage from at least one reported private plan were coded as having private prescription drug coverage. Users should note that persons with missing information on prescription drug benefits for all reported private plans and those who reported that they did not have prescription drug coverage for one or more plans but had missing information on other plans are coded as not having private prescription drug coverage. PMEDIN53 reflects coverage for all of Panel 26 Round 3, all of Panel 25 Round 5, all of Panel 24 Round 7, and all of Panel 23 Round 9, where the end reference year for Panels 24 and 26 could extend into 2022. PMEDIN31 for Panel 25 Round 3, Panel 24 Round 5, and Panel 23 Round 7 reflects coverage in 2020 and 2021 since the reference period for all three rounds spans both years. A second version of these prescription drug coverage indicators was built to reflect only current year coverage (PMDINSrr). Medical Debt Variables (PROBPY42 - PYUNBL42) Questions relating to medical debt were asked in the health insurance section. Respondents in Round 2, Round 4, Round 6, or Round 8 were asked questions HX770 (“In the past 12 months did anyone in the family have problems paying or were unable to pay any medical bills?”), HX780 (“Does anyone in your family currently have any medical bills that are being paid off over time?”), and HX790 (“Does anyone in your family currently have any medical bills that you are unable to pay at all?”). The corresponding constructed variables PROBPY42, CRFMPY42, and PYUNBL42 are included in this file. PROBPY42 was set to 1 (Yes) if the respondent indicated that someone in their family had problems paying or were unable to pay any medical bills. Additional questions ascertained if anyone in the family currently had medical bills that were being paid off over time (CRFMPY42), and if anyone in the family currently had any medical bills that were unable to be paid at all (PYUNBL42). If the respondent indicated that someone in their family currently has any medical bills that are being paid off over time, then CRFMPY42 was set to 1 (Yes). Note that if the respondent indicates that no one in their family had problems paying medical bills, then PYUNBL42 is set to -1 (Inapplicable). Prescription Drug Usual Third Party Payer Variables (PMEDUP31-PMEDPY53) Round-specific variables are provided that indicate whether the sample member had a usual third party payer for prescription medications (PMEDUPrr), and if so, what type of payer (PMEDPYrr). These questions were asked only of sample members who reportedly had at least one prescription medication purchase in the round. In each interview, if the sample member reportedly had a third party payer, then the respondent was asked the name of the sample member’s usual third party payer. These responses were coded into the following source of payment categories in PMEDPYrr: Private Insurance, Medicare, Medicaid, VA/CHAMPVA, TRICARE, State/Local Government, and Other. Users should note that these questions were asked in the Prescription Medicine (PM) section of the questionnaire, and that no attempt was made to reconcile the responses with information collected in the health insurance section of the questionnaire. 2.5.10 Person-Level Medical Utilization Variables (OBTOTV21-HHINFD21)The person-level medical utilization variables will be provided in the forthcoming full year Consolidated file. 2.5.11 Changes in Variable ListVariables were added and removed from the file due to changes in the questions asked in 2021 relative to prior years. The MEPS HC questionnaires can be found on the MEPS website. Following is a list of changes to the variable list for the 2021 full-year data file. Added
Added (included in alternating years only, will not be included in 2022):
Deleted
Deleted (included in alternating years only, will be included in 2022):
2.6 Linking to Other Files2.6.1 Event and Condition FilesRecords on this file can be linked to 2021 MEPS HC public use event and conditions files by the sample person identifier (DUPERSID). The Panel 25 cases on this file (PANEL=25), the Panel 24 cases on this file (PANEL=24), and the Panel 23 cases on this file (PANEL=23) can also be linked back to the 2020 MEPS HC public use event and condition files. In addition, the Panel 23 and Panel 24 cases can be linked back to the 2019 MEPS HC public use event and conditions files, and the Panel 23 cases can be linked back to the 2018 MEPS HC public use event and condition files. 2.6.2 National Health Interview SurveyThe set of households selected for MEPS is a subsample of those participating in the National Health Interview Survey (NHIS); thus, each MEPS panel can also be linked back to the previous year’s NHIS public use data files. For information on obtaining MEPS/NHIS link files please see the AHRQ website. 2.6.3 Longitudinal AnalysisPanel-specific longitudinal files are available for downloading in the data section of the MEPS website. For all four panels (Panel 23, Panel 24, Panel 25, and Panel 26), the longitudinal file comprises MEPS survey data obtained in Rounds 1 through 5 of the panel and can be used to analyze changes over a two-year period. In addition, for Panel 23 a longitudinal file that comprises MEPS survey data obtained in Rounds 1 through 7 of the panel can be used to analyze changes over a three-year period. For Panel 24 a file representing a three-year period will also be established, and for Panel 23 a file representing a four-year period will be established. Variables in the file pertaining to survey administration, demographics, employment, health status, disability days, quality of care, patient satisfaction, health insurance, and medical care use and expenditures were obtained from the MEPS full-year Consolidated files from the two years covered by that panel. For more details or to download the data files, please see Longitudinal Weight files at the AHRQ website. 3.0 Survey Sample Information3.1 Discussion of Pandemic Effects on Quality of 2021 MEPS Data3.1.1 SummaryThe challenges associated with MEPS data collection in 2020 after the onset of the COVID-19 pandemic continued into 2021. The major modifications to the standard MEPS study design remained in effect, permitting data to be collected safely but with accompanying concerns related to the quality of the data obtained. These data quality issues are discussed below. The suggestion made in the documentation for the FY 2020 MEPS Consolidated PUF data (as well as for most federal major in-person surveys conducted in 2021 and 2020) still holds. Researchers are counseled to take care in the interpretation of estimates based on data collected from these two calendar years. This includes the comparison of such estimates to those of other years and corresponding trend analyses. 3.1.2 OverviewSection 3.1 of the documentation for the 2020 Full Year Consolidated Data File provides a general discussion of the impact of the COVID-19 pandemic on several other major in-person federal surveys as well as on MEPS. In addition, it offers a detailed look at how MEPS was modified to permit safe data collection and the development of useful estimates at a time when the way the U.S. health care system functioned underwent many transformations in order to meet population needs. In this corresponding 2021 document, focus is placed mostly on MEPS data quality in 2021. However, it also includes how data quality issues related to the two federal surveys most closely connected to it, the National Health Interview Survey (NHIS) carried out by the National Center for Health Statistics (NCHS) and the Current Population Survey (CPS) carried out by the Census Bureau, have an impact on the data quality issues of MEPS. Specifically, the following discussion describes: 1) data quality issues experienced by the NHIS and CPS that affect MEPS; 2) modifications to the MEPS sample design in 2021 due to the continuing pandemic; and 3) potential data quality issues in the FY 2021 MEPS data related to the COVID-19 pandemic. 3.1.3 Data Quality Issues for MEPS in 2021 Directly Associated with Data Quality Concerns for the NHIS and CPSHouseholds fielded for Round 1 of MEPS in each year have been selected as a subsample from among the NHIS responding households from the prior year. The MEPS first year panel in 2021 was Panel 26. The households fielded for MEPS in Round 1 of Panel 26 were thus selected from NHIS responding households in 2020. It is important to note here that the NHIS households eligible for use in MEPS are restricted to the first three quarters of the NHIS as the fourth quarter households cannot be made available in time for MEPS data collection early in the next calendar year. The onset of the pandemic in 2020 at a national level took place in mid-March of that year, when the NHIS data collection for the first quarter of 2020 was virtually completed and that of the second quarter was about to begin. The NHIS had to make a rapid transition from in-person to telephone interviewing in order to attempt to gather NHIS data for the second quarter of 2020. While NCHS was able to make the transition, assessments made by NCHS at the time indicated a much lower response rate than is typically experienced during Quarter 2 and the quality of Quarter 2 data was of particular concern. NCHS thus modified the 2020 NHIS sample design for Quarters 3 and 4. A randomly selected subsample of the sampled housing units originally selected for fielding in Quarters 3 and 4 of 2020 was removed from the sample to be fielded. This reduced sample for Quarters 3 and 4 was then enhanced by randomly selecting responding households from the 2019 NHIS for interviewing in 2020 as well. In consideration of the data quality issues and sample design modifications associated with the 2020 NHIS, the MEPS sample design for FY 2021 was modified, as will be discussed shortly. With respect to the CPS, the quality of CPS data is always of particular importance to MEPS as March CPS-ASEC estimates serve as the basis of control totals for the raking component of the MEPS weighting process. These control totals incorporate the following demographic variables: age, sex, race/ethnicity, region, MSA status, educational attainment, and poverty status. The CPS estimates of educational attainment and poverty status used in the development of the FY 2021 MEPS PUFs were of particular concern. Evaluations of these estimates undertaken by the Census Bureau have shown that they suffered from bias due to survey nonresponse with CPS income estimates being on the high side and the estimate of those under poverty being on the low side. The impact of these CPS estimates on the quality of MEPS estimates has been carefully considered. The approach used for the MEPS FY 2021 Population Characteristics PUF sample weights, where educational attainment is employed in the raking process but poverty status is not, is discussed in Section 3.5. A set of references discussing the fielding of these surveys during the pandemic and since, including possible bias concerns, can be found in the References section of this document. 3.1.4 Modifications to the MEPS-HC 2021 Sample DesignTwo key factors were thus expected to raise issues with MEPS plans for fielding a 2021 sample. First, 2020 NHIS data quality and sample size issues were of particular concern for Quarter 2 of that year. Second, roughly half of the NHIS sampled households for Quarter 3 would also have been respondents in the 2019 NHIS so that many of the Quarter 3 NHIS respondents were expected to have already been sampled and fielded for Panel 25 of MEPS. It thus became clear that it would be prudent to modify the 2021 MEPS sample design for MEPS Panel 26. Action had to be taken immediately because the MEPS sample selection from NHIS responding households begins in the late summer/early fall of each year. AHRQ contacted NCHS, reviewing the various issues and asking if it would be possible that responding households in NHIS Panels 2 and 4 from Quarter 1 of 2020 be made available for MEPS sample selection. Virtually all of these households were interviewed in-person prior to the major onset of the pandemic, so the Quarter 1 response rates for all four NHIS panels were consistent with prior years and the data quality issues associated with the pandemic could be avoided. NCHS was fully supportive of this approach and made NHIS Panels 2 and 4 for Quarter 1 available for use by MEPS. Thus, for MEPS Panel 26, the NHIS responding households subsampled from MEPS were selected from among all NHIS responding households in Quarter 1 as well as those responding in Quarter 3 that were not originally sampled for the 2019 NHIS. As an adjunct to this modification, it was decided to take advantage of the additional PSUs (sampled localities) available from NHIS Panels 2 and 4 and appearing in the MEPS sample for the first time. State level estimation is of interest to MEPS, and the added PSUs would serve to increase the precision for state level estimates. State estimates that would be expected to benefit the most from these added PSUs were the “middle-sized” states. The largest states already had large sample sizes while precision for the smallest states would remain low. As a result, the MEPS sample focused on oversampling the “middle-sized” states rather than Hispanics, Blacks, and Asians, as has usually been the practice. Finally, it was decided to collect data for Panels 23 and 24 for nine rounds, so that these two panels will ultimately contribute to MEPS estimates for four calendar years. In so doing, the number of respondents to MEPS will be kept at a relatively high level despite the decline in response rates due to the pandemic. The MEPS FY 2021 PUF records thus consist of data obtained from the following MEPS Panels and corresponding rounds: Panel 23, Rounds 7-9; Panel 24, Rounds 5-7; Panel 25, Rounds 3-5; and Panel 26, Rounds 1-3. 3.1.5 Data Quality Issues for MEPS for FY 2021Three sources of potential bias were identified for MEPS for FY 2020: long recall period for Round 6 of Panel 23; switching from in-person to telephone interviewing which likely had a larger impact on Panel 25; and the impact of CPS bias on the MEPS weights. A number of statistically significant differences were found between panels for FY 2020. Those findings are discussed in MEPS HC-224. With this in mind, there were a number of uncertainties for FY 2021 warranting examination. Would Panel 23 data quality increase substantially once the issue of an extensive recall period was eliminated? Would the switching from in-person to telephone interview in Round 1 continue to impact Panel 25 estimates? Since Panel 26 was the first year MEPS panel in 2021, would Panel 26 estimates tend to be different than those of the other three panels? Preliminary analyses undertaken to examine the quality of MEPS FY 2021 data appearing on the Population Characteristics PUF have been focused on the comparison of health insurance status distribution (some private insurance, some public insurance, no health insurance) for the MEPS target population between the panels fielded. These comparisons were undertaken for the full sample and the three age groups of 0-17, 18-64, and 65+. The analyses undertaken thus far suggest no major differences between the four panels for the distribution of health insurance status. Even though slight differences were observed with Panel 25 (e.g., the distribution associated with the age range 18-64 showed a higher percentage of all public insurance compared to the other three panels while those at least 65 years of age showed a lower percentage of some private insurance compared to the other three panels), no statistically significant differences were detected. In summary, the FY 2021 Population Characteristic PUF weights can be expected to produce useful estimates for initial analyses of MEPS 2021 data. Further analyses of MEPS estimates will be conducted as part of the production of the FY 2021 Consolidated PUF to be released later in 2023. This will help identify any additional data quality issues as well as possible improvements that could be implemented. 3.1.6 Discussion and GuidanceThe various actions taken in the development of the person-level weights for the MEPS FY 2021 Population Characteristics PUF were designed to limit the potential for bias in the data due to changes in data collection and response bias. However, evaluations of MEPS data quality in 2021, consistent with those of other Federal surveys fielded in 2021, suggest that users of the MEPS FY 2021 Population Characteristics PUF should exercise caution when interpreting estimates and assessing analyses based on these data as well as in comparing 2021 estimates to those of prior years. 3.2 Background on Sample Design and Response RatesThe MEPS is designed to produce estimates at the national and regional level over time for the civilian, noninstitutionalized population of the United States and some subpopulations of interest. The data in this public use file pertain to calendar year 2021. The data were collected in Rounds 1, 2, and 3 for MEPS Panel 26, Rounds 3, 4, and 5 for MEPS Panel 25, Rounds 5, 6 and 7 for MEPS Panel 24, and Rounds 7, 8, and 9 for MEPS Panel 23. As usual, Round 3 for a MEPS panel (this time for Panel 26) has been designed to overlap two calendar years, as illustrated below. However it may be noted that, with the fielding of a third and fourth panel in 2021 (as indicated in the data quality discussion in Section 3.1), the structure of other rounds has changed. Round 7 of Panel 23 and Round 5 of Panel 24 serve the same purpose. Thus, Round 7 of Panel 23 was fielded in 2020 and designed to collect data for the remainder of 2020 as well as the period of time from January 1, 2021 up through the date of the Round 7 interview. Round 5 of Panel 24 was designed for the same purpose, collecting data associated with both 2020 and 2021. This was done to permit all three of these panels to provide data for the FY 2021 MEPS data sets as well as those for FY 2020. A sample design feature shared by Panel 23, Panel 24, and Panel 25 involved the partitioning of the sample domain “Other” (serving as the catchall stratum and consisting mainly of households with “White” members) into two sample domains. This was done for the first time in Panel 16. The two domains distinguished between those households characterized as “complete” respondents to the NHIS; and those characterized as “partial completes.” NHIS “partial completes” typically have a lower response rate to MEPS and for all three MEPS panels the “partial” domain was sampled at a lower rate than the “complete” domain. This approach has served to reduce survey costs, since the “partials” tend to have higher costs in gaining survey participation, but has also increased sample variability due to the resulting increased variability in sampling rates. Starting with Panel 25, the “Other, Partial” domain includes the NHIS households that have provided only a roster of household members. For detailed information on the MEPS sample design, see Chowdhury et al (2019). This feature was not of particular emphasis in Panel 26, for reasons discussed in Section 3.1.4. 3.2.1 MEPS-Linked to the National Health Interview Survey (NHIS)Each responding household found in this 2021 MEPS dataset is associated with one of the four separate and overlapping MEPS panels, Panel 23, Panel 24, Panel 25, and Panel 26. These panels consist of subsamples of households participating in the 2017, 2018, 2019, and 2020 NHIS, respectively. The Full Year 2018 PUF was the first one where both MEPS panels reflect the new NHIS sample design first implemented in 2016. Whenever there is a change in sample or study design, it is good survey practice to assess whether such a change could affect the sample estimates. For example, increased coverage of the target populations with an updated sample design based on data from the latest Census can improve the accuracy of the sample estimates. MEPS estimates have been and will continue to be evaluated to determine if an important change in the survey estimates might be associated with a change in design. Discussion on the potential effects of such MEPS design change in 2021 appears in the data quality section, 3.1. Background on the two NHIS sample designs of interest here is provided next. Background on the NHIS Sample Redesign Implemented in 2016 Beginning in 2016, NCHS implemented another new sample design for the NHIS, which differed substantially from the prior design. Each of the 50 states as well as the District of Columbia served as explicit strata for sample selection purposes with the intent of providing the capability of state-level NHIS estimates obtained through pooling across years if the sample size for a single year would result in unreliable estimates. In contrast to the previous design, households in areas with relatively high concentrations of minorities are not oversampled. PSUs are still formed at the county level. However, within sampled PSUs, the clusters of addresses that have been sampled for each year of the NHIS are not in the form of segments (consisting of one or more census blocks) as was done for the previous NHIS designs. For the 2016 NHIS, each such cluster consisted of roughly 25 subclusters selected using random systematic sampling across the full geography of the PSU. Each subcluster is made up of, generally, four nearby addresses or roughly 100 addresses in all. The number of subclusters per cluster can vary from year to year. Another major change is that the list of DUs (addresses) was obtained from the Computerized Delivery Sequence File (CDSF) of the U.S. Postal Service, which is a different approach than the standard listing process for area probability samples used in the pre-2016 designs. While addresses in the CDSF provide very high coverage of most areas of the country, coverage in rural areas can be somewhat lower. For rural areas where this was a concern, address lists were created through the conventional listing process. A description of the NHIS sample design is provided by NCHS on the NHIS website. Panel 23 Household Sample Size A subsample of 9,700 households (occupied DUs) was randomly selected for MEPS Panel 23 from NHIS responding households in 2017, of which 9,694 were fielded for MEPS after the elimination of any units characterized as ineligible for fielding. Panel 24 Household Sample Size A subsample of 9,700 households was randomly selected for MEPS Panel 24 from the households responding to the 2018 NHIS, of which 9,684 were fielded for MEPS after the elimination of any units characterized as ineligible for fielding. Panel 25 Household Sample Size A subsample of 9,900 households was randomly selected for MEPS Panel 25 from the households responding to the 2019 NHIS, of which 9,888 were fielded for MEPS after the elimination of any units characterized as ineligible for fielding. Panel 26 Household Sample Size A subsample of 9,510 households was randomly selected for MEPS Panel 26 from the households responding to the 2020 NHIS, of which all 9,510 were fielded for MEPS after the elimination of any units characterized as ineligible for fielding. Implications of the New Design on MEPS Estimates Under the new design, MEPS sampled households reflect the clustering of the NHIS, as described above but to a somewhat lesser degree due to the sampling from NHIS respondents. Due to the spreading of the NHIS sample in small subclusters across the PSU and the sampling limited to only NHIS respondents, the impact of clustering on the variance of MEPS estimates may be more limited. Also, in contrast to the previous design, the NHIS sampling rates at the address level currently do not vary due to oversampling of minorities (although this could change in subsequent years). On balance, the overall variation in sampling rates/weights at the national level for the NHIS is expected to be lower with a corresponding positive impact on the precision of MEPS estimates. However, with a reduction in the sample sizes of minority households, precision levels of MEPS estimates for Blacks, Hispanics, and Asians may be reduced to some extent. 3.2.2 Sample Weights and Variance EstimationIn the dataset “MEPS HC-228: 2021 Full Year Population Characteristics,” weight variables are provided for generating MEPS estimates of totals, means, percentages, and rates for persons and families in the civilian noninstitutionalized population. Procedures and considerations associated with the construction and interpretation of person and family-level estimates using these and other variables are discussed below. It should be noted that NCHS has made a modification to the NHIS sample design that has affected the MEPS variance structure. This is discussed in more detail in Section 3.8.1. 3.3 The MEPS Sampling Process and Response Rates: An OverviewFor most MEPS panels, a sample representing about three-eighths of the NHIS responding households is made available for use in MEPS. This was the case for MEPS Panel 23, Panel 24, and Panel 25. For Panel 26, the sample made available for use in MEPS represented about five-eighths of the NHIS responding households for the reasons discussed in Section 3.1.4. Because the MEPS subsampling has to be done soon after NHIS responding households are identified, a small percentage of the NHIS households initially characterized as NHIS respondents are later classified as nonrespondents for the purposes of NHIS data analysis. This actually serves to increase the overall MEPS response rate slightly since the percentage of NHIS households designated for use in MEPS (all those characterized initially as respondents from the NHIS panels and quarters used by MEPS for a given year) is slightly larger than the final NHIS household-level response rate and some NHIS nonresponding households do participate in MEPS. However, as a result, these NHIS nonrespondents who are MEPS participants have no NHIS data available to link with MEPS data. Once the MEPS sample is selected from among the NHIS households, characterized as NHIS respondents, RUs consisting entirely of military personnel are deleted from the sample. Military personnel not living in the same RU as civilians are ineligible for MEPS. After these exclusions, all RUs associated with households, selected from among those identified as NHIS responding households are then fielded in the first round of MEPS. Table 3.1 shows in Rows A, B, and C the three informational components just discussed. Row A indicates the percentage of NHIS households eligible for MEPS. Row B indicates the number of NHIS households sampled for MEPS. Row C indicates the number of sampled households actually fielded for MEPS (after dropping the military members discussed above and a small number of NHIS households sampled in error). Note that all response rates discussed here are unweighted.
*Among the panels and quarters of the NHIS allocated to MEPS, the percentage of households that were considered to be NHIS respondents at the time the MEPS sample was selected. 3.3.1 Response RatesIn order to produce annual health care estimates for calendar year 2021 based on the full MEPS sample data from the MEPS Panel 23, Panel 24, Panel 25, and Panel 26 the four panels are combined. More specifically, full calendar year 2021 data collected in Rounds 7 through 9 for the MEPS Panel 23, Rounds 5, 6 and 7 for the MEPS Panel 24 and Rounds 3 through 5 for the MEPS Panel 25 sample are pooled with data from the first three rounds of data collection for the MEPS Panel 26 sample (the general approach is described below). As mentioned above, all response rates discussed here are unweighted. To understand the calculation of MEPS response rates, some features related to MEPS data collection should be noted. When an RU is visited for a round of data collection, changes in RU membership are identified. Such changes include the formation of student RUs as well as other new RUs created when RU members from a previous round have moved to another location in the U.S. Thus, the number of RUs eligible for MEPS interviewing in a given round is determined after data collection is fully completed. The ratio of the number of RUs completing the MEPS interview in a given round to the number of RUs characterized as eligible to complete the interview for that round represents the “conditional” response rate for that round expressed as a proportion. It is “conditional” in that it pertains to the set of RUs characterized as eligible for MEPS for that round and thus is “conditioned” on prior participation rather than representing the overall response rate through that round. For example, in Table 3.1, for Panel 26 Round 2 the ratio of 4,799 (Row G) to 6,045 (Row F) multiplied by 100 represents the response rate for the round (79.4 percent when computed), conditioned on the set of RUs characterized as eligible for MEPS for that round. Taking the product of the percentage of the NHIS sample eligible for MEPS (Row A) with the product of the ratios for a consecutive set of MEPS rounds beginning with Round 1 produces the overall response rate through the last MEPS round specified. The overall unweighted response rate for 2021 for the combined sample after pooling the respondents across the four panels was obtained by computing the product of the compositing factor associated with each panel (discussed in section 3.5.7 describing the development of the final weight for the FY 2021 Population Characteristics File) and the corresponding overall panel response rate and then summing the four products. Panel 26 represents about 31.5 percent of the combined sample size, Panel 25 represents about 24.7 percent of the combined sample size, Panel 24 represents about 22.1 percent and Panel 23 represents the remaining 21.7 percent. Thus, the combined response rate of 21.8 percent was computed as 0.22 times 22.0 (22.0 is the overall Panel 23 response rate through Round 9) plus 0.22 times 20.7 (20.7 is the overall Panel 24 response rate through Round 7) plus 0.25 times 19.6 (19.6 is the overall Panel 25 response rate through Round 5) plus 0.31 times 24.3 (24.3 is the overall Panel 26 response rate through Round 3.) The overall response rate of 21.8 percent for 2021 is lower than that for 2020 (27.6 percent), reflecting the continued impact of the pandemic on data collection efforts. 3.3.2 Panel 26 Response RatesFor MEPS Panel 26 Round 1, 9,510 households were fielded in 2021 (Row C of Table 3.1), a randomly selected subsample of the households responding to the 2020 NHIS. Table 3.1 shows the number of RUs eligible for interviewing in each Round of Panel 26 as well as the number of RUs completing the MEPS interview. Computing the individual round “conditional” response rates as described in Section 3.3.1 and then taking the product of these three response rates and the factor 60.6 (the percentage of the NHIS sampled households characterized as responding at the time of sample selection of households for MEPS) yields an overall response rate of 24.3 percent for Panel 26 through Round 3. 3.3.3 Panel 25 Response RatesA total of 9,888 households were fielded in 2020 for MEPS Panel 25 (as indicated in Row C of Table 3.1), a randomly selected subsample of the households responding to the 2019 National Health Interview Survey (NHIS). Table 3.1 shows the number of RUs eligible for interviewing and the number completing the interview for all five rounds of Panel 25. The overall response rate for Panel 25 was computed in a similar fashion to that of Panel 26 but covering all five rounds of MEPS interviewing as well as the factor representing the percentage of NHIS sampled households eligible for MEPS. The overall response rate for Panel 25 through Round 5 is 19.6 percent. 3.3.4 Panel 24 Response RatesA total of 9,684 households were fielded in 2019 for MEPS Panel 24 (as indicated in Row C of Table 3.1), a randomly selected subsample of the households responding to the 2018 National Health Interview Survey (NHIS). Table 3.1 shows the number of RUs eligible for interviewing and the number completing the interview for all seven rounds of Panel 24. The overall response rate for Panel 24 was computed in a similar fashion to that of Panel 25 but covering all seven rounds of MEPS interviewing as well as the factor representing the percentage of NHIS sampled households eligible for MEPS. The overall response rate for Panel 24 through Round 7 is 20.7 percent. 3.3.5 Panel 23 Response RatesA total of 9,694 households were fielded in 2018 for MEPS Panel 23 (as indicated in Row C of Table 3.1), a randomly selected subsample of the households responding to the 2017 National Health Interview Survey (NHIS). Table 3.1 shows the number of RUs eligible for interviewing and the number completing the interview for all nine rounds of Panel 23. The overall response rate for Panel 23 was computed in a similar fashion to that of Panel 24 but covering all nine rounds of MEPS interviewing as well the factor representing the percentage of NHIS sampled households eligible for MEPS. The overall response rate for Panel 23 through Round 9 is 22.0 percent. 3.3.6 Annual (Combined Panel) Response RateA combined panel response rate for the survey respondents in this data set is obtained by taking a weighted average of the panel specific response rates. The Panel 23 response rate was weighted by a factor of 0.22, the Panel 24 response rate was weighted by a factor of 0.22, the Panel 25 response rate was weighted by a factor of 0.25, and the Panel 26 response rate was weighted by a factor of 0.31, reflecting approximately the distribution of the overall sample between the four panels. The resulting combined response rate for the combined panels was computed as (0.22 x 22.0) plus (0.22 x 20.7) plus (0.25 x 19.6) plus (0.31 x 24.3) or 21.8 percent (as shown in Table 3.1). 3.3.7 OversamplingOversampling is a feature of the MEPS sample design, helping to increase the precision of estimates for some subgroups of interest. Before going into details related to MEPS, the concept of oversampling will be discussed. In a sample where all persons in a population are selected with the same probability and survey coverage of the population is high, the sample distribution is expected to be proportionate to the population distribution. For example, if Hispanics represent 15 percent of the general population, one would expect roughly 15 percent of the persons sampled to be Hispanic. However, in order to improve the precision of estimates for specific subgroups of a population, one might decide to select samples from those subgroups at higher rates than the remainder of the population. Thus, one might select Hispanics at twice the rate (i.e., at double the probability) of persons not oversampled. As a result, an oversampled subgroup comprises a higher proportion of the sample than it represents in the general population. Sample weights ensure that population estimates are not distorted by a disproportionate contribution from oversampled subgroups. Base sample weights for oversampled groups will be smaller than for the portion of the population not oversampled. For example, if a subgroup is sampled at roughly twice the rate of sample selection for the remainder of the population not oversampled, members of the oversampled subgroup will receive base or initial sample weights (prior to nonresponse or poststratification adjustments) that are roughly half the size of the group not oversampled. As mentioned above, oversampling is implemented to increase the sample sizes and thus improve the precision of survey estimates for particular subgroups of the population. The “cost” of oversampling is that the precision of estimates for the general population and subgroups not oversampled will be reduced to some extent compared to the precision one could have achieved if the same overall sample size were selected without any oversampling. The NHIS no longer oversamples households with members who are Asian, Black, or Hispanic. Nevertheless, these minority groups are still of analytic interest for MEPS. As a result for Panels 23, 24, and 25, all households in the Asian, Hispanic, and Black domains were sampled with certainty (i.e., all households assigned to those domains were included in the MEPS). In addition, all households in Panel 23 who had a member who was a veteran were also selected with certainty. Among all remaining households for Panel 23, the “Other, complete” domain was sampled at a rate of about 69 percent while the “Other, partial complete” domain was sampled at a rate of about 43 percent. For Panel 24, the corresponding sampling rates for the “Other, complete” domain and the “Other, partial complete” domain were about 79 percent and 50 percent, respectively. For Panel 25, the corresponding sampling rates for the “Other, complete” domain and the “Other, partial complete” domain were about 77 percent and 50 percent, respectively. The somewhat lower sampling rates for Panel 23 in the two “Other” domains arose due to the oversampling of veterans in that panel. With a specified overall sample size of 9,700 fewer were needed from those assigned to the “Other” domains in that panel. Within the “noncertainty” strata (the “Other” domains) for the three panels, responding NHIS households were selected for MEPS using a systematic sample selection procedure from among those eligible. The selection of the households was with probability proportionate to size (pps) where the size measure was the inverse of the NHIS initial probability of selection. The pps sampling was undertaken to help reduce the variability in the MEPS weights incurred due to the variability of the NHIS sampling rates. As discussed in Section 3.1.4, the Panel 26 sample focused on oversampling the “middle-sized” states rather than Hispanics, Blacks, and Asians. A note with respect to the interpretation of MEPS response rates, which are unweighted. Typically, sample allocations across sample domains change from one MEPS panel to another. The sample domains used may also vary by panel as is the case for Panel 23 versus Panel 24, Panel 25, and Panel 26. When one compares unweighted measures (e.g., response rates) between panels and years, one should take into account such differences. Suppose, for example, members of one domain have a lower propensity to respond than those of another domain. Then, if that domain has been allocated a higher proportion of the sample, the corresponding panel may have a lower unweighted response rate simply because of the differences in sample allocation. 3.4 Background on Person-Level Estimation Using this MEPS Public Use Release3.4.1 OverviewThere is a single full year person-level weight variable called PERWT21P. However, care should be taken in its application as it permits both “point-in-time” and “range of time” estimates, depending on the variables used to define the set of persons of interest for analysis. A person-level weight was assigned for each key, in-scope person who responded to MEPS for the full period of time that he or she was in-scope during 2021. A key person was either a member of a responding NHIS household at the time of interview or joined a family associated with such a household after being out-of-scope at the time of the NHIS (the latter circumstance includes newborns as well as those returning from military service, an institution, or residence in a foreign country). A person is in-scope whenever he or she is a member of the civilian, noninstitutionalized portion of the U.S. population. 3.4.2 Developing Person-Level EstimatesThe data in this file can be used to develop estimates on persons in the civilian, noninstitutionalized population at any time during 2021 and for the slightly smaller population of persons in the civilian, noninstitutionalized population on December 31, 2021. To obtain a cross-sectional (point-in-time) estimate for in-scope persons living in the country on December 31, 2021, the analysis should be restricted to cases where INSC1231=1 (the person is in-scope on December 31, 2021). The weight variable PERWT21P must be applied to the analytic variable(s) of interest to obtain either type of national estimate. Table 3.2 contains a summary of cases to include and sample sizes for the two populations described above.
3.5 Details on Person-Level Weights Construction3.5.1 OverviewThe person-level weight PERWT21P was developed in three stages. The person-level weight for Panel 23 was created, including both an adjustment for nonresponse over time and raking. The raking involved controlling to several sets of marginal control totals reflecting Current Population Survey (CPS) population estimates based on six different variables (identified below). The person-level weights for Panel 24, Panel 25, and Panel 26 were created, also including an adjustment for nonresponse over time and raking, where the raking established consistency with CPS population estimates based on the same six variables. A composite weight was formed from the Panel 23, Panel 24, Panel 25, and Panel 26 weights by multiplying the panel weights by factors corresponding to the relative effective sample sizes of the four panels. Then a final raking was undertaken on this composite weight variable, based on the same six variables used previously. 3.5.2 MEPS Panel 23 Weight Development ProcessThe person-level weight for MEPS Panel 23 was developed using the 2020 full-year weight for an individual as a “base” weight for survey participants present in 2021. For key, in-scope members who joined an RU some time in 2021 after being out-of-scope in 2020, the initially assigned person-level weight was the corresponding 2020 family weight. The weighting process included an adjustment for person-level nonresponse over Rounds 8 and 9 as well as raking to population control figures for December 2021 for key, responding persons in-scope on December 31, 2021. These control totals were derived by scaling back the population distribution obtained from the March 2022 CPS to reflect the December 31, 2021 estimated population total (estimated based on Census projections for January 1, 2022). Variables used for person-level raking included: education of the reference person (three categories: no degree; high school/GED only or some college; Bachelor’s or higher degree); Census region (Northeast, Midwest, South, West); MSA status (MSA, non-MSA); race/ethnicity (Hispanic; Black, non-Hispanic; Asian, non-Hispanic; and other); sex; and age. (It may be noted that for confidentiality reasons, the MSA status variables are no longer released for public use.) The final weight for key responding persons who were not in-scope on December 31, 2021 but were in-scope earlier in the year was the nonresponse-adjusted person weight without raking. The 2020 full-year weight used as the base weight for Panel 23 was derived from the 2018 MEPS Round 1 weight and reflected adjustment for nonresponse over the remaining data collection rounds in 2018, 2019, and 2020 as well as raking to the December 2018, December 2019, and December 2020 population control figures. 3.5.3 MEPS Panel 24 Weight Development ProcessThe person-level weight for MEPS Panel 24 was developed using the 2020 full-year weight for an individual as a “base” weight for survey participants present in 2021. For key, in-scope members who joined an RU some time in 2021 after being out-of-scope in 2020, the initially assigned person-level weight was the corresponding 2020 family weight. The weighting process also included an adjustment for person-level nonresponse over Rounds 6 and 7 as well as raking to the same population control figures for December 2021 used for the MEPS Panel 23 weights for key, responding persons in-scope on December 31, 2021. The same six variables employed for Panel 23 raking (education level, census region, MSA status, race/ethnicity, sex, and age) were also used for Panel 24 raking. Similar to Panel 23, the Panel 24 final weight for key, responding persons not in-scope on December 31, 2021 but in-scope earlier in the year was the nonresponse-adjusted person weight without raking. Note that the 2020 full-year weight that was used as the base weight for Panel 24 was derived using the 2019 MEPS Round 1 weight and reflected adjustment for nonresponse over the remaining data collection rounds in 2019 and 2020 as well as raking to the December 2019 and December 2020 population control figures. 3.5.4 MEPS Panel 25 Weight Development ProcessThe person-level weight for MEPS Panel 25 was developed using the 2020 full-year weight for an individual as a “base” weight for survey participants present in 2021. For key, in-scope members who joined an RU sometime in 2021 after being out-of-scope in 2020, the initially assigned person-level weight was the corresponding 2020 family weight. The weighting process also included an adjustment for person-level nonresponse over Rounds 4 and 5 as well as raking to the same population control figures for December 2021 used for the MEPS Panels 23 and 24 weights for key, responding persons in-scope on December 31, 2021. The same six variables employed for Panels 23 and 24 raking (education level, census region, MSA status, race/ethnicity, sex, and age) were also used for Panel 25 raking. Similar to Panels 23 and 24, the Panel 25 final weight for key, responding persons not in-scope on December 31, 2021 but in-scope earlier in the year was the nonresponse-adjusted person weight without raking. Note that the 2020 full-year weight that was used as the base weight for Panel 25 was derived using the 2020 MEPS Round 1 weight and reflected adjustment for nonresponse over the remaining data collection rounds in 2020 as well as raking to the December 2020 population control figures. 3.5.5 MEPS Panel 26 Weight Development ProcessThe person-level weight for MEPS Panel 26 was developed using the 2021 MEPS Round 1 person-level weight as a “base” weight. The MEPS Round 1 weights incorporated the following components: the original household probability of selection for the NHIS and for the NHIS subsample reserved for MEPS and an adjustment for NHIS nonresponse, the probability of selection for MEPS from NHIS, an adjustment for nonresponse at the dwelling unit level for Round 1, and poststratification to control figures at the person level obtained from the March CPS of the corresponding year. For key, in-scope members who joined an RU after Round 1, the Round 1 DU weight served as a “base” weight. The weighting process also included an adjustment for nonresponse over the remaining data collection rounds in 2021 as well as raking to the same population control figures for December 2021 used for the MEPS Panel 23, Panel 24, and Panel 25 weights for key, responding persons in-scope on December 31, 2021. The same six variables employed for Panel 23, Panel 24, and Panel 25 raking (education level of the reference person, census region, MSA status, race/ethnicity, sex, and age) were also used for Panel 26 raking. Similar to Panel 23, Panel 24, and Panel 25, the Panel 26 final weight for key, responding persons who were not in-scope on December 31, 2021 but were in-scope earlier in the year was the nonresponse-adjusted person weight without raking. 3.5.6 RakingBeginning with the Full Year 2002 files, “raking” has been employed for the “Full Year” MEPS weighting to calibrate survey weights to match designated population control totals, replacing the poststratification process previously employed. Raking is a commonly used process for adjusting survey weights so that estimates of subpopulation totals match more stable figures available from independent sources. It can be thought of as multi-dimensional poststratification that requires an iterative solution. Survey weights are poststratified to several sets of control figures (dimensions) in a sequential and continuous fashion until convergence is achieved. Convergence is the state where survey weights satisfy the criteria that the sums of the survey weights for the subgroups represented by the various dimensions are simultaneously within a specified distance of the corresponding control figures (e.g., within 1, 5, 10, etc. of the control totals). For instance, if one dimension in a raking effort was sex by MSA status and the specified distance was 5, then, after convergence has been achieved, the sum of the survey weights for males in MSA areas would be within ±5 of the control figure for males in MSA areas, the sum for females in MSA areas would be within ±5, etc. 3.5.7 The Final (Non-Poverty Adjusted) Weight for the 2021 Population Characteristics FileAs mentioned earlier, after raking the weights from each panel separately, a composited weight for use in representing the full set of MEPS respondents was formed from the individual panel weights by multiplying the weights of those in a given panel by the corresponding compositing factor associated with that panel. Then a final raking was undertaken on this composited weight variable, based on the same six variables used previously for raking. The purpose of the compositing factors is to establish an appropriate weight for estimation purposes across all FY 2021 MEPS respondents from the four panels after pooling their records into a single full-sample data base. If estimates from each of the four panels were unbiased, any four factors that are all less than 1 and that sum to 1 would be suitable. Using the relative nominal sample sizes (the proportions that the number of respondents in a panel represent among the total number of respondents in the four panels) has worked well for MEPS in previous years. However, choosing factors that reflect the relative ‘effective’ sample size (the inverse of the relative amount of variability in the individual panel estimates attributable to the variability of the sample weights and sample size) helps limit the variability of the estimates obtained using the composited weights across the four samples pooled. Beginning with the Full Year 2020 files, we have chosen to use relative effective sample size in order to increase the effectiveness of the compositing factors to some extent. One reason for doing this is to account for the more variable panel weights due to increasing nonresponse. The effective sample sizes were computed by dividing the sample sizes of each panel by the design effect associated with the variability of the nonresponse-adjusted person weights (i.e., prior to raking the weights of a panel) across the person-level respondents in the panel. The relative effective sample size was then computed by taking the ratio of the effective sample size for a panel to the sum of the effective sample sizes across the four panels. It should be noted that, as discussed in the data quality subsection in 3.1, there is evidence of bias in the preliminary estimates related to events for Panel 25. The compositing factors used produced full-sample estimates that were consistent with Panels 23, 24 and 26. indicating the compositing may also have been effective in limiting survey bias. This is viewed as a very positive result in terms of the development of estimates associated with data collected in 2021. Variables used in the raking of the composited person-level weights utilized the same variables that were employed in forming control totals for the individual panels (derived from CPS data). These were: education of the reference person (no degree, high school/GED no college or some college, Bachelor’s or a higher degree); census region (Northeast, Midwest, South, West); MSA status (MSA, non-MSA); race/ethnicity (Hispanic, Black but non-Hispanic, Asian, and other); sex; and age. Persons included in the raking process were those in-scope on December 31, 2021. (It may be noted that poverty status is included as a raking variable for producing the weight for the full-year consolidated file but not included in this version of the MEPS weights. This is because the poverty status variable is not available at the time this version of the MEPS weights is created. Additional time is required to process the income data collected and then assign persons to a poverty status category.) In addition, the weights of some persons out-of-scope on December 31, 2021 were poststratified. Specifically, the weights of persons out-of-scope on December 31, 2021 that were in-scope sometime during the year but were residing in a nursing home at the end of the year were adjusted to compensate for expected undercoverage of this subpopulation. Overall, the population estimate for the civilian, noninstitutionalized population over the course of the year (PERWT21P>0) is 331,249,393 (see Table 3.3). The estimated population total for those in-scope on December 31, 2021 (PERWT21P>0 and INSC1231=1) is 327,209,772.
3.5.8 A Note on MEPS Population EstimatesBeginning with the 2021 Full Year data, MEPS transitioned to 2020 census-based population estimates from the CPS for poststratification and raking. CPS estimates began reflecting 2020 census-based data in 2022, and the March 2022 CPS data serve as the basis for the 2021 MEPS weight calibration efforts. Use of the updated population controls will have a noticeable effect on estimated totals for some population subgroups. The article Adjustments to Household Survey Population Estimates in January 2022 compares some 2021 CPS estimates for those aged 16 and older “as published” with those that would have been generated had the updated population controls been used. Among the more notable increases were for the following subgroups: those aged 16-19 (about a half million more, a 3.5 percent increase); and Asians (170 thousand more, a 1 percent increase). Corresponding changes were thus anticipated for MEPS full year data beginning with the 2021 MEPS PUF. 3.5.9 CoverageThe target population associated with this MEPS database is the 2021 U.S. civilian, noninstitutionalized population. However, the MEPS sampled households are a subsample of the NHIS households interviewed in 2017 (Panel 23), 2018 (Panel 24), 2019 (Panel 25), and 2020 (Panel 26). New households created after the NHIS interviews for the respective panels and consisting exclusively of persons who entered the target population after 2017 (Panel 23), after 2018 (Panel 24), after 2019 (Panel 25), or after 2020 (Panel 26) are not covered by MEPS. Neither are previously out-of-scope persons who join an existing household but are unrelated to the current household residents. Persons not covered by a given MEPS panel thus include some members of the following groups: immigrants, persons leaving the military, U.S. citizens returning from residence in another country, and persons leaving institutions. Those not covered represent a small proportion of the MEPS target population. 3.6 No Family, SAQ, or DCS Weights on this Public Use FileDue to relatively limited opportunities for family-level analysis with the data on this file, family weights are not included on this file. However, family weights will be created for the Full Year 2021 MEPS Consolidated public use file where expenditure and income data are provided. To maintain consistency in terms of file structure with the upcoming public use file with expenditure and income data, records for those persons who will receive a positive family weight but not a positive person weight have been placed on this public use file. These records will be the only records without a positive person weight appearing on this file. While not appearing on this PUF, the family weights and those associated with the Self-Administered Questionnaire (SAQ) and the Diabetes Care Survey (DCS) will be provided on the 2021 FY Consolidated PUF. 3.7 Weights and Response Rates for the Social Determinants of Health SurveyFor analytic purposes, a single person-level weight variable, SDOHWT21P, has been provided for use with the data obtained from the Social Determinants of Health (SDOH) questionnaire. This questionnaire was administered in Panel 26, Round 1, Panel 25 Round 3, Panel 24 Round 5, and Panel 23 Round 7. These were the rounds associated with the initial fielding of each panel in 2021. The questionnaire was to be completed by each adult (person aged 18 or older) in the family. Thus, the target population for the SDOH is adults in the civilian, noninstitutionalized population at the time data were collected for Rounds 1/3/5/7 (generally speaking, the late winter/spring of 2021). The final full-year person-level SDOH weight for 2021 was constructed as follows with only those with a 2021 full year person weight (PERWT21P>0) eligible to receive the 2021 SDOH weight. The weighting process for the SDOH weight initiated with the 2021 full year person weights of each panel. The full sample person-level weights were adjusted to account for nonresponse to the SDOH for each panel separately. A raking process similar to the full sample weight were then implemented for each panel. The final steps were to composite the weights from thefour panels and re-rake the composited weights. March 2021 CPS control totals for each panel separately. Variables used in the nonresponse adjustment process were region, MSA status, family size, marital status, level of education, employment status of the reference person, health status, health insurance status, age, sex, and race/ethnicity. The March 2021 CPS control totals were formed based on the following variables: educational attainment of the reference person, region, MSA status, age, sex, and race/ethnicity. The age categories were developed after excluding ages under 18 since only adults were eligible for the SDOH. This final weight was assigned the variable name SDOHWT21P. In all, there were 18,400 persons assigned an SDOH weight with the sum of the weights being 252,273,615 (an estimate of the civilian, noninstitutionalized population aged 18 or older at the time the SDOH was administered). The unweighted response rates for the 2021 SDOH for each panel were: Panel 23, 79.7 percent; Panel 24, 75.5 percent; Panel 25, 76.8 percent; and Panel 26, 68.5. The pooled unweighted response rate for the SDOH survey respondents has been computed by taking a weighted average of the panel-specific response rates, where the weights were the same factors used for the weight compositing as described in Section 3.3 (these values are: Panel 23, 0.22; Panel 24, 0.22; Panel 25, 0.25; and Panel 26, 0.31). The pooled unweighted response rate for the combined panels for the 2021 SDOH is 74.5 percent. 3.8 Variance EstimationTo obtain estimates of variability (such as the standard error of sample estimates or corresponding confidence intervals) for MEPS estimates, analysts need to take into account the complex sample design of MEPS for both person-level and family-level analyses. Several methodologies have been developed for estimating standard errors for surveys with a complex sample design, including the Taylor-series linearization method, balanced repeated replication, and jackknife replication. Various software packages provide analysts with the capability of implementing these methodologies. MEPS analysts most commonly use the Taylor Series approach. Although this data file does not contain replicate weights, the capability of employing replicate weights constructed using the Balanced Repeated Replication (BRR) methodology is also provided if needed to develop variances for more complex estimators (see Section 3.8.2). 3.8.1 Taylor-series Linearization MethodThe variables needed to calculate appropriate standard errors based on the Taylor-series linearization method are included on this and all other MEPS public use files. Software packages that permit the use of the Taylor-series linearization method include SUDAAN, R, Stata, SAS (version 8.2 and higher), and SPSS (version 12.0 and higher). For complete information on the capabilities of a package, analysts should refer to the corresponding software user documentation. Using the Taylor-series linearization method, variance estimation strata and the variance estimation PSUs within these strata must be specified. The variables VARSTR and VARPSU on this MEPS data file serve to identify the sampling strata and primary sampling units required by the variance estimation programs. Specifying a “with replacement” design in one of the previously mentioned computer software packages will provide estimated standard errors appropriate for assessing the variability of MEPS survey estimates. It should be noted that the number of degrees of freedom associated with estimates of variability indicated by such a package may not appropriately reflect the number available. For variables of interest distributed throughout the country (and thus the MEPS sample PSUs), one can generally expect to have at least 100 degrees of freedom associated with the estimated standard errors for national estimates based on this MEPS database. Prior to 2002, MEPS variance strata and PSUs were developed independently from year to year, and the last two characters of the strata and PSU variable names denoted the year. Beginning with the 2002 Point-in-Time PUF, the approach changed with the intention that variance strata and PSUs would be developed to be compatible with all future PUFs until the NHIS design changed. Thus, when pooling data across years 2002 through the Panel 11 component of the 2007 files, the variance strata and PSU variables provided can be used without modification for variance estimation purposes for estimates covering multiple years of data. There are 203 variance estimation strata, each stratum with either two or three variance estimation PSUs. From Panel 12 of the 2007 files, a new set of variance strata and PSUs were developed because of the introduction of a new NHIS design. There are 165 variance strata with either two or three variance estimation PSUs per stratum, starting from Panel 12. Therefore, there are a total of 368 (203+165) variance strata in the 2007 Full Year file as it consists of two panels that were selected under two independent NHIS sample designs. Since both MEPS panels in the Full Year files from 2008 through 2016 were based on the next NHIS design, there are only 165 variance strata. These variance strata (VARSTR values) have been numbered from 1001 to 1165 so that they can be readily distinguished from those developed under the former NHIS sample design if data are pooled for several years. The NHIS sample design was changed again in 2016, effectively changing the MEPS design beginning with calendar year 2017. From Panel 22 of the 2017 files, a new set of variance strata and PSUs were developed. There are 117 variance strata with either two or three variance estimation PSUs per stratum. Therefore, there are a total of 282 (165+117) variance strata in the 2017 Full Year file as it consists of two panels that were selected under two independent NHIS sample designs. To make the pooling of data across multiple years of MEPS more straightforward, the numbering system for the variance strata has changed. Those strata associated with the new design were numbered from 2001 to 2117. However, the NHIS sample design was further modified in 2018. With the modification in the 2018 NHIS sample design, the MEPS variance structure for the 2019 Full Year file was also modified, reducing the number of variance strata to 105. Consistency was maintained with the prior structure in that the 2019 Full Year file variance strata were also numbered within the range of values from 2001-2117, although there are now gaps in the values assigned within this range. Due to the modification, each stratum could contain up to five variance estimation PSUs. For Panel 26 in the 2021 Full Year file, additional NHIS sample was used for MEPS to account for increasing nonresponse during the pandemic (as discussed in Section 3.1.4). The additional sample was assigned to the existing variance strata, so the 2021 Full Year file continues to have 105 variance strata, numbered 2001-2117, with a few gaps in the values in that range. In many cases, the additional sample was assigned to new variance estimation PSUs, so in the 2021 Full Year file, each stratum could contain up to eight variance estimation PSUs. Some analysts may be interested in pooling data across multiple years of MEPS data. If pooling across years is to be undertaken, it should be noted that, to obtain appropriate standard errors when doing so, it is necessary to specify a common variance structure. Prior to 2002, each annual MEPS public use file was released with a variance structure unique to the particular MEPS sample in that year. Starting in 2002, the annual MEPS public use files were released with a common variance structure that allowed users to pool data from 2002 through 2018. However, with the need to modify the variance structure beginning with 2019, this can no longer be routinely done. To ensure that variance strata are identified appropriately for variance estimation purposes when pooling MEPS data across several years, one can proceed as follows:
3.8.2 Balanced Repeated Replication (BRR) MethodBRR replicate weights are not provided on this MEPS PUF for the purposes of variance estimation. However, a file containing a BRR replication structure is made available so that the users can form replicate weights, if desired, from the final MEPS weight to compute variances of MEPS estimates using either BRR or Fay’s modified BRR (Fay, 1989) methods. The replicate weights are useful to compute variances of complex non-linear estimators for which a Taylor linear form is not easy to derive and not available in commonly used software. For instance, it is not possible to calculate the variances of a median or the ratio of two medians using the Taylor linearization method. For these types of estimators, users may calculate a variance using BRR or Fay’s modified BRR methods. However, it should be noted that the replicate weights have been derived from the final weight through a shortcut approach. Specifically, the replicate weights are not computed starting with the base weight and all adjustments made in different stages of weighting are not applied independently in each replicate. Thus, the variances computed using this one-step BRR do not capture the effects of all weighting adjustments that would be captured in a set of fully developed BRR replicate weights. The Taylor Series approach does not fully capture the effects of the different weighting adjustments either. The dataset, HC-036BRR, MEPS 1996-2021 Replicates for Variance Estimation File, contains the information necessary to construct the BRR replicates. It contains a set of 128 flags (BRR1-BRR128) in the form of half sample indicators, each of which is coded 0 or 1 to indicate whether the person should or should not be included in that particular replicate. These flags can be used in conjunction with the full-year weight to construct the BRR replicate weights. For analysis of MEPS data pooled across years, the BRR replicates can be formed in the same way using the HC-036, MEPS 1996-2021 Pooled Linkage Variance Estimation File. For more information about creating BRR replicates, users can refer to the documentation for the HC-036BRR pooled linkage file on the AHRQ website. 3.9 Using MEPS Data for Trend AnalysisFirst, of course, we note that there are uncertainties associated with 2020 and 2021 data quality for reasons discussed earlier in the data quality section (Section 3.1). Preliminary evaluations of a set of MEPS estimates of particular importance suggest that they are of reasonable quality. Nevertheless, analysts are advised to exercise caution in interpreting these estimates, particularly in terms of trend analyses since access to health care was substantially affected by the COVID-19 pandemic as were related factors such as health insurance and employment status for many people. MEPS began in 1996, and the utility of the survey for analyzing health care trends expands with each additional year of data; however, when examining trends over time using MEPS, the length of time being analyzed should be considered. In particular, large shifts in survey estimates over short periods of time (e.g. from one year to the next) that are statistically significant should be interpreted with caution unless they are attributable to known factors such as changes in public policy, economic conditions, or MEPS survey methodology. With respect to methodological considerations, in 2013 MEPS introduced an effort focused on field procedure changes such as interviewer training to obtain more complete information about health care utilization from MEPS respondents with full implementation in 2014. This effort likely resulted in improved data quality and a reduction in underreporting starting in the second half of 2013 and throughout 2014 full year files and have had some impact on analyses involving trends in utilization across years. The changes in the NHIS sample design in 2016 and 2018 could also potentially affect trend analyses. The new NHIS sample design is based on more up-to-date information related to the distribution of housing units across the U.S. As a result, it can be expected to better cover the full U.S. civilian, noninstitutionalized population, the target population for MEPS, as well as many of its subpopulations. Better coverage of the target population helps to reduce the potential for bias in both NHIS and MEPS estimates. Another change with the potential to affect trend analysis involved major modifications to the MEPS instrument design and data collection process, particularly in the events sections of the instrument. These were introduced in the Spring of 2018 and thus affected data beginning with Round 1 of Panel 23, Round 3 of Panel 22, and Round 5 of Panel 21. Since the Full Year 2017 PUFs were established from data collected in Rounds 1-3 of Panel 22 and Rounds 3-5 of Panel 21, they reflected two different instrument designs. In order to mitigate the effect of such differences within the same full year file, the Panel 22 Round 3 data and the Panel 21 Round 5 data were transformed to make them as consistent as possible with data collected under the previous design. The changes in the instrument were designed to make the data collection effort more efficient and easy to administer. In addition, expectations were that data on some items, such as those related to health care events, would be more complete with the potential of identifying more events. Increases in service use reported, since the implementation of these changes are consistent with these expectations. Data users should be aware of possible impacts on the data and especially trend analyses for these data years due to the design transition. Process changes, such as data editing and imputation, may also affect trend analyses. For example, users should refer to Section 2.5.11 in the 2021 Consolidated file (HC-233) and, for more detail, the documentation for the prescription drug file (HC-229A) when analyzing prescription drug spending over time. As always, it is recommended that data users review relevant sections of the documentation for descriptions of these types of changes that might affect the interpretation of changes over time before undertaking trend analyses. Analysts may also wish to consider using statistical techniques to smooth or stabilize analyses of trends using MEPS data such as comparing pooled time periods (e.g. 1996-1997 versus 2011-2012), working with moving averages or using modeling techniques with several consecutive years of MEPS data to test the fit of specified patterns over time. Finally, statistical significance tests should be conducted to assess the likelihood that observed trends are not attributable to sampling variation. In addition, researchers should be aware of the impact of multiple comparisons on Type I error. Without making appropriate allowance for multiple comparisons, undertaking numerous statistical significance tests of trends increases the likelihood of concluding that a change has taken place when one has not. ReferencesBethel C.D., Read D., Stein R.E.K., Blumberg S.J., Wells N., Newacheck P.W. (2002). Identifying Children with Special Health Care Needs: Development and Evaluation of a Short Screening Instrument. Ambulatory Pediatrics 2(1), 38-48. Bramlett, M.D., Dahlhamer, J.M., & Bose, J. (2021, September). Weighting Procedures and Bias Assessment for the 2020 National Health Interview Survey. Centers for Disease Control and Prevention. Chowdhury, S.R., Machlin, S.R., & Gwet, K.L. Sample Designs of the Medical Expenditure Panel Survey Household Component, 1996-2006 and 2007-2016. Methodology Report #33. January 2019. Agency for Healthcare Research and Quality, Rockville, MD. Current Population Survey: 2021 Annual Social and Economic (ASEC) Supplement. (2021). U.S. Census Bureau. Dahlhamer, J.M., Bramlett, M.D., Maitland, A., & Blumberg, S.J. (2021). Preliminary evaluation of nonresponse bias due to the COVID-19 pandemic on National Health Interview Survey estimates, April-June 2020. National Center for Health Statistics. Fay, R.E. (1989). Theory and Application of Replicate Weighting for Variance Calculations. Proceedings of the Survey Research Methods Sections, ASA, 212-217. Lau, D.T., Sosa, P., Dasgupta, N., & He, H. (2021). Impact of the COVID-19 Pandemic on Public Health Surveillance and Survey Data Collections in the United States. American Journal of Public Health, 111 (12), pp. 2118-2121. Rothbaum, J. & Bee, A. (2021, May 3). Coronavirus Infects Surveys, Too: Survey Nonresponse Bias and the Coronavirus Pandemic. U.S. Census Bureau. Rothbaum, J. & Bee, A. (2022, September 13). How Has the Pandemic Continued to Affect Survey Response? Using Administrative Data to Evaluate Nonresponse in the 2022 Current Population Survey Annual Social and Economic Supplement. U.S. Census Bureau. Zuvekas, S.H. & Kashihara, D. (2021). The Impacts of the COVID-19 Pandemic on the Medical Expenditure Panel Survey. American Journal of Public Health, 111 (12), pp. 2157-2166. D. Variable-Source CrosswalkFOR MEPS HC-228: 2021 FULL YEAR DATA FILE
Appendix 1
|
Condensed Industry Code | Census Industry Code Range | Description |
---|---|---|
1 | 0170 - 0290 | Natural Resources |
2 | 0370 - 0490 | Mining |
3 | 0770 | Construction |
4 | 1070 - 3990 | Manufacturing |
5 | 4070 - 5790 | Wholesale and Retail Trade |
6 | 0570 - 0690, 6070 - 6390 | Transportation and Utilities |
7 | 6470 - 6780 | Information |
8 | 6870 - 7190 | Financial Activities |
9 | 7270 - 7790 | Professional and Business Services |
10 | 7860 - 8470 | Education, Health, and Social Services |
11 | 8560 - 8690 | Leisure and Hospitality |
12 | 8770 - 9290 | Other Services |
13 | 9370 - 9590 | Public Administration |
14 | 9890 | Military |
15 | 9990 | Unclassifiable Industry |
MEPS uses the 4-digit Census occupation and industry coding systems developed for the Current Population Survey and the American Community Survey.
For industry coding, MEPS uses the 2007 4-digit Census industry codes. Descriptions of the 4-digit Census industry codes can be found at the U.S. Census Bureau website.
See Census IO Index for more information on the Census coding systems used by MEPS.
Condensed Occupation Code | Census Occupation Code Range | Description |
---|---|---|
1 | 0010 - 0950 | Management, Business, and Financial Operations Occupations |
2 | 1005 - 3540 | Professional and Related Occupations |
3 | 3600 - 4650 | Service Occupations |
4 | 4700 - 4965 | Sales and Related Occupations |
5 | 5000 - 5940 | Office and Administrative Support Occupations |
6 | 6005 - 6130 | Farming, Fishing, and Forestry Occupations |
7 | 6200 - 7630 | Construction, Extraction, and Maintenance Occupations |
8 | 7700 - 9750 | Production, Transportation, and Material Moving Occupations |
9 | 9840 | Military Specific Occupations |
10 | 9920 | Not in Labor Force |
11 | 9990 | Unclassifiable Occupation |
MEPS uses the 4-digit Census occupation and industry coding systems developed for the Current Population Survey and the American Community Survey.
For occupation coding, MEPS uses the 2010 4-digit Census occupation codes. Descriptions of the 4-digit Census occupation codes can be found at the U.S. Bureau of Labor Statistics website.
See the Census IO Index for more information on the Census coding systems used by MEPS.