Protocol - Ethnicity and Race

Add to My Toolkit

This protocol includes 2 questions that ask about ethnicity and race with answers that are multiple choice. For Ethnicity, the respondent can specify an ‘other’ ethnicity. For Race, the respondent can select one or more answer choice and write in their specific origin.

Specific Instructions

Open ended responses can pose challenges to analyzing and interpreting data. To assist with standardization of reporting, a list of countries is provided here for use with this protocol.

Person 1 refers to the respondent.


NOTE: Please answer BOTH Question 1 about Hispanic origin and Question 2 about race. For this census, Hispanic origins are not races.

1. Is Person 1 of Hispanic, Latino, or Spanish origin?

[ ] No, not of Hispanic, Latino, or Spanish origin

[ ] Yes, Mexican, Mexican Am., Chicano

[ ] Yes, Puerto Rican

[ ] Yes, Cuban

[ ] Yes, another Hispanic, Latino, or Spanish origin – Print, for example, Salvadoran, Dominican, Colombian, Guatemalan, Spaniard, Ecuadorian, etc. __________________ (Recommended specific choices)

2. What is person 1’s race? Mark one or more boxes AND print origins.

[ ] White – Print, for example, German, Irish, English, Italian, Lebanese, Egyptian, etc. ______________ (Recommended specific choices)

[ ] Black or African Am. – Print, for example, African American, Jamaican, Haitian, Nigerian, Ethiopian, Somali, etc. ______________(Recommended specific choices)

[ ] American Indian or Alaska Native – Print name of enrolled or principal tribe(s), for example, Navajo Nation, Blackfeet Tribe, Mayan, Aztec, Native Village of Barrow Inupiat Traditional Government, Nome Eskimo Community, etc. _____________ (Recommended specific choices)

[ ] Chinese

[ ] Filipino

[ ] Asian Indian

[ ] Vietnamese

[ ] Korean

[ ] Japanese

[ ] Other Asian – Print, for example, Pakistani, Cambodian, Hmong, etc. ____________ (Recommended specific choices)

[ ] Native Hawaiian

[ ] Samoan

[ ] Chamorro

[ ] Other Pacific Islander – Print, for example, Tongan, Fijian, Marshallese, etc. _____________ (Recommended specific choices)

[ ] Some other race – Print race of origin. _____________________

Protocol Name from Source

U.S. Census Bureau, Census 2020, Questionnaire



Personnel and Training Required


Equipment Needs

The PhenX Steering Committee acknowledges these questions can be administered in a computerized or noncomputerized format (i.e., paper-and-pencil instrument). Computer software is necessary to develop computer-assisted instruments. The interviewer will require a laptop computer/handheld computer to administer a computer-assisted questionnaire.

Requirement CategoryRequired
Major equipment No
Specialized training No
Specialized requirements for biospecimen collection No
Average time of greater than 15 minutes in an unaffected individual No
Mode of Administration

Self-administered questionnaire

Life Stage

Infant, Toddler, Child, Adolescent, Adult, Senior, Pregnancy


Adults 18 and older

Selection Rationale



English, Spanish

Common Data Elements (CDE) Ethnic Group Category Text 2192217 CDE Browser
Common Data Elements (CDE) Race Category Text 2192199 CDE Browser
Derived Variables


Process and Review

Not Applicable


United States Census 2020 Questionnaire – questions 8 and 9

All of Us Research Program, Participant Provided Information (PPI), Version: December 17, 2018

General References


Protocol ID


Export Variables
Variable Name Variable IDVariable DescriptiondbGaP Mapping
Measure Name

Ethnicity and Race

Release Date

August 1, 2020


This measure includes two items. The first asks if the person considers himself or herself to be Hispanic, Latino or Spanish and, if so, asks an additional question about his or her specific Hispanic origin is asked. One or more Hispanic origin categories may be selected.The second question asks about race, with multiple close-ended answer choices and optional write-in to provide more country of origin.Both questions ask for self-reported information.


Ethnicity is used to stratify study populations and to associate those populations with physical, geographic, biological, social, and cultural characteristics (e.g., Dominican). Ethnicity is a social and epidemiological factor, and individuals of some ethnicity are at greater risk for disease. By capturing the ethnicity of respondents, the researcher will be able to identify those who are or are not of Hispanic origin and will be able to stratify the study population accordingly.Race is used to stratify study populations and to associate those populations with physical, geographic, biological, social, and cultural characteristics (e.g., African Americans). Race is a social and epidemiological factor, and individuals of some races are at greater risk for certain diseases.


U.S. Census Bureau, Census 2020, All of Us, Demographics