Skin Color Protocols

Ensuring diversity of skin pigments in a medical device study has become recognized as increasingly important yet challenging. Our protocol for optimization skin pigment diversity

What is skin color?

There are many terms related to skin color that are often used interchangeably, but in fact may have different meanings. Here are a few key definitions from the terminology section of this toolkit:

Skin color

Human skin color is the result of how light interacts with the skin surface and deeper tissues. It is impacted by multiple chromophores or molecules that absorb light at specific wavelengths and thus effectively reflect/emit color. The main chromophores impacting human skin color include oxyhaemoglobin (red), deoxygenated hemoglobin (dark red/brown), carotenoids (a yellow-orange exogenous pigment), bilirubin (yellow), biliverdin (green) and melanin (brown). The epidermis contains melanin but little hemoglobin while the dermis contains little melanin but significant vascularity (e.g. hemoglobin). Color and pigmentation can be impacted by numerous environmental, pharmacologic, and medical factors. (See also ‘race’ and ‘ethnicity’)

Skin pigment

Human skin pigments include two forms of melanin (eumelanin, and pheomelanin), which are produced and contained in the epidermal or outer layer of the skin. Pheomelanin is associated with light red or yellow colors, while eumelanin is associated with dark brown or black colors. The amount of melanin in the skin impacts color (i.e. more melanin appears darker). Color and pigmentation can be impacted by numerous environmental, pharmacologic, and medical factors. Melanin and hemoglobin strongly absorb visible and ultraviolet light, and show less absorption at near-infrared wavelengths.

Objective skin color measurement methods

Method Name
Konica minolta CM700d
Light source: pulsed xenon lamp with UV cut filter Illuminated area: 6mm vs 11mm depending on aperture Angle: 8* viewing angle, 2* or 10* observer angle Wavelength range: 400-700nm
Eligible ISO 17025 certification L*, a*, b* L*, c*, h* Yxy XYZ Munsell (Hue, Value, Chroma) Color differences
Zero calibration & White calibration functions Spectral reflectance: SD <0.1% Chromaticity value: SD <∆E*ab 0.04 Inter-instrument agreement: <∆E*ab 0.2 (for 8mm aperture) Short-term repeatability: higher with continuous measurements and larger aperture size
Mexameter by Courage & Khazaka
Narrow-band reflectance colorimeter
Melanin index and erythema index (arbitrary units 0-999)
Colorimeter CL 400 by Courage & Khazaka
Full visible spectrum reflectance colorimeter
Light source: 8 LEDs arranged circularly Illuminated area: ~17mm Wavelength range: 440-670nm
skin color RGB L*a*b XYZ ITA
Calibration check function Measurement error: +/- 5% Interobserver reliability: ICC 0.79-0.97 moderate/good (Van der Wal et al 2013)
Delfin SkinColorCatch
Full visible spectrum reflectance colorimeter
Light source: 3 white LEDs arranged circularly Illuminated area: 0.3cm^2 Angle: 45* to minimize gloss RGB range: 25-246 Peak wavelengths: 620/540/460nm
skin color L*a*b ITA L*c*h RGB melanin and erythema indices
Calibration check function Not affected by ambient light due to direct skin contact Optical orifice designed to minimize pressure induced skin blanching

Subjective skin color measurements

Method Name
Intended Use
Baumann skin type
Classify skin type according to 4 different components: dry or oily, sensitive or resistant, pigmented or nonpigmented and prone or tight
Fanous classification
Classify skin type into 6 categories by race and genetic origin: Nordics, Europeans, Mediterraneans, Indo-Pakistanis, Africans and Asians
Fitzpatrick skin type
Visual, self-reported
Scale of I- VI
Limited relevance and reliability among people with darker skin color which are clustered into 1 Fitzpatrick category.
6-point subjective classification system to assess the propensity of the skin to burn during phototherapy
Glogau wrinkle scale
Used to assess photoaging (rhytides and discoloration) in White individuals
Goldman World Classification of Skin Types
Visual, self-reported
Used to assess skin color in response to burning or tanning, and PIH based on race/ethnicity
Kawada Skin Classification System for Japanese Individuals
Used to describe Japanese skin types and their sensitivity to UV light, sunburn, and tanning
Lancer Ethnicity Scale
Used to account for 5 different skin types based on geography and heredity; can be used in conjunction with FST to assess for risk factors prior to treatments such as cosmetic laser surgery or chemical peels
Modified Fizpatrick skin type
Used to assess phototype, skin color and the ability to burn or tan in Indian individuals
Robert Skin Type Classification System
Visual, self-reported
Used to evaluate phototype, hyperpigmentation, photoaging and scarring to identify a patient’s skin type and provide data to predict the skin’s likely response to insult, injury or inflammation
Taylor Hyperpigmentation Scale
15 colored cards
Used to measure pigmentation using 15 uniquely colored plastic cards spanning the full range of skin hues; each card contains 10 bands of increasingly darker gradation
Von Luschan chromatic scale
36 colored tiles
Of the 36 tiles, 2 were found to have the same RGB values
Used to establish race classification by skin color using 36 opaque glass tiles which are compared to the compare the patient’s skin
Willis and Earles scale
Used to classify skin color, UV light reaction and associated pigmentary disorders in people of African descent
Color Bar Tool
Self-reported, visual
Used to determine the amount of melanin pigment in various skin types and assess an individual’s potential to get sunburn
Munsell Color Chart
Chart classifying color based on hue, value and chroma
Originally intended to be used in agriculture to identify soil colors but has been adopted to describe human skin tones
Massey- Martin Scale
11 point scale
Used to classify human skin color without respondent seeing the chart
Monk Skin Tone Scale
10-tone scale
Used to classify a broader range of skin colors and developed to improve skin tone evaluation in machine learning and artificial intelligence

When to measure skin color?

Skin color is a factor associated with many things, ranging from how optically- based technologies work (e.g. pulse oximeters, self-driving cars, wearable health technologies) to a person’s access to education, earning, employment, and health. In some cases this association is causal and in other cases it is not. While much attention is given to race, ethnicity and racism, relatively little attention is given to color and colorism. Skin color should not be conflated with race or ethnicity (Important note: race is not a biological or genetic characteristic). Although some studies may collect data on race or ethnicity as a proxy for skin color, it is critical to first question and understand the purpose of data being collected. In some cases, such as studies related to optically based technologies, it is more precise to measure skin color and not just simply collect race or ethnicity to be used as a surrogate for light or dark skin color.  As a researcher, you should articulate the purpose of each of the study variables being collected in the study design, and study publications. Be sure to state if you are using the variable as a cause or as a marker of some social exposure. If the latter, you should consider addressing the exposure more directly. For example:


  • Good: Does skin pigment interact with red or infrared light of pulse oximeters resulting in performance bias?
  • Good: Are certain conditions that have been associated with race, such as lung disease, vascular disease, environmental exposure, or smoking also linked to performance bias in pulse oximeters?
  • Not Good: Do biological differences between black and white people result in worse performance of pulse oximeters?

If you are unsure which variables are best suited to help answer your research questions or how to categorize variables, engage with broad stakeholders. It is also important to have a diverse study team. Consider inviting researchers and members from the communities you are studying to be involved in the project starting from the planning phase. Different perspectives and community insights can help refine and enhance the objectives and outcomes.

When comparing across studies or datasets, especially older data, be wary of comparing racial categories, which may have changed over time or are inherently different across settings. If the legacy data are no longer relevant to the study questions you are investigating, consider collecting new data that align with the current context. As researchers, this allows us to be responsive and adaptable to new information.

If using race in addition to skin color, consider asking how your study cohort has been racialized (i.e., assigning people to a racial group and utilizing that assignment as the basis for differential treatment). Race is a social construct and not applied in a standardized manner across settings.

  • What are examples when you should consider measuring/assessing skin color?
    • Examples when this was not done are had negative consequence
  • What are examples when you should NOT use skin color/pigment?


How can skin color be characterized?

There are several commonly used methods for characterizing skin color and skin pigment, however, there is no consensus on a standard or validated approach. Not all techniques have been used for the purposes of validating diversity of skin color in cohorts for medical device research and development. Below we discuss some of these methods including evolving evidence and regulatory guidance.

Before selecting a method, it is important to clarify the reason why you want to measure this variable, and specifically is the study interested in color, pigment, or other variables (e.g. colors of deeper structures). It is also worth noting that skin color is usually only a reflection of the outermost layers of skin, and some devices, like pulse oximeters, may be impacted by color (light absorbance/reflection) of tissues that are deeper.


Quantification of skin color for research or device development is a topic with limited consensus and evolving data. There are many methods and many caveats to each. The Project is trying to keep track and pilot several methods in laboratory and clinical trials. Our latest protocols are online and open access.

Download Protocols

Subjective methods

Numerous methods have been used for subjectively characterizing skin color, though not all of these scales were designed or validated for this purpose. Many of these methods are listed in Table 1 below. Important characteristics to consider when selecting a method include:

  1. Standardization of the scale:
  2. Usability/repeatability – impact of lighting
  3. Potential bias
  4. Validation
  5. Intention

Objective methods

Numerous methods have been used for objectively characterizing skin color, though limited consensus exists on how to optimally utilize these methods. Also of note, there is considerable heterogeneity in how these devices/methods work at a technical level as well as what data they report. Many of these methods are listed in Table 2 below. Important characteristics to consider when selecting an objective method include:

  1. Standardization and transparency of reported data: should be observer independent, quantitative. Several studies are ongoing to compare outputs from different devices.
  2. Repeatability – study subject variables may impact repeatability such as age, sex, anatomic site, skin surface properties as well as temporal factors such as sun exposure, orthostatic effect and external variables such as lighting and temperature, and application of the technology like pressure applied
  3. Calibration
  4. Usability – small areas vs larger, pressure
  5. Validation
  6. Intention – interaction with surface pigment or color, vs interaction with deeper structures (e.g. blood, tissue, bone, fat)
  7. Interpreting data generated by objective measures Some but not all metrics used by objective methods are standardized. Below is a summary of the most commonly reported data:
    1. RGB color space – this number is used to characterize color by the intensity of values of red, green, and blue color
    2. CIELAB color space – defined by Commission Internationale de l’Eclairage (CIE) is a three-dimensional color space with three axes defined further below. The L* and b* have been correlated with pigment and the a* correlated to erythema levels. The purpose of CIELAB is to be more uniform than RGB space. The rough idea is that distance in LAB space between colors provides a quantification of how perceptibly different the colors are. The distance in RGB space is thought to not be so linear, so for example, if the distance doubles the colors may not actually become twice as easy to tell apart.
      1. L* = lightness 0 (black) to 100 (white)
      2. a* = red/green on the chroma plane
      3. b* = yellow/blue on the chroma plane
    3. Individual Typology Angle (ITA) – this metric is used to classify skin pigmentation, specifically melanin, using L* and b* according to the formula below. Del Bino et al showed that ITA has a linear relationship with tissue melanin content (originally using ex-vivo samples). The ITA was introduced in 1991 (Chardon et al 1991) with 6 proposed categories of skin color (very light > 55° > light > 41° > intermediate > 28° > tan >10° > brown). However, the original study bins were based on data from only caucasian subjects and only non-sun-exposed skin. A subsequent study by Del Bino et al. in 2013 did include a more diverse population and introduced an additional ITA cut-off of <-30° for darker skin37. It is important to note, that this expanded and most current categorization of skin color by ITA is still based largely on potentially flawed categories from the 1991 study. (Leeb at al, SSRN, 2023). Of note, ITA is not skin color and does not account at all for the red/green color scale. ITA has been the most commonly evaluated and validated metric for melanin.
    4. Melanin index (MI) – this metric is intended to quantify melanin as a single number, sometimes by taking log10 of the reflectance ratio. Not all device manufacturers provide detail on the derivation of MI and thus MI may not be comparable across devices, subjects and studies.
    5. Melanin density – this metric has been reported by subtracting reflectance at 400nm from 420nm (R420-R400), in order to reduce confounding by hemoglobin which has similar absorption at both wavelengths
    6. Erythema Index (EI) – non standardized metric for quantifying amount of erythema present in skin
    7. Illuminant & observer angle – illuminant and observer angle help convert a spectrum to a set of LAB and RGB values
    8. Other proposed strategies
      –  R390 – this is the isosbestic point for oxy and deoxyhemoglobin, only penetrates at shallow depths and thus has been used to characterize epidermal melanin
      –  AU intensity curve 450-615
      –  Slope of absorbance spectrum (620-720nm)

Assessment Strategy

Both subjective and objective assessment methods can be applied in many different ways that impact the reliability of data. It matters if assessment methods are being applied to printed photos, digital photos or real-life people. Factors like lighting, printer color calibration, and monitor color calibration become significant factors.
Some assessment methods were not designed or validated to be used in multiple ways, so be sure to check how your method was intended to be used.


Agache 2017 Measuring the Skin, Chapter 6 “The Measurement of Skin Color”:
Chau et al 2019 Cutaneous Colorimetry as Gold Standard for Skin Color Measurement
Visscher 2017 Skin color and pigmentation in ethnic skin
Wilkes et al 2015 Fitzpatrick Skin Type, Individual Typology Angle, and Melanin Index in an African Population: Steps Toward Universally Applicable Skin Photosensitivity Assessments
Ware et al, Racial Limitations of Fitzpatrick skin type, Cutis, 2020