Chapter 13
Reference Values and Interpretation Strategies
1. Identify the reference set recommended for spirometry testing and the effect of ethnicity.
2. Determine the methodology used in defining the lower 5th percentile.
3. Understand the key parameter in determining airway obstruction.
1. Describe a z score in determining the lower limit of normal.
2. Define the nuances of the nonspecific pattern.
3. Understand grading the degree of obstruction in the mixed pattern.
Selecting and using reference values
Lung Volumes
Identification of normal lung volume values, specifically FRC, RV, and TLC, is more challenging because the amount of published data is limited. Lung volumes are related to body size with height being the most important variable. An additional factor that should be considered when selecting lung volumes reference sets is the testing methodology used in deriving the values (e.g., plethysmography versus dilutional methods). The ATS-ERS did not recommend a specific reference set, but a few popular authors are listed in Table 13-1.
Table 13-1
Common Reference Authors for Lung Volumes
Adult: Author | Year | Journal |
Crapo | 1982 | Bull Eur Physiopathol Respir 1982; 18:419-427 |
Goldman | 1969 | Am Rev Respir Dis 1969; 79:457-467 |
Quanjer | 1993 | Eur Respir J 1993; 6(Suppl 16):5–40 |
Stocks | 1995 | Eur Respir J 1995; 8:492–506 |
Pediatric: | ||
Quanjer | 1989 | Eur Respir J 1989; 1(Suppl 4): 184 S–261 S |
Hsu | 1979 | J Pediatr 1979; 95:14-23 |
Diffusing Capacity
The ATS-ERS did not recommend a specific set of reference equations for diffusing capacity, citing inter-laboratory variability as their reason. Published data have shown even in a well controlled clinical trial that intersession variability can range from 10%-25%. The ATS-ERS statement did recommend that predicted values for alveolar volume (VA), inspired volume (VI), and Dlco should come from the same source. Figure 13-1 demonstrates the difference in a subject between various Dlco predicted equations. A 60-year-old female of average height can have a predicted value ranging from approximately 21-27 mL/min/mm Hg, depending on the reference set selected. Thompson and others published a reference set in middle aged to older subjects (ages 45-71), which complied entirely with the 2005 ATS-ERS recommendations for testing technique and quality assurance. Their equations compared favorably with those previously published by Miller. Several common reference authors are listed in Table 13-2.
Table 13-2
Common Reference Authors for Diffusing Capacity
Adult: Author | Year | Journal |
Crapo | 1986 | Am Rev Respir Dis 1986; 134:856 |
Cotes | 1993 | Eur Respir J 1993; 6(Suppl 16):41–52 |
Knutson | 1987 | Am Rev Respir Dis 1987; 135:805-811 |
Miller | 1983 | Am Rev Respir Dis 1983; 127:270-277 |
Paoletti | 1985 | Am Rev Respir Dis 1985; 132:806-813 |
Thompson | 2008 | Thorax 2008; 63:889-893 |
Pediatric: | ||
Hsu | 1979 | J Pediatr 1979; 95:14-23 |
Nasr | 1991 | Pediatr Pulmonol 1991; 10:267-272 |
Several methods for applying reference values are used:
Establishing what is abnormal
Determining the lower limit of normal (LLN) should be done by analyzing some measure (e.g., FVC, FEV1) in healthy subjects and then determining the variability of that measurement. In clinical medicine, the 5th percentile is often defined as the LLN because it represents the segment of healthy subjects farthest below the average. Even though subjects in the 5th percentile are healthy, they are arbitrarily defined as “abnormal” for clinical purposes. Figure 13-2 depicts the predicted and the LLN for white females from ages 8–80 years (NHANES III). It is noteworthy that the statistical LLN is approximately the same across the adult age range.
Some clinicians use a fixed percentage (measured value divided by the reference value × 100) of the reference value to determine the degree of abnormality. Eighty percent (80%) is often used as the limit of normal. Unfortunately, this method leads to errors because the variability around the predicted value is relatively constant in adults. In other words, the scatter of normal values does not vary with the size of the predicted value. Figure 13-3 illustrates why using fixed percentages, such as 80% of the predicted, can lead to misclassification. In tall, young subjects 80% of the predicted is often less than the 5th percentile; using 80% as the limit can allow a patient who really does have decreased lung function (in the 5th percentile or lower) to be misclassified as normal. This situation is a false-negative result; the patient has disease but the test does not indicate abnormality. Similarly, an elderly patient who is short may have a lung function parameter that is less than 80% of predicted but well within the statistically normal range (above the 5th percentile). This short elderly subject would be misclassified as having lung disease when in fact she is within the “normal” range (i.e., a false-positive result). Using percents of predicted introduces both age and height biases. The situation is slightly different in children because the variability of lung function measures tends to change proportionately with the size of the predicted value. For this reason, percents of predicted values may be appropriate for classifying lung function in children.
A more statistically sound approach for classifying abnormality is to compute the z score or standard deviation score (SDS). If lung function varies in a normal fashion (a Gaussian or bell-shaped distribution curve; Figure 13-4), the mean ± 1.96 standard deviation (SD) defines the 95% confidence interval. Statistically, 95% of the healthy population falls within approximately 2 SD of the mean. The remaining subjects fall into either the highest or lowest 2.5% of the distribution. The z score or SDS can be calculated easily if the variability (residual standard deviation (RSD)) of the reference population is known:
< ?xml:namespace prefix = "mml" />
RSD = residual standard deviation
where 0.468 is the residual standard deviation from the reference population, the z score of −1.53 suggests that this subject is above the 5th percentile and likely has normal lung function. The advantage of z scores is that they can be used for any index that is normally distributed. Because the z score accounts for the variability occurring in healthy subjects, it tells how common, or uncommon, the finding may be in the patient being studied.
There are several areas in which the definition of lung function abnormality may have important clinical consequences. One such area is the use of a fixed ratio to define airway obstruction, as is frequently done with the FEV1/FVC (FEV1/VC). The World Health Organization’s Global Initiative for Obstructive Lung Disease (GOLD) recommends the use of 70% as a cutoff, with ratios less than this value defining the presence of airway obstruction. However, because the FEV1/FVC ratio falls with age (sex, height, and ethnicity also may play a role), using a fixed ratio may misclassify younger subjects as normal (false negative) and older subjects as obstructed (false positive) (Figure 13-5). Similarly, using fixed percentages of predicted (e.g., 80%, 50%) to categorize the severity of obstruction may misclassify subjects who are young and tall or old and short (as discussed in a preceding paragraph). These misapplications of fixed ratios and fixed percents of predicted can have serious consequences for individual patients and for large groups of subjects when research is involved. Misclassifying an elderly subject as having COPD may mean the inappropriate prescription of drugs that can have serious side effects. Using an inappropriate classification, such as an FEV1/FVC ratio of 70%, to exclude subjects from a clinical trial (because they are incorrectly classified as “obstructed”) means that otherwise healthy subjects are not exposed to the treatment or drug being evaluated.
1. Type of equipment used for the reference study: Does equipment comply with the most recent recommendations of the ATS-ERS (See Chapter 11.)
2. Methodology: Were standardized procedures used in the reference study similar to those to be used, particularly for spirometry, lung volumes, and Dlco?
3. Reference population: What were the ranges of ages of the individuals in the reference population? Were both males and females tested? Did the study generate different regressions for different ethnic origins? Did the study include smokers or other “at-risk” individuals as healthy individuals? If a specific group of subjects was studied, are the results applicable to the population in general?
4. Statistical analysis: Are lower limits of normal specifically defined (e.g., 5th percentile, 1.645 × RSD)? Are adequate measures of variability available (RSD, SEE) so that upper or lower limits of normal can be calculated along with the predicted values?
5. Conditions of the study: Was the study performed at a different altitude or under significantly different environmental conditions?
6. Published reference equations: Do reference values generated using the study’s regressions differ markedly from other published references?
Individual laboratories may wish to perform measurements on subjects who represent a healthy cross-section of the population that the laboratory usually tests. Doing this in a statistically meaningful way may require testing a large number of subjects. However, measured values from these individuals can then be compared with expected values using various reference equations. Equations that produce the smallest average differences (measured – predicted) may be preferable. Evaluation of a small number of individuals may not show much difference between equations for FVC and FEV1. However, there may be noticeable discrepancies for Dlco or maximal flows. Equations for spirometry, lung volumes, and Dlco should be taken from a single reference, if possible. If healthy individuals fall outside of the limits of normal, the laboratory should examine its test methods, how the individuals were selected, and the prediction equations being used. Table 13-3 lists “typical” normal values for pulmonary function and blood gas parameters.
Table 13-3
Typical Values for Pulmonary Function Tests
Values are for a healthy young male, 1.7 m2 body surface area.
Test | Value |
Lung Volumes |