Measurement and documentation are critical components in the process of providing patient care. Measurements (numerical or categorical assignments based on testing or measuring) form the basis for deciding intervention strategy and therefore influence patient response to therapeutic interventions.¹ Measurements are also used during treatment sessions to determine rate of progression and appropriateness of exercise prescriptions. Typically, therapists obtain a series of measures and, in combination with those made by other health care professionals, formulate a clinical hypothesis. The hypothesis includes both physical and psychosocial aspects. If parts of the hypothesis are incorrect because of inaccurate measures, interventions may be misdirected, which can result in treatment that is either not effective or unsafe. Consequently, knowledge of the qualities of measurements that relate to the cardiovascular and pulmonary systems is essential for effective patient care.

Documentation, interpretation of measurements, and the patient plan of care are also important for reimbursement and to ensure communication among health care team members. Timely and appropriate sharing of information on physiological responses to activity is often critical for optimal medical management. Documentation must be written clearly and concisely and include objective findings that will facilitate efficient and continuous care from all members of the health care team.

This chapter provides a discussion of types and characteristics of measurements that are common to cardiopulmonary physical therapy, followed by a discussion of the process of selecting and performing tests and measures, and interpreting the results. A discussion of the purposes and recommended terminology for documentation follows, including suggestions for providing objective and outcome-oriented information and supporting skilled and medically necessary physical therapy services in this population.

Characteristics of Measurements and Outcomes

The purpose of performing a measure is to assess or evaluate a characteristic or attribute of an individual. The characteristic to be measured must first be defined, and the purpose of performing each measure must be clear. Therapists can then select the most appropriate method of measuring, given the available resources and their clinical skills.

Levels or Types of Measurements

Measurements can be described according to their type or level of measurement. There are four levels of measurements: nominal, ordinal, interval, and ratio (Table 7-1).² Recognizing the level of measurement aids understanding and interpretation of the result.

Table 7-1

Examples of Commonly Used Measurements and Their Respective Level of Measurement

Patient Characteristic	Test or Other Measure	Level of Measurement
Gender	Male/female	Nominal
Range of motion	Goniometry	Ratio
Muscle strength	Manual muscle testing (MMT)	Ordinal
	Isokinetic dynamometry	Interval
Functional status	Functional independence measure	Ordinal
	Timed Up and Go (TUG)	Ratio
Angina	Angina rating scale	Ordinal
	Borg CR10	Ratio
Dyspnea	MRC scale	Ordinal
	Borg CR10	Ratio
	Visual Analog Scale	Ratio

Nominal

Objects or people are often placed in categories according to specific characteristics. If the categories have no rank or order, then the measurement is considered nominal. An example of a nominal measurement is the classification of patients with pulmonary disease into those with obstructive lung disease, restrictive lung disease, or a combination of obstructive and restrictive disease. The categories are mutually exclusive (i.e., all patients fit into one and only one category). Nominal categories are unranked, in that an individual with obstructive lung disease would not necessarily have a worse prognosis than an individual with restrictive lung disease.

The categories of a nominal measurement scale are defined using objective indicators that are universally understood. For example, the classification of patients with heart failure could be based on the primary cause for the development of the condition (Box 7-1). In each case, the cause would be determined by diagnostic testing such as angiography or echocardiography. Clear descriptions of the criteria for inclusion in each category are necessary to facilitate clinicians’ agreement on the assignment of patients to categories. A high percentage of agreement indicates high interrater reliability.

Box 7-1

Etiology of Congestive Heart Failure

Hypertension

Coronary artery disease (myocardial ischemia)

Cardiac dysrhythmias

Renal insufficiency

Cardiomyopathy

Heart valve abnormality

Pericardial effusion

Pulmonary embolism

Pulmonary hypertension

Spinal cord injury

Age-related changes

From Cahalin LP: Cardiac muscle dysfunction. In Hillegass EA, editor: Essentials of cardiopulmonary physical therapy, ed 3, St Louis, 2011, Saunders.

Ordinal

Ordinal measurements are similar to nominal measurements with the exception that the categories are ordered or ranked. The categories in an ordinal scale indicate more or less of a certain attribute. The scale for rating angina is an example of an ordinal scale (Table 7-2). Each category is defined, and a rating of grade 1 angina is less than a rating of grade 4. In an ordinal scale, the differences between consecutive ratings are not necessarily equal. The difference between grade 1 angina and grade 2 is not necessarily the same as between grade 3 and grade 4 angina. Consequently, if numbers are assigned to categories, they can be used to represent rank but cannot be subjected to mathematical operations. Averaging angina scores is incorrect because averaging assumes that there are equal intervals between categories. A group of ordinal data could be reported as a percentage of each response (i.e., 80% of clients reported exercise-induced angina as 3 before a cardiac rehabilitation program.)

Table 7-2

Angina Rating Scale

Rating	Description
1	Mild, barely noticeable
2	Moderate, bothersome
3	Moderately severe, very uncomfortable
4	Most severe or intense pain ever experienced

From American College of Sports Medicine: ACSM’s guidelines for exercise testing and prescription. Philadelphia, 2010, Lippincott Williams & Wilkins.

Categorical measurements are considered ordinal if being assigned to a specific category is considered better than or worse than being in another category. For example, patients with angina could be classified as having either stable or unstable angina. This measurement would be considered ordinal, because stable angina is considered a better condition to have compared with unstable angina.³

There are various types of validity. Of importance in clinical practice are concurrent, predictive, and prescriptive validity. Concurrent validity is when a measurement accurately reflects measurements made with an accepted standard. Comparing a measurement made with a heart rate monitor with an ECG recording is an example of determining concurrent validity. In this example, the ECG recording would be considered the gold or accepted standard. Another example is using pulse oximetry during exercise testing. Yamaya and colleagues (2002)¹⁰ compared pulse oximetry versus directly measured arterial oxygen saturation (the gold standard) and reported that a forehead sensor was more valid than a finger sensor. Measurements with predictive validity can be used to estimate the probability of occurrence of a future event. Screening tests often involve measurements that are used to predict future events. For example, identifying people with risk factors for coronary artery disease (CAD) leads to a prediction that their likelihood of developing CAD is higher than normal. Measures with prescriptive validity provide guidance to the direction of treatment. The categorical measurement of determining a person’s risk for a future coronary event is a measurement that would need to have prescriptive validity. By classifying patients into high- versus low-risk categories on the basis of results of a diagnostic exercise test, the intensity and rate of progression of treatment is determined.

Sensitivity and Specificity

Accuracy of various types of exercise tests often is described by reporting sensitivity and specificity. Sensitivity is the ability of a measurement to identify individuals who are positive, or who have the characteristic that is being measured. If a test produces a high number of false-positive results, then the sensitivity will be low. A false-positive result means that the test result was positive but the characteristic was absent. Young women often have positive stress test results but do not have coronary artery disease. The consequence of a false-positive test result could be unnecessary treatment or further diagnostic testing. Specificity is the ability of a measurement to identify individuals who are negative, or who do not have the characteristic. A high number of false-negative results would produce a low specificity. A false-negative result has a negative test result even though the disease or characteristic is present. The consequence of a false-negative test result is not receiving treatment when it is indicated. Use of the Homans’ sign to screen for deep vein thrombosis (DVT) is no longer advocated because the test lacks both sensitivity and specificity.11,12

Objective and Subjective Measurements

Measurements vary in degree of subjectivity versus objectivity. Subjective measurements are those that are affected in some way by the person obtaining the measurement (i.e., the measurer must make a judgment as to the value assigned). The assessment of a patient’s breath sounds is influenced by many factors, including the therapist’s choice of terminology for describing the findings, perception of normal breath sounds, and hearing acuity. The grading of functional skills may be influenced by the therapist’s interpretation of what constitutes minimal versus moderate assistance. Because of the influence of the person performing the measure, subjective measurements usually have lower interrater reliability compared with objective measurements.²

Objective measurements are not affected by the person performing the measure (i.e., these measures do not involve judgment of the measurer). Heart rate measured by a computerized ECG system is an example of an objective measurement. Other examples include measuring blood pressure using an intraarterial catheter and oxygen consumption using a metabolic system. Objective measurements are not necessarily accurate but usually have high interrater reliability.²

Clinical Decision Making

The Hypothesis-Oriented Algorithm for Clinicians II (HOAC-II) intertwines the examination, evaluation, and diagnosis elements into an organized algorithm for clinical decision making.¹³ Using the HOAC-II, initial data collection (e.g., data from the medical record and patient interview) during the examination allows for generation of patient-identified problems (PIPs) and an initial set of hypotheses that will guide the formulation of an examination strategy (i.e., selection and ordering of tests and measures). These initial measurements help refine the hypotheses and additional measurements are obtained to help confirm or deny the initial set of hypotheses, leading to an eventual hypothesis/physical therapy diagnosis (i.e., an idea of the underlying cause of the patient’s problem), and/or a decision to consult with other health care practitioners. For instance, early in the examination process the therapist may suspect a proximal DVT; the Wells clinical decision rule can aid in deciding whether or not to refer the patient for further testing.¹⁴

Selecting Tests and Measures

Many factors influence the therapist’s choice, including information obtained from the medical record, patient interview, and knowledge of available treatment options. Selected tests and measures should be relevant as measured by their potential ability to impact the direction of the examination and intervention. Tests and measures should be limited to those that are necessary to establish a clinical hypothesis, make decisions about appropriate interventions, and determine the effectiveness of the intervention. Therapists also must strive for efficiency and not repeat tests that have been performed by other health care professionals. Characteristics or qualities of measurements, such as reliability and validity, also influence the therapist’s decision.

Another factor that influences the selection of tests and measures is the risk-benefit ratio. How do the risks of obtaining a measurement relate to the value of the information gained? Subjecting a patient to a symptom-limited graded exercise test during the acute stage of post-myocardial infarction (MI) could provide information to formulate an exercise prescription; however, the risks of performing this procedure at this phase of the recovery period likely outweigh the benefits.

Measurements selected need to be appropriate to the specific health condition (disorder or disease), severity of the condition, and other contextual (i.e., environmental and personal) factors specific to the patient.¹⁵ Tests and measures that do not help optimize or assess patient-centered outcomes are an inefficient use of the therapist’s time and add unnecessary costs to health care.

Performing Tests and Measures

General Principles

When performing tests and measures, therapists must take care to use procedures that can be replicated for future comparisons (i.e., there must be acceptable intrarater/interrater reliability). Time must be taken to ensure that conditions are optimal and that the patient is informed of his or her part in the activity. For example, measuring blood pressure in a noisy treatment area immediately when the patient arrives for an appointment may not provide an accurate measurement of resting blood pressure. Documenting the conditions in which a measure was made is also important. Conditions may include, but are not limited to, time of day, room temperature, recent activities performed by the patient (including medication administration), and type and model of measuring device (e.g., specific treadmill, sphygmomanometer, pulse oximeter, etc.). Also, if the patient needs supplemental oxygen, it is vital to document the oxygen flow rate or fraction of inspired oxygen and the oxygen delivery device (e.g., nasal cannula, Venturi mask, etc.) for proper test interpretation. For example, a patient’s walking distance may not have improved, but if the patient used less oxygen, then a meaningful outcome was achieved.

Measures should be made with an objective and open mind (i.e., without anticipating the result of the measurement). A measure that is approached with a preconceived idea of the outcome may be affected by the therapist’s expectations. Having confidence in the results of one’s measurements is important and develops as clinical skills develop.

In clinics where more than one therapist is likely to evaluate or treat a patient, written procedures for performing measurements are necessary. Therapists also need to review the written procedures on a regular basis and practice performing the measures as a group. Practicing together is especially important for therapists who are new to the clinic. Interrater reliability for commonly used measures can be determined. If the reliability is low, written procedures may need to be revised to ensure optimal consistency of measurement. Many commonly used tests and measures, such as blood pressure and 6-minute walk test, have standardized methodology.16,17

Examination

Initial data collection from referral information or the medical record regarding the patient’s current medical stability and functional status will help guide the examination strategy. For example, tests and measures will differ for a patient with an acute MI compared to a patient who is 3 weeks post-MI. Other factors to consider include the size of the infarction and associated complications such as dysrhythmias, heart failure, or angina. Other initial data collected during the interview may also guide the examination strategy. For example, if a patient becomes anxious when discussing walking on a treadmill, then perhaps a “field test,” such as the 6-minute walk test (6-MWT), would be more prudent.

The examination begins with and is centered on the patient’s self-identified problems that may reflect specific impairments, limitations in activity, and/or restrictions in patient-identified roles at home, work, and the community. The patient’s goals for physical therapy should be reported. Documentation of examination findings should reflect the physical therapist’s clinical reasoning in determining what impairments may underlie the patient’s limitation in activity.

The history section may include information related to the current and past medical/surgical diagnoses, previous functional levels, lifestyle choices, living environment, medications, laboratory values, or diagnostic imaging results. The patient’s perception of his or her cardiac or pulmonary condition is important, as are descriptions of pain or discomfort that may be associated with either a cardiac event or a pulmonary complication. Other systems, such as the musculoskeletal, neurological, and integumentary systems are briefly examined to identify concerns that may influence the patient’s readiness or ability to participate in certain interventions.

Tests can be categorized as measuring impairments, activity limitations, or participation restrictions as defined by the World Health Organization’s International Classification of Functioning, Disability and Health (ICF) model,¹⁸ which was recently adopted by the American Physical Therapy Association (APTA).¹⁹ A body function or body structure impairment is an abnormality of physiological function or anatomic structure at the tissue, organ, or body system level.¹⁵ Examples of impairments include pain, dyspnea, decreased muscle strength or range of motion, abnormal heart rate and blood pressure values, and impaired aerobic capacity. Measurements of impairments are important because they assist therapists in deciphering the causes or reasons for limitations in activity. Activity limitations are difficulties an individual has in performing a task or action.¹⁵ Activity limitations can be attributed to physical, social, cognitive, or emotional factors. Examples of activity limitations include the inability to dress, transfer, walk long distances, or climb stairs. Improvements in activity levels usually are of primary interest to patients and families and to those who reimburse for health care. Participation restrictions are those problems that an individual may experience in life situations, such as performing a job requirement or playing a sport.¹⁵ Figure 7-1 illustrates a patient’s condition using the ICF model.

Fig. 7-1 Use of the ICF model to document a patient’s condition and demonstrate various levels that need to be measured. The diagram illustrates how a person’s health condition and body structure/function impairments affect participation in the community. Contextual factors (i.e., environmental and personal factors) interact at various levels to facilitate or hinder a person’s health condition and level of function.

When examining patients with cardiopulmonary disorders, therapists often assess responses to activity. Specific information on the activity and on the physiological responses should be included:

Mode of activity (corridor or track walking, lower extremity cycling)

Intensity, work level, or rate of activity (mph, percent grade, estimated MET (metabolic equivalent) level)

Duration of activity at each intensity level

The description of activities should be written clearly so that the workload can be reproduced. Responses to activity include changes in heart rate and rhythm, blood pressure, respiratory rate, oxygen saturation levels, and heart and breath sounds from pre-activity to either during or immediately after activity. Subjective signs of exercise intolerance, such as changes in skin color, decrease in coordination, and sweating, also need to be documented. Whether the patient used oxygen during treatment or required physical assistance should be noted. By objectively recording the activity performed and the physiological responses, therapists can estimate the patient’s activity tolerance.

Evaluation and Interpretation of Findings

The evaluation process involves using the results of the examination to make clinical judgments. Therapists state their “clinical hypothesis,” or explanation of reasons for activity limitations and/or participation restrictions. Evaluation reports provide the foundational support to confirm why the patient requires the skilled services of a physical therapist and why the services are medically necessary at this time. Examination findings may indicate the need for referral of the patient to other health care professionals or to community services.

Interpreting measurements often is challenging. Usually patients’ problems are understood not by reviewing results of a single measure but by viewing relationships between results of several measures. For example, the finding that blood pressure does not increase with activity may not be considered abnormal by itself if the activity level is low or the patient is taking a beta-receptor antagonist medication (e.g., metoprolol); however, the finding of no increase in blood pressure with signs and symptoms of exercise intolerance (e.g., shortness of breath, dizziness, fatigue, etc.) during moderate-level activity in another patient may be indicative of cardiovascular pump dysfunction.

Knowledge of what is “normal” is important to interpret measurements accurately. For some measurements, normal values are well defined. Measurements of resting blood pressure, cholesterol, and blood glucose have defined categories of normal, borderline, and elevated. For other measurements, population normative standards are not specifically defined. For example, what is the normal increase in heart rate when walking at 3.5 mph on a level surface? Values for individuals differ depending on age, medications, fitness level, and walking efficiency. Results must be interpreted by considering these factors and pathological conditions, if present. Each individual has his or her own “normal” or usual response, and variations from this value could be considered abnormal.

Interpreting measurements is similar to putting the pieces of a puzzle together to create a picture of the patient and his or her activity limitations and participation restrictions. Data are collected from several sources, including the medical record, patient interview, and physical therapy examination. Measures performed and interpreted by other health care professionals can be obtained from the medical record; these include measures such as chest radiographs, blood tests, echocardiography, angiography, and ventilation-perfusion scans. During an interview, patients report information about their current and past medical problems and especially specific patient-identified problems. It is important to be sensitive to patient’s feelings about their condition, noting their stage of emotional recovery. Detecting attitudes related to changing lifestyle habits is also important. After the interview, the therapist should have a sense of the patient as a person and begin to plan an intervention strategy for improving body structure and function impairments, and optimizing activity and participation.¹³

Measures made during the physical examination may include physiological responses to activity, breathing patterns, ventilatory capacity, and breath sounds. These measurements are integrated with results collected during the chart review and evaluated in the context of the patient-identified problems and the goals for each problem. As therapists refine their hypothesis, they develop a picture of the severity of the cardiopulmonary condition, stage of recovery, and presence of coexisting conditions. An intervention strategy is developed on the basis of the final clinical hypothesis/diagnosis. The intervention strategy is implemented, and measurements are regularly obtained during treatment sessions to reassess existing problems and goals. Because of the dynamic nature of many of the conditions that affect the cardiovascular and pulmonary systems, each treatment session can be viewed as a reassessment of the hypothesis and progress toward patient goals.

Minimal Clinically Important Difference

To help facilitate evidence-based practice related to interpreting outcome measurements, a minimal clinically important difference (MCID)—that is, “the minimal level of change required in response to an intervention before the outcome would be considered worthwhile in terms of a patient/client’s function or quality of life”²⁰—has been determined for several endurance/aerobic capacity tests and measures used by physical therapists. Examples include the 6-MWT (54 m²¹ or 10% improvement²²), modified shuttle test (40 m²³), and the Borg CR10⁴ and visual analog scale ratings of dyspnea (1 point and 10-20 mm [Table 7-3], respectively²⁴). The MCID for various tests and measures can be used for setting relevant goals, as well as interpreting progress toward outcomes.

Table 7-3

MRC Dyspnea Scale

Grade	Degree of Breathlessness Related to Activities
1	Not troubled by breathlessness except on strenuous exertion
2	Short of breath when hurrying on level ground or walking up a slight hill
3	Walks slower than contemporaries on level ground because of breathlessness, or has to stop for breath after ≈1 mile (or after 15 min) when walking at own pace
4	Stops for breath after walking about 100 yards (or after a few minutes) on level ground
5	Too breathless to leave the house, or breathless after dressing or undressing

Modified by permission from BMJ Publishing Group Limited. From Fletcher CM, Elmes PC, Wood CH: The significance of respiratory symptoms and the diagnosis of chronic bronchitis in a working population, Br Med J 1:257-266, 1959.