Chapter 4 Diagnostic pathology in clinical practice
Laboratory techniques play an important part in the diagnosis and treatment of disease in patients. Many of the tests performed in pathology laboratories are diagnostic, quantitative measurements or prognostic, but these are complemented by expert advice on the interpretation of the results. Microbiologists are also involved in formulating policies designed to prevent spread of infection in hospitals; haematologists have clinical responsibilities for treating patients with haematological malignancies and other disorders.
In this chapter the general principles of diagnostic tests, quantitative measurements and prognostic tests are given and these are then related to the specific roles of clinical chemistry, cytogenetics, cytopathology, haematology, histopathology, immunology, microbiology and autopsies.
TYPES OF LABORATORY TESTS
Diagnostic tests
Diagnostic tests are those that are made on a sample from a patient, the result allocating the case to a diagnostic grouping; an example would be a needle core biopsy of a lesion of the breast which is sent for histopathological examination and classified into a benign or malignant (i.e. cancer) category. Quantitative measurements, such as haemoglobin concentration or arterial blood oxygen tension, may be used in the clinician’s diagnostic process but they do not by themselves assign a patient to a particular diagnostic category. A diagnostic test may be based on:
The ideal diagnostic test would produce complete separation between two diagnostic categories; usually, however, there is some overlap. This problem can be illustrated by taking as an example a screening test for colorectal carcinoma which makes measurements on a sample of faeces (many attempts have been made to devise such a test using measurements of blood contained in the faeces and other parameters). An ideal diagnostic test would produce complete separation of patients with and without colorectal carcinoma (Fig. 4.1). The majority of real diagnostic tests do not provide complete separation between diagnostic categories and there is overlap (Fig. 4.2).
Fig. 4.1 Distribution graph for an ideal diagnostic test. There is complete separation of the population into those with colorectal carcinoma (shaded area) and those without. In this example a measurement of above 70 units would indicate that the subject had colorectal carcinoma and a measurement below 60 units would indicate that the subject did not have colorectal carcinoma.
Fig. 4.2 Distribution graph of a more realistic diagnostic test. In this example there is a range of values between 60 and 80 units where there are subjects with and without colorectal carcinoma.
The effectiveness of a diagnostic test can be expressed using a number of different parameters:
Test result from NCB | ||
---|---|---|
Actual outcome | Benign | Malignant |
Benign | True negative | False positive |
Malignant | False negative | True positive |
These can be combined into the following measures:
The desired values of these for a particular test will vary according to the action taken on the result. A malignant NCB result can result in a surgeon excising the breast (mastectomy), so the specificity and predictive value of a positive result must be as close to 100% as possible. In contrast, if a disease has a relatively safe, non-toxic treatment (such as a course of antibiotics) but the consequences of not detecting the disease can be fatal (e.g. bacterial meningitis), the sensitivity and predictive value of a negative result should be as high as possible. In most situations there is a direct ‘trade-off’ between sensitivity and specificity and a suitable threshold has to be set that will give the best overall performance (Fig. 4.3).
Fig. 4.3 A graph showing the effect of moving the threshold value for a test on its sensitivity and specificity. If the threshold is set at A then there are no false positives so the specificity is 100% but the sensitivity is low at about 60%. If the threshold is moved down to C there are no false negatives so the sensitivity is 100% but the rise in false positives has led to a reduction in the specificity to about 30%. At threshold B the test gives the greatest overall accuracy with three false positives and two false negatives.
In many medical situations a continuous biological spectrum is arbitrarily divided into a number of discrete categories which will always lead to some apparent misclassification but is necessary to give information on which clinicians can base their management decisions (e.g. division of intra-epithelial neoplasia of the uterine cervix into three categories, see Ch.19).
A laboratory’s performance in diagnostic tests should be monitored by a formal audit process and by use of appropriate positive and negative controls in tests.
Quantitative measurements
Many tests in pathology do not categorise results into discrete groups but give a quantitative result which is interpreted in relation to a ‘normal’ range of values. Examples of such tests include measurement of haemoglobin concentration, electrolyte concentrations, and blood oxygen and carbon dioxide levels.
The measures of performance for such tests differ from diagnostic grouping tests. In quantitative tests the accuracy of the measurement (how close the measured value is to the ‘true’ value determined by a more accurate or absolute method) and the reproducibility of the measurement (what variation there is when measuring the same sample many times) are important parameters. These can be assessed by using reference samples with ‘known’ values and putting these through the measurement system at regular intervals; most laboratories will have their own reference samples which are used frequently (internal quality assurance), and graphs of single measurement and running mean values will be used to ensure that the test is performing within expected limits and not showing ‘drift’ away from the central expected value (Fig. 4.4). Many countries also have external quality assurance schemes where reference samples are sent to all participating laboratories to ensure acceptable analytical performance.
Fig. 4.4 Internal quality assurance graph for a quantitative pathological test. A reference sample is used for each test; tests A and B lie outside the acceptable range and the process of the test would have to be investigated for sources of error (e.g. out of date reagents, contamination, etc.).
When a laboratory gives a quantitative result for a parameter that is under physiological control, a reference range is often given to facilitate interpretation of the result. If a parameter shows normal (Gaussian) distribution in the local population, the ‘normal’ range is often given as two standard deviations below the mean to two standard deviations above the mean. If a value lies outside this range then it lies outside 95% of the results for that population (Fig. 4.5) and may be regarded as abnormal, but 2.5% of the healthy population will have values lying outside the range at either end. Thus, all the details of the individual case must be considered, including other measurements, as a number of results at the top end of the ‘normal’ range could be more significant than a single result just above the ‘normal’ range. If the distribution is not Gaussian it may require normalisation by transformation, or non-parametric methods must be used.
Prognostic tests
In many tumours, assignment to a diagnostic category (e.g. adenoma or carcinoma) gives an indication of the prognosis for the individual patient, but within such groupings (e.g. colorectal carcinoma) there may be wide variation in the biological behaviour of the tumour. In order to plan appropriate treatment and to be able to give useful information and counselling to individual patients, many prognostic pathological tests have been developed.
In tumour pathology one of the most predictive prognostic tests is staging of the tumour (extent of spread), which is always assessed in the histopathological examination of specimens. One of the best examples of this is Dukes’ staging of colorectal carcinoma (Ch. 11 and Ch. 15). The histological type of tumour has important prognostic implications, particularly in some organs; subjects with papillary thyroid carcinoma have a life expectancy that is the same as for the rest of the general population without the tumour, whereas subjects with anaplastic thyroid carcinoma have a median survival of a few months. The grade of the tumour, an assessment of its degree of differentiation and proliferative activity, also has predictive value; well-differentiated tumours (closely resembling parent tissue) with few mitoses have a better prognosis.
In tumours that produce substances that enter the blood or urine (e.g. alpha-fetoprotein produced by testicular teratomas, see Ch. 20), measurement of the levels of these at the time of diagnosis may be predictive of prognosis (and can be used in follow-up). As more becomes known of the molecular abnormalities of tumours, the possibilities for specific molecular tests that will have prognostic value increase, but the translation of an apparently significant research result into a routinely used prognostic test is not straightforward. When evaluating any new prognostic test the significance for the individual patient has to be considered; a test that shows a statistically significant difference between two large groups of patients may not assign individual cases to a prognostic category with a sufficient degree of certainty to be useful in management decisions or patient information. One recently developed test that has found usage is the detection of expression of the transmembrane receptor tyrosine kinase KIT, which is defined by the CD117 antigen and is the product of the c-kitproto-oncogene in stromal tumours of the gastrointestinal tract. This can be detected by immunohistochemistry (Fig. 4.6), which, if positive, predicts that the patient’s tumour will respond to treatment with a specific tyrosine kinase inhibitor, imatinib mesylate.
SPECIALISED TESTS
Clinical chemistry
Methods in clinical chemistry detect and measure subcellular substances—usually in the blood but also in other bodily fluids and tissue:
The range of molecules measured is constantly expanding, ranging through electrolytes (such as sodium and potassium), larger inorganic molecules (urea), proteins (including many enzymes) and exogenous molecules (such as carbon monoxide and drugs):
Since many of the tests in clinical chemistry are quantitative, the laboratories have extensive programmes of internal and external quality control, and laboratory reports quote reference ranges. For many tests, ranges appropriate for the age and sex of the patient may be quoted.
As with all pathological tests the clinician with direct responsibility for the patient must decide whether a particular test is an appropriate investigation and what sample is most appropriate for that test. These considerations are especially important in clinical chemistry where large automated machines can measure a wide range of substances on a single sample and, if not used selectively, may generate non-essential data which may be difficult to interpret and lead to unnecessary further investigations.
The type of sample and the circumstances in which it is taken are also important. It is outside the scope of this chapter to give specific recommendations for individual tests but examples of inappropriate samples would be blood taken for glucose analysis shortly after a large carbohydrate-rich meal, blood taken for electrolyte analysis from a vein in an arm receiving an intravenous infusion, and blood taken for a digoxin level immediately after a dose of the drug.
The interpretation of results also requires knowledge about the substances being assayed, and the advice of a specialist clinical chemist is often useful. An example of this is the use of cardiac enzymes measured to determine whether a myocardial infarct has occurred. The enzymes lactate dehydrogenase, aspartate transaminase and creatine phosphokinase normally reside intracellularly in muscle cells; if muscle is damaged, the enzymes gain entry to the blood and elevated levels may be detected. The interpretation of these assays requires knowledge about the time course of the enzyme release and the possible sites of enzyme release. The enzymes are not released immediately when the myocytes become hypoxic because the cell membranes take some time to break down; Figure 4.7