Abstract
ABSTRACT. The ability of the Modification of Renal Disease (MDRD) equation to predict GFR when compared with multiple other prediction equations in healthy subjects without known kidney disease was analyzed. Between May 1995 and December 2001, a total of 117 healthy individuals underwent 125I-iothalamate or 99mTc-diethylenetriamine-pentaacetic acid (DTPA) renal studies as part of a routine kidney donor evaluation at either Brigham and Women’s Hospital or Boston Children’s Hospital. On chart review, 100 individuals had sufficient data for analysis. The MDRD 1, MDRD 2 (simplified MDRD equation), Cockcroft-Gault (CG), Cockcroft-Gault corrected for GFR (CG-GFR), and other equations were tested. The median absolute difference in ml/min per 1.73 m2 between calculated and measured GFR was 28.7 for MDRD 1, 18.5 for MDRD 2, 33.1 for CG, and 28.6 for CG-GFR in the 125I-iothalamate group and was 31.1 for MDRD 1, 38.2 for MDRD 2, 22.0 for CG, and 31.1 for CG-GFR in the 99mTc-DTPA group. Bias was −0.5, −3.3, 25.6, and 5.0 for MDRD 1, MDRD 2, CG, and CG-GFR, respectively, in subjects who received 125I-iothalamate and −33.2, −36.5, 6.0, and −15.0 for MDRD 1, MDRD 2, CG, and CG-GFR, respectively, in those who received 99mTc-DTPA studies. Precision testing, as measured by linear regression, yielded R2 values of 0.04 for CG, 0.05 for CG-GFR, 0.15 for MDRD 1, and 0.14 for MDRD in those who underwent 125I-iothalamate studies and 0.18 for CG, 0.21 for CG-GFR, 0.40 for MDRD 1, and 0.38 for MDRD 2 for those who underwent 99mTc-DTPA studies. The MDRD equations were more accurate within 30 and 50% of the measured GFR compared with the CG and CG-GFR equations. When compared with the CG equation, the MDRD equations are more precise and more accurate for predicting GFR in healthy adults. The MDRD equations, however, consistently underestimate GFR, whereas the CG equations consistently overestimate measured GFR in people with normal renal function. In potential kidney donors, prediction equations may not be sufficient for estimating GFR; radioisotope studies may be needed for a better assessment of GFR. Further studies are needed to derive and assess GFR prediction equations in people with normal or mildly impaired renal function. E-mail: jlin11@partners.org
A noninvasive and accurate estimation of GFR is one of the holy grails of nephrology. Not only are prediction equations crucial for estimating GFR or creatinine clearance (CrCl) in the clinical research setting where only a single blood test is available, but also the new K/DOQI guidelines recommend estimating GFR by the Modification of Renal Disease (MDRD) or Cockcroft-Gault (CG) equations (1). Because GFR determinations by inulin or radioisotope studies on large numbers of patients are impractical, cumbersome, and expensive, clinicians also rely on GFR prediction equations on a daily basis.
In 1999, a new prediction equation derived from 1628 subjects with renal insufficiency enrolled in the MDRD study was published (2). This equation has been subsequently evaluated in (1) blacks with chronic kidney disease (CKD) (3), (2) scleroderma patients (4), (3) potential renal transplant donors with CrCl <80 ml/min by two 24-h urine collections (5), (4) patients with kidney disease and normal serum creatinine (SCr) levels (≤1.5 mg/dl) (6), and (5) 46 normal and 46 individuals with type 1 diabetes without evidence of nephropathy (7). In 2000, a simplified MDRD equation (MDRD 2) was published in abstract form that used SCr as the only serum assay (8). To our knowledge, this is the only published study that tests the MDRD 2 equation in people without renal disease compared with actual GFR measurements.
Materials and Methods
Between May 1995 and December 2001, a total of 117 healthy adults underwent 125I-iothalamate or 99mTc-diethylenetriamine-pentaacetic acid (DTPA) renal clearance studies as part of a routine work-up for potential kidney donation at either Brigham and Women’s Hospital (BWH) or Boston Children’s Hospital (Boston, MA). On chart review, a total of 100 subjects had sufficient clinical and laboratory data to estimate GFR, including age, gender, race, weight, height, and SCr. Those with missing data for blood urea nitrogen (BUN; n = 4) and serum albumin (n = 10) were given the default values of 15 mg/dl and 4.0 g/dl, respectively; the rationale is that these two variables together contribute <1% to the observed variance of the calculations (A.S. Levey, personal communication). Of the 100 subjects included in the study, 55 underwent 125I-iothalamate studies and 45 underwent 99mTc-DTPA.
Subjects who underwent 125I-iothalamate studies at BWH were asked to fast for at least 8 h and given a water load of 10 ml/kg and 5 drops of potassium iodide diluted in 15 ml of water orally (to block thyroid uptake of 125I-iothalamate) at the initiation of the study. Thirty-five micro-Curies of 125I-iothalamate was injected subcutaneously into the upper arm. Blood was drawn and urine was sampled at time 0 (before 125I-iothalamate injection) and at 60, 120, and 180 min. Total urine volume and urinary flow rates were assessed every 60 min. Oral fluid hydration was administered at 500 ml/h as tolerated. GFR measurements for two timed urine collections were averaged and standardized for a body surface area (BSA) of 1.73 m2. Intra-assay coefficients of variation (CV) were ≤10% for 40 of the 55 iothalamate subjects and ≥10% for the remaining 15 subjects. Forty-six subjects had CV of ≤15%. For those with CV ≥15%, the recorded minimum urinary flow rates ranged from 1.1 to 2.8 ml/min. The acceptable minimum urinary flow rate in the original protocol was 3 ml/min; if this was not achieved at time 60 min, then urine was collected at time 90 min and urinary flow rate was calculated at this time.
Individuals who underwent 99mTc-DTPA studies were hydrated orally or intravenously at 10 ml/kg per h for 30 min before study initiation. 99mTc-DTPA dosed at 50 μCi/kg was injected intravenously, and blood was sampled at 120, 180, and 240 min. Three GFR measurements were averaged and standardized for a BSA of 1.73 m2. No information on intra-assay CV was available for 99mTc-DTPA studies.
Forty-one of 45 subjects who had 99mTc-DTPA studies had their SCr levels measured from their primary care provider’s office at a wide variety of laboratories; the remaining four patients had SCr assayed at the BWH laboratory. For subjects who underwent 125I-iothalamate testing, 45 of 55 (82%) SCr levels were assayed at the BWH, and the remainder were obtained through different outside laboratories. The BWH laboratory used alkaline picrate reactions with a DAX96 (Bayer) machine to measure SCr through January 31, 2000, and an Olympus 640/2700 machine from February 1, 2000, to the present. The reference normal values for SCr were 0.8 to 1.6 mg/dl in men and 0.8 to 1.3 mg/dl in women from 1992 through September 1999. This changed to 0.7 to 1.3 mg/dl for both men and women starting in September 1999. The laboratory reports that the CV of the assay is 3% for measurements in the normal range.
The prediction equations that we used are listed as follows:
Cockcroft-Gault (CG) (9): CrCl × BSA/1.73 m2
a. For men: CrCl = [(140 − Age) × Weight (kg)]/SCr × 72
b. For women: CrCl = ([(140 − Age) × Weight (kg)]/SCr × 72) × 0.85
i
CG-GFR estimate: GFR = 0.84 × CrCl by Equation (1)
MDRD 1 (2): GFR = 170 × [SCr]−0.999 × [Age]−0.176 × [0.762 if patient is female] × [1.18 if patient is black] × [BUN]−0.170 × [Alb]0.318
MDRD 2 (8): GFR = 186 × [SCr]−1.154 × [Age]−0.203 × [0.742 if patient is female] × [1.212 if patient is black]
Jellife 1 (× BSA/1.73 m2) (10)
a. For men: (98 − [0.8 × (age − 20)])/SCr
b. For women: (98 − [0.8 × (age − 20)])/SCr × 0.90
Jellife 2 (11)
a. For men: (100/SCr) − 12
b. For women: (80/SCr) − 7
Mawer (12)
a. For men: weight × [29.3 − (0.203 × age)] × [1 − (0.03 × SCr)]
(14.4 × SCr) × (70/weight)
b. For women: weight × [25.3 − (0.175 × age)] × [1 − (0.03 × SCr)]
(14.4 × SCr) × (70/weight)
Bjornsson (13)
a. For men: [27 − (0.173 × age)] × weight × 0/SCr
b. For women: [25 − (0.175 × age)] × weight × 0.07/SCr
Gates (14)
a. For men: (89.4 × SCr−1.2) + (55 − age) × (0.447 × SCr−1.1)
b. For women: (89.4 × SCr−1.2) + (55 − age) × (0.447 × SCr−1.1)
Salazar-Corcoran (15)
a. For men: [137 − age] × [(0.285 × weight) + (12.1 × height2)]/(51 × SCr)
b. For women: [146 − age] × [(0.287 × weight) + (9.74 × height2)]/(60 × SCr)
Fisher’s exact test was used for proportions and t test for comparison of means. A Pearson correlation coefficient was also calculated for the normally distributed GFR data (Shapiro-Wilk test for normality, P = 0.98).
The mean and median absolute differences were calculated from absolute difference = predicted value − measured value. The % absolute difference was calculated as % absolute difference = predicted value − measured value × 100 measured value.
Bias, a measure of systematic error, was defined by the mean prediction error (ME):
where pei = predicted value − true value and n = sample size.
The R2 statistic was derived by simple linear regression (PROC GLM) and reflects the predictive ability of the model. P = 0.05 was considered significant.
Refitting of the log-linear MDRD equation to our data set was performed with the following code using PROC REG in SAS:
For MDRD 1: Model ln(GFR) = ln(SCr) + ln(AGE) + ln(BUN) + ln(albumin) + race + gender
For MDRD 2: Model ln(GFR) = ln(SCr) + ln(AGE) + race + gender
SAS for Windows version 8.0 (Cary, NC) was used for all statistical calculations. Data collection by chart review was approved by the BWH Institutional Review Board.
Results
Clinical and laboratory characteristics of this study population are summarized in Table 1. The mean measured GFR was 112.8 ml/min per 1.73 m2 (range, 70.0 to 169.0 ml/min). When data from subjects who received 125I-iothalamate were compared with those who received 99mTc-DTPA, statistically significant differences were found in distribution of black race, mean weight, albumin, and measured GFR (Table 1).
Table 1. Patient characteristicsa
Performance assessment for several CrCl and GFR equations are presented in Table 2. Mean absolute difference ranged from 26.0 to 49.6, bias from −18.4 to 28.7, median absolute difference from 19.0 to 35.4, median % absolute difference from 17 to 30%, precision from 0.01 to 0.08, and Pearson correlations from 0.14 to 0.28. The percentage of calculated values within 30% of the measured value ranged from 50 to 76%, whereas the percentage within 50% of measured values ranged from 72 to 96%. Results of CG compared with CG-GFR and MDRD 1 compared with MDRD 2 were very similar.
Table 2. Mean calculated CrCl or GFR, mean absolute difference, bias, precision, and accuracy of GFR prediction equations compared with measured GFRa
When the predicted values were compared with measured values stratified by the type of study (125I-iothalamate or 99mTc-DTPA), there was marked improvement in the MDRD predictions (Table 3). For those who received 125I-iothalamate studies, the mean, median, and median % absolute differences did not change greatly, but the bias, precision, Pearson correlation, and accuracy improved with both MDRD equations. For those who received 99mTc-DTPA studies, the mean, median, and median % absolute differences were increased in the MDRD 2 equation, whereas bias, precision, and Pearson correlation increased for both MDRD 1 and 2 equations and accuracy remained approximately the same (Table 3) when compared with the results for the combined 125I-iothalamate and 99mTc-DTPA studies (Table 2).
Table 3. Analysis of CG versus MDRD prediction equations stratified by type of GFR study (125I-Iothalamate or 99mTc-DTPA)
Because there were many overweight people who might have had a supraphysiologic calculated CrCl by CG, we also restricted the analyses to those with body mass index ≤30 (n = 73). No improvements were noted in any of the parameters examined in any of the equations (data not shown).
Refitting of the MDRD equation parameter estimates to our data set resulted in the following prediction equations:
MDRD 1: GFR = 278 × [SCr]−0.107 × [Age]−0.169 × [0.94 if patient is female] × [0.91 if patient is black] × [BUN]−0.089 × [Alb]−0.028 (R2 = 0.11)
MDRD 2: GFR = 214 × [SCr]−0.113 × [Age]−0.174 × [0.96 if patient is female] × [0.92 if patient is black] (R2 = 0.08)
Discussion
The MDRD and CG equations are the most widely recommended and used formulas for assessment of renal clearance; therefore, this discussion focuses primarily on these two equations. On the basis of a comparison of these findings to those in the published literature (Table 4), our first observation is that the MDRD equations perform much more poorly in subjects without kidney disease than in those with chronic kidney disease. This is not surprising since the MDRD equation was derived in MDRD study participants who were selected as having moderate to severe renal failure (measured 125I-iothalamate GFR mean was 39.8 ml/min per 1.73 m2) (2).
Table 4. Comparison of CG and MDRD equations in previously published and current studiesa
Second, the MDRD prediction equations seem to systematically underestimate GFR, as indicated by the high negative bias (Table 3), especially as measured by the 99mTc-DTPA method. Although the 99mTc-DTPA group had a significantly higher mean weight, they also had significantly higher GFR measurements; therefore, these differences between the two groups are unlikely to explain the greater negative bias.
Third, we have confirmed that the simplified MDRD 2 equation loses very little predictive ability when compared with MDRD 1. Because serum albumin and BUN measurements may not be readily available, especially in the research setting, MDRD 2 is the prediction equation of choice over MDRD 1 in these situations. Likewise, estimating GFR from the CG CrCl calculation does not appreciably improve the predictive ability of the CG equation.
Last, 125I-iothalamate and 99mTc-DTPA techniques do not seem to be highly correlated with each other because the degree of bias, precision, linear correlation, and accuracy varied notably when the results were pooled (Table 2) rather than stratified by type of GFR testing (Table 3). The decreased precision observed when all 100 subjects were pooled for analysis may reflect the interassay variability that was conferred by using two different measurements of GFR for reference. However, high correlations of both 125I-iothalamate and 99mTc-DTPA to simultaneous measures of inulin clearance have been reported (r > 0.90) (16), although 125I-iothalamate measurements exceeded inulin measurements by 14.6 to 25.9 ml/min per 1.73 m2 (16) and 99mTc-DTPA overestimated inulin clearance by 3.5 to 13.5 ml/min per 1.73 m2 for (16,17⇓). Therefore, analyses of prediction equations must take into account the type of GFR measurement, and caution should be used when comparing results of such investigations using diverse GFR measurement techniques. In this study, other factors that may have influenced the different correlations between 125I-iothalamate and 99mTc-DTPA include the higher numbers of blacks and lower mean weights in the 125I-iothalamate group.
The MDRD equations had the least bias and highest accuracy when compared with 125I-iothalamate measurements. This is not surprising considering that these equations were derived using 125I-iothalamate GFR measurements. The CG equation is understandably poorer, because it was derived by CrCl calculated by 24-h urinary creatinine collections as the “gold standard” in 249 adults ages 18 to 92 with mean SCr 0.99 to 1.78 mg/dl (9). In this original study, 96% of study subjects were male, and no information on race was given; this raises the issue of the generalizability of the CG prediction equations. The use of 24-h urine collection as the “gold standard” was also suboptimal because multiple investigations have reported the inaccuracies of using 24-h urine collections to measure CrCl, usually from under- or overcollection by the subject (5,18,19⇓⇓). In fact, Coresh et al. (20) concluded that there was no advantage in 24-h urine collection over the CG estimations when compared with GFR determined by 125I-iothalamate clearance.
Hsu et al. (21) reported finding systematic differences in SCr levels measured at BWH among those measured during 1997 and during 1998 when compared with values measured before 1997; therefore, the investigators adjusted their analysis by adding 0.1 mg/dl to SCr values measured in 1997 and 0.3 mg/dl to values measured in 1998. In our data set, 35 of the 45 SCr assayed at BWH in the 125I-iothalamate group were performed in 1998 or later, whereas all four of the SCr assayed at BWH in the 99mTc-DTPA group were performed after 1998. Because the vast majority of 125I-iothalamate patients were assayed between 1998 and 2000 and because there is no information (either published or provided by the BWH laboratory) of how SCr assays after 1998 are related to those performed before 1998, we did not attempt to adjust SCr and recalculate the prediction equations, although this is another likely source of measurement error.
The issue of calibration of SCr laboratory measurement is especially critical when estimating GFR in subjects with normal or mildly impaired renal function because small changes in SCr result in large changes in calculated CrCl and GFR. For example, a 40-yr-old black woman who weighs 70 kg and has a SCr of 1.0 mg/dl in steady state has a calculated CrCl of 82.6 ml/min and a calculated GFR of 78.3 ml/min per 1.73 m2; an increase of SCr by 0.1 mg/dl to 1.1 mg/dl results in a calculated CrCl of 75.1 ml/min and a calculated GFR of 71.2 ml/min per 1.73 m2. In contrast, the same patient with a SCr of 3.1 mg/dl in steady state would have an estimated CrCl of 27.5 ml/min and an estimated GFR of 26.1 ml/min per 1.73 m2; an increase in SCr to 3.2 mg/d results in very small changes in estimates to 26.7 ml/min and 25.3 ml/min per 1.73 m2 for CrCl and GFR, respectively. There is no standard calibration for SCr measurements for laboratories in the United States (22), and investigators have reported that SCr on the same stored serum samples were 0.23 mg/dl higher when assayed by the laboratory used by Third National Health and Nutrition Examination Survey study when compared with the values obtained by the laboratory used by the MDRD study, a magnitude of difference that was common across different laboratories (18).
Other potential sources of measurement error include (1) intraindividual variability in SCr; (2) intraindividual variability in other clinical or laboratory measurements; (3) intraindividual variability in GFR; and (4) intra-assay variability in GFR measurement. Despite the 125I-iothalamate protocol requirements for minimum urinary flow rates of 3 ml/min, this was not achieved in all nine subjects who had CV ≥15%. Intra-assay GFR measurement errors are likely minimized in those who underwent 125I-iothalamate testing with CV ≤10%; however, in those with high CV and those who underwent 99mTc-DTPA studies without reported intra-assay CV, we acknowledge that the questionable quality of some of the GFR measurements may not make it a true and reliable “gold standard.”
In refitting the MDRD coefficients to our data, we found that the intercept terms are different for both MDRD 1 and MDRD 2, suggesting that, as expected, our study population varies considerably from the original cohort in which these equations were derived. Furthermore, the regression coefficients for all laboratory assays (Scr, BUN, and alb) are also considerably different, likely reflecting lack of calibration in a central laboratory and measurement error, whereas the regression coefficients for age and gender are similar. The coefficient for black race is in the opposite direction of those in the MDRD equations but are based on only 10 blacks. Using the MDRD predictors of GFR resulted in models that explain only 8 to 10% of the variance in our data set of normal subjects, which is not surprising in light of the relatively small sample size of 100 subjects and the various sources of measurement error as previously discussed, especially those involving SCr calibration.
Our results are consistent with other published investigations in the literature (Table 4). A study by Bostom et al. (6) of 109 patients with known kidney disease and SCr ≤1.5 mg/dl reported an R2 of 0.31 for MDRD1, 0.29 for MDRD 2, and 0.17 for CG (Table 4). The Bostom study differed from this current study on several points: it included patients with renal disease, there was a narrower range of weights among the subjects, the majority of subjects had some degree of proteinuria, and iohexol was used to determine GFR (Table 4). Despite the “normal serum creatinine levels” (range, 1.0 to 1.3 mg/dl), measured GFR by iohexol was as low as 18 ml/min per 1.73 m2 (25th to 75th percentile range was 88 to 138 ml/min per 1.73 m2). In addition, 25% had iohexol GFR <80 ml/min per 1.73 m2, and 59% had glomerular disease, whereas our cohort were healthy, normal adults.
Another recent study of 46 healthy adults and 46 individuals who had type 1 diabetes without evidence of nephropathy and underwent inulin clearance studies found median absolute differences of 10.7 versus 9.0, respectively, when comparing MDRD 1 and CG equations in healthy subjects; the CG equation was corrected to reflect GFR by either a multiplication factor of 0.9 and the formula Y = −0.004 + 1.54 (Table 4) (7). No analyses of bias, precision, or accuracy were presented (7). On the basis of the higher median absolute differences in the MDRD 1 equation compared with the CG equation for all subjects, the authors concluded that the MDRD 1 equation underestimated GFR, especially in women with diabetes, and that the MDRD 1 equation was less “accurate” than the CG equation. Whereas we also found that the MDRD 1 equation consistently underestimated GFR measurements (most notably in those who underwent 99mTc-DTPA), we observed that the MDRD equations were less biased, more precise, and more accurate than the CG equation.
Several limitations of this study should be noted. First, very few black individuals were included, so the findings may not be generalizable to this group. Second, these analyses were performed on a relatively small sample size with wide variability in clinical and laboratory parameters. Our study participants, however, are likely representative of the general population with normal or mildly decreased kidney function. Third, SCr measurements were not performed at the same time in the same laboratory and were not calibrated with a standardized measurement, although the vast majority (82%) of those who underwent 125I-iothalamate studies had SCr measured in the same BWH laboratory with a low reported CV. Despite these limitations, however, this is the first study to evaluate rigorously these GFR formulas in healthy individuals without chronic kidney disease. Moreover, the lack of calibration of SCr across different laboratories continues to be a reality that physicians face daily in clinical practice.
In addition to the inaccuracies of the formulas themselves, this study underscores how multiple sources of measurement error (including intra-assay SCr variability, intraindividual Scr variability, lack of calibration of Scr assays across different laboratories, intra-assay GFR variability, intraindividual GFR variability, and measurement error of other variables in the prediction equations) can affect the precision and accuracy of renal clearance prediction equations. In the future, researchers and clinicians would greatly benefit from a study of a large and racially diverse cohort of people with normal or mildly impaired kidney function that would allow investigators to derive an improved prediction equation for estimating GFR.
Acknowledgments
This work was supported in part by the DCI Paul Teschan Research Fund.
- © 2003 American Society of Nephrology