You will find elsewhere in these pages of JASN1 a paper by Argyropoulos et al. entitled “Considerations in the Statistical Analysis of Hemodialysis Patient Survival.” The claim of this work—a comparative statistical analysis—is that the authors have proven their a priori hypothesis that the natural history ofdialysis patients proceeds in an accelerating fashion. As such, they say their chosen statistical method, the accelerated failure time model (AFTM), should be used when evaluating survival on dialysis. The evidence and its interpretation, however, are weak, and both should be considered critically and viewed skeptically. I have several reasons for this view.
Contrary to the authors' assertions, there is widespread agreement that dialysis dose is important to the survival of dialysis patients. The National Cooperative Dialysis Study2 proved many years ago that small molecule–directed dialysis is important. Later analyses using those data, but ignoring length of dialysis (t),3 suggested that a Kt/V of 0.9 was a suitable initial threshold. The Hemodialysis Study4 used higher Kt/V values, but even so, there remains the implication that women had worse survival in the low than the high Kt/V arm.4,5 More to the point, there is widespread transnational agreement that dose is important.5–9 Although some might argue about the best formula for describing dose, opinion converges on a Kt/V ratio of 1.2 to 1.4 per session.6–9
Furthermore, the authors' database is quite small1—only 491 patients evaluated—and thus insufficient to prove their point. Nearly 24% of the parent sample lacked a dose measurement any time during their first 3 mo of treatment (Table 1, last row, of Argyropoulos et al.)—Kt/V was the primary target here. This fact suggests either poor patient care or poor data management. The latter possibility is far more likely than the first and raises serious concerns about the quality of the data. Furthermore, patients were followed for up to 9 yr—median follow-up of approximately 4 yr—and it is difficult to imagine, let alone prove, that Kt/V does not change during those interval years. No follow-up dose measurements are included in the paper, although they must have been collected by the dialysis facilities given current regulatory requirements. Those data would allow evaluation of changing dose for use in routine, time-varying, proportional hazards models10,11 or to prove that dose did not change. The current data are inadequate for either purpose.
Next, some of the analyses are suspect. Supplementary Tables 1 and 2 from the study of Argyropoulos et al. show that six AFTMs were evaluated. Kt/V is significantly associated with survival in only two (P = 0.04; last rows). Such “multiple dipping” is usually frowned on, because they lead to claims of new discovery when none exist in statistical fact. Here is a simple example: we have a 1 in 20 chance of randomly drawing a green ball mixed in a bowl with 19 red balls (P = 0.05). The chance of drawing red is 0.95. If one dips into the bowl three times, the chance of drawing all red is 0.953 = 0.86. This means the chance of drawing at least one green ball is 0.14—greater than the magic P value of <0.05 to which we are accustomed. In other words, the chance of encountering a type I error, declaring significance when none exists, becomes large unless one accounts for such multiple sampling. That was not done here. Indeed, one can estimate the chances of finding at least one “nonsignificant” significant P value in six tries like this are approximately 26%. Multiple testing in search of P values to support hypotheses is an unwelcome approach.
One might argue that each of these six tries used a different statistical distribution. Following that line, one would say the complex pathobiology of dialysis patients not only follows an accelerated failure time model but also is constrained to a particular statistical distribution, that is to say, a Log-Normal but not a Weibull distribution, a Log-Logistic distribution, and so forth because Kt/V is not associated with survival for those distributions (Supplementary Tables 1 and 2 from the study of Argyropoulos et al.). That argument may seem reasonable to a statistician but dubious to a clinical nephrologist.
The most favorable AFTM (lowest P value associated with Kt/V) was selected for comparison with the Cox model (Table 2 from the study of Argyropoulos et al.).1 The usual requirement for Cox models—proportionality of hazard over time—is not met here as stated by the authors and shown in Supplementary Table 3 from the study of Argyropoulos et al. Perhaps this is because the hazards of dose are not consistently proportional over time as the authors suggest. Or perhaps Kt/V changes with time in patients so it was not the same in year 4, for example, as during those first 3 mo. That dynamic is clinically quite likely. Time-varying Cox models11 can easily manage both possibilities, as well as obviate the proportionality concern10 raised by these authors.1 Unfortunately, these data are insufficient to the purpose because, as noted earlier, follow-up Kt/V data are not evaluated. The lack of adequate data simply cannot drive conclusions about pathobiology or the relative value of AFTMs versus Cox models.
I have constrained my observations thus far by assuming that Kt/V, and this algebraic estimate of it, is an optimum survival-associated expression of hemodialysis dose. However, Kt/V may be a suboptimum expression of hemodialysis dose,12 in part because of its compound nature, dividing one measure associated with survival (Kt) by another (body size or V). The proposals of the study of Argyropoulos et al.1 become doubly problematic if that argument holds and is deemed true.
Finally, this new paper1 is an excellent subject for academic journal club review. There are many reasons. For example, the urea kinetics paradigm (Kt/V) is not grounded in a model for the pathophysiology of uremia that depends on cumulative exposure to the uremic milieu claimed here as a premise.1 To the contrary, urea kinetics was conceived as a method simply to control blood concentrations of substances in dialysis patients assuming that “the control of blood concentrations of various toxic or inhibiting substances to some maximum (peak) value is required for adequate treatment (emphasis added).”13 There is nothing in the derivation of the urea kinetic equations, either there13 or elsewhere,2–4 suggesting a cumulative or additive damage model for uremic pathobiology.
In summary, it seems that the stated premises, the statistical methods, and the analyses themselves are constructed to support a particular preconception about the pathobiology of dialysis patients. Perhaps the authors are constrained by the lack of follow-up data on dialysis dose or did not seek to later collect the information from medical records to complete their existing database. However, those possible dynamics aside, although it is reasonable to experiment with AFTMs,14 this information that is based only on two P values of P = 0.04 can not yet be used to inform changes in either current concepts about patient physiology on dialysis or the statistical methods used by the renal community.
Disclosures
None.
Footnotes
-
Published online ahead of print. Publication date available at www.jasn.org.
-
See related article, “Considerations in the Statistical Analysis of Hemodialysis Patient Survival,” on pages 2034–2043.
- Copyright © 2009 by the American Society of Nephrology