Urine Biomarkers Predict the Cause of Glomerular Disease
Sanju A. Varghese*,
T. Brian Powell*,
Milos N. Budisavljevic*,,
Jim C. Oates*,
John R. Raymond*,,
Jonas S. Almeida and
John M. Arthur*,
* Department of Medicine, Medical University of South Carolina, and Department of Medicine, Ralph H. Johnson VA Medical Center, Charleston, South Carolina; and Department of Biostatistics and Applied Mathematics, University of Texas MD Anderson Cancer Center, Houston, Texas
Address correspondence to: Dr. John M. Arthur, Department of Medicine, Division of Nephrology, Medical University of South Carolina, 96 Jonathan Lucas Street, P.O. Box 250623, Charleston, SC 29425. Phone: 843-792-4123; Fax: 843-792-8399; E-mail: arthurj{at}musc.edu
Received for publication July 21, 2006.
Accepted for publication December 26, 2006.
Diagnosis of the type of glomerular disease that causes thenephrotic syndrome is necessary for appropriate treatment andtypically requires a renal biopsy. The goal of this study wasto identify candidate protein biomarkers to diagnose glomerulardiseases. Proteomic methods and informatic analysis were usedto identify patterns of urine proteins that are characteristicof the diseases. Urine proteins were separated by two-dimensionalelectrophoresis in 32 patients with FSGS, lupus nephritis, membranousnephropathy, or diabetic nephropathy. Protein abundances from16 patients were used to train an artificial neural networkto create a prediction algorithm. The remaining 16 patientswere used as an external validation set to test the accuracyof the prediction algorithm. In the validation set, the modelpredicted the presence of the diseases with sensitivities between75 and 86% and specificities from 92 to 67%. The probabilityof obtaining these results in the novel set by chance is 5 x108. Twenty-one gel spots were most important for thedifferentiation of the diseases. The spots were cut from thegel, and 20 were identified by mass spectrometry as charge formsof 11 plasma proteins: Orosomucoid, transferrin, -1 microglobulin,zinc -2 glycoprotein, -1 antitrypsin, complement factor B, haptoglobin,transthyretin, plasma retinol binding protein, albumin, andhemopexin. These data show that diseases that cause nephroticsyndrome change glomerular protein permeability in characteristicpatterns. The fingerprint of urine protein charge forms identifiesthe glomerular disease. The identified proteins are candidatebiomarkers that can be tested in assays that are more amenableto clinical testing.
Glomerular diseases such as diabetic nephropathy and FSGS areassociated with proteinuria that is caused by increased glomerularpermeability. Of the 100,000 patients who developed ESRD inthe United States in 2003, more than half had some form of glomerulardisease (1). The glomerulus consists of a tuft of capillariesand associated cells that are responsible for filtration ofsmall molecules while preventing the loss of larger molecules.The glomerular wall contains three layers: Endothelial cells,basement membrane, and epithelial cells. Much of the selectivityof filtration occurs in the basement membrane, where the barrierexcludes proteins on the basis of both their size and theircharge. Uncharged molecules pass through the basement membranemore readily than negatively charged proteins of a similar size(2,3). Selective permeability related to the size of the moleculeand its charge has been shown with charged dextrans (4). Thepermeability barrier is damaged in glomerular diseases thatlead to proteinuria. Because treatment is disease specific,it requires knowledge of the underlying process. A renal biopsyis needed to make a definitive diagnosis of the cause of thedisease. The utility of renal biopsy is limited by several factors.Because of comorbid conditions such as bleeding disorders andobesity, some patients are not suitable or are at higher riskfor a biopsy (5,6). Because the biopsy obtains only a smallportion of the kidney, in some cases it may not accurately portraythe disease if the affected portion of the kidney is not sampled.In other cases, the disease may be so far advanced that diagnosticfeatures are obscured. Urine testing for biomarkers could replacerenal biopsy as a simple, safe, and accurate test that couldbe repeated to follow progression of the disease and monitorresponse to therapy.
Potential urine markers for diagnosis of glomerular diseasehave been proposed, but none has been confirmed to differentiatebetween causes of glomerular diseases. Many of the proposedmarkers are urinary cytokines. Levels of monocyte chemoattractantprotein-1 (MCP-1) (7), IL-6 (8), vascular cellular adhesionmolecule-1 (9), the complement degradation product C3d (10),and urinary free light chains (11) have been proposed as markersof renal activity of lupus. Urinary concentrations of vascularendothelial growth factor (1215), IL receptor 1 antagonist(16), IL-17 (17), TNF- (18), and CD46 (19) have been proposedas markers of specific glomerular diseases. Urinary macrophages(20,21), podocytes, and associated proteins are also potentialmarkers for glomerular diseases (2224). An expressionratio of two genes that are expressed in podocytes, podocinand synaptopodin differentiated patients with FSGS from thosewith minimal-change disease (25). Although many studies havecompared levels of candidate markers between two diseases, nonehas reliably used urine protein markers to differentiate betweena group of glomerular diseases. The presence of plasma proteinsin urine of patients with nephrotic syndrome provides an opportunityfor discovery of new biomarkers. Differences in the charge ofproteins such as ferritin and horseradish peroxidase are knownto affect their permeability at the glomerulus (26,27). In nephroticsyndrome, the glomerular permeability barrier is altered, resultingin proteinuria. If changes in size and charge permeability occurindependently and are specific to the disease, then these changescould be used to predict the cause of glomerular diseases.
The pathway from biomarker discovery to clinically useful assayhas three phases. The first phase is the discovery of candidatebiomarkers. An unbiased approach that is not limited to knowncandidates is best for this stage. The false discovery ratein this phase may be high because of noise in the system, anecessarily small sample size, and introduction of systematicbias during analysis. Unlike the first stage, in which manyproteins are measured simultaneously, in the second phase, onlythe candidate biomarkers are measured. Because the number ofanalytes measured is smaller, a more reproducible, rapid andaccurate assay can be used. In this phase, some candidate markersmay be discarded because they are not useful. Finally, in thethird phase, the successful biomarkers (and the algorithm tointerpret them) can be tested in a large, novel set of patientsunder true clinical conditions.
Proteomic techniques such as two-dimensional gel electrophoresis(2DE) are well suited to the discovery phase of biomarker identification.2DE is a high-resolution separation technique that can be coupledwith protein identification by mass spectrometry. A particularstrength of 2DE for urine biomarker discovery is the abilityto visualize differences in posttranslational modificationswhen these changes alter the isoelectric point of the proteins.Posttranslational modifications affect the charge on the proteinand the ability of a given protein to pass through the glomerularpermeability barrier. Because many plasma proteins exist asglycosylation forms with different charges, they may be filtereddifferentially. We used 2DE to identify patterns of candidatebiomarkers that can differentiate from among four glomerulardisease. The candidate markers can predict the disease witha relatively high degree of sensitivity and specificity in apopulation of patients who were not used to derive the algorithm.
Sample Collection and Protein Separation
We examined urine from 32 patients with proteinuria of >3g/d. Twenty-seven of the patients, including three with diabeticnephropathy, had a renal biopsy to confirm the diagnosis. Becausepatients with typical diabetic nephropathy are not routinelybiopsied at our center, we obtained urine from five additionalpatients with a typical presentation of diabetic nephropathyand without suggestion of any other disease process. These patientsall had diabetes for at least 15 yr, diabetic retinopathy witha history of laser photocoagulation therapy, an absence of microscopichematuria, and negative serologies for hepatitis and HIV. Sampleswere collected at the Medical University of South Carolina andRalph H. Johnson VA hospitals under a protocol that was approvedby the appropriate institutional review boards. Urine sampleswere collected immediately before biopsy. Since the urine wascollected at the time of biopsy, samples were retained in thebladder for variable periods of time. The samples were centrifugedat 1000 x g for 10 min to remove cellular and particulate matterand immediately frozen at 80°C until processing.No protease inhibitors were added. These are conditions thatwe have refined in our laboratory to optimize reproducibility(unpublished data). Two milliliters of urine was injected intoa Biologic Duo Flow HPLC system (Bio-Rad, Hercules, CA), andbuffer exchange with 100 mM ammonium acetate at a flow rateof 5 ml/min was done. The sample was passed through a HiTrapDesalting column (Amersham, Uppsala, Sweden), and a fractionof 1.75 ml was collected with desalting verified by conductivitytracings. The sample was then frozen at 80°C andlyophilized. Urine protein concentration was adjusted to 100µg in 185 µl with a buffer that contained 9 M urea,4% NP-40, 0.2% 3 to 10 ampholytes, and 50 mM dithiothreitol.The samples were centrifuged at 100,000 x g for 30 min. Thesupernatant was applied to Bio-Rad IPG strips (11 cm, pH 4 to7). Strips were incubated at room temperature for 1 h. After1 h, the strips were covered with mineral oil and rehydratedfor an additional 12 h at room temperature. After focusing,strips were equilibrated sequentially in buffers that containeddithiothreitol and iodoacetamide and separated by SDS-PAGE onan 8 to 16% gradient gel using a Criterion Doceca cell (Bio-Rad).Proteins were stained with Sypro Ruby, the gels were imaged,and individual spots were aligned across the gel using PDQuest(Bio-Rad). Urine concentrations of IL-6, IL-8, and MCP-1 weremeasured with a Bio-Plex System (Bio-Rad) using Luminex xMAPtechnology according to the manufacturers instructions.
Exploratory Multivariate Statistical Analysis
To determine whether unsupervised groupings of patients on thebasis of the urine protein spots present would correlate withthe disease process that caused the nephrotic syndrome, we performedan exploratory analysis using clustering by the bottom-up approachof unsupervised simultaneous clustering of gels and spots byunweighted pair group average. The clustering was performedusing code that was written in Matlab. Clustering of patientsamples was compared with the order of sample collection, disease,race, age, and serum creatinine to determine potential sourcesof variability within the samples.
Artificial Neural Network Analysis
Protein spot intensities were ranked by intensity and expressedas quantiles as described previously (28). Four patients witheach disease were randomly selected for the training set usingthe random-number-generator function in Microsoft Excel. Theartificial neural network (ANN) algorithm was trained on urineconcentrations of IL-6, IL-8, MCP-1, and the set of ranked proteinspot abundance data from 16 patients. The data from the remaining16 patients (external validation set) were not seen by the ANNduring the training phase. An input was assigned for each ofthe four diseases, where 0 was disease absent and 1 was diseasepresent. The identification of ANN models was performed by Matlabcode that was written along the guidelines previously proposed(29), which includes bootstrapped cross-validation as an earlystop criterion and screening for optimal topology. The predictivevalue of each spot was evaluated by sensitivity analysis bydetermining, for each ith spot in each jth gel/patient, Si,j= (dOj/dIi,j) x (Ii,j/Oj). A cross-validation scheme with leaveone ninth out was used in which every ninth sample was usedfor internal validation. The median performing ANN was selectedfrom each run.
The external data set was kept completely independent from thetraining procedures with the purpose of having an unbiased assessmentof model predictability. Data from the external validation setwere analyzed using the network that was obtained with the trainingset, and an output for each disease category for each patientwas obtained. The output was a number between 0 and 1. A thresholdvalue for each disease was chosen such that values that weregreater than the threshold were disease positive and valuesthat were less than the threshold were disease negative. Becauseeach prediction of disease was independent, each patient couldbe predicted to be disease positive in zero to four diseases.The prediction was compared with the known disease for eachpatient, and each prediction was determined to be true positive,false positive, true negative, or false negative.
To determine the importance of the number of spots includedin the analysis, we sequentially removed spots from the analysis.Spots were divided into two groups: The 21 spots that providedthe most sensitivity to the analysis and the remaining 103 spots.Spots in the 103 group were ordered using a random-number generatorand removed from the analysis in groups of five spots. For eachremoval of five spots, a new ANN was trained using the trainingset. Each network was trained only once with the data for eachnumber of spots. The network was used to analyze the externaltest set. Threshold values for each test were set at the valuethat would best optimize sensitivity. After the initial 103spots were removed, the same process was done with the final21 spots. These spots were removed one at a time in the orderof increasing sensitivity as determined by the initial analysis.Total accuracy of the test was calculated as the number of correctpredictions divided by the number of tests (n = 64). Sensitivitywas calculated as the number of correct predictions from thetests of the disease that was present (true positives).
The initial evaluation of the output gave a true or false valuefor each of the four diseases but did not give a single diseaseoutput. To determine the accuracy of the test to be able topredict one disease, we designed a simple voting scheme so thata single disease output could be obtained. In patients for whomonly one of the four test results was positive, the positivetest was chosen to predict the disease. In cases in which morethan one test was positive, the ANN output value was comparedwith the threshold value. The test for which the differencebetween the threshold value and the output value was greatestwas chosen as the predicted disease.
Protein Identification
Protein spots were picked from the gels and digested with trypsinas described previously (30). Digests were concentrated usingC18 Zip Tips (Millipore, Billerica, MA). Proteins were identifiedusing a matrix-assisted laser desorption ionizationtimeof flight (MALDI-TOF-MS) mass spectrometer (MS), an ABI 4700MALDI-TOF/TOF MS, or a Finnigan LTQ linear ion trap MS as describedpreviously (31). Initial protein identification was done usingthe Mascot search engine, whereby the Mascot score is a descriptorof the quality of the match of the spectra to a protein. Identificationby peptide mass fingerprinting required a Mascot score >62.Protein coverage was calculated as the percentage of the totalnumber of amino acids in the protein that were accounted forby the predicted matches. Proteins that were identified withMascot scores <90 were confirmed by tandem MS with a totalion score of at least 80. Protein identification from the iontrap MS was done using the Turbo-SEQUEST algorithm. Criteriafor identification of peptides was XCorr >1.5 for singlycharged ions, >2.0 for doubly charged, and >2.5 for triplycharged ions. All proteins that were identified by the linearion trap had at least three peptides.
Urine samples were collected from 32 patients. Demographic andclinical information of the patients in the study is shown inTable 1. Patients with lupus nephritis were younger and morelikely to be female. Patients with diabetic nephropathy hadhigher serum creatinine values. In the group with membranousnephropathy, there was a trend toward increased proteinuriathat did not reach statistical significance. All of the patientsexcept six were on either an angiotensin-converting enzyme inhibitoror an angiotensin receptor blocker at the time urine was collected(diabetes six of eight; lupus nephritis eight of 11; membranousnephropathy four of five; FSGS eight of eight). Levels of thethree urinary cytokines (IL-6, IL-8, and MCP-1) were not differentbetween groups.
Table 1. Characteristics of patients in the disease groupsa
Urine proteins were separated by 2D gel electrophoresis (Figure 1),and the abundance of the protein spots was determined. Proteinabundance was compared between groups to identify differencesin protein expression. No differences in protein expressionof single spots that could differentiate all four diseases werefound. To identify aggregate variation in the samples and todetermine whether clinical diagnosis or specific clinical anddemographic characteristics contributed to the overall variability,we performed a double-cluster analysis of the gels and spots.To identify associations with the variability, we compared samplecollection order, diagnosis, race, age, and serum creatininewith the position in the cluster analysis (Figure 2). No correlationswere observed, demonstrating that none of these factors wasthe major influence on variability within the samples. Becauseanalysis of individual spot abundances was unable to identifybiomarkers that could differentiate the diseases, we used ANNto find differences in patterns of spots and to identify markers.Particular attention was given to data analysis safeguards againstoverfitting in the test set. The external validation set (16patients) was selected once and was analyzed once, with no inclusionin any internal cross-validation procedure. Data about fourpatients from each of the four groups were used to train theself-configurable ANN to predict the disease. The external validationset was used to test the accuracy of its predictions. The thresholdwas set as the value that minimized false discovery rate. Theoutput value from the ANN for the external validation set wascompared with the threshold value, and the prediction was determined.The output values and predictions for the validation set areshown in Table 2. Each row shows a patients disease andthe output value for the test for each of the four diseases.The true disease is in the column on the left side of the table,and the output value for diagnosis of each of the diseases isshown in the remaining columns. The output value for each ofthe four diseases was compared with the threshold value at thebottom of each column to obtain the prediction of disease positiveor disease negative. False predictions are shown in bold. TheInternational Society of Nephrology/Renal Pathology Societydisease classification is shown for patients with lupus nephritis.The area under the receiver operating characteristics curvefor the test data was found to be 0.69 for FSGS, 0.84 for lupusnephritis, and 0.73 for diabetic nephropathy. The area underthe receiver operating characteristics curve for membranousnephropathy is not reported because there was only one casein the test set. Sixty-four predictions were made (16 patientsfor each of four diseases), 11 of which were incorrect (accuracy83%). Two incorrect predictions each were made for FSGS, systemiclupus erythematosus, and diabetic nephropathy and five for membranousnephropathy. Sensitivity of the assay in patients in the externaltest set ranged from 75 to 86%, and specificity ranged from92 to 67% (Figure 3).
Figure 1. Proteins that are necessary for diagnosis of the cause of the nephrotic syndrome. Representative gel from a patient with the nephrotic syndrome. Proteins were separated in two dimensions by isoelectric point and molecular weight. The 21 numbered spots provided the most sensitivity to the analysis of the cause of the glomerular disease by artificial neural network (ANN). Numbers correspond to protein identifications in Table 3.
Figure 2. Unsupervised cluster analysis of protein expression in patients with glomerular diseases. Patterns of clustering did not occur on the basis of collection order, disease, race, age, or serum creatinine of patients. The colored boxes represent disease, race, age, and serum creatinine values. The creatinine values are color coded for those above or below the median value. Numbers in the line above the disease represent the sequential order in which the samples were collected.
Figure 3. Sensitivity and specificity of biomarkers to predict four glomerular diseases. Calculations were made for patients in the set of patients who were not used to train the ANN. Sensitivity for membranous nephropathy is not reported because only one patient was tested. The legend shows the number of true positives/patients with the disease for the sensitivity bars and the number of true negatives/number of patients without the disease for specificity bars.
To determine the importance of individual spots to the analysis,we sequentially removed spot values from the data set and retrainedthe network. As in the initial analysis, the network was alwaystrained with the training set and the results were observedfrom the test set that was not used to train the network. Thenumber of correct diagnoses was determined, and accuracy andsensitivity of the assay were determined for each number ofspots. The results are shown in Figure 4. Total accuracy ofthe test began to decline when fewer than 50 spots were includedand decreased more rapidly toward 50% (chance) when fewer than20 spots were included. Sensitivity of the assay was maintainedat >70% until fewer than five spots were included.
Figure 4. Relationship of number of spots included in the analysis to the sensitivity and total accuracy for the assay. An ANN was trained for each sequential removal of spots from the data set. Sensitivity was calculated as the percentage of true-positive diagnoses from 16. Total accuracy was calculated as the percentage of correct test from all 64 possible.
Because the tests provided only a true/false prediction forthe presence of each disease and did not give a final predictionof which single disease was present, we used the differencebetween threshold values and the ANN output to predict whichof the four diseases was present. From the 16 patients in thetest set, eight had only the expected test greater than thresholdand three had only tests for diseases that were not presentgreater than threshold. The remaining five had tests for twodiseases that were greater than threshold (positive), and thevoting scheme was used. Two of the five votes predicted thecorrect disease, and three predicted the incorrect disease.The final predicted disease is shown in the last column of Table 1.Of the 16 patients, the test correctly predicted the singledisease present in 10. It is interesting that the test was alwayscorrect when it predicted lupus nephritis (four of four) andusually correct for FSGS (three of four) and diabetic nephropathy(two of three) but poor for prediction of membranous nephropathy(one of five).
We determined from the ANN the amount of sensitivity that eachspot provided to the analysis. The top 11 inputs for each diseasewere chosen for identification. These spots included all inputsthat provided >2.5% of the sensitivity for any diagnosis.None of the cytokine inputs was among the top 11 sensitivities.Twenty-one protein spots accounted for all of the inputs inthe top 11 because several spots were among the top sensitivitiesfor more than one disease. These spots are shown on the gelin Figure 1. For determination of the identity of the proteins,spots were cut from the gel, digested with trypsin, and analyzedby mass spectrometry. We identified 20 of the 21 proteins. Table 3shows the protein identification and the molecular weight andisoelectric point of the protein spot on the gel. Mascot scoresare shown for proteins that were identified by peptide massfingerprinting and total ions scores for proteins that wereidentified by MALDI-TOF MS/MS. Three proteins were identifiedby electrospray MS/MS on a linear ion trap instrument (LTQ).It is interesting that all of the identified proteins are plasmaproteins and many are present as multiple charge forms on thegel. Many of these proteins are known to contain glycosylationposttranslational modifications that produce variable chargesdepending on the specific glycosylation present, including zinc-2 glycoprotein, -1 antitrypsin, haptoglobin, transferrin, albumin,and -1 microglobulin. Two of the spots are protein fragmentsbecause they are present at a lower molecular weight on thegel than the predicted size of the protein (6210 [albumin] and7209 [orosomucoid]).
Finally, we examined the differences in mean quantile valuesbetween the diseases to determine whether differences in individualproteins could be diagnostic themselves. As expected from thedata shown in Figure 4, large differences in individual spotabundance between diseases were not seen. Quantile values arethe ranked spot abundances expressed as a number between 0 and1. The smallest mean differences in abundance was seen for spot0506 (orosomucoid), for which the mean value for all four diseaseswas within 0.05 of each other, to spot 8908 (transferrin), forwhich the maximum and minimum differed by 0.35.
We identified a set of protein spots that differentiate betweenpatients with one of four common renal glomerular diseases witha relatively high degree of sensitivity and specificity. Theseproteins are candidate biomarkers because they have not yetbeen tested in a more reproducible assay on a larger set ofpatients. Nevertheless, the markers were confirmed in an independentset of patients in whom they showed an impressive level of accuracydespite the variability present in 2DE. The analytic abilityof the candidate biomarkers in this relatively small set ofpatients with heterogeneity within diseases demonstrates thatthere is a strong diagnostic signal in these markers.
Several studies have used proteomics to characterize candidateurine markers for glomerular diseases. Capillary electrophoresiscoupled to mass spectrometry has been used to identify candidatepolypeptides in diabetic nephropathy (3234) and otherglomerular diseases (3538). These studies have definedpatterns of polypeptides that are associated with glomerulardiseases but have not yet confirmed the validity of the patternsin an independent set. Two-dimensional gel electrophoresis hasbeen used to define a reference map of proteins that are differentbetween healthy subjects and those with IgA nephropathy butdid not characterize differences between diseases (39). Ourprevious study in lupus nephritis and a study from Thongboonkerdet al. (40) are the most similar to this study. We identifieda set of urine that can differentiate between classes of lupusnephritis (31). We did not confirm the results in a new setof patients however. Thongboonkerd et al. (40) used 2DE to identifya growth hormone and a protein that they were unable to identifyas differentially expressed in FSGS compared with healthy subjectsand those with lupus nephritis or diabetic nephropathy. Theydid not characterize the diagnostic ability of these proteinsor confirm the finding in an independent set. This study isthe first to find markers for glomerular diseases, identifythe proteins, and confirm that the pattern of candidate markersis valid in an independent set.
In this study, a single marker was not sufficient to distinguishfrom among the diseases. In fact, unsupervised clustering, asshown in Figure 2, did not segregate the diseases, demonstratingthat the relationship between the diagnostic markers is complex.Only when we used the ANN analysis were we able to differentiatethe diseases from each other. An important concern when usingANN for analysis is the concern of overfitting (29). Overfittingoccurs when the network is trained to identify not just thesignal but also the noise within the signal. We have taken severalapproaches to avoid this. The ANN that we used has been writtenand implemented by one of us (J.S.A.) and includes bootstrappedcross-validation as an early stop criterion and screening foroptimal topology to minimize overfitting. More important, wehave tested the algorithm using a set of patients who were notused to train the network, and patients were placed into thetraining and test sets randomly. Despite these safeguards, itis possible that a systematic factor is introduced by collectionor some other phenomenon that is associated with the samplesfrom patients with a disease but not with the disease itself.To test whether we could find such a factor, we performed theunsupervised clustering analysis that is shown in Figure 2.There do not seem to be any factors that are associated witha given disease. The final support for the relevance of theseproteins is that there is a physiologic rationale for why theymay be biomarkers for glomerular diseases.
The proteins that we identified as diagnostic markers are plasmaproteins that are filtered at the glomerulus. It is interestingthat many proteins were identified from multiple spots at differentisoelectric points on the gel. It is possible that these chargeforms represent modifications of the proteins as they transitthe tubule because this is known to occur. However, these proteinsalso exist in plasma in multiple charge forms, whereby the differencesin isoelectric point are related to variable glycosylation ofthe proteins (41). This suggests that the reason that urineproteins can predict specific glomerular diseases is that thereare differences in the relationship between the glomerular sizeand charge permeability barriers that are specific to a givendisease. Independent changes for size and charge permeabilityin glomerulonephritis have previously been reported (42,43).We recently showed that charge forms of plasma proteins in theurine can differentiate between classes of lupus nephritis (31).Changes in permeability have previously been reported in a singledisease but have not been shown to have diagnostic specificityamong a group of diseases. This study demonstrates two importantadvantages of 2D gel electrophoresis as a biomarker discoverytool. First, not only can it discover spots that are candidatebiomarkers, but also the proteins that make up the spots canbe identified, so the proteins can be used for the developmentof diagnostic assays. Second, it can differentiate between posttranslationallymodified proteins when they result in differences in the isoelectricpoint.
The assay that we used performs four tests for each patient:A true or false test for each of the four diseases. When a largernumber of patients are tested in a higher throughput test, adecision tree analysis could be used to predict the single diseasepresent. However, the approach that we used allows diagnosisof more than one disease simultaneously. We identified amongpatients with nephrotic syndrome diagnostic markers that correctlyidentified the presence or absence of the disease in 53 of 64tests because each of the 16 patients had an independent determinationof the presence of each of the four diseases. Although the accuracyis not yet as good as it needs to be for a diagnostic assay,the probability of obtaining a result this good or better ina novel set of patients by chance is 5 x 108. This isthe probability of any combination of 53 correct results of64: {[64!/(53! x 11!)]/2 [64] = 4.03 x 108}. By addingto that the probability of higher numbers of correct resultsthat diminish and become smaller (8.2 x 109, 1.5 x 109,2.3 x 1010, etc), one obtains the cumulative probabilityof a result this good or better. These results demonstrate thepower of this combination of techniques and analysis to uncovera diagnostic signal behind the noise that is produced by biologicand technical variability. They suggest a high probability ofsuccess when the candidate markers are assayed with a more reproducibleassay. Even more exciting is that the assay was done in a groupof patients with a variety of coexisting diseases and heterogeneitywithin the diseases; for instance, among the patients with lupuswere patients with three International Society of Nephrology/RenalPathology Society classes of lupus nephritis (Table 1). Thisdemonstrates that the assay was able to identify patterns thatare common to the disease despite the heterogeneity. The accuracyof the test is likely to increase when the candidate markersare tested using a more reproducible assay. In addition to the64 true or false answers for the individual diseases, we useda voting scheme to predict a single disease. Although this approachis premature, until a better test is developed using the candidatemarkers, the correct identification of 10 of 16 diseases demonstratesthe promise of the approach. A potential shortcoming of thestudy was the inclusion of five patients among the eight patientswith diabetic nephropathy who did not have renal biopsies. Weincluded these patients because they have a high probabilityof having diabetic nephropathy because they had a typical presentation,long history of diabetes, and the presence of diabetic retinopathy.The presence of diabetic retinopathy strongly suggests the presenceof diabetic nephropathy among patients with proteinuria. Ina study of nephrotic syndrome among patients with diabetes anddiabetic retinopathy, all of the patients with retinopathy haddiabetic nephropathy (44). We cannot rule out the possibility,however, that the patients had a second renal disease in additionto diabetic nephropathy. Despite this question, this study hasidentified candidate markers that can identify the patientswho have diabetic nephropathy (with or without an additionaldisease). In the next phase of biomarker discovery, the biomarkerscan be tested in a more facile format for their ability to differentiatediabetic nephropathy from diabetic nephropathy plus anotherdisease.
These findings not only have potential significance for understandingthe pathophysiology of glomerular changes in specific diseasesbut also are a promising way to diagnose specific glomerulardiseases without a renal biopsy. The candidate biomarkers thatwe identified can be used to develop a multiplexed assay thatcan identify the glomerular disease in patients with the nephroticsyndrome. Addition of markers that can differentiate minimal-changedisease from the other diseases included here will be importantfor a clinically meaningful assay. Confirmation in larger setsof patients and development of a useful clinical assay willrequire development of tests that can differentiate betweencharge forms of a protein. The test will use protein-bindingmolecules such as antibodies. An assay that is based on theseproteins should have greater accuracy because of the increasedsensitivity, specificity, and dynamic range of antibodies andthe larger number of patients who could be used to train analgorithm. The development of this assay presents a challengebecause many of the proteins are identical but the charge formis different. Potential approaches to development of this assaywill include use of antibodies that recognize the specific modificationof the protein, the combination of an antibody that recognizesall forms of the protein with a substance such as a lectin thatrecognizes a specific glycosylation, or the combination of anantibody with a chip that contains features with affinitiesfor many different modifications of proteins.
Support for this project came from the Medical University ofSouth Carolina (MUSC) General Clinical Research Center (RR01070);Department of Veterans Affairs; and grants from Dialysis Clinics,Inc., and the National Institutes of Health (R21 AR051719).The informatics pipeline for data analysis was developed withfederal funds as part of the National Heart, Lung, and BloodInstitute Proteomics Initiative, National Institutes of Health,under contract N01-HV-28181.
Tandem mass spectrometry was done in the MUSC mass spectrometryfacility with the assistance of Dr. Kevin Schey and JenniferBethard. We are grateful to Tim Taylor and the MUSC nephrologyfellows for help with collection of samples and identificationof patients. The Bio-Plex was purchased with funds from theInflammatory Mediators of Glomerular Diseases Research EnhancementAward Program from the Department of Veterans Affairs.
Footnotes
Published online ahead of print. Publication date availableat www.jasn.org.
US Renal Data System:
USRDS 2004 Annual Data Report: Atlas of End-Stage Renal Disease in the United States, Bethesda, National Institutes of Health, National Institute of Diabetes and Digestive and Kidney Diseases, 2004
Brenner BM, Hostetter TH, Humes HD: Glomerular permselectivity: Barrier function based on discrimination of molecular size and charge.
Am J Physiol 234
: F455
F460, 1978
Brenner BM, Hostetter TH, Humes HD: Molecular basis of proteinuria of glomerular origin.
N Engl J Med 298
: 826
833, 1978[Medline]
Caulfield JP, Farquhar MG: The permeability of glomerular capillaries to graded dextrans. Identification of the basement membrane as the primary filtration barrier.
J Cell Biol 63
: 883
903, 1974[Abstract/Free Full Text]
Eiro M, Katoh T, Watanabe T: Risk factors for bleeding complications in percutaneous renal biopsy.
Clin Exp Nephrol 9
: 40
45, 2005[CrossRef][Medline]
Stiles KP, Yuan CM, Chung EM, Lyon RD, Lane JD, Abbott KC: Renal biopsy in high-risk patients with medical diseases of the kidney.
Am J Kidney Dis 36
: 419
433, 2000[Medline]
Rovin BH, Song H, Birmingham DJ, Hebert LA, Yu CY, Nagaraja HN: Urine chemokines as biomarkers of human systemic lupus erythematosus activity.
J Am Soc Nephrol 16
: 467
473, 2005[Abstract/Free Full Text]
Peterson E, Robertson AD, Emlen W: Serum and urinary interleukin-6 in systemic lupus erythematosus.
Lupus 5
: 571
575, 1996[Abstract/Free Full Text]
Molad Y, Miroshnik E, Sulkes J, Pitlik S, Weinberger A, Monselise Y: Urinary soluble VCAM-1 in systemic lupus erythematosus: A clinical marker for monitoring disease activity and damage.
Clin Exp Rheumatol 20
: 403
406, 2002[Medline]
Negi VS, Aggarwal A, Dayal R, Naik S, Misra R: Complement degradation product C3d in urine: Marker of lupus nephritis.
J Rheumatol 27
: 380
383, 2000[Medline]
Hopper JE, Sequeira W, Martellotto J, Papagiannes E, Perna L, Skosey JL: Clinical relapse in systemic lupus erythematosus: Correlation with antecedent elevation of urinary free light-chain immunoglobulin.
J Clin Immunol 9
: 338
350, 1989[CrossRef][Medline]
Honkanen E, von Willebrand E, Koskinen P, Teppo AM, Tornroth T, Ruutu M, Gronhagen-Riska C: Decreased expression of vascular endothelial growth factor in idiopathic membranous glomerulonephritis: Relationships to clinical course.
Am J Kidney Dis 42
: 1139
1148, 2003[CrossRef][Medline]
Honkanen EO, Teppo AM, Gronhagen-Riska C: Decreased urinary excretion of vascular endothelial growth factor in idiopathic membranous glomerulonephritis.
Kidney Int 57
: 2343
2349, 2000[CrossRef][Medline]
Matsumoto K, Kanmatsuse K: Elevated vascular endothelial growth factor levels in the urine of patients with minimal-change nephrotic syndrome.
Clin Nephrol 55
: 269
274, 2001[Medline]
Cha DR, Kang YS, Han SY, Jee YH, Han KH, Han JY, Kim YS, Kim NH: Vascular endothelial growth factor is increased during early stage of diabetic nephropathy in type II diabetic rats.
J Endocrinol 183
: 183
194, 2004[Abstract/Free Full Text]
Rauta V, Teppo AM, Tornroth T, Honkanen E, Gronhagen-Riska C: Lower urinary-interleukin-1 receptor-antagonist excretion in IgA nephropathy than in Henoch-Schonlein nephritis.
Nephrol Dial Transplant 18
: 1785
1791, 2003[Abstract/Free Full Text]
Matsumoto K, Kanmatsuse K: Increased urinary excretion of interleukin-17 in nephrotic patients.
Nephron 91
: 243
249, 2002[CrossRef][Medline]
Kalantarinia K, Awad AS, Siragy HM: Urinary and renal interstitial concentrations of TNF-alpha increase prior to the rise in albuminuria in diabetic rats.
Kidney Int 64
: 1208
1213, 2003[CrossRef][Medline]
Shoji T, Nakanishi I, Kunitou K, Tsubakihara Y, Hirooka Y, Kishi Y, Hatanaka M, Matsumoto M, Toyoshima K, Seya T: Urine levels of CD46 (membrane cofactor protein) are increased in patients with glomerular diseases.
Clin Immunol 95
: 163
169, 2000[CrossRef][Medline]
Hotta O, Yusa N, Kitamura H, Taguma Y: Urinary macrophages as activity markers of renal injury.
Clin Chim Acta 297
: 123
133, 2000[CrossRef][Medline]
Hotta O, Yusa N, Ooyama M, Unno K, Furuta T, Taguma Y: Detection of urinary macrophages expressing the CD16 (Fc gamma RIII) molecule: A novel marker of acute inflammatory glomerular injury.
Kidney Int 55
: 1927
1934, 1999[CrossRef][Medline]
Hara M, Yanagihara T, Takada T, Itoh M, Matsuno M, Yamamoto T, Kihara I: Urinary excretion of podocytes reflects disease activity in children with glomerulonephritis.
Am J Nephrol 18
: 35
41, 1998[CrossRef][Medline]
Vogelmann SU, Nelson WJ, Myers BD, Lemley KV: Urinary excretion of viable podocytes in health and renal disease.
Am J Physiol Renal Physiol 285
: F40
F48, 2003[Abstract/Free Full Text]
Hara M, Yanagihara T, Itoh M, Matsuno M, Kihara I: Immunohistochemical and urinary markers of podocyte injury.
Pediatr Nephrol 12
: 43
48, 1998[CrossRef][Medline]
Schmid H, Henger A, Cohen CD, Frach K, Grone HJ, Schlondorff D, Kretzler M: Gene expression profiles of podocyte-associated molecules as diagnostic markers in acquired proteinuric diseases.
J Am Soc Nephrol 14
: 2958
2966, 2003[Abstract/Free Full Text]
Rennke HG, Patel Y, Venkatachalam MA: Glomerular filtration of proteins: Clearance of anionic, neutral, and cationic horseradish peroxidase in the rat.
Kidney Int 13
: 278
288, 1978[Medline]
Rennke HG, Venkatachalam MA: Glomerular permeability: In vivo tracer studies with polyanionic and polycationic ferritins.
Kidney Int 11
: 44
53, 1977[Medline]
Almeida JS, Stanislaus R, Krug EL, Arthur JM: Normalization and analysis of residual variation in 2D gel electrophoresis for quantitative differential proteomics.
Proteomics 5
: 1242
1249, 2005[CrossRef][Medline]
Almeida JS: Predictive non-linear modeling of complex data by artificial neural networks.
Curr Opin Biotechnol 13
: 72
76, 2002[CrossRef][Medline]
Lefler DM, Pafford RG, Black NA, Raymond JR, Arthur JM: Identification of proteins in slow continuous ultrafiltrate by reversed-phase chromatography and proteomics.
J Proteome Res 3
: 1254
1260, 2004[CrossRef][Medline]
Oates JC, Varghese SA, Bland AM, Taylor TP, Self SE, Stanislaus R, Almeida JS, Arthur JM: Prediction of urinary protein markers in lupus nephritis.
Kidney Int 65
: 2588
2592, 2005
Rossing K, Mischak H, Parving HH, Christensen PK, Walden M, Hillmann M, Kaiser T: Impact of diabetic nephropathy and angiotensin II receptor blockade on urinary polypeptide patterns.
Kidney Int 68
: 193
205, 2005[CrossRef][Medline]
Mischak H, Kaiser T, Walden M, Hillmann M, Wittke S, Herrmann A, Knueppel S, Haller H, Fliser D: Proteomic analysis for the assessment of diabetic renal damage in humans.
Clin Sci (Lond) 107
: 485
495, 2004[Medline]
Meier M, Kaiser T, Herrmann A, Knueppel S, Hillmann M, Koester P, Danne T, Haller H, Fliser D, Mischak H: Identification of urinary protein pattern in type 1 diabetic adolescents with early diabetic nephropathy by a novel combined proteome analysis.
J Diabetes Complications 19
: 223
232, 2005[CrossRef][Medline]
Haubitz M, Wittke S, Weissinger EM, Walden M, Rupprecht HD, Floege J, Haller H, Mischak H: Urine protein patterns can serve as diagnostic tools in patients with IgA nephropathy.
Kidney Int 67
: 2313
2320, 2005[CrossRef][Medline]
Weissinger EM, Wittke S, Kaiser T, Haller H, Bartel S, Krebs R, Golovko I, Rupprecht HD, Haubitz M, Hecker H, Mischak H, Fliser D: Proteomic patterns established with capillary electrophoresis and mass spectrometry for diagnostic purposes.
Kidney Int 65
: 2426
2434, 2004[CrossRef][Medline]
Wittke S, Mischak H, Walden M, Kolch W, Radler T, Wiedemann K: Discovery of biomarkers in human urine and cerebrospinal fluid by capillary electrophoresis coupled to mass spectrometry: Towards new diagnostic and therapeutic approaches.
Electrophoresis 26
: 1476
1487, 2005[CrossRef][Medline]
Wittke S, Fliser D, Haubitz M, Bartel S, Krebs R, Hausadel F, Hillmann M, Golovko I, Koester P, Haller H, Kaiser T, Mischak H, Weissinger EM: Determination of peptides and proteins in human urine with capillary electrophoresis-mass spectrometry, a suitable tool for the establishment of new diagnostic markers.
J Chromatogr A 1013
: 173
181, 2003
Park MR, Wang EH, Jin DC, Cha JH, Lee KH, Yang CW, Kang CS, Choi YJ: Establishment of a 2-D human urinary proteomic map in IgA nephropathy.
Proteomics 6
: 1066
1076, 2006[CrossRef][Medline]
Thongboonkerd V, Klein JB, Jevans AW, McLeish KR: Urinary proteomics and biomarker discovery for glomerular diseases.
Contrib Nephrol 141
: 292
307, 2004[Medline]
Wilson NL, Schulz BL, Karlsson NG, Packer NH: Sequential analysis of N- and O-linked glycosylation of 2D-PAGE separated glycoproteins.
J Proteome Res 1
: 521
529, 2002[CrossRef][Medline]
Machii R, Sakatume M, Kubota R, Kobayashi S, Gejyo F, Shiba K: Examination of the molecular diversity of alpha1 antitrypsin in urine: Deficit of an alpha1 globulin fraction on cellulose acetate membrane electrophoresis.
J Clin Lab Anal 19
: 16
21, 2005[CrossRef][Medline]
Myers BD, Okarma TB, Friedman S, Bridges C, Ross J, Asseff S, Deen WM: Mechanisms of proteinuria in human glomerulonephritis.
J Clin Invest 70
: 732
746, 1982[Medline]
Parving HH, Gall MA, Skott P, Jorgensen HE, Lokkegaard H, Jorgensen F, Nielsen B, Larsen S: Prevalence and causes of albuminuria in non-insulin-dependent diabetic patients.
Kidney Int 41
: 758
762, 1992[Medline]
Related Article
Cumulative Excretion of Urinary Podocytes Reflects Disease Progression in IgA Nephropathy and Schönlein-Henoch Purpura Nephritis
Masanori Hara, Toshio Yanagihara, and Itaru Kihara
Clin. J. Am. Soc. Nephrol. 2007 2: 231-238.
[Abstract][Full Text][PDF]
This article has been cited by other articles:
B. H. Rovin and X. Zhang Biomarkers for Lupus Nephritis: The Quest Continues
Clin. J. Am. Soc. Nephrol.,
November 1, 2009;
4(11):
1858 - 1865.
[Abstract][Full Text][PDF]
K. Bramham, H.D. Mistry, L. Poston, L.C. Chappell, and A.J. Thompson The non-invasive biopsy--will urinary proteomics make the renal tissue biopsy redundant?
QJM,
August 1, 2009;
102(8):
523 - 538.
[Abstract][Full Text][PDF]
A. A. Soler-Garcia, D. Johnson, Y. Hathout, and P. E. Ray Iron-Related Proteins: Candidate Urine Biomarkers in Childhood HIV-Associated Renal Diseases
Clin. J. Am. Soc. Nephrol.,
April 1, 2009;
4(4):
763 - 771.
[Abstract][Full Text][PDF]
W. G. Miller, D. E. Bruns, G. L. Hortin, S. Sandberg, K. M. Aakre, M. J. McQueen, Y. Itoh, J. C. Lieske, D. W. Seccombe, G. Jones, et al. Current Issues in Measurement and Reporting of Urinary Albumin Excretion
Clin. Chem.,
January 1, 2009;
55(1):
24 - 38.
[Abstract][Full Text][PDF]
N. Tangri, D. Ansell, and D. Naimark Predicting technique survival in peritoneal dialysis patients: comparing artificial neural networks and logistic regression
Nephrol. Dial. Transplant.,
September 1, 2008;
23(9):
2972 - 2981.
[Abstract][Full Text][PDF]
Md. I. Hassan, A. Waheed, S. Yadav, T. P. Singh, and F. Ahmad Zinc {alpha}2-Glycoprotein: A Multidisciplinary Protein
Mol. Cancer Res.,
June 1, 2008;
6(6):
892 - 906.
[Abstract][Full Text][PDF]
H.-A. Shui, T.-H. Huang, S.-M. Ka, P.-H. Chen, Y.-F. Lin, and A. Chen Urinary proteome and potential biomarkers associated with serial pathogenesis steps of focal segmental glomerulosclerosis
Nephrol. Dial. Transplant.,
January 1, 2008;
23(1):
176 - 185.
[Abstract][Full Text][PDF]