Genome-Wide Association Scan for Survival on Dialysis in African-Americans with Type 2 DiabetesMurea M.a · Lu L.b · Ma L.c · Hicks P.J.d · Divers J.b · McDonough C.W.d · Langefeld C.D.b · Bowden D.W.a, d · Freedman B.I.a
Departments of aInternal Medicine (Section on Nephrology), bBiostatistical Sciences, cInternal Medicine (Section on Endocrinology) and dBiochemistry, Wake Forest University Baptist Medical Center, Winston-Salem, N.C., USA
Background: African-Americans (AAs) with diabetes have high incidence rates of end-stage renal disease (ESRD) with associated high mortality. Genetic factors modulating the risk of mortality on dialysis are poorly understood. Methods: A genome-wide association study was performed in 610 AAs with type 2 diabetes (T2D) and ESRD on dialysis, using the Affymetrix 6.0 platform (868,155 SNPs). Time to death was assessed using Cox proportional hazards model adjusting for ancestry and other confounding variables. Cases were censored at kidney transplant or (if living) at study conclusion. Results: Mean follow-up was 5.4 ± 3.5 years; 434 deaths were recorded. Five SNPs were associated with time to death at p < 1.00 × 10–6: rs2681019 (HR = 2.58, PREC = 8.00 × 10–8), rs815815 in CALM2 (HR = 1.51, PADD = 6.50 × 10–7), rs926392 (HR = 2.37, PREC = 4.80 × 10–7), and rs926391 (HR = 2.30, PREC = 7.30 × 10–7) near DHX35, and rs11128347 in PDZRN3 (HR = 0.57, PADD = 6.00 × 10–7). Other SNPs had nominal associations with time to death (p < 1.00 × 10–5). Conclusion: Genetic variation may modify the risk of death on dialysis. SNPs in proximity to genes regulating vascular extracellular matrix, cardiac ventricular repolarization, and smoking cessation are associated with dialysis survival in AAs with T2D. These results warrant replication in other cohorts and races.
Copyright © 2011 S. Karger AG, Basel
Despite advances in technique and medications, alarmingly high mortality rates continue to be observed in the dialysis population, particularly in patients with diabetes (http://www.usrds.org; last accessed September, 2010). Factors associated with accelerated mortality in the dialysis population include traditional cardiovascular disease risk factors as well as non-traditional inflammatory markers . Cardiovascular disease contributes to the largest proportion of deaths, related in part to pre-dialysis co-morbidity from diabetes, hypertension, and metabolic syndrome . Derangements in mineral metabolism, anemia and hemoglobin variability, malnutrition, BMI and fat composition, residual renal function, dialysis access type, biocompatibility of dialysis membranes, intensity of pre-dialysis care, and frailty are independent determinants of life expectancy on dialysis .
Striking racial differences exist in the incidence and prevalence rates of end-stage renal disease (ESRD) and in mortality rates on dialysis (http://www.usrds.org; last accessed September, 2010). Compared with Caucasians, African-Americans (AAs) have a markedly increased risk of ESRD, yet manifest a paradoxical lower risk of death on dialysis [3,4]. The survival advantage in AAs with kidney failure stands in stark contrast to the higher prevalence of negative prognosticators of survival and the higher mortality rate observed in the general (non-renal disease) AA population [3,5].
Genetic factors have been demonstrated to underlie part of the risk for ESRD in AAs , and may also play a role in survival after initiation of dialysis. Several studies have explored genetic determinants for racial differences in atherosclerosis and heart failure mortality [7,8,9]. However, no such analysis has been performed in dialysis patients. In this study, an unbiased genome-wide association scan was performed to detect genetic variants associated with survival on dialysis in AAs with type 2 diabetes (T2D). Using genome-wide data from a local AA cohort with T2D , coupled with the dialysis survival span, we sought to detect genetic variants associated with survival on dialysis that may provide novel insights into the molecular mechanisms that underlie death in patients on dialysis and set the stage for future functional studies.
Research Design and Methods
Self-reported AAs initiating renal replacement therapy between February 1985 and September 2009 in eight ESRD Network 6 facilities comprised the study cohort. Participants were enrolled in a study of genetic factors in AAs with type 2 diabetic ESRD, and had ESRD clinically attributed to diabetic nephropathy on the Center for Medicare and Medicare Services 2728 form . T2D was diagnosed with age at diabetes onset >30 years and/or use of oral hypoglycemic agents (not insulin alone) 1 year after onset. T2D-associated ESRD was diagnosed in the presence of either: (1) T2D preceding ESRD by >5 years, in the absence of other causes of nephropathy, (2) diabetic retinopathy, or (3) dipstick proteinuria ≧100 mg/dl (or 500 mg/day). Exclusion criteria included non-AA race and evidence of recovery of renal function or renal transplant within 6 months of initiating dialysis. Baseline measures of serum albumin and serum hemoglobin were obtained at the start of dialysis and BMI was computed. Proportions, mean values, and standard deviations were calculated for baseline variables.
The primary end point was death from any cause. Survival time on dialysis was computed as the time between the start of dialysis and the date of death. Cases lost to follow-up or undergoing kidney transplantation were censored at their final follow-up date or transplant date, respectively. Follow-up in surviving participants ended on January 1, 2010. Data involving deaths were collected by means of primary cause of death documentation in the Death Notification Form (form 2746) required for every death occurring in the US ESRD population. Death events were categorized into 5 broad groups. Deaths coded as any of the following were defined as ‘cardiovascular’: atherosclerotic heart disease, cardiac arrest, cardiac arrhythmia, cardiomyopathy, congestive heart failure, acute myocardial infarction, valvular heart disease, ischemic stroke, or ischemic brain damage/anoxic encephalopathy due to cardiac event. Deaths coded as any of the following causes were classified as ‘infectious’: abdominal infection, perforated bowel, diverticulitis, gallbladder infection, endocarditis, pulmonary infection, or septicemia. Deaths ascribed to any of the following causes were grouped as ‘other’: accident related or unrelated to treatment, acidosis, cachexia, intracranial hemorrhage, chronic obstructive lung disease, cirrhosis, dementia (including dialysis dementia), Alzheimer’s, gastrointestinal hemorrhage, hemorrhage from vascular access, hyperkalemia, malignant disease, viral hepatitis, pancreatitis, pulmonary embolus, or seizures. Withdrawal from dialysis and unknown cause of death constituted the 2 remaining groups.
Study approvals were obtained from the Wake Forest University Baptist Medical Center Institutional Review Board, and all subjects provided written informed consent.
DNA extraction from whole blood was performed using the PureGene system (Gentra Systems, Minneapolis, Minn., USA). Genotyping was performed at the Center for Inherited Disease Research using 1 µg of genomic DNA (diluted in 1× TE buffer and at 50 ng/µl) on Affymetrix Genome-Wide Human SNP array 6.0. DNA from cases was interleaved on 96-well master plates. To confirm sample identity, a SNP barcode (96 SNPs) was generated prior to genotyping on the Affymetrix array and confirmed on downstream released genotyping data. Genotypes were called using Birdseed version 2; APT 1.10.0 by grouping samples by DNA plate to determine the genotype cluster boundaries. All autosomal SNPs (n = 868,157) were included in the analysis, but classified on data quality with primary inference drawn from polymorphic SNPs (minor allele frequency > 0.05) with <5% missing data (Hardy-Weinberg Equilibrium p values >0.0001) resulting in a total of 832,357 SNPs for analysis. The average sample call rate was 99.16% for all autosomal SNPs. Forty-six blind duplicates were included in genotyping and had a concordance rate of 99.59%. In addition, one individual was removed whose self-reported gender was inconsistent with X chromosome genotype data. Relatedness was estimated using the identity-by-descent analysis implemented in the PLINK analysis software package (http://pngu.mgh.harvard.edu/purcell/plink/). One duplicate pair and 110 first-degree relative pairs were identified, and a single individual from each pair was retained for analysis based on the completeness of the phenotypic data.
To account for admixture, ancestral allele frequencies were estimated from the results of the 70 ancestry informative markers genotyped in 44 Yoruba Nigerians and 39 European Americans. Individual ancestral proportions were generated for each subject using FRAPPE, an expectation maximization algorithm, under a two-population model .
To test for association between each SNP and survival, adjusting for covariates, a Cox proportional hazard semi-parametric regression model was computed. The proportional hazards assumption for each covariate was examined before allowing that covariate into the model. Time to all-cause death was tested for each SNP using dominant, recessive, and additive models. Primary inference was based on the additive genetic model unless significant departure from additivity was observed (p < 0.05). All genetic models were defined relative to the minor allele where the dominant model tests whether the presence of the minor allele influences survival, the additive model tests for a dose effect on the number of minor alleles (0, 1, or 2) and the recessive model tests whether 2 copies of the minor allele were associated with survival. The recessive and additive genetic models required at least 30 and 10 individuals homozygous for the minor allele, respectively. All analyses adjusted for population structure using the ancestral proportions noted above. Quantile-quantile (Q-Q) plots and test inflation factors were computed to assure that the experiment had appropriate family-wise type 1 error rates. Participants were censored when lost to follow-up due to transfer to another ESRD Network, receipt of a kidney transplant, or (if living) on January 1, 2010. Due to recent improvements in dialysis prescription and medications, factors with potential impact on dialysis survival introduced in approximately the year 2000, we adjusted the analysis using a binary variable denoting pre- and post-2000 initiation of dialysis. Model 1 adjusted for ancestry, age at start of dialysis, sex, BMI, pre-dialysis diabetes duration, and incident year of dialysis; model 2 adjusted for these factors plus serum albumin and hemoglobin at the initiation of dialysis. HR with 95% CIs are presented. We estimated study power using the Genetic Power Calculator; assuming dominant models with minor allele frequencies of 0.07, our sample size had 95% power to detect a minimal HR of 1.6 for death while on dialysis at α = 0.05 .
A total of 647 participants were evaluated for inclusion. Of these, 5 recovered renal function, 4 had temporary discontinuation of dialysis, 2 were transplanted within 6 months of dialysis onset, and 26 did not have the date of first dialysis recorded; these patients were withdrawn. The remaining 610 subjects were included in the final analysis. Table 1 summarizes the baseline characteristics of participants. Hemodialysis was the predominant initial mode of renal replacement therapy (93% of patients), and we observed an overall 10% rate of modality conversion from any initial to an alternative renal replacement modality during the period of follow-up. After mean follow-up of 5.4 ± 3.5 years, 434 deaths, 44 kidney transplants, and 2 cases lost to follow-up were recorded (table 2). The mean survival on dialysis for the 434 subjects who died was 5.2 ± 3.1 years. A total of 130 subjects remained alive at the last follow-up. Table 2 summarizes the major causes of death. Sex, BMI, incident albumin, incident hemoglobin, and admixture were not individually significantly associated with survival, while age at dialysis inception (5-year HR = 1.2; p < 0.0001) and incident year of dialysis (dialysis vintage, 5-year HR = 1.16; p= 0.0008) were significantly associated with mortality (data not shown).
|Table 1. Demographics at dialysis inception|
|Table 2. Events at follow-up|
A number of genomic regions showed evidence of association with survival at p < 1.00 × 10–5, with 5 of the landmark SNPs associated at p < 1.00 × 10–6 (fig. 1). Overall, there was no evidence of inflation of the tests of association, and there was appropriate fit to the expected overall distribution of association (fig. 2).
|Fig. 1. GWAS results for death on dialysis. The genome-wide distribution of –log10 p values from the adjusted trend is shown across the chromosomes. The dotted line indicates the genome-wide significance threshold (p < 1 × 10–6). Annotated are the top five SNPs with genome-wide significance for dialysis survival at p < 1 × 10–6.|
|Fig. 2. Q-Q plot from the genome-wide association with dialysis survival analysis. The black bold line represents the observed p values; the red line is the expected line under the null distribution.|
After adjustment for ancestry, age at dialysis, sex, BMI, diabetes duration, and incident year factor (model 1), 30 SNPs provided evidence for association (p < 1.00 × 10–5), 12 were located in 10 genes (table 3), and 18 SNPs were located in intergenic regions (table 4). One of the top hits (rs2412980, chr22q12.2, HR = 1.65, p = 3.67 × 10–6) was located in LOC729980, a gene encoding a protein with unknown function. Two of the top hits (rs6546886, chr2p13.1, HR = 2.13, p = 3.28 × 10–6; rs9921518, chr16q12.2, HR = 2.11, p = 8.62 × 10–6) were located in regions of reported copy number variation. Of note, several SNPs clustered around A Disintegrin And Metalloproteinase with Thrombospondin Motifs (ADAMTS) and Iroquois (IRX) genes (rs6816344, chr4q13.3, HR = 1.7, p = 1.37x 10–6; rs1452093, chr21q21.3, HR = 1.59, p = 2.31 × 10–6; rs9977499, chr21q21.3, HR = 1.57, p = 4.36 × 10–6; rs1817114, chr21q21.3, HR = 1.57, p = 4.36 × 10–6; rs2830881, chr6q22.31, HR = 1.57, p = 4.67 × 10–6; and rs9921518, chr16q12.2, HR = 2.11, p = 8.62 × 10–6).
|Table 3. Summary results for top gene SNPs associated with all-cause death on dialysis|
|Table 4. Summary results for top intergenic SNPs associated with all-cause death on dialysis|
Additional covariate adjustments in model 2 (incident serum albumin and hemoglobin) maintained comparable HRs, but diminished the p values modestly (data not shown).
This report represents the first genome-wide search for variation influencing dialysis survival in AAs with T2D. We found several alleles (30 SNPs) that had strong statistical associations (p < 5.00 × 10–6) with survival on dialysis, even after adjustment for important covariates known to impact dialysis survival (model 1). The lack of change in the HR with modest reduction of statistical significance (10-fold drop) in model 2 reflected a reduction in statistical power due to the reduced sample size resulting from a lack of measured covariates in some participants. Of the top 30 SNPs, some indicated a negative effect on survival by association with a higher propensity for death (HR > 1.0), while other SNPs indicated a protective effect by association with a lower rate of death (HR < 1.0). These associations tag potentially interesting genomic regions and the functional qualities of the top SNPs relating to death on dialysis are unknown, but may prove to be important based on their detection using unbiased methodologies. Replication, subsequent fine mapping, and functional studies are necessary to more accurately narrow down regions of interest and establish loci as influencing survival. Genome-wide association study (GWAS) results from the entire cohort have been entered in the Database of Genotypes and Phenotypes (dbGaP) .
The top SNPs correlating with dialysis survival were located primarily in or near genes with mechanistic roles that can be categorized in 3 major groups: regulation of extracellular matrix (ECM) composition and turnover, myocardial cell development and repolarization, and neurobiological regulation of smoking cessation. At this stage, these results are hypothesis-generating and require replication in other cohorts and races. If replicated, they merit experimental analysis to define the molecular pathways that impact survival.
Accelerated atherosclerosis, a multi-factorial process involving inflammation, altered matrix turnover, and composition in vascular walls is often present in patients with diabetes and kidney disease . Proteases from the ADAMTS family regulate ECM turnover in atherosclerotic plaque , and hyaluronic acid (encoded by HAS2 gene) is a prominent constituent of the ECM in atherosclerotic vascular lesions . Our analysis found several SNPs clustered near the ADAMTS5 gene (rs6816344, rs1452093, rs9977499, rs1817114, rs2830881) and a single SNP located near the HAS2 gene (rs17232789) to have significant correlations with survival. These SNPs may impact the rate of atherosclerosis, either pre-dialysis or on dialysis. Further studies are needed to replicate these findings and delineate their pathophysiology.
Sudden cardiac death due to arrhythmia is the leading cause of death in patients on dialysis. Several genes are known to control myocardial development and repolarization. The protein tyrosine phosphatase receptor type M (PTPRM) gene and PDZ domain containing the ring finger 3 (PDZRN3) gene play important roles in myocardial development, myocardial cell resistance to ischemia-reperfusion injury, and electrolyte-induced depolarization [17,18]. Calmodulin regulates intracellular calcium homeostasis in myocardial and vascular endothelial cells, and Ca2+/calmodulin kinase complex plays a major role in ventricular repolarization . Polymorphisms in the calmodulin 2 (CALM2) gene (rs815815), PTPRM gene (rs8098064, rs7243299, rs9953514), and PDZRN3 gene (rs11128347) were among the top associations with dialysis survival, suggesting a role for differential myocardial cell response to triggers of cell depolarization. Experimental studies of the morphologic and functional impacts of genomic variation detected in this study could identify arrhythmogenic pathways in myocardial cells that contribute to survival on dialysis.
Association was detected between dialysis survival and polymorphisms in and near genes thought to predict successful smoking cessation: PDZRN3, PTPRM, neuregulin 1 (NRG1), SH3-domain binding protein 4 (SH3BP4), kelch-like 29 (KLHL29), DEAH (Asp-Glu-Ala-His) box polypeptide 35 (DHX35), ATP-binding cassette sub-family A member 4 (ABCA4), ephrins receptor A7 (EPHA7), proprotein convertase subtilisin/kexin type 2 (PCSK2), tet oncogene family member 3 (TET3), G patch domain-containing 2 (GPATCH2), estrogen-related receptor gamma (ESRRG), Rap guanine nucleotide exchange factor 5 (RAPGEF5), and Ras and Rab interactor 3 (RIN3) [20,21]. Unfortunately, our dataset lacks information pertaining to smoking history.
Unique strengths of this GWAS include the novel phenotype (dialysis survival) and use of an unbiased genetic approach in a population enriched for cardiovascular disease risk factors. Previous genetic studies of dialysis survival included mainly Caucasian and non-diabetic individuals, and applied a priori single gene analysis (e.g. Klotho, CCR5, Fetuin A) [22,23,24]. The current GWAS did not reveal polymorphisms in or near these genes, potentially due to race-dependent and environment-conducive genetic variation. These confounding factors were minimized in our analysis by virtue of homogenous ethnicity and cause of ESRD.
GWAS analyses have been applied in large study cohorts, including the Wellcome Trust Case Control Consortium (WTCCC) and Atherosclerosis Risk in Communities (ARIC) studies, to detect genetic variants associated with cardiovascular events. These analyses detected several SNPs associated with incident heart failure and coronary heart disease in African-ancestry populations, but there was no overlap with risk variants in our study [7,9]. This is not surprising, since the mechanisms of cardiovascular death in dialysis patients are likely to differ from those in the general population. Since few studies with ESRD patients have undergone GWAS, it will be necessary to evaluate dialysis survival as a phenotype in other GWAS. Although the genetic mechanisms underlying various etiologies of death on dialysis may differ, we elected not to perform separate analyses by cause of death, since the sample sizes were small and inconsistencies might exist in the attributed cause of death. However, it is clear that the majority of deaths in our cohort were related to cardiovascular events – an expected observation.
Limitations of this study include (1) a relatively small cohort, (2) inability to adjust for being on hemodialysis versus peritoneal dialysis, (3) inability to assess the type of hemodialysis access used, (4) inability to ascertain that T2D was the actual cause of ESRD in clinically diagnosed subjects due to lack of kidney biopsies. An inherent limitation of the study represents the fact that the statistical association only tags regions of interest. Thus, replication and fine mapping studies will be necessary. Frequent changes in dialysis modality and access type are commonly observed, making adjustment for these factors difficult. The possibility of false-positive results relating to our relatively small sample size cannot be excluded; however, we detected suggestive evidence of genome-wide association. An unadjusted p value of <1 × 10–8 is regarded as significant in genome wide association studies. Several SNPs in this discovery cohort had p values <1 × 10–6; it is possible that these may play important roles in survival on dialysis or may be false-positive results. A replication cohort to validate these signals and exclude genomic noise is critical. The relatively large numbers of deaths was not unexpected in this high-risk sample of subjects with diabetes and ESRD. Known biological connections between the associated genes and their pathways suggest a plausible mechanism for effects on survival.
In summary, a GWAS employing 868,155 SNPs identified 30 genetic variants that appeared to influence the risk of death on dialysis in AAs with T2D and ESRD. These results highlight several important pathways and processes involved in cardiac and ECM physiology, and provide insights into the potential genetic architecture of dialysis survival as a polygenic trait. This is a hypothesis-generating GWAS that presents a preliminary set of SNPs associated with survival in AA ESRD patients with T2D. In the future, GWAS in other racial groups and dialysis cohorts, replication of these findings, fine-mapping and functional studies of these loci in myocardial, vascular, and/or neuronal cells will be necessary to validate these results and determine the role played by these polymorphisms in the pathogenesis of dialysis survival.
We wish to thank the patients and the Southeastern Kidney Council, Inc./ESRD Network 6 for their participation. This work was supported by NIH grants R01 DK066358 (D.W.B.), R01 DK053591 (D.W.B.), R01 HL56266 (B.I.F.), R01 DK070941 (B.I.F.) and in part by the General Clinical Research Center of the Wake Forest University School of Medicine grant M01 RR07122.
The authors report no conflicts of interest.
Mariana Murea, MD
Department of Internal Medicine, Section on Nephrology
Wake Forest University School of Medicine, Medical Center Boulevard
Winston-Salem, NC 27157-1053 (USA)
Tel. +1 336 716 2406, E-Mail firstname.lastname@example.org
Received: March 4, 2011
Accepted: March 17, 2011
Published online: May 5, 2011
Number of Print Pages : 8
Number of Figures : 2, Number of Tables : 4, Number of References : 24
American Journal of Nephrology
Vol. 33, No. 6, Year 2011 (Cover Date: June 2011)
Journal Editor: Bakris G. (Chicago, Ill.)
ISSN: 0250-8095 (Print), eISSN: 1421-9670 (Online)
For additional information: http://www.karger.com/AJN