Concept and design of a genome-wide association genotyping array tailored for transplantation-specific studies

Li, Yun R.; van Setten, Jessica; Verma, Shefali S.; Lu, Yontao; Holmes, Michael V.; Gao, Hui; Lek, Monkol; Nair, Nikhil; Chandrupatla, Hareesh; Chang, Baoli; Karczewski, Konrad J.; Wong, Chanel; Mohebnasab, Maede; Mukhtar, Eyas; Phillips, Randy; Tragante, Vinicius; Hou, Cuiping; Steel, Laura; Lee, Takesha; Garifallou, James; Guettouche, Toumy; Cao, Hongzhi; Guan, Weihua; Himes, Aubree; van Houten, Jacob; Pasquier, Andrew; Yu, Reina; Carrigan, Elena; Miller, Michael B.; Schladt, David; Akdere, Abdullah; Gonzalez, Ana; Llyod, Kelsey M.; McGinn, Daniel; Gangasani, Abhinav; Michaud, Zach; Colasacco, Abigail; Snyder, James; Thomas, Kelly; Wang, Tiancheng; Wu, Baolin; Alzahrani, Alhusain J.; Al-Ali, Amein K.; Al-Muhanna, Fahad A.; Al-Rubaish, Abdullah M.; Al-Mueilo, Samir; Monos, Dimitri S.; Murphy, Barbara; Olthoff, Kim M.; Wijmenga, Cisca; Webster, Teresa; Kamoun, Malek; Balasubramanian, Suganthi; Lanktree, Matthew B.; Oetting, William S.; Garcia-Pavia, Pablo; MacArthur, Daniel G.; de Bakker, Paul I W; Hakonarson, Hakon; Birdwell, Kelly A.; Jacobson, Pamala A.; Ritchie, Marylyn D.; Asselbergs, Folkert W.; Israni, Ajay K.; Shaked, Abraham; Keating, Brendan J.

doi:10.1186/s13073-015-0211-x

Research
Open access
Published: 01 October 2015

Concept and design of a genome-wide association genotyping array tailored for transplantation-specific studies

Yun R. Li^1,2,
Jessica van Setten³,
Shefali S. Verma⁶,
Yontao Lu⁴,
Michael V. Holmes⁵,
Hui Gao^2,5,
Monkol Lek^7,8,
Nikhil Nair^2,5,
Hareesh Chandrupatla^2,5,
Baoli Chang^2,5,
Konrad J. Karczewski^7,8,
Chanel Wong^2,5,
Maede Mohebnasab²,
Eyas Mukhtar^2,5,
Randy Phillips^2,5,
Vinicius Tragante³,
Cuiping Hou²,
Laura Steel^2,5,
Takesha Lee^2,5,
James Garifallou²,
Toumy Guettouche²,
Hongzhi Cao^10,11,
Weihua Guan¹²,
Aubree Himes^2,5,
Jacob van Houten²,
Andrew Pasquier²,
Reina Yu²,
Elena Carrigan²,
Michael B. Miller¹³,
David Schladt²⁶,
Abdullah Akdere¹,
Ana Gonzalez¹,
Kelsey M. Llyod¹,
Daniel McGinn¹,
Abhinav Gangasani¹,
Zach Michaud¹,
Abigail Colasacco¹,
James Snyder²,
Kelly Thomas²,
Tiancheng Wang²,
Baolin Wu¹²,
Alhusain J. Alzahrani²⁵,
Amein K. Al-Ali¹⁴,
Fahad A. Al-Muhanna¹⁴,
Abdullah M. Al-Rubaish¹⁴,
Samir Al-Mueilo¹⁴,
Dimitri S. Monos^15,2,
Barbara Murphy¹⁶,
Kim M. Olthoff⁵,
Cisca Wijmenga¹⁷,
Teresa Webster⁴,
Malek Kamoun¹⁵,
Suganthi Balasubramanian¹⁸,
Matthew B. Lanktree⁵,
William S. Oetting¹⁹,
Pablo Garcia-Pavia²⁰,
Daniel G. MacArthur^7,8,
Paul I W de Bakker⁹,
Hakon Hakonarson²,
Kelly A. Birdwell²¹,
Pamala A. Jacobson²²,
Marylyn D. Ritchie⁶,
Folkert W. Asselbergs^3,23,24,
Ajay K. Israni²⁷,
Abraham Shaked⁵ &
…
Brendan J. Keating^5,28,2,29

Genome Medicine volume 7, Article number: 90 (2015) Cite this article

9321 Accesses
42 Citations
61 Altmetric
Metrics details

Abstract

Background

In addition to HLA genetic incompatibility, non-HLA difference between donor and recipients of transplantation leading to allograft rejection are now becoming evident. We aimed to create a unique genome-wide platform to facilitate genomic research studies in transplant-related studies. We designed a genome-wide genotyping tool based on the most recent human genomic reference datasets, and included customization for known and potentially relevant metabolic and pharmacological loci relevant to transplantation.

Methods

We describe here the design and implementation of a customized genome-wide genotyping array, the ‘TxArray’, comprising approximately 782,000 markers with tailored content for deeper capture of variants across HLA, KIR, pharmacogenomic, and metabolic loci important in transplantation. To test concordance and genotyping quality, we genotyped 85 HapMap samples on the array, including eight trios.

Results

We show low Mendelian error rates and high concordance rates for HapMap samples (average parent-parent-child heritability of 0.997, and concordance of 0.996). We performed genotype imputation across autosomal regions, masking directly genotyped SNPs to assess imputation accuracy and report an accuracy of >0.962 for directly genotyped SNPs. We demonstrate much higher capture of the natural killer cell immunoglobulin-like receptor (KIR) region versus comparable platforms. Overall, we show that the genotyping quality and coverage of the TxArray is very high when compared to reference samples and to other genome-wide genotyping platforms.

Conclusions

We have designed a comprehensive genome-wide genotyping tool which enables accurate association testing and imputation of ungenotyped SNPs, facilitating powerful and cost-effective large-scale genotyping of transplant-related studies.

Background

Since the Organ Procurement and Transplantation Network (OPTN) began its registry in 1987 until mid-2014 over 575,000 solid organ transplantations have been performed in the United States [1]. Although there have been considerable improvements in patient treatment pre- and post-transplant surgery and immunosuppressant therapies (IST), various grades of rejection are observed in up to 40 % of transplanted individuals within the first year post transplant [2], and affects approximately 60 % of transplanted individuals over the course of the graft lifetime thereby representing a major risk factor for graft damage and eventual graft loss [3, 4]. There are also significant risks post transplant ranging from severe adverse events to side-effects of ISTs including nephrotoxicity, hyperlipidemia, and new onset of diabetes after transplantation (NODAT) [5, 6].

Recent advances in genomic technologies and large-scale human reference maps such as the International HapMap Project have led to the development of genome-wide association studies (GWAS) utilizing cost-effective arrays that allow for the rapid interrogation of several hundreds of thousands single nucleotide polymorphisms (SNPs) and copy number variants (CNV) across the human genome [7–9]. Large scale whole genome sequencing studies show that approximately 3.5 million and approximately 10 million common and rare polymorphisms are typically observed between two unrelated individuals of European and African ancestries, respectively [10]. Each donor-recipient (D-R) pair of genomes contains vast permutations of non-synonymous amino-acid differences and other potential sources for allogenicity, beyond the highly characterized human leukocyte antigen (HLA) region, conventionally considered to contain the main genetic factors underpinning allograft rejection. Rejection is observed in following HLA-matched transplantations between full sibling, suggesting that histocompatibility may depend on non-HLA genetic differences. This includes a number of minor histocompatibility antigens, such as the H-Y antigens [11], which have been studied in the context of renal transplantation. Such findings from these studies identifying non-HLA histocompatibility loci suggest that non-HLA genetic disparities exist between D-Rs, and that these differences may manifest as the presentation of polymorphic peptides that the recipient’s immune system recognizes as non-self even in the presence of IST. Indeed, analyses of overall 10-year kidney graft failure rates for cadaver donors showed that 18 % of graft failures were due to HLA factors, as observed through mismatched living donor grafts; and 43 % were attributable to non-immunological factors, and 38 % of the failures were due to immunological reactions against non-HLA factors as seen in HLA-identical sibling grafts [4]. The natural killer cell immunoglobulin-like receptor (KIR) region comprising a family of 13 genes on chr19q13.46 are known to interact with HLA Class I molecules, and many unique KIR haplotypes identified have been linked to transplantation outcomes [12, 13] Additional non-HLA/KIR polymorphisms have also been shown to impact transplantation outcomes since through the generation histo-incompatibilities [14–16]. Investigations of non-HLA genetic determinants of clinical outcomes following organ transplantation have yet to be performed in any systematic well powered fashion to date.

A recent genome-wide study of NODAT was conducted in a prospective cohort of 529 kidney transplant recipients, 57 of whom developed NODAT with 26 SNPs identified in the discovery stage (P <1 × 10⁻⁵), eight of which retained association on replication, of which seven intriguingly are in loci known to have a role in Beta-cell apoptosis [17]. A number of genetic variants impacting uptake, metabolism, and excretion of immunosuppressant drugs have been identified [18]. While there are examples of robust associations in a number of these studies, validation of a large number of other putative associations in independent studies are often not observed [19]. This is likely to contribute to publication bias, underpowered discovery cohorts, and failure to adjust for population stratification.

The use of current sequencing and dense genotyping data from reference populations also makes it feasible to further infer, or impute, tens of millions of additional genotypes, which were not directly genotyped on the initial platform [20–22], by the use of whole genome imputation using highly characterized genomic reference datasets such as the 1000 genomes project (1KGP) and the Genomes of the Netherlands (GoNL) [23, 24]. Array-based genotyping technologies that have enabled conventional GWAS analyses also permit flexibility in choosing the scope and density of SNPs for disease or trait-specific arrays geared toward particular research communities. Such arrays include platforms such as the ‘cardiochip’ [25] and more recently the Immunochip and Cardio-Metabochip arrays [26, 27] have unveiled hundreds of new genetic associations leading to deeper understanding of the genetic architecture of new regions underpinning biological and disease processes. These newest arrays, including the Axiom Biobank and the UK Biobank genotyping arrays enable more comprehensive capture of genetic diversity across populations [28].

To create a unique genome-wide platform to facilitate genomic research studies in transplant-related studies, we designed a genome-wide genotyping tool customized for known and potentially relevant loci in metabolic and pharmacological aspects of transplantation including content relevant for D-R genomic incompatibility. We describe here the design and implementation of a genome-wide 782,000 marker array herein termed the ‘TxArray’ with tailored deeper capture of variants in HLA, KIR, pharmacogenomic, and metabolic genes/loci important in transplantation while still allowing conventional hypothesis-free GWAS to be performed. The genome-wide coverage of this array was created using content from conventional GWAS arrays [29, 30] with transplant-specific content informed from a range of sources including comprehensive literature searching and by expert opinions on priority pharmacogenomic loci. Our targeted customized modules are also designed to provide improved coverage of functional variations based on updated content from the 1KGP [10] and powerful analyses of over 32,000 exomes [31, 32].

DNA from over 16,000 DNA samples has thus far been genotyped using this array allowing for more robustly powered in silico replication as well analyses of rare variants and loss of function (LoF) variants ablating all or parts or a given gene, and cross-cohort meta-analyses in diverse populations. The majority of these samples are contributed as a part of International Genetics & Translational Research in Transplantation Network (iGeneTRAiN), a major international collaboration on the genomics of transplantation [33]. The objectives of forming this consortium are: (1) to pool expertise for selection of genes and SNPs; (2) to reduce costs by producing a standardized genome-wide genotyping platform; (3) to facilitate ease of cross cohort meta-analyses and replication for a large set of SNPs in high priority candidate genes; and (4) to bolster statistical power by combining as many of these appropriately harmonized datasets to discover new genes involved in a range of phenotypes and outcomes relating to solid organ and hematopoietic stem cell transplantation (HCT). Here we formally describe the rationale, design and content of our transplant genotyping array, and describe the imputation process as well as evaluate its performance in capturing variation across major populations.

Methods

Affymetrix genotyping platform and assay technology

The Axiom genotyping platform utilizes a two-color, ligation-based assay using 30-mer Oligonucleotide probes synthesized in situ onto a microarray substrate. There are approximately 1.38 million features available for experimental content with each feature approximately 3 μm² with each SNP feature contains a unique oligonucleotide sequence complementary to the sequence flanking the polymorphic site on either the forward or the reverse strand. Solution probes bearing attachment sites for one of two dyes, depending on the 3' (SNP-site) base (A or T, versus C or G) are hybridized to the target complex, followed by ligation for specificity.

Array design and variant selection

The transplant-specific modules and genome-wide content for the TxArray was designed based on a tiered system built on the main Affymetrix GWAS imputation grids [30] for the major human populations as defined by the Hapmap Project [9] and subsequent high density population reference studies yielding high density genomic datasets including representative individuals of European ancestry (Utah residents with ancestry from Northern and Western Europe (CEU)), of Asian descent (Japanese from Tokyo, Japan (JPT)), and Han Chinese from Beijing, China (CHB)), and of African ancestry (Yoruba in Ibadan, Nigeria (YRI)) and Americans of African Ancestry in SouthWest, USA (ASW)).

In addition to this core content, additional modules of SNPs were added sequentially so that maximal economy of markers was retained by ensuring no redundant SNPs were added. We describe the tiers sequentially below:

A. Cross-platform ‘cosmopolitan’ genome-wide coverage markers (approximately 350,000 markers)

Genome-wide imputation grid (approximately 296K markers): The TxArray’s core imputation grid consists of genome-wide approximately 296K SNPs shared in common with the conventional Affymetrix Biobank Array. These include a set of 246K SNPs, also included in the UK Biobank array, that provide high-density coverage (mean r² >0.81 and 0.90) across European populations (CEU) at minor allele frequencies (MAFs) >1 % and 5 %, respectively.
Additional coverage for non-European populations: An additional set of approximately 50K SNPs, covered in the 1KGP Phase I reference panel, were additionally extracted from the Affymetrix-Biobank array to improve the mean coverage achieved in African and other populations. These SNPs were chosen with the goal of achieving comprehensive overlap with already existing UK Biobank Axiom Array and the Axiom Biobank Array, to facilitate additional collaborative efforts where joint or meta-analyses of samples genotyped across these platforms and other conventional GWAS platforms are required [29, 30].
Compatibility markers (approximately 18K markers): This module was designed to optimize and standardize genotyping quality control (QC) and sample validation through the use of: Polymorphisms capturing Ancestry informative markers (AIMs); fingerprinting panels; mitochondrial, Y-chromosome; and miRNA binding sites or targets regions were included.

B. Module-specific content from the UK Biobank core array (approximately 36K markers)

These constitute markers identified based on reported GWAS signals and candidate gene associations across pharmacogenomic and metabolic phenotypes. Again, to enable cross-platform analysis, where feasible, we also included markers directly overlapping the UK-Biobank array and additional markers for the transplant-specific content. The following UK-Biobank array modules were included; see Fig. 1 and the UK-Biobank Consortium for details of included variants [29]:

1.
HLA and KIR region markers (7,348 and 1,546 variants, respectively)
2.
Known phenotype associations curated by the National Human Genome Research Institute (NHGRI) GWAS Catalog [34, 35], (8,136 variants)
3.
Known CNVs (2,369 variants)
4.
Expression-quantitative trail loci, or eQTLs (17,115 variants)
5.
Lung-tissue specific or pulmonary function-associated markers (8,645 variants)

Targeted MHC and transplant-specific modules

Specific modular content incorporated in the array dedicated to address transplant community research goals. Aside from the above-described modules overlapping with the UK-Biobank array, we expanded modules dedicated to non-HLA MHC region markers, deep coverage of known and predicted LoF variants, and untranslated regions (UTR)-specific module. Note that all positions and variants referenced herein are based on the human genome builds hg19/build37 (Fig. 2).

MHC and KIR content for fine-mapping and imputation

The TxArray provides the most current and densest coverage of the extended MHC (Chr 6:25.5MB to 34MB hg19/build37) [36, 37]. While the UK-Biobank array includes dense HLA-specific coverage, a number of MHC genes and markers mapping variants outside of the HLA-encoding regions are critical players in immune function and some have known roles in histocompatibility (for example, MICA, MICB). Thus, we included a comprehensive set of MHC markers in addition to the conventional HLA-coding regions (Fig. 2a).

Additionally, given the important role of KIR in allo-recognition through its interaction with HLA, we included additional KIR SNPs to enable fine-mapping, imputation, and structural variation association analysis, as well as interaction analyses across KIR and HLA Class I, which has a known role in histocompatibility in HCT, as well as other MHC loci.

To build this content and attempt to preserve significant overlap with state-of-the-art, popular genotyping platforms, we curated and included in our design content from the following resources and platforms (Fig. 2a and Additional file 1: Table S1):

1.
UK-Biobank array (8,894 total variants), including 7,348 HLA markers and 1,546 KIR markers.
2.
Multiethnic HLA haplotype tagging SNPs [36] (421 SNPs).
3.
The Type 1 Diabetes Genetic Consortium (T1DGC) Imputation panel (4,794 SNPs included directly tiling or tagging by LD those SNPs in the HLA imputation panel for SNP2HLA [38]).
4.
Non-redundant MHC validated SNPs from existing genotyping platforms used in large-scale studies: (1) Metabochip (1,123 SNPs) and (2) Immunochip (12,609 variants) [26, 27].

The content above includes 10,820 non-redundant SNP markers. We maximized the coverage of this content using a non-redundant set of best-tagging variants to achieve satisfactory tagging of the major HapMap continental populations including African (ASW and YRI), European (CEU), and Asian (CHD, JPT, CHB).

C. Transplant-specific content

Pharmacogenomic

Drug absorption, metabolism, excretion and toxicity markers, n = approximately 7,500 SNPs including markers derived from PharmGKB [39]. As these SNPs were of key relevance to this array, we also included at least one or more tagging SNPs to cover those common variants present in the 1KGP database. Literature searching was also performed (see below) for serious adverse events and pharmacogenomics studies related to IST and other therapeutics relating to transplantation. Previous candidate gene/pathway genotyping results from the Deterioration of Kidney Allograft Function (DeKAF) study were also included (n = approximately 2,000 SNPs) [40–42].

Candidate genes associated with transplant outcomes

Over 600 transplantation-related genetic association studies were manually curated from PubMed using the following search string: ‘transplant + DNA + donor + recipient AND (liver OR hepatic OR lung OR pulmonary OR heart OR cardiac OR kidney OR renal) AND (SNP OR polymorphism OR variant)’. Key information including PMID number, size and population examined, loci and SNPs studied (including the respective rsID numbers), and number of donors and recipient subjects were collated. An emphasis was placed on sample size, data quality and strength of the described associations to facilitate more powerful meta-analyses with data from existing publications.

To maximize the coverage in the CEU, YRI, and ASN populations, we selected an additional non-redundant set of 23.8K variants to boost coverage the total of 91.9K polymorphic sites included in these loci. The SNPs were chosen based on an algorithm that attempts to maximize the expected mean coverage across all three key populations simultaneously instead of one at a time. This was performed by selecting the tagging SNP marker that tags most SNP markers from all three populations first; this strategy enables identification of minimal SNP sets for maximal cross-ethnic coverage (see Additional file 1: Table S2).

In our comprehensive literature search we identified primary research and review articles across each of the major solid transplant organs, including heart, liver, kidney, lung, among others as well as hematopoietic stem cell transplantation. In addition to considering measurements of graft survival and all cause, or organ failure related mortality, we also considered genes previously implicated in transplant associated complications, such as new-onset diabetes after transplantation (NODAT) and response to transplant-related medications.

We included those previously identified gene candidates and wherever no specific candidate gene has been fine-mapped or independent replicated or validated, we mapped known SNP associations to nearby coding loci and included tagging variants and variants in LD to boost local coverage. We considered a number of recent studies that attempted to replicate GWAS findings [17, 19, 43], as well as a number of recent reviewers [44–47].

Full details of all the polymorphisms on the array including their chromosome position and additional annotations are outlined in [33].

Functional variants modules

Aside from the modules noted as being shared with the UK-Biobank, the following categories of variants were included in the design for this component:

Affymetrix Biobank array content: We considered a total of approximately 250,000 SNPs from the Axiom Biobank Genotyping Array, including 86,000 putative exonic SNVs and putative LoF variants. As not all of these have been validated and many are not polymorphic in the general populations, we used one of the largest whole-exome sequencing reference datasets available at the time of the design, comprising over 32,000 samples, to annotate and filter these variants based on the observed minor allele counts (MACs). We included only those variants with MACs greater than five observations in this database, which yielded approximately 168K exonic or coding variants and over 16K putative LoF variants. A total of 178,680 unique variants were selected in this module.
Human Gene Mutation Database (HGMD): We curated variants of The HGMD LoF database (up until 1 August 2013). Again, as above, we only included MAC observed greater than five times, for a total of 3,571 variants (See Additional file 1: Table S3).
Additional LoF variants : Using the above-noted approximately 32,000 exome database, we identified additional putative LoFs included in the Affymetrix Biobank Genotyping Array, UK Biobank Axiom Array, or HGMD databases; again, filtering the observed SNVs and indels from analysis across over 32,000 human exomes [32] for at least MACs greater than 5, we obtained a conservative set of 8,557 unique putative exonic SNVs and/or putative LoF variants (See Additional file 1: Table S3).
Untranslated Region (UTR) Coverage: To provide maximal coverage of SNPs that may affect functional gene expression, we additionally focused on the coverage of 5’ (and 3’) UTRs defined as the exonic region between the transcriptional start (stop) and translational start (stop) sites as defined by either the RefSeq or ENSEMBLE human genome (hg19) reference sequences in June 2013. Using a MAF cutoff for inclusion of >1 % or 5 % in CEU and AFR (ASW + YRI) populations, respectively, we included a total of approximately 184,000 SNPs as shown in Additional file 1: Table S4 and described in the Supplementary Material.
A priori associations: To focus on known phenotypes, 8,136 SNPs that reached a conventional GWS threshold at P <5 × 10⁻⁸ (December 2012) for both quantitative traits and disease-specific reported in NHGRI GWAS Catalog were included.

Copy number variations (CNVs) and polymorphisms (CNPs)

CNP tagging and regional coverage

To cover common genomic structural elements by SNP-tagging we included 5,410 markers (See Additional file 1: Table S5A) and we used an additional 21,960 variants to cover approximately 2,200 manually curated CNV regions as described in the Supplementary Materials.

E. GWAS booster

The GWAS ‘booster module’ includes a set of additional markers (on top of the markers included in the main modules) selected by identifying the minimal set of markers that will provide the optimal added coverage value (with regard to the best overall coverage for whole genome imputation). Since the goal is to fill the array and gain additional coverage with the minimal number of markers, we used LD-based pair-wise tagging. Specifically, we focused on improving the coverage of common variants (MAF >2 % in CEU and MAF >5 % in AFR populations) by selectively adding additional variants resulting in selection a total of 135,363 additional markers based on the projected improvements in the overall coverage (Additional file 2: Figure S1). The online resource [33] outlines a comprehensive list of SNPs and genes for the TxArray.

This study conformed to the Helsinki Declaration as well as to local legislation. Informed and written consent was obtained independently for each iGeneTRAIN study participant, with appropriate oversight and approvals from respective local institutional review boards/Research Ethics Committees to use either summary-level or anonymized individual-level data. A number of our GWA studies are mandated to release their datasets into dbGAP under their funding conditions and subject to the ethical consents in place. We will update these dbGAP uploads on the [33] site every 3 months.

Results

Quality control

Assays for approximately 782,000 markers were manufactured following recommendations based on the Affymetrix Best Practices protocol [48] for performing genotype marker QC using a merged set of 4,885 DNA samples including those from the DeKAF study site and samples genotyped expressly for the purpose of quality control assessment. The latter consists of 85 samples from the HapMap project. Genotype clustering efficiency was also performed per manufacturer’s recommendations based on unique parameters established for this specific custom-design array, which consists of a high number of markers covering loss of function and copy number variable regions that may not be polymorphic in the vast majority of the population.

Genotype concordance with HapMap, 1KGP panels, and duplicate samples

We genotyped 85 HapMap samples on the TxArray to test concordance and genotyping quality. We included 48, 24, and 13 samples of European (CEU), Asian (JPT, CHB), and African (YRI) ancestry, respectively. All analyses were performed with PLINK (1.07/1.9) [49]. First, we examined eight trios (four CEU, four YRI) for Mendelian errors. All 767,203 genotyped SNPs passing QC based on manufacturer’s metrics were included and Mendelian inconsistency was calculated based on the number of total instances where a child’s genotypes at a given SNP position are not attributable to that of either parent, for instance if both parents have an AA genotype, while the child has AB. The number of SNPs errors for each of the eight families varied between 264 and 4,672, which corresponds to a parent-parent-child (P-P-C) heritability greater than 0.993 and an average of 0.997 (Table 1).

Table 1 Mendelian consistencies among HapMap family samples genotyped

Full size table

Next, we investigated concordance between 279,061 genotypes overlapping between our genotyping array and HapMap2 (r22, b36). We tested 22,944,075 sample-SNP combinations for concordance and observed a concordance rate of 0.996. Concordance rates for the three populations (African, Asian, and European) were very similar (Table 2). As our array is specifically set up to test MHC and X chromosome SNPs, we also tested SNPs in these two regions. Results are comparable to the overall concordance: The concordance rate for the MHC SNPs is 0.994, and 0.998 for SNPs on the X chromosome (Table 2). We also performed this analysis using data from the 1KGP reference panel and observed comparably high concordance rates (Table 2). Overall, we show that the genotyping quality of the TxArray is high, which enables accurate association testing and imputation of ungenotyped SNPs using reference panels such as 1KGP and Go-NL consortia.

Table 2 Genotyping concordance rates across HapMap and 1000 Genomes Panel samples genotyped on the TxArray

Full size table

Fifty duplicate pairs across 12 cohorts were genotyped using the TxArray. To assess the quality of the genotyping array, we tested concordance of all SNPs that were non-missing in both samples. In assessments of between approximately 742,000 and 765,000 SNPs we observed that on average 99.657 % of SNPs were fully concordant (that is, both alleles correspond), while 0.341 % of SNPs had a single concordant allele and only 0.002 % of SNPs were fully discordant.

Comparisons with conventional GWAS platforms

Coverage of the 1KGP panel markers

We compared the mean coverage (based on maximum achievable r2) across common markers (either MAF >0.05 or 0.01) in the 1KGP achieved by either the marker content designed on the TxArray versus that by a number of conventional genotyping platforms (that is, Infinium 1M and 660K Beadchips (Illumina), and Genome-Wide Human SNP Array 6.0 (Affymetrix)).

Figure 3a and b shows the composite coverage of markers in the 1KGP panel by the markers genotyped on the TxArray versus other conventional GWAS products for European (CEU and Toscani in Italia (TSI)), African (ASW/YRI), Admixed American (AMR) (Colombians from Medellin, Colombia (CLM), Mexican Ancestry from Los Angeles USA (MXL), and Puerto Ricans from Puerto Rico (PUR)), and Asian (ASN) (CHB, Southern Han Chinese (CHS), and JPT) individuals using MAF cutoffs of >0.01 and >0.05 for the full range of r² cutoff thresholds (from r² = 0 to 1). The TxArray performed comparably next to these other genotyping SNP chips, which were designed to provide optimal genome-wide coverage even though the TxArray devoted a significant number of markers to transplant-specific, rare loss of function and MHC/KIR specific content.

Coverage of exonic, MHC, and KIR locus markers

The TxArray also provided efficient coverage of markers across the exonic, KIR, and MHC regions when compared to the commonly-used Illumina 1M platform (Fig. 4a, b, and c, respectively). While mean expected coverage is comparable for the exonic and MHC regions, the TxArray provides a significantly improved coverage of markers across the KIR locus, which has been a region that has arguably received insufficient attention in most transplant association studies.

Imputation

We performed genotype imputation across all autosomal regions for 12 iGeneTRAIN studies (n = 12,048 post-QC GWAS samples) using ShapeIT2/ IMPUTE2 with the 1KGP reference panel (v3), resulting in approximately 38 million variant calls. We masked 0.2 % directly genotyped SNPs, with separate imputation performed with and without these 0.2 % SNPs to assess proxy genome-wide imputation accuracy. We report an accuracy in the range of 96.24 % to 97.71 % for directly genotyped SNPs across the genome for the 12,048 GWAS datasets imputed.

We looked at number of SNPs per MAF bins (0.01 intervals) in all imputed data. We observed that INFO score (quality metric to estimate uncertainty in imputation) for variants below MAF 0.05 declines but as MAF increases, INFO score also increases. In most cases, all variants above MAF 0.05 have INFO scores greater than 0.8 (data not shown). From masked analysis, we looked at concordance among each masked SNP where INFO scores were greater than 0.8, and results indicated very high concordance among all masked SNPs. Comparison of imputation accuracy/metrics from two independent pipelines (both using ShapeIT/IMPUTE2 with 1KGP as the reference population) was performed for The Genomics of Chronic Renal Allograft Rejection (Go-CAR) Study at Penn State and Mount Sinai and >99.998 % of imputed SNPs were concordant.

We looked at number of SNPs per MAF bins (0.01 intervals) in all imputed data as well as after performing masked analysis where 0.1 % of genotyped markers were removed and imputed again to assess the accuracy of our imputation. We observed that INFO score (quality metric to estimate uncertainty in imputation) for variants below MAF 0.05 declines but as MAF increases, info score also increases. In most cases, all variants above MAF 0.05 have info score greater than 0.8.

Using Beagle version 3.0.4, we imputed classical alleles and amino acid polymorphisms in HLA-A, HLA-B, HLA-C, HLA-DPA1, HLA-DPB1, HLA-DQA1, HLA-DQB1, and HLA-DRB1 at a four-digit resolution, as well as an additional 3,117 MHC SNPs. We used data collected by the T1DGC as a reference panel, which include 5,225 individuals of European descent. Methods have been described previously in more detail [38, 50].

Discussion

We have designed and implemented a genome-wide SNP array tailored for deeper capture of variation in loci of high priority in transplantation. Our primary goal in the array design was to generate a low-cost genome-wide array while maximizing coverage of known or putative transplant-related content. In attempts to unveil allogenicity between D-R genomes we also augmented the custom content from all available resources for rarer LoF variants that may not be identified using traditional association studies based on imputation. Flexibility in SNP selection afforded the ability to: (1) ensure selective and consistent coverage for a range of prioritized loci across multiple ancestries; (2) provide deeper coverage beyond conventional HapMap populations; (3) to directly assay specific SNPs derived from previously published transplant studies; and (4) updated HLA and KIR, pharmacogenomic, and LoF variants. We demonstrate much deeper coverage in high priority regions such as KIR and tag SNPs in these regions provide much better coverage for populations of African ancestry relative to existing GWAS products.

With the recent reports of bona fide associations transplant outcomes and/or pharmacokinetics of immunosuppression medication with genetic polymorphisms in transplant-related genes (for example, APOL1, IL28B, CYP3A4/5) [51–54], one of the current challenges will be to determine how these variants and loci at a molecular and mechanistic level and how they interact with other variants with drugs used in therapy and prevention, towards intermediate and clinical phenotypes.

In designing the transplant SNP v1 array we have also targeted all nsSNPs >MAF 0.01 using information from both HapMap and 1KGP and have tagged to MAFs >0.02 for a large number of key loci related to key transplantation outcomes such as pharmacogenomic and metabolic-related traits. We have also updated the HLA and KIR content with the most-extensive content known to date. As with other recent custom genotyping arrays in cardiovascular diseases and metabolism as well as in autoimmune diseases, the TxArray facilitates conduct of powerful and cost-effective large-scale genotyping of transplant-related studies. Such platform enable integration with additional ‘omics’ datasets such as transcriptomics, proteomics, and metabolomics to provide richer analyses in a number of the studies using this tool. Additionally, as other transplant related cohorts are utilizing this TxArray for GWAS, validation of results and comprehensive meta-analyses will be much more robust. The TxArray achieves dramatic reduction compared to designing single-trait follow-up reagents, and provides the opportunity for transplant community researchers to perform unbiased genome-wide analysis and cross-consortium independent replications.

Conclusions

We report the design and implementation of a state-of-the-art, powerful oligonucleotide array that is optimized to interrogate the genome for associations with transplant-related phenotypes and outcomes. This array, the TxArray, includes independent modules that encompass fine-mapping SNPs mapping across the MHC useful for HLA imputation, in drug-response associated loci for the study of pharmacogenomics and adverse drug response, previously-reported genes associated with transplant-related outcomes, among others, that are of high priority among the transplant community. With the advent of this array and the formation of the iGeneTRAIN consortium, it is our aim that the downstream application of such genomics technologies can ultimately generate associations which will be applied as personalized and precision-oriented genomic tools to solve clinical questions and improving patient outcomes in transplantation.

Abbreviations

1KGP:: 1000 genome project
APOL1:: Apolipoprotein L1
CNV:: copy number variants
CYP3A:: Cytochrome P450 3A
D-Rs:: donor-recipient pairs
GoNL:: Genomes of the Netherlands
GWAS:: genome-wide association studies
HCT:: hematopoietic cell transplantation
HGMD:: Human Gene Mutation Database
IL28B:: Interleukin 28B
ISTs:: Immunosuppression therapies
KIR:: natural killer cell immunoglobulin-like receptor
LoF:: loss of function
MAFs:: minor allele frequencies
MICA and MICB:: MHC class I polypeptide-related sequence-A and –B
NODAT:: new onset of diabetes after transplantation
OPTN:: Organ Procurement and Transplantation Network
SAE:: severe adverse events
SNPs:: single nucleotide polymorphisms
T1DGC:: Type 1 Diabetes Genetic Consortium
UTR:: Untranslated Region

References

Organ Procurement and Transplantation Network. Available at: http://optn.transplant.hrsa.gov.
Stehlik J, Edwards LB, Kucheryavaya AY, Aurora P, Christie JD, Kirk R, et al. The Registry of the International Society for Heart and Lung Transplantation: twenty-seventh official adult heart transplant report--2010. J Heart Lung Transplant. 2010;29:1089–103.
Article PubMed Google Scholar
Burton CM, Iversen M, Carlsen J, Mortensen J, Andersen CB, Steinbrüchel D, et al. Acute cellular rejection is a risk factor for bronchiolitis obliterans syndrome independent of post-transplant baseline FEV1. J Heart Lung Transplant. 2009;28:888–93.
Article PubMed Google Scholar
Terasaki PI. Deduction of the fraction of immunologic and non-immunologic failure in cadaver donor transplants. Clin Transpl. 2003;449–52.
Kaplan B, Qazi Y, Wellen JR. Strategies for the management of adverse events associated with mTOR inhibitors. Transplant Rev (Orlando). 2014;28:126–33.
Article Google Scholar
Hornum M, Lindahl JP, von Zur-Mühlen B, Jenssen T, Feldt-Rasmussen B. Diagnosis, management and treatment of glucometabolic disorders emerging after kidney transplantation: a position statement from the Nordic Transplantation Societies. Transpl Int. 2013;26:1049–60.
Article PubMed Google Scholar
McCarthy MI, Hirschhorn JN. Genome-wide association studies: potential next steps on a genetic journey. Hum Mol Genet. 2008;17:R156–65.
Article PubMed Central CAS PubMed Google Scholar
Peiffer DA, Gunderson KL. Design of tag SNP whole genome genotyping arrays. Methods Mol Biol. 2009;529:51–61.
Article CAS PubMed Google Scholar
International HapMap Consortium. The International HapMap Project. Nature. 2003;426:789–96.
Article Google Scholar
1000 Genomes Project Consortium, Abecasis GR, Altshuler D, Auton A, Brooks LD, Durbin RM, et al. A map of human genome variation from population-scale sequencing. Nature. 2010;467:1061–73.
Article Google Scholar
Gratwohl A, Döhler B, Stern M, Opelz G. H-Y as a minor histocompatibility antigen in kidney transplantation: a retrospective cohort study. Lancet. 2008;372:49–53.
Article CAS PubMed Google Scholar
Venstrom JM, Pittari G, Gooley TA, Chewning JH, Spellman S, Haagenson M, et al. HLA-C-dependent prevention of leukemia relapse by donor activating KIR2DS1. N Engl J Med. 2015;367:805–16.
Article Google Scholar
Vampa ML, Norman PJ, Burnapp L, Vaughan RW, Sacks SH, Wong W. Natural killer-cell activity after human renal transplantation in relation to killer immunoglobulin-like receptors and human leukocyte antigen mismatch. Transplantation. 2003;76:1220–8.
Article CAS PubMed Google Scholar
Tan JC, Kim JP, Chertow GM, Grumet FC, Desai M. Donor-recipient sex mismatch in kidney transplantation. Gend Med. 2012;9:335–47. e2.
Article PubMed Central PubMed Google Scholar
Sigdel TK, Sarwal MM. Moving beyond HLA: a review of nHLA antibodies in organ transplantation. Hum Immunol. 2013;74:1486–90.
Article CAS PubMed Google Scholar
McCarroll SA, Bradner JE, Turpeinen H, Volin L, Martin PJ, Chilewski SD, et al. Donor-recipient mismatch for common gene deletion polymorphisms in graft-versus-host disease. Nat Genet. 2009;41:1341–4.
Article PubMed Central CAS PubMed Google Scholar
McCaughan JA, McKnight AJ, Maxwell AP. Genetics of new-onset diabetes after transplantation. J Am Soc Nephrol. 2014;25:1037–49.
Article PubMed Central CAS PubMed Google Scholar
Birdwell KA, Grady B, Choi L, Xu H, Bian A, Denny JC, et al. The use of a DNA biobank linked to electronic medical records to characterize pharmacogenomic predictors of tacrolimus dose requirement in kidney transplant recipients. Pharmacogenet Genomics. 2012;22:32–42.
Article PubMed Central CAS PubMed Google Scholar
Oetting WS, Schladt DP, Leduc RE, Jacobson PA, Guan W, Matas AJ, et al. Validation of single nucleotide polymorphisms associated with acute rejection in kidney transplant recipients using a large multi-center cohort. Transpl Int. 2011;24:1231–8.
Article PubMed Central CAS PubMed Google Scholar
Marchini J, Howie B. Genotype imputation for genome-wide association studies. Nat Rev Genet. 2010;11:499–511.
Article CAS PubMed Google Scholar
Howie B, Marchini J, Stephens M. Genotype imputation with thousands of genomes. G3 (Bethesda). 2011;1:457–70.
Article Google Scholar
Browning SR, Browning BL. Rapid and accurate haplotype phasing and missing-data inference for whole-genome association studies by use of localized haplotype clustering. Am J Hum Genet. 2007;81:1084–97.
Article PubMed Central CAS PubMed Google Scholar
The Genome of the Netherlands Consortium. Whole-genome sequence variation, population structure and demographic history of the Dutch population. Nat Genet. 2014;46:818–25.
Article Google Scholar
Abecasis GR, Auton A, Brooks LD, DePristo MA, Durbin RM, Handsaker RE, et al. An integrated map of genetic variation from 1,092 human genomes. Nature. 2012;491:56–65.
Article PubMed Google Scholar
Keating BJ, Tischfield S, Murray SS, Bhangale T, Price TS, Glessner JT, et al. Concept, design and implementation of a cardiovascular gene-centric 50 k SNP array for large-scale genomic association studies. PLoS One. 2008;3:e3583.
Article PubMed Central PubMed Google Scholar
Cortes A, Brown MA. Promise and pitfalls of the Immunochip. Arthritis Res Ther. 2011;13:101.
Article PubMed Central PubMed Google Scholar
Voight BF, Kang HM, Ding J, Palmer CD, Sidore C, Chines PS, et al. The metabochip, a custom genotyping array for genetic studies of metabolic, cardiovascular, and anthropometric traits. PLoS Genet. 2012;8:e1002793.
Article PubMed Central CAS PubMed Google Scholar
Hoffmann TJ, Kvale MN, Hesselson SE, Zhan Y, Aquino C, Cao Y, et al. Next generation genome-wide association tool: design and coverage of a high-throughput European-optimized SNP array. Genomics. 2011;98:79–89.
Article PubMed Central CAS PubMed Google Scholar
UK Biobank Array Design Group. UK Biobank Axiom Array Datasheet. 2014. Available from: http://www.ukbiobank.ac.uk/wp-content/uploads/2014/04/UK-Biobank-Axiom-Array-Datasheet-2014.pdf.
Affymetrix Inc. Axiom Biobank Genotyping Arrays. 2014. Available from: http://media.affymetrix.com/support/technical/datasheets/axiom_biobank_genotyping_arrays_datasheet.pdf
MacArthur DG, Balasubramanian S, Frankish A, Huang N, Morris J, Walter K, et al. A Systematic survey of loss-of-function variants in human protein-coding genes. Science. 2012;335:823–8.
Article PubMed Central CAS PubMed Google Scholar
ExAC Browser. Available from: http://exac.broadinstitute.org.
iGeneTrain. Available from: www.igenetrain.org.
Hindorff LA, Sethupathy P, Junkins HA, Ramos EM, Mehta JP, Collins FS, et al. Potential etiologic and functional implications of genome-wide association loci for human diseases and traits. Proc Natl Acad Sci U S A. 2009;106:9362–7.
Article PubMed Central CAS PubMed Google Scholar
Welter D, MacArthur J, Morales J, Burdett T, Hall P, Junkins H, et al. The NHGRI GWAS Catalog, a curated resource of SNP-trait associations. Nucleic Acids Res. 2014;42:D1001–6.
Article PubMed Central CAS PubMed Google Scholar
T G, de Bakker PIW, McVean G, Sabeti PC, Miretti MM, Green T, et al. A high-resolution HLA and SNP haplotype map for disease association studies in the extended human MHC. Nat Genet. 2006;38:1166.
Article Google Scholar
Horton R, Wilming L, Rand V, Lovering RC, Bruford EA, Khodiyar VK, et al. Gene map of the extended human MHC. Nat Rev Genet. 2004;5:889.
Article CAS PubMed Google Scholar
Jia X, Han B, Onengut-Gumuscu S, Chen W-M, Concannon PJ, Rich SS, et al. Imputing amino acid polymorphisms in human leukocyte antigens. PLoS One. 2013;8:e64683.
Article PubMed Central CAS PubMed Google Scholar
Hewett M, Oliver DE, Rubin DL, Easton KL, Stuart JM, Altman RB, et al. PharmGKB: the Pharmacogenetics Knowledge Base. Nucleic Acids Res. 2002;30:163–5.
Article PubMed Central CAS PubMed Google Scholar
Jacobson PA, Schladt D, Oetting WS, Leduc R, Guan W, Matas AJ, et al. Genetic determinants of mycophenolate-related anemia and leukopenia after transplantation. Transplantation. 2011;91:309–16.
Article PubMed Central CAS PubMed Google Scholar
Jacobson PA, Oetting WS, Brearley AM, Leduc R, Guan W, Schladt D, et al. Novel polymorphisms associated with tacrolimus trough concentrations: results from a multicenter kidney transplant consortium. Transplantation. 2011;91:300–8.
Article PubMed Central CAS PubMed Google Scholar
Jacobson PA, Schladt D, Israni A, Oetting WS, Lin YC, Leduc R, et al. Genetic and clinical determinants of early, acute calcineurin inhibitor-related nephrotoxicity: results from a kidney transplant consortium. Transplantation. 2012;93:624–31.
Article PubMed Central CAS PubMed Google Scholar
O’Brien RP, Phelan PJ, Conroy J, O’Kelly P, Green A, Keogan M, et al. A genome-wide association study of recipient genotype and medium-term kidney allograft function. Clin Transplant. 2013;27:379–87.
Article PubMed Google Scholar
Marder B, Schröppel B, Murphy B. Genetic variability and transplantation. Curr Opin Urol. 2003;13:81–9.
Article PubMed Google Scholar
Goldfarb-Rumyantzev AS, Naiman N. Genetic prediction of renal transplant outcome. Curr Opin Nephrol Hypertens. 2008;17:573–9.
Article CAS PubMed Google Scholar
Krüger B, Schröppel B, Murphy BT. Genetic polymorphisms and the fate of the transplanted organ. Transplant Rev (Orlando). 2008;22:131–40.
Article Google Scholar
Nickerson P. The impact of immune gene polymorphisms in kidney and liver transplantation. Clin Lab Med. 2008;28:455–68.
Article PubMed Google Scholar
Affymetrix. Available from: www.affymetrix.com.
Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MAR, Bender D, et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet. 2007;81:559–75.
Article PubMed Central CAS PubMed Google Scholar
Sherman deBakker, P. J. TIDGC Immunochip HLA reference Panel [Internet]. Repository NC, editor. 2013. Available from: https://www.niddkrepository.org/studies/t1dgc-special/
Charlton MR, Thompson A, Veldt BJ, Watt K, Tillmann H, Poterucha JJ, et al. Interleukin-28B polymorphisms are associated with histological recurrence and treatment response following liver transplantation in patients with hepatitis C virus infection. Hepatology. 2011;53:317–24.
Article CAS PubMed Google Scholar
Reeves-Daniel AM, DePalma JA, Bleyer AJ, Rocco MV, Murea M, Adams PL, et al. The APOL1 gene and allograft survival after kidney transplantation. Am J Transplant. 2011;11:1025–30.
Article PubMed Central CAS PubMed Google Scholar
Pallet N, Jannot A-S, El Bahri M, Etienne I, Buchler M, de Ligny BH, et al. Kidney transplant recipients carrying the CYP3A4*22 allelic variant have reduced tacrolimus clearance and often reach supratherapeutic tacrolimus concentrations. Am J Transplant. 2015;15:800–5.
Article CAS PubMed Google Scholar
Zuo X, Ng CM, Barrett JS, Luo A, Zhang B, Deng C, et al. Effects of CYP3A4 and CYP3A5 polymorphisms on tacrolimus pharmacokinetics in Chinese adult renal transplant recipients: a population pharmacokinetic analysis. Pharmacogenet Genomics. 2013;23:251–61.
Article CAS PubMed Google Scholar

Download references

Acknowledgements

We would like to thank the patients and their families for their participation in the genotyping studies. We are also very thankful for the contributions of the all of the study coordinators and clinicians from the respective studies that made collection of these DNA samples and phenotyping possible. This project was funded in part by Fundación Mutua Madrileña, Spain. Part of this work is also supported by the Dutch PLN Foundation (www.stichtingpln.nl). Folkert W. Asselbergs is supported by UCL Hospitals NIHR Biomedical Research Centre and by a Dekker scholarship-Junior Staff Member 2014T001 – Netherlands Heart Foundation. Partial funding was also provided by the Deanship of Scientific Research at King Saud University, Riyadh. We also acknowledge funding from the following NIH grants: U01 HG006830; U01-DK062494; UM1AI109565; U01-AI63589 and U19-AI070119.

Author information

Authors and Affiliations

Medical Scientist Training Program, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
Yun R. Li, Abdullah Akdere, Ana Gonzalez, Kelsey M. Llyod, Daniel McGinn, Abhinav Gangasani, Zach Michaud & Abigail Colasacco
The Children’s Hospital of Philadelphia, Philadelphia, PA, USA
Yun R. Li, Hui Gao, Nikhil Nair, Hareesh Chandrupatla, Baoli Chang, Chanel Wong, Maede Mohebnasab, Eyas Mukhtar, Randy Phillips, Cuiping Hou, Laura Steel, Takesha Lee, James Garifallou, Toumy Guettouche, Aubree Himes, Jacob van Houten, Andrew Pasquier, Reina Yu, Elena Carrigan, James Snyder, Kelly Thomas, Tiancheng Wang, Dimitri S. Monos, Hakon Hakonarson & Brendan J. Keating
Department of Cardiology, Division of Heart and Lungs, University Medical Center Utrecht, Utrecht, The Netherlands
Jessica van Setten, Vinicius Tragante & Folkert W. Asselbergs
Affymetrix Incorporated, Santa Clara, CA, USA
Yontao Lu & Teresa Webster
Penn Transplant Institute, Hospital of the University of Pennsylvania, Philadelphia, PA, USA
Michael V. Holmes, Hui Gao, Nikhil Nair, Hareesh Chandrupatla, Baoli Chang, Chanel Wong, Eyas Mukhtar, Randy Phillips, Laura Steel, Takesha Lee, Aubree Himes, Kim M. Olthoff, Matthew B. Lanktree, Abraham Shaked & Brendan J. Keating
Center for Systems Genomics, The Pennsylvania State University, University Park, PA, USA
Shefali S. Verma & Marylyn D. Ritchie
Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
Monkol Lek, Konrad J. Karczewski & Daniel G. MacArthur
Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, Cambridge, MA, USA
Monkol Lek, Konrad J. Karczewski & Daniel G. MacArthur
Department of Medical Genetics, Center for Molecular Medicine and Department of Epidemiology, Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht, The Netherlands
Paul I W de Bakker
BGI-Shenzhen, Shenzhen, China
Hongzhi Cao
Department of Biology, University of Copenhagen, Copenhagen, Denmark
Hongzhi Cao
Division of Biostatistics, University of Minnesota, Minneapolis, MN, USA
Weihua Guan & Baolin Wu
Department of Psychology, University of Minnesota, Minneapolis, MN, USA
Michael B. Miller
College of Medicine, University of Dammam, Dammam, Kingdom of Saudi Arabia
Amein K. Al-Ali, Fahad A. Al-Muhanna, Abdullah M. Al-Rubaish & Samir Al-Mueilo
Department of Pathology and Laboratory Medicine, Perelman School of Medicine, University of Pennsylvania and the Children’s Hospital of Philadelphia, Philadelphia, PA, USA
Dimitri S. Monos & Malek Kamoun
Division of Nephrology and Department of Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA
Barbara Murphy
Department of Genetics, The University Medical Center Groningen, Groningen, The Netherlands
Cisca Wijmenga
Program in Computational Biology and Bioinformatics, and Molecular Biophysics and Biochemistry Department, Yale University, New Haven, CT, 06520, USA
Suganthi Balasubramanian
Experimental and Clinical Pharmacology, University of Minnesota, Minneapolis, MN, USA
William S. Oetting
Heart Failure and Inherited Cardiac Diseases Unit, Department of Cardiology, Hospital Universitario Puerta de Hierro Majadahonda, Madrid, Spain
Pablo Garcia-Pavia
School of Medicine, Vanderbilt University, Nashville, TN, USA
Kelly A. Birdwell
College of Pharmacy, University of Minnesota, Minneapolis, USA
Pamala A. Jacobson
Durrer Center for Cardiogenetic Research, ICIN-Netherlands Heart Institute, Utrecht, The Netherlands
Folkert W. Asselbergs
Institute of Cardiovascular Science, faculty of Population Health Sciences, University College London, London, UK
Folkert W. Asselbergs
Department of Clinical Laboratories Sciences, College of Applied Medical Sciences, King Saud University, Riyadh, Saudi Arabia
Alhusain J. Alzahrani
Minneapolis Medical Research Foundation, Hennepin County Medical Center, Minneapolis, MN, USA
David Schladt
Hennepin County Medical Center, University of Minneosta, Minneapolis, MN, USA
Ajay K. Israni
Department of Pediatrics, University of Pennsylvania, Philadelphia, PA, USA
Brendan J. Keating
Division of Transplantation, 2 Dulles, Hospital of the University of Pennsylvania, 3400 Spruce Street, Philadelphia, PA, 19104, USA
Brendan J. Keating

Authors

Yun R. Li
View author publications
You can also search for this author in PubMed Google Scholar
Jessica van Setten
View author publications
You can also search for this author in PubMed Google Scholar
Shefali S. Verma
View author publications
You can also search for this author in PubMed Google Scholar
Yontao Lu
View author publications
You can also search for this author in PubMed Google Scholar
Michael V. Holmes
View author publications
You can also search for this author in PubMed Google Scholar
Hui Gao
View author publications
You can also search for this author in PubMed Google Scholar
Monkol Lek
View author publications
You can also search for this author in PubMed Google Scholar
Nikhil Nair
View author publications
You can also search for this author in PubMed Google Scholar
Hareesh Chandrupatla
View author publications
You can also search for this author in PubMed Google Scholar
Baoli Chang
View author publications
You can also search for this author in PubMed Google Scholar
Konrad J. Karczewski
View author publications
You can also search for this author in PubMed Google Scholar
Chanel Wong
View author publications
You can also search for this author in PubMed Google Scholar
Maede Mohebnasab
View author publications
You can also search for this author in PubMed Google Scholar
Eyas Mukhtar
View author publications
You can also search for this author in PubMed Google Scholar
Randy Phillips
View author publications
You can also search for this author in PubMed Google Scholar
Vinicius Tragante
View author publications
You can also search for this author in PubMed Google Scholar
Cuiping Hou
View author publications
You can also search for this author in PubMed Google Scholar
Laura Steel
View author publications
You can also search for this author in PubMed Google Scholar
Takesha Lee
View author publications
You can also search for this author in PubMed Google Scholar
James Garifallou
View author publications
You can also search for this author in PubMed Google Scholar
Toumy Guettouche
View author publications
You can also search for this author in PubMed Google Scholar
Hongzhi Cao
View author publications
You can also search for this author in PubMed Google Scholar
Weihua Guan
View author publications
You can also search for this author in PubMed Google Scholar
Aubree Himes
View author publications
You can also search for this author in PubMed Google Scholar
Jacob van Houten
View author publications
You can also search for this author in PubMed Google Scholar
Andrew Pasquier
View author publications
You can also search for this author in PubMed Google Scholar
Reina Yu
View author publications
You can also search for this author in PubMed Google Scholar
Elena Carrigan
View author publications
You can also search for this author in PubMed Google Scholar
Michael B. Miller
View author publications
You can also search for this author in PubMed Google Scholar
David Schladt
View author publications
You can also search for this author in PubMed Google Scholar
Abdullah Akdere
View author publications
You can also search for this author in PubMed Google Scholar
Ana Gonzalez
View author publications
You can also search for this author in PubMed Google Scholar
Kelsey M. Llyod
View author publications
You can also search for this author in PubMed Google Scholar
Daniel McGinn
View author publications
You can also search for this author in PubMed Google Scholar
Abhinav Gangasani
View author publications
You can also search for this author in PubMed Google Scholar
Zach Michaud
View author publications
You can also search for this author in PubMed Google Scholar
Abigail Colasacco
View author publications
You can also search for this author in PubMed Google Scholar
James Snyder
View author publications
You can also search for this author in PubMed Google Scholar
Kelly Thomas
View author publications
You can also search for this author in PubMed Google Scholar
Tiancheng Wang
View author publications
You can also search for this author in PubMed Google Scholar
Baolin Wu
View author publications
You can also search for this author in PubMed Google Scholar
Alhusain J. Alzahrani
View author publications
You can also search for this author in PubMed Google Scholar
Amein K. Al-Ali
View author publications
You can also search for this author in PubMed Google Scholar
Fahad A. Al-Muhanna
View author publications
You can also search for this author in PubMed Google Scholar
Abdullah M. Al-Rubaish
View author publications
You can also search for this author in PubMed Google Scholar
Samir Al-Mueilo
View author publications
You can also search for this author in PubMed Google Scholar
Dimitri S. Monos
View author publications
You can also search for this author in PubMed Google Scholar
Barbara Murphy
View author publications
You can also search for this author in PubMed Google Scholar
Kim M. Olthoff
View author publications
You can also search for this author in PubMed Google Scholar
Cisca Wijmenga
View author publications
You can also search for this author in PubMed Google Scholar
Teresa Webster
View author publications
You can also search for this author in PubMed Google Scholar
Malek Kamoun
View author publications
You can also search for this author in PubMed Google Scholar
Suganthi Balasubramanian
View author publications
You can also search for this author in PubMed Google Scholar
Matthew B. Lanktree
View author publications
You can also search for this author in PubMed Google Scholar
William S. Oetting
View author publications
You can also search for this author in PubMed Google Scholar
Pablo Garcia-Pavia
View author publications
You can also search for this author in PubMed Google Scholar
Daniel G. MacArthur
View author publications
You can also search for this author in PubMed Google Scholar
Paul I W de Bakker
View author publications
You can also search for this author in PubMed Google Scholar
Hakon Hakonarson
View author publications
You can also search for this author in PubMed Google Scholar
Kelly A. Birdwell
View author publications
You can also search for this author in PubMed Google Scholar
Pamala A. Jacobson
View author publications
You can also search for this author in PubMed Google Scholar
Marylyn D. Ritchie
View author publications
You can also search for this author in PubMed Google Scholar
Folkert W. Asselbergs
View author publications
You can also search for this author in PubMed Google Scholar
Ajay K. Israni
View author publications
You can also search for this author in PubMed Google Scholar
Abraham Shaked
View author publications
You can also search for this author in PubMed Google Scholar
Brendan J. Keating
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Brendan J. Keating.

Additional information

Competing interests

Yontao Li and Theresa Webster are employees in the SNP Array Research Division of Affymetrix, CA, USA. All other co-authors declare that they have no competing interests.

Authors’ contributions

FWA, AKI, WO, PAJ, AS, and BJK conceived the study. YRL, JvS, YL, MVH, SSV, ML, NN, HG, HC, KJK, CW, MM, EM, RP, VT, LS, TL, JG , HC, AH, JvH, AP, RU, EC, AAA, FAA, AMA, BM KMO, CW, TW, MK, SB, MBL, WSO, PGP, DGMA, PAJ, FWA, AKI, AS, and BJK were involved in design of the array. YRL, YL, MVH, SSV, ML, NN, HG, HC, KJK, HC, WG, TG, DSM, TW, DGMA, HH, KB, MDR, PAJ, FWA, AKI, AS, and BJK were involved in acquisition of data, analysis and/or interpretation of data. YRL, JvS, YL, MVH, KB, FWA, AKI, AS, and BJK were involved in drafting the manuscript and critical revisions. All authors read and approved the final manuscript, and are accountable for all aspects and integrity of the work.

Additional files

Additional file 1: Table S1.

Tagging and coverage of MHC region markers. Table S2: Tagging and coverage of Tx-specific genes. Table S3: Untranslated regions (UTRs) considered in the TxArray design. Table S4: Loss-of-function variants included in the TxArray. Table S5: Copy number polymorphisms (CNPs) and variations (CNVs) included in the TxArray. (DOCX 54 kb)

Additional file 2: Figure S1.

TxArray transplant-specific modular contents. (PDF 140 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Cite this article

Li, Y.R., van Setten, J., Verma, S.S. et al. Concept and design of a genome-wide association genotyping array tailored for transplantation-specific studies. Genome Med 7, 90 (2015). https://doi.org/10.1186/s13073-015-0211-x

Download citation

Received: 10 February 2015
Accepted: 28 July 2015
Published: 01 October 2015
DOI: https://doi.org/10.1186/s13073-015-0211-x

Concept and design of a genome-wide association genotyping array tailored for transplantation-specific studies

Abstract

Background

Methods

Results

Conclusions

Background

Methods

Affymetrix genotyping platform and assay technology

Array design and variant selection

A. Cross-platform ‘cosmopolitan’ genome-wide coverage markers (approximately 350,000 markers)

B. Module-specific content from the UK Biobank core array (approximately 36K markers)

Targeted MHC and transplant-specific modules

MHC and KIR content for fine-mapping and imputation

C. Transplant-specific content

Pharmacogenomic

Candidate genes associated with transplant outcomes

Functional variants modules

Copy number variations (CNVs) and polymorphisms (CNPs)

CNP tagging and regional coverage

E. GWAS booster

Results

Quality control

Genotype concordance with HapMap, 1KGP panels, and duplicate samples

Comparisons with conventional GWAS platforms

Coverage of the 1KGP panel markers

Coverage of exonic, MHC, and KIR locus markers

Imputation

Discussion

Conclusions

Abbreviations

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Authors’ contributions

Additional files

Additional file 1: Table S1.

Additional file 2: Figure S1.

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Genome Medicine

Contact us