Author Correction: Leveraging genomic diversity for discovery in an electronic health record linked biobank: the UCLA ATLAS Community Health Initiative

Johnson, Ruth; Ding, Yi; Venkateswaran, Vidhya; Bhattacharya, Arjun; Boulier, Kristin; Chiu, Alec; Knyazev, Sergey; Schwarz, Tommer; Freund, Malika; Zhan, Lingyu; Burch, Kathryn S.; Caggiano, Christa; Hill, Brian; Rakocz, Nadav; Balliu, Brunilda; Denny, Christopher T.; Sul, Jae Hoon; Zaitlen, Noah; Arboleda, Valerie A.; Halperin, Eran; Sankararaman, Sriram; Butte, Manish J.; Lajonchere, Clara; Geschwind, Daniel H.; Pasaniuc, Bogdan

doi:10.1186/s13073-022-01128-5

Author Correction
Open access
Published: 16 November 2022

Author Correction: Leveraging genomic diversity for discovery in an electronic health record linked biobank: the UCLA ATLAS Community Health Initiative

Ruth Johnson ORCID: orcid.org/0000-0002-1929-0998^1,2,
Yi Ding^2,3,
Vidhya Venkateswaran^2,4,
Arjun Bhattacharya^2,5,
Kristin Boulier^3,6,
Alec Chiu³,
Sergey Knyazev^2,5,
Tommer Schwarz^2,3,
Malika Freund^7,8,
Lingyu Zhan⁹,
Kathryn S. Burch^2,3,
Christa Caggiano^2,10,
Brian Hill¹,
Nadav Rakocz¹,
Brunilda Balliu¹¹,
Christopher T. Denny^12,13,14,
Jae Hoon Sul¹⁵,
Noah Zaitlen^10,11,
Valerie A. Arboleda^2,7,11,
Eran Halperin^1,11,16,
Sriram Sankararaman^1,7,11,
Manish J. Butte¹⁷,
UCLA Precision Health Data Discovery Repository Working Group, UCLA Precision Health ATLAS Working Group,
Clara Lajonchere^10,18,
Daniel H. Geschwind^7,10,18 &
…
Bogdan Pasaniuc^2,3,7,11,18

Genome Medicine volume 14, Article number: 128 (2022) Cite this article

2516 Accesses
1 Altmetric
Metrics details

The Original Article was published on 09 September 2022

Correction: Genome Med 14, 104 (2022)

https://doi.org/10.1186/s13073-022-01106-x

The original publication of this article [1] contained incorrect figure panels/labels in Figs. 2 and 5 and Additional file 1.

The correct figures are available in this correction article. The original article has been updated to correct these errors.

Reference

Johnson R, et al. Leveraging genomic diversity for discovery in an electronic health record linked biobank: the UCLA ATLAS Community Health Initiative. Genome Med. 2022;14:104. https://doi.org/10.1186/s13073-022-01106-x.
Article PubMed PubMed Central Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, University of California, Los Angeles, Los Angeles, CA, 90095, USA
Ruth Johnson, Brian Hill, Nadav Rakocz, Eran Halperin & Sriram Sankararaman
Department of Pathology and Laboratory Medicine, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, 90095, USA
Ruth Johnson, Yi Ding, Vidhya Venkateswaran, Arjun Bhattacharya, Sergey Knyazev, Tommer Schwarz, Kathryn S. Burch, Christa Caggiano, Valerie A. Arboleda & Bogdan Pasaniuc
Bioinformatics Interdepartmental Program, University of California, Los Angeles, Los Angeles, CA, 90095, USA
Yi Ding, Kristin Boulier, Alec Chiu, Tommer Schwarz, Kathryn S. Burch & Bogdan Pasaniuc
Department of Oral Biology, School of Dentistry, University of California, Los Angeles, Los Angeles, CA, 90095, USA
Vidhya Venkateswaran
Institute for Quantitative and Computational Biosciences, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, 90095, USA
Arjun Bhattacharya & Sergey Knyazev
Department of Medicine, Division of Cardiology, University of California, Los Angeles, Los Angeles, CA, 90095, USA
Kristin Boulier
Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, 90095, USA
Malika Freund, Valerie A. Arboleda, Sriram Sankararaman, Daniel H. Geschwind & Bogdan Pasaniuc
Department of Genetics, Stanford School of Medicine, Stanford, CA, 94305, USA
Malika Freund
Molecular Biology Institute, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, 90095, USA
Lingyu Zhan
Program in Neurogenetics, Department of Neurology, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, 90095, USA
Christa Caggiano, Noah Zaitlen, Clara Lajonchere & Daniel H. Geschwind
Department of Computational Medicine, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, 90095, USA
Brunilda Balliu, Noah Zaitlen, Valerie A. Arboleda, Eran Halperin, Sriram Sankararaman & Bogdan Pasaniuc
Division of Hematology/Oncology, Department of Pediatrics, Gwynne Hazen Cherry Memorial Laboratories, University of California, Los Angeles, Los Angeles, CA, 90095, USA
Christopher T. Denny
Molecular Biology Institute, University of California, Los Angeles, Los Angeles, CA, 90095, USA
Christopher T. Denny
Jonsson Comprehensive Cancer Center, University of California, Los Angeles, Los Angeles, CA, 90095, USA
Christopher T. Denny
Department of Psychiatry and Biobehavioral Sciences, University of California, Los Angeles, Los Angeles, CA, 90095, USA
Jae Hoon Sul
Department of Anesthesiology and Perioperative Medicine, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, 90095, USA
Eran Halperin
Department of Pediatrics, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, 90095, USA
Manish J. Butte
Institute of Precision Health, University of California, Los Angeles, Los Angeles, CA, 90095, USA
Clara Lajonchere, Daniel H. Geschwind & Bogdan Pasaniuc

Authors

Ruth Johnson
View author publications
You can also search for this author in PubMed Google Scholar
Yi Ding
View author publications
You can also search for this author in PubMed Google Scholar
Vidhya Venkateswaran
View author publications
You can also search for this author in PubMed Google Scholar
Arjun Bhattacharya
View author publications
You can also search for this author in PubMed Google Scholar
Kristin Boulier
View author publications
You can also search for this author in PubMed Google Scholar
Alec Chiu
View author publications
You can also search for this author in PubMed Google Scholar
Sergey Knyazev
View author publications
You can also search for this author in PubMed Google Scholar
Tommer Schwarz
View author publications
You can also search for this author in PubMed Google Scholar
Malika Freund
View author publications
You can also search for this author in PubMed Google Scholar
Lingyu Zhan
View author publications
You can also search for this author in PubMed Google Scholar
Kathryn S. Burch
View author publications
You can also search for this author in PubMed Google Scholar
Christa Caggiano
View author publications
You can also search for this author in PubMed Google Scholar
Brian Hill
View author publications
You can also search for this author in PubMed Google Scholar
Nadav Rakocz
View author publications
You can also search for this author in PubMed Google Scholar
Brunilda Balliu
View author publications
You can also search for this author in PubMed Google Scholar
Christopher T. Denny
View author publications
You can also search for this author in PubMed Google Scholar
Jae Hoon Sul
View author publications
You can also search for this author in PubMed Google Scholar
Noah Zaitlen
View author publications
You can also search for this author in PubMed Google Scholar
Valerie A. Arboleda
View author publications
You can also search for this author in PubMed Google Scholar
Eran Halperin
View author publications
You can also search for this author in PubMed Google Scholar
Sriram Sankararaman
View author publications
You can also search for this author in PubMed Google Scholar
Manish J. Butte
View author publications
You can also search for this author in PubMed Google Scholar
Clara Lajonchere
View author publications
You can also search for this author in PubMed Google Scholar
Daniel H. Geschwind
View author publications
You can also search for this author in PubMed Google Scholar
Bogdan Pasaniuc
View author publications
You can also search for this author in PubMed Google Scholar

Consortia

UCLA Precision Health Data Discovery Repository Working Group, UCLA Precision Health ATLAS Working Group

Corresponding authors

Correspondence to Ruth Johnson or Bogdan Pasaniuc.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1: Figure S1.

Self-identified race/ethnicity (SIRE) and genetically inferred ancestry (GIA) capture distinct information. We show the percentage breakdown of (A) SIREs and (B) continental genetic ancestry for all individuals in ATLAS (N=36,736). We exclude individuals whose self-identify race and/or ethnicity are unknown. Figure S2. Clustering individuals by continental GIA using PCA and K-nearest neighbors clustering. Genetic PCs 1-6 of ATLAS participants (N=36,736) and individuals in 1000 Genomes stratified by genetic ancestry groups. Figure S3. PCA in the East Asian American GIA group by self-identified race and language. (A) Genetic PCs 3 and 4 from principal component analysis performed on East Asian American GIA group (N=3,331) colored by self-identified race and (B) self-identified preferred language. Only languages with >10 responses are assigned a color. Figure S4. East Asian American GIA subclusters. Principal component analysis on the East Asian American GIA group in ATLAS with East Asian ancestry samples from 1000 Genomes. (A) shows the projection of PCs 3 and 4 and subclusters identified from performing K-nearest neighbors using population labels from 1000 Genomes to define clusters and (B) self-identified race information from ATLAS as cluster labels. Figure S5. PCA in the European American GIA group by self-identified race and language. (A) Genetic PCs 3 and 5 from principal component analysis performed within the European American GIA group (N=22,380) colored by self-identified race and (B) self-identified preferred language. Only languages with >10 responses are assigned a color. Figure S6. PCA in the Hispanic Latino American GIA group by self-identified ethnicity, language, and inferred ancestry proportions. (A) Genetic PCs 1 and 2 from principal component analysis performed within the Hispanic Latino American GIA group (N=6,073) colored by self-identified ethnicity and (B) self-identified preferred language. Only languages with >10 responses are assigned a color. (C) and (D) show the PCs shaded according to the estimated proportion of European and Native American genetic ancestry inferred from ADMIXTURE. Figure S7. PCA in the African American GIA group by genetic ancestry, self-identified race, language, and inferred ancestry proportions. Genetic PCs 1 and 2 from principal component analysis performed within the Hispanic Latino American GIA group (N=1995) colored by (A) genetic ancestry of individuals from 1000 Genomes, (B) self-identified race and (B) self-identified preferred language. Only languages with >10 responses are assigned a color. (C) and (D) show the PCs shaded according to the estimated proportion of European and African genetic ancestry inferred from ADMIXTURE. Figure S8. Individual admixture proportions vary across and within SIRE. Admixture proportions for ATLAS participants (N=36,736) were estimated using ADMIXTURE with k=4, 5, or 6 ancestral populations. Within each SIRE, we visualize the proportions of each ancestry as a vertical bar for each individual. Individuals are ordered on the x-axis by global ancestry proportions. For k=4, the respective components correspond to European, African, East Asian, and Native American ancestries. Figure S9. Disease associations vary across continental GIA groups in ATLAS even after adjusting for SIRE. We show the odds ratio computed from associating each phenotype with individuals’ genetically inferred ancestry in ATLAS (N=36,736) under a logistic regression model after accounting for each individual’s SIRE category. Error bars represent 95% confidence intervals. Figure S10. Disease associations vary across subcontinental groups within the East Asian American GIA group. For individuals in the East Asian American GIA group in ATLAS (N=3,331), we show the odds ratio computed from associating each phenotype with individuals’ subcontinental GIA group under a logistic regression model. We limit analyses to phenotypes with N>20 cases; for this reason, the analysis for skin cancer has been omitted. Error bars represent 95% confidence intervals. Figure S11. Global ancestry correlates with disease prevalence in admixed individuals. Individuals by SIRE who have had a diagnosis of (A) skin cancer, (B) chronic kidney disease, or (C) heart disease are binned by their proportions of either European, African, or Native American ancestry estimated using ADMIXTURE. Within each bin, we plot the prevalence of the diagnoses and provide standard errors (+/-1.96 SE) of the computed frequencies. Figure S12. Manhattan plot for ancestry-specific analysis for skin cancer. GWAS Manhattan plot for skin cancer in the European American GIA group. The red dashed line denotes genome-wide significance (p-value<5× 10^-8). Figure S13. Manhattan plots for ancestry-specific and multi-ancestry meta-analysis for chronic nonalcoholic liver disease. GWAS Manhattan plots for chronic nonalcoholic liver disease in the (A) European American, (B) African American, (C) Hispanic Latino American, (D) East Asian American GIA groups, and (E) the meta-analysis across all GIA groups. The red dashed line denotes genome-wide significance (p-value<5× 10^-8). Figure S14. Manhattan plots for ancestry-specific and multi-ancestry meta-analysis for ischemic heart disease. GWAS Manhattan plots for ischemic heart disease in the (A) European American, (B) African American, (C) Hispanic Latino American, and (D) East Asian American GIA groups, and (E) the meta-analysis across all GIA groups. The red dashed line denotes genome-wide significance (p-value<5× 10^-8). Figure S15. Manhattan plot for ancestry-specific analysis for uterine leiomyoma. GWAS Manhattan plots for uterine leiomyoma in the African American GIA group. The red dashed line denotes genome-wide significance (p-value<5× 10^-8). Figure S16. Manhattan plots for ancestry-specific and multi-ancestry meta-analysis for liver/intrahepatic bile duct cancer. GWAS Manhattan plots for liver/intrahepatic bile duct cancer in the (A) Hispanic Latino American, (B) East Asian American GIA groups, and (C) the meta-analysis across both GIA groups. The red dashed line denotes genome-wide significance (p-value<5× 10^-8). Figure S17. Manhattan plots for ancestry-specific and multi-ancestry meta-analysis for chronic kidney disease. GWAS Manhattan plots for chronic kidney disease in the (A) European American, (B) African American, and (C) the meta-analysis across GIA groups. The red dashed line denotes genome-wide significance (p-value<5× 10^-8). Figure S18. PheWAS at top GWAS associations. We show a PheWAS plot at rs12203592 (chr6:396321) and rs1333045 (chr9:22119196) computed within the European American GIA group. The red dashed line denotes p-value=4.09× 10^-5, the significance threshold after adjusting for the number of tested phenotypes. The red dotted line denotes the significance threshold after correcting for both genome-wide significance and the number of tested phenotypes (p-value=4.09× 10^-11). Figure S19. Role of phecode occurrences for defining cases. We show the percentage of cases retained while varying the number of required phecode occurrences (x-axis) for 6 phenotypes. In A), phecodes are derived from all types of encounters. In B), phecodes are only derived only from appointments and office, hospital, or procedure visits. The y-axis ranges from 0.95 to 1.0.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article

Johnson, R., Ding, Y., Venkateswaran, V. et al. Author Correction: Leveraging genomic diversity for discovery in an electronic health record linked biobank: the UCLA ATLAS Community Health Initiative. Genome Med 14, 128 (2022). https://doi.org/10.1186/s13073-022-01128-5

Download citation

Published: 16 November 2022
DOI: https://doi.org/10.1186/s13073-022-01128-5

Author Correction: Leveraging genomic diversity for discovery in an electronic health record linked biobank: the UCLA ATLAS Community Health Initiative

Reference

Author information

Authors and Affiliations

Consortia

UCLA Precision Health Data Discovery Repository Working Group, UCLA Precision Health ATLAS Working Group

Corresponding authors

Additional information

Publisher’s Note

Supplementary Information

Additional file 1: Figure S1.

Rights and permissions

About this article

Cite this article

Share this article

Genome Medicine

Contact us