Go to JCI Insight
  • About
  • Editors
  • Consulting Editors
  • For authors
  • Publication ethics
  • Publication alerts by email
  • Advertising
  • Job board
  • Contact
  • Clinical Research and Public Health
  • Current issue
  • Past issues
  • By specialty
    • COVID-19
    • Cardiology
    • Gastroenterology
    • Immunology
    • Metabolism
    • Nephrology
    • Neuroscience
    • Oncology
    • Pulmonology
    • Vascular biology
    • All ...
  • Videos
    • Conversations with Giants in Medicine
    • Video Abstracts
  • Reviews
    • View all reviews ...
    • Complement Biology and Therapeutics (May 2025)
    • Evolving insights into MASLD and MASH pathogenesis and treatment (Apr 2025)
    • Microbiome in Health and Disease (Feb 2025)
    • Substance Use Disorders (Oct 2024)
    • Clonal Hematopoiesis (Oct 2024)
    • Sex Differences in Medicine (Sep 2024)
    • Vascular Malformations (Apr 2024)
    • View all review series ...
  • Viewpoint
  • Collections
    • In-Press Preview
    • Clinical Research and Public Health
    • Research Letters
    • Letters to the Editor
    • Editorials
    • Commentaries
    • Editor's notes
    • Reviews
    • Viewpoints
    • 100th anniversary
    • Top read articles

  • Current issue
  • Past issues
  • Specialties
  • Reviews
  • Review series
  • Conversations with Giants in Medicine
  • Video Abstracts
  • In-Press Preview
  • Clinical Research and Public Health
  • Research Letters
  • Letters to the Editor
  • Editorials
  • Commentaries
  • Editor's notes
  • Reviews
  • Viewpoints
  • 100th anniversary
  • Top read articles
  • About
  • Editors
  • Consulting Editors
  • For authors
  • Publication ethics
  • Publication alerts by email
  • Advertising
  • Job board
  • Contact
Top
  • View PDF
  • Download citation information
  • Send a comment
  • Terms of use
  • Standard abbreviations
  • Need help? Email the journal
  • Top
  • Footnotes
  • References
  • Version history
  • Article usage
  • Citations to this article

Advertisement

Editor's note Open Access | 10.1172/JCI184350

The importance of diverse multiomics datasets and analyses

Laura J. Rasmussen-Torvik, Associate Editor

Find articles by Rasmussen-Torvik, L. in: PubMed | Google Scholar |

Published November 1, 2024 - More info

Published in Volume 134, Issue 21 on November 1, 2024
J Clin Invest. 2024;134(21):e184350. https://doi.org/10.1172/JCI184350.
© 2024 Rasmussen-Torvik This work is licensed under the Creative Commons Attribution 4.0 International License. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
Published November 1, 2024 - Version history
View PDF

Related article:

Proteogenomic analysis integrated with electronic health records data reveals disease-associated variants in Black Americans
Usman A. Tahir, … , Laura M. Raffield, Robert E. Gerszten
Usman A. Tahir, … , Laura M. Raffield, Robert E. Gerszten
Our study integrates genetic and protein associations with large electronic health record datasets in diverse populations to enhance biological insights in clinical disease.
Clinical Research and Public Health Genetics Immunology

Proteogenomic analysis integrated with electronic health records data reveals disease-associated variants in Black Americans

  • Text
  • PDF
Abstract

BACKGROUND Most GWAS of plasma proteomics have focused on White individuals of European ancestry, limiting biological insight from other ancestry-enriched protein quantitative loci (pQTL).METHODS We conducted a discovery GWAS of approximately 3,000 plasma proteins measured by the antibody-based Olink platform in 1,054 Black adults from the Jackson Heart Study (JHS) and validated our findings in the Multi-Ethnic Study of Atherosclerosis (MESA). The genetic architecture of identified pQTLs was further explored through fine mapping and admixture association analysis. Finally, using our pQTL findings, we performed a phenome-wide association study (PheWAS) across 2 large multiethnic electronic health record (EHR) systems in All of Us and BioMe.RESULTS We identified 1,002 pQTLs for 925 protein assays. Fine mapping and admixture analyses suggested allelic heterogeneity of the plasma proteome across diverse populations. We identified associations for variants enriched in African ancestry, many in diseases that lack precise biomarkers, including cis-pQTLs for cathepsin L (CTSL) and Siglec-9, which were linked with sarcoidosis and non-Hodgkin’s lymphoma, respectively. We found concordant associations across clinical diagnoses and laboratory measurements, elucidating disease pathways, including a cis-pQTL associated with circulating CD58, WBC count, and multiple sclerosis.CONCLUSIONS Our findings emphasize the value of leveraging diverse populations to enhance biological insights from proteomics GWAS, and we have made this resource readily available as an interactive web portal.FUNDING NIH K08 HL161445-01A1; 5T32HL160522-03; HHSN268201600034I; HL133870.

Authors

Usman A. Tahir, Jacob L. Barber, Daniel E. Cruz, Meltem Ece Kars, Shuliang Deng, Bjoernar Tuftin, Madeline G. Gillman, Mark D. Benson, Jeremy M. Robbins, Zsu-Zsu Chen, Prashant Rao, Daniel H. Katz, Laurie Farrell, Tamar Sofer, Michael E. Hall, Lynette Ekunwe, Russell P. Tracy, Peter Durda, Kent D. Taylor, Yongmei Liu, W. Craig Johnson, Xiuqing Guo, Yii-Der Ida Chen, Ani W. Manichaikul, Deepti Jain, NHLBI Trans-Omics for Precision Medicine Consortium, Thomas J. Wang, Alex P. Reiner, Pradeep Natarajan, Yuval Itan, Stephen S. Rich, Jerome I. Rotter, James G. Wilson, Laura M. Raffield, Robert E. Gerszten

×

The past 15 years have seen enormous technical advances resulting in researchers having the ability to efficiently and economically generate large-scale -omics data (genetic, methylation, and proteomic data, to name just a few) on hundreds or thousands of samples provided by volunteers in many different types of research studies. Simultaneously, efforts by the NIH and other funders have made this data rapidly available to researchers all over the world through expanding data repositories such as https://sharing.nih.gov/accessing-data/accessing-genomic-data/accessing-genomic-data-from-nih-repositories, https://www.proteomicsdb.org/ and https://ngdc.cncb.ac.cn/methbank These databases have enabled critical analyses exploring pathways of disease. However, some analyses arising from public databases are plagued with issues such as lack of generalizability and lack of replication. A particular challenge is that the easiest to use datasets (those with the lowest barriers to access or the best documentation for use) may not always include the best data to address a given hypothesis. The UK Biobank, for example, has facilitated large numbers of important discoveries, but there are limitations, such as, the ability to analyze outcomes that require information outside of hospital inpatient data (1). Furthermore, the relative paucity of genetic data generated in non-White populations in these databases has led to in far too few genetic analyses published in non-White populations (2), resulting in disparities that affect genetic risk prediction in non-White populations (3, 4) and, fundamentally, limiting discovery opportunities (5).

In this issue of the JCI, Tahir and colleagues (6) do not, by any means, take the easiest approach to large-scale protein quantitative loci (pQTL) and phenome-wide association study (PheWAS) analyses. Instead, they thoughtfully utilized data from two different cohorts (Jackson Heart Study and Multi-Ethnic Study of Atherosclerosis) and two different biobanks (All of Us and BioMe) to assemble an analytic dataset including large numbers of individuals with African ancestry. Using assembled cohort data, they identified cis-pQTLs (some among variants enriched in those with African ancestry) and then, using assembled biobank data, they performed a PheWAS and discovered important associations between identified cis-pQTLs and clinical diagnoses and laboratory measurements. Equally importantly, the summary statistics from their GWAS and PheWAS analyses have been made available to the broader scientific community at https://bidmc-cardiology-2024.shinyapps.io/pqtl_phewas_explorer/, allowing other researchers to benefit from the authors’ hard work in assembling and analyzing multiple databases. The scientific community must continue to recognize and value the efforts of researchers doing the work required to publish results of population science analyses in diverse populations.

Footnotes

Conflict of interest: The author has declared that no conflict of interest exists.

Copyright: © 2024, Rasmussen-Torvik. This is an open access article published under the terms of the Creative Commons Attribution 4.0 International License.

Reference information: J Clin Invest. 2024;134(21):e184350. https://doi.org/10.1172/JCI184350.

See the related article at Proteogenomic analysis integrated with electronic health records data reveals disease-associated variants in Black Americans.

References
  1. Bassett E, et al. Inconsistency in UK Biobank event definitions from different data sources and its impact on bias and generalizability: a case study of venous thromboembolism. Am J Epidemiol. 2024;193(5):787–797.
    View this article via: CrossRef PubMed Google Scholar
  2. Wang Y, et al. Aspiring toward equitable benefits from genomic advances to individuals of ancestrally diverse backgrounds. Am J Hum Genet. 2024;111(5):809–824.
    View this article via: CrossRef PubMed Google Scholar
  3. Martin AR, et al. Clinical use of current polygenic risk scores may exacerbate health disparities. Nat Genet. 2019;51(4):584–591.
    View this article via: CrossRef PubMed Google Scholar
  4. Caswell-Jin JL, et al. Racial/ethnic differences in multiple-gene sequencing results for hereditary cancer risk. Genet Med. 2018;20(2):234–239.
    View this article via: CrossRef PubMed Google Scholar
  5. Verma A, et al. Diversity and scale: genetic architecture of 2068 traits in the VA Million Veteran Program. Science. 2024;385(6706):eadj1182.
    View this article via: CrossRef PubMed Google Scholar
  6. Tahir U, et al. Proteogenomic analysis integrated with electronic health records data reveals disease-associated variants in Black Americans. J Clin Invest. 2024;134(21):e181802.
    View this article via: JCI PubMed CrossRef Google Scholar
Version history
  • Version 1 (November 1, 2024): Electronic publication

Article tools

  • View PDF
  • Download citation information
  • Send a comment
  • Terms of use
  • Standard abbreviations
  • Need help? Email the journal

Metrics

  • Article usage
  • Citations to this article

Go to

  • Top
  • Footnotes
  • References
  • Version history
Advertisement
Advertisement

Copyright © 2025 American Society for Clinical Investigation
ISSN: 0021-9738 (print), 1558-8238 (online)

Sign up for email alerts