Advertisement
Editor's note
Open Access | 10.1172/JCI184350
Find articles by
Rasmussen-Torvik, L.
in:
PubMed
|
Google Scholar
|
Published November 1, 2024 - More info
BACKGROUND Most GWAS of plasma proteomics have focused on White individuals of European ancestry, limiting biological insight from other ancestry-enriched protein quantitative loci (pQTL).METHODS We conducted a discovery GWAS of approximately 3,000 plasma proteins measured by the antibody-based Olink platform in 1,054 Black adults from the Jackson Heart Study (JHS) and validated our findings in the Multi-Ethnic Study of Atherosclerosis (MESA). The genetic architecture of identified pQTLs was further explored through fine mapping and admixture association analysis. Finally, using our pQTL findings, we performed a phenome-wide association study (PheWAS) across 2 large multiethnic electronic health record (EHR) systems in All of Us and BioMe.RESULTS We identified 1,002 pQTLs for 925 protein assays. Fine mapping and admixture analyses suggested allelic heterogeneity of the plasma proteome across diverse populations. We identified associations for variants enriched in African ancestry, many in diseases that lack precise biomarkers, including cis-pQTLs for cathepsin L (CTSL) and Siglec-9, which were linked with sarcoidosis and non-Hodgkin’s lymphoma, respectively. We found concordant associations across clinical diagnoses and laboratory measurements, elucidating disease pathways, including a cis-pQTL associated with circulating CD58, WBC count, and multiple sclerosis.CONCLUSIONS Our findings emphasize the value of leveraging diverse populations to enhance biological insights from proteomics GWAS, and we have made this resource readily available as an interactive web portal.FUNDING NIH K08 HL161445-01A1; 5T32HL160522-03; HHSN268201600034I; HL133870.
Usman A. Tahir, Jacob L. Barber, Daniel E. Cruz, Meltem Ece Kars, Shuliang Deng, Bjoernar Tuftin, Madeline G. Gillman, Mark D. Benson, Jeremy M. Robbins, Zsu-Zsu Chen, Prashant Rao, Daniel H. Katz, Laurie Farrell, Tamar Sofer, Michael E. Hall, Lynette Ekunwe, Russell P. Tracy, Peter Durda, Kent D. Taylor, Yongmei Liu, W. Craig Johnson, Xiuqing Guo, Yii-Der Ida Chen, Ani W. Manichaikul, Deepti Jain, NHLBI Trans-Omics for Precision Medicine Consortium, Thomas J. Wang, Alex P. Reiner, Pradeep Natarajan, Yuval Itan, Stephen S. Rich, Jerome I. Rotter, James G. Wilson, Laura M. Raffield, Robert E. Gerszten
The past 15 years have seen enormous technical advances resulting in researchers having the ability to efficiently and economically generate large-scale -omics data (genetic, methylation, and proteomic data, to name just a few) on hundreds or thousands of samples provided by volunteers in many different types of research studies. Simultaneously, efforts by the NIH and other funders have made this data rapidly available to researchers all over the world through expanding data repositories such as https://sharing.nih.gov/accessing-data/accessing-genomic-data/accessing-genomic-data-from-nih-repositories, https://www.proteomicsdb.org/ and https://ngdc.cncb.ac.cn/methbank These databases have enabled critical analyses exploring pathways of disease. However, some analyses arising from public databases are plagued with issues such as lack of generalizability and lack of replication. A particular challenge is that the easiest to use datasets (those with the lowest barriers to access or the best documentation for use) may not always include the best data to address a given hypothesis. The UK Biobank, for example, has facilitated large numbers of important discoveries, but there are limitations, such as, the ability to analyze outcomes that require information outside of hospital inpatient data (1). Furthermore, the relative paucity of genetic data generated in non-White populations in these databases has led to in far too few genetic analyses published in non-White populations (2), resulting in disparities that affect genetic risk prediction in non-White populations (3, 4) and, fundamentally, limiting discovery opportunities (5).
In this issue of the JCI, Tahir and colleagues (6) do not, by any means, take the easiest approach to large-scale protein quantitative loci (pQTL) and phenome-wide association study (PheWAS) analyses. Instead, they thoughtfully utilized data from two different cohorts (Jackson Heart Study and Multi-Ethnic Study of Atherosclerosis) and two different biobanks (All of Us and BioMe) to assemble an analytic dataset including large numbers of individuals with African ancestry. Using assembled cohort data, they identified cis-pQTLs (some among variants enriched in those with African ancestry) and then, using assembled biobank data, they performed a PheWAS and discovered important associations between identified cis-pQTLs and clinical diagnoses and laboratory measurements. Equally importantly, the summary statistics from their GWAS and PheWAS analyses have been made available to the broader scientific community at https://bidmc-cardiology-2024.shinyapps.io/pqtl_phewas_explorer/, allowing other researchers to benefit from the authors’ hard work in assembling and analyzing multiple databases. The scientific community must continue to recognize and value the efforts of researchers doing the work required to publish results of population science analyses in diverse populations.
Conflict of interest: The author has declared that no conflict of interest exists.
Copyright: © 2024, Rasmussen-Torvik. This is an open access article published under the terms of the Creative Commons Attribution 4.0 International License.
Reference information: J Clin Invest. 2024;134(21):e184350. https://doi.org/10.1172/JCI184350.
See the related article at Proteogenomic analysis integrated with electronic health records data reveals disease-associated variants in Black Americans.