1.
A fully joint Bayesian quantitative trait locus mapping of human protein abundance in plasma.
Ruffieux, H, Carayol, J, Popescu, R, Harper, ME, Dent, R, Saris, WHM, Astrup, A, Hager, J, Davison, AC, Valsesia, A
PLoS computational biology. 2020;(6):e1007882
Abstract
Molecular quantitative trait locus (QTL) analyses are increasingly popular to explore the genetic architecture of complex traits, but existing studies do not leverage shared regulatory patterns and suffer from a large multiplicity burden, which hampers the detection of weak signals such as trans associations. Here, we present a fully multivariate proteomic QTL (pQTL) analysis performed with our recently proposed Bayesian method LOCUS on data from two clinical cohorts, with plasma protein levels quantified by mass-spectrometry and aptamer-based assays. Our two-stage study identifies 136 pQTL associations in the first cohort, of which >80% replicate in the second independent cohort and have significant enrichment with functional genomic elements and disease risk loci. Moreover, 78% of the pQTLs whose protein abundance was quantified by both proteomic techniques are confirmed across assays. Our thorough comparisons with standard univariate QTL mapping on (1) these data and (2) synthetic data emulating the real data show how LOCUS borrows strength across correlated protein levels and markers on a genome-wide scale to effectively increase statistical power. Notably, 15% of the pQTLs uncovered by LOCUS would be missed by the univariate approach, including several trans and pleiotropic hits with successful independent validation. Finally, the analysis of extensive clinical data from the two cohorts indicates that the genetically-driven proteins identified by LOCUS are enriched in associations with low-grade inflammation, insulin resistance and dyslipidemia and might therefore act as endophenotypes for metabolic diseases. While considerations on the clinical role of the pQTLs are beyond the scope of our work, these findings generate useful hypotheses to be explored in future research; all results are accessible online from our searchable database. Thanks to its efficient variational Bayes implementation, LOCUS can analyze jointly thousands of traits and millions of markers. Its applicability goes beyond pQTL studies, opening new perspectives for large-scale genome-wide association and QTL analyses. Diet, Obesity and Genes (DiOGenes) trial registration number: NCT00390637.
2.
Analysis of 1508 Plasma Samples by Capillary-Flow Data-Independent Acquisition Profiles Proteomics of Weight Loss and Maintenance.
Bruderer, R, Muntel, J, Müller, S, Bernhardt, OM, Gandhi, T, Cominetti, O, Macron, C, Carayol, J, Rinner, O, Astrup, A, et al
Molecular & cellular proteomics : MCP. 2019;(6):1242-1254
Abstract
Comprehensive, high throughput analysis of the plasma proteome has the potential to enable holistic analysis of the health state of an individual. Based on our own experience and the evaluation of recent large-scale plasma mass spectrometry (MS) based proteomic studies, we identified two outstanding challenges: slow and delicate nano-flow liquid chromatography (LC) and irreproducibility of identification of data-dependent acquisition (DDA). We determined an optimal solution reducing these limitations with robust capillary-flow data-independent acquisition (DIA) MS. This platform can measure 31 plasma proteomes per day. Using this setup, we acquired a large-scale plasma study of the diet, obesity and genes dietary (DiOGenes) comprising 1508 samples. Proving the robustness, the complete acquisition was achieved on a single analytical column. Totally, 565 proteins (459 identified with two or more peptide sequences) were profiled with 74% data set completeness. On average 408 proteins (5246 peptides) were identified per acquisition (319 proteins in 90% of all acquisitions). The workflow reproducibility was assessed using 34 quality control pools acquired at regular intervals, resulting in 92% data set completeness with CVs for protein measurements of 10.9%.The profiles of 20 apolipoproteins could be profiled revealing distinct changes. The weight loss and weight maintenance resulted in sustained effects on low-grade inflammation, as well as steroid hormone and lipid metabolism, indicating beneficial effects. Comparison to other large-scale plasma weight loss studies demonstrated high robustness and quality of biomarker candidates identified. Tracking of nonenzymatic glycation indicated a delayed, slight reduction of glycation in the weight maintenance phase. Using stable-isotope-references, we could directly and absolutely quantify 60 proteins in the DIA.In conclusion, we present herein the first large-scale plasma DIA study and one of the largest clinical research proteomic studies to date. Application of this fast and robust workflow has great potential to advance biomarker discovery in plasma.