-
1.
Genome-based identification and comparative analysis of enzymes for carotenoid biosynthesis in microalgae.
Narang, PK, Dey, J, Mahapatra, SR, Roy, R, Kushwaha, GS, Misra, N, Suar, M, Raina, V
World journal of microbiology & biotechnology. 2021;(1):8
Abstract
Microalgae are potential feedstocks for the commercial production of carotenoids, however, the metabolic pathways for carotenoid biosynthesis across algal lineage are largely unexplored. This work is the first to provide a comprehensive survey of genes and enzymes associated with the less studied methylerythritol 4-phosphate/1-deoxy-D-xylulose 5-phosphate pathway as well as the carotenoid biosynthetic pathway in microalgae through bioinformatics and comparative genomics approach. Candidate genes/enzymes were subsequently analyzed across 22 microalgae species of lineages Chlorophyta, Rhodophyta, Heterokonta, Haptophyta, Cryptophyta, and known Arabidopsis homologs in order to study the evolutional divergence in terms of sequence-structure properties. A total of 403 enzymes playing a vital role in carotene, lutein, zeaxanthin, violaxanthin, canthaxanthin, and astaxanthin were unraveled. Of these, 85 were hypothetical proteins whose biological roles are not yet experimentally characterized. Putative functions to these hypothetical proteins were successfully assigned through a comprehensive investigation of the protein family, motifs, intrinsic physicochemical features, subcellular localization, pathway analysis, etc. Furthermore, these enzymes were categorized into major classes as per the conserved domain and gene ontology. Functional signature sequences were also identified which were observed conserved across microalgal genomes. Additionally, the structural modeling and active site architecture of three vital enzymes, DXR, PSY, and ZDS catalyzing the vital rate-limiting steps in Dunaliella salina were achieved. The enzymes were confirmed to be stereochemically reliable and stable as revealed during molecular dynamics simulation of 100 ns. The detailed functional information about individual vital enzymes will certainly help to design genetically modified algal strains with enhanced carotenoid contents.
-
2.
Genotyping-By-Sequencing diversity analysis of international Vanilla collections uncovers hidden diversity and enables plant improvement.
Chambers, A, Cibrián-Jaramillo, A, Karremans, AP, Moreno Martinez, D, Hernandez-Hernandez, J, Brym, M, Resende, MFR, Moloney, R, Sierra, SN, Hasing, T, et al
Plant science : an international journal of experimental plant biology. 2021;:111019
Abstract
Genomics-based diversity analysis of natural vanilla populations is important in order to guide conservation efforts and genetic improvement through plant breeding. Vanilla is a cultivated, undomesticated spice that originated in Mesoamerica prior to spreading globally through vegetative cuttings. Vanilla extract from the commercial species, mainly V. planifolia and V. × tahitensis, is used around the world as an ingredient in foods, beverages, cosmetics, and pharmaceuticals. The global reliance on descendants of a few foundational clones in commercial production has resulted in an industry at heightened risk of catastrophic failure due to extremely narrow genetic diversity. Conversely, national and institutional collections including those near the center of cultivation contain previously undiscovered diversity that could bolster the genetic improvement of vanilla and guide conservation efforts. Towards this goal, an international vanilla genotyping effort generated and analyzed 431,204 single nucleotide polymorphisms among 412 accessions and 27 species from eight collections. Phylogenetic and STRUCTURE analysis sorted vanilla by species and identified hybrid accessions. Principal Component Analysis and the Fixation Index (FST) were used to refine relationships among accessions and showed differentiation among species. Analysis of the commercial species split V. planifolia into three types with all V. × tahitensis accessions being most similar to V. planifolia type 2. Finally, an in-depth analysis of V. × tahitensis identified seven V. planifolia and six V. odorata accessions as most similar to the estimated parental genotypes providing additional data in support of the current hybrid theory. The prevalence of probable V. × tahitensis parental accessions from Belize suggests that V. × tahitensis could have originated from this area and highlights the need for vanilla conservation throughout Central and South America. The genetic groupings among accessions, particularly for V. planifolia, can now be used to focus breeding efforts on fewer accessions that capture the greatest diversity.
-
3.
Comparative Genomics and Transcriptome Profiling in Primary Aldosteronism.
Aristizabal Prada, ET, Castellano, I, Sušnik, E, Yang, Y, Meyer, LS, Tetti, M, Beuschlein, F, Reincke, M, Williams, TA
International journal of molecular sciences. 2018;(4)
Abstract
Primary aldosteronism is the most common form of endocrine hypertension with a prevalence of 6% in the general population with hypertension. The genetic basis of the four familial forms of primary aldosteronism (familial hyperaldosteronism FH types I-IV) and the majority of sporadic unilateral aldosterone-producing adenomas has now been resolved. Familial forms of hyperaldosteronism are, however, rare. The sporadic forms of the disease prevail and these are usually caused by either a unilateral aldosterone-producing adenoma or bilateral adrenal hyperplasia. Aldosterone-producing adenomas frequently carry a causative somatic mutation in either of a number of genes with the KCNJ5 gene, encoding an inwardly rectifying potassium channel, a recurrent target harboring mutations at a prevalence of more than 40% worldwide. Other than genetic variations, gene expression profiling of aldosterone-producing adenomas has shed light on the genes and intracellular signalling pathways that may play a role in the pathogenesis and pathophysiology of these tumors.
-
4.
Assessment of transfer methods for comparative genomics of regulatory networks in bacteria.
Kılıç, S, Erill, I
BMC bioinformatics. 2016;(Suppl 8):277
Abstract
BACKGROUND Comparative genomics can leverage the vast amount of available genomic sequences to reconstruct and analyze transcriptional regulatory networks in Bacteria, but the efficacy of this approach hinges on the ability to transfer regulatory network information from reference species to the genomes under analysis. Several methods have been proposed to transfer regulatory information between bacterial species, but the paucity and distributed nature of experimental information on bacterial transcriptional networks have prevented their systematic evaluation. RESULTS We report the compilation of a large catalog of transcription factor-binding sites across Bacteria and its use to systematically benchmark proposed transfer methods across pairs of bacterial species. We evaluate motif- and accuracy-based metrics to assess the results of regulatory network transfer and we identify the precision-recall area-under-the-curve as the best metric for this purpose due to the large class-imbalanced nature of the problem. Methods assuming conservation of the transcription factor-binding motif (motif-based) are shown to substantially outperform those assuming conservation of regulon composition (network-based), even though their efficiency can decrease sharply with increasing phylogenetic distance. Variations of the basic motif-based transfer method do not yield significant improvements in transfer accuracy. Our results indicate that detection of a large enough number of regulated orthologs is critical for network-based transfer methods, but that relaxing orthology requirements does not improve results. Using the transcriptional regulators LexA and Fur as case examples, we also show how DNA-binding domain sequence similarity can yield confounding results as an indicator of transfer efficiency for motif-based methods. CONCLUSIONS Counter to standard practice, our evaluation of metrics to assess the efficiency of methods for regulatory network information transfer reveals that the area under precision-recall (PR) curves is a more precise and informative metric than that of receiver-operating-characteristic (ROC) curves, confirming similar findings in other class-imbalanced settings. Our systematic assessment of transfer methods reveals that simple approaches to both motif- and network-based transfer of regulatory information provide equal or better results than more elaborate methods. We also show that there are not effective predictors of transfer efficacy, substantiating the long-standing practice of manual curation in comparative genomics analyses.
-
5.
Inferring primary tumor sites from mutation spectra: a meta-analysis of histology-specific aberrations in cancer-derived cell lines.
Dietlein, F, Eschner, W
Human molecular genetics. 2014;(6):1527-37
Abstract
Next-generation sequencing technologies have led to profound characterization of mutation spectra for several cancer types. Hence, we sought to systematically compare genomic aberrations between primary tumors and cancer lines. For this, we compiled publically available sequencing data of 1651 genes across 905 cell lines. We used them to characterize 23 distinct primary tumor sites by a novel approach that is based on Bayesian spam-filtering techniques. Thereby, we confirmed the strong overall similarity of alterations between patient samples and cell culture. However, we also identified several suspicious mutations, which had not been associated with their cancer types before. Based on these characterizations, we developed the inferring cancer origins from mutation spectra (ICOMS) tool. On our cell line collection, the algorithm reached a prediction specificity rate of 79%, which strongly variegated between primary cancer sites. On an independent validation cohort of 431 primary tumor samples, we observed a similar accuracy of 71%. Additionally, we found that ICOMS could be employed to deduce further attributes from mutation spectra, including sub-histology and compound sensitivity. Thus, thorough classification of site-specific mutation spectra for cell lines may decipher further genome-phenotype associations in cancer.
-
6.
Dandruff/seborrhoeic dermatitis is characterized by an inflammatory genomic signature and possible immune dysfunction: transcriptional analysis of the condition and treatment effects of zinc pyrithione.
Mills, KJ, Hu, P, Henry, J, Tamura, M, Tiesman, JP, Xu, J
The British journal of dermatology. 2012;:33-40
Abstract
BACKGROUND Dandruff/seborrhoeic dermatitis is a common scalp condition that is characterized by flakes, pruritus and sometimes mild erythema. These symptoms reflect tissue level events that are poorly understood at the molecular level. OBJECTIVES The purpose of this work was: (i) to compare gene expression profiles in subjects with dandruff vs. those of subjects without dandruff to determine the key physiological disruptions manifest in the condition; and (ii) to determine the effect on this profile of treatment with a shampoo containing potentiated zinc pyrithione (ZPT). METHODS In study 1, scalp biopsies were taken from 16 normal subjects and from involved and uninvolved sites in 15 subjects with dandruff. In study 2, 30 subjects with dandruff were treated for 3 weeks with a commercial ZPT shampoo (n = 15) or a vehicle (n = 15), and scalp lesional biopsies were collected at baseline and end of study for transcriptomic analysis. RNA was extracted from all biopsies and Affymetrix gene chips were used to analyse transcriptomic profiles, followed by bioinformatic analysis. RESULTS Analysis of study 1 biopsies revealed more than 7000 individual probes differentially regulated in dandruff lesional skin relative to normal. Enriched Gene Ontology categories included: lipid metabolism, immune response, response to stimulus, apoptosis, cell proliferation, and epidermal development. The most striking feature of lesional skin relative to normal was the reciprocal expression of induced inflammatory genes and repressed lipid metabolism genes. Induced inflammatory genes were also enriched in dandruff uninvolved skin, suggesting the existence of predisposing factors associated with inflammation. Many genes increased in lesional skin were increased at the level of protein in stratum corneum samples (e.g. IL-1RA, S100A8, S100A9, S100A11, IL-8). Under conditions known to improve overall scalp condition, the ZPT shampoo treatment in study 2 produced a transcriptomic profile resembling that of normal scalp skin. CONCLUSIONS These data provide novel insights into the nature of dandruff and the therapeutic action of potentiated ZPT-containing shampoo, and provide a basis to explore many new mechanistic questions related to these topics.
-
7.
Genomic profiling in CEPH cell lines distinguishes between the camptothecins and indenoisoquinolines.
Watson, VG, Hardison, NE, Harris, T, Motsinger-Reif, A, McLeod, HL
Molecular cancer therapeutics. 2011;(10):1839-45
-
-
Free full text
-
Abstract
We have attempted to use a familial genetics strategy to study mechanisms of topoisomerase 1 (Top1) inhibition. Investigations have steadily been chipping away at the pathways involved in cellular response following Top1 inhibition for more than 20 years. Our system-wide approach, which phenotypes a collection of genotyped human cell lines for sensitivity to compounds and interrogates all genes and molecular pathways simultaneously. Previously, we characterized the in vitro sensitivity of 15 families of Centre d'Etude Polymorphisme Humain (CEPH) cell lines (n = 142) to 9 camptothecin analogues. Linkage analysis revealed a pattern of 7 quantitative trait loci (QTL) shared by all of the camptothecins. To identify which, if any, QTLs are related to the general mechanism of Top1 inhibition or should be considered camptothecin specific, we characterized the in vitro sensitivity of the same panel of CEPH cell lines to the indenisoquinolones, a structurally distinct class of Top1 inhibitors. Four QTLs on chromosomes 1, 5, 11, and 16 were shared by both the camptothecins and the indenoisoquinolines and are considered associated with the general mechanism of Top1 inhibition. The remaining 3 QTLs (chromosomes 6 and 20) are considered specific to camptothecin-induced cytotoxicity. Finally, 8 QTLs were identified, which were unique to the indenoisoquinolines.
-
8.
QTL detection with bidirectional and unidirectional selective genotyping: marker-based and trait-based analyses.
Navabi, A, Mather, DE, Bernier, J, Spaner, DM, Atlin, GN
TAG. Theoretical and applied genetics. Theoretische und angewandte Genetik. 2009;(2):347-58
Abstract
Selective genotyping of one or both phenotypic extremes of a population can be used to detect linkage between markers and quantitative trait loci (QTL) in situations in which full-population genotyping is too costly or not feasible, or where the objective is to rapidly screen large numbers of potential donors for useful alleles with large effects. Data may be subjected to 'trait-based' analysis, in which marker allele frequencies are compared between classes of progeny defined based on trait values, or to 'marker-based' analysis, in which trait means are compared between progeny classes defined based on marker genotypes. Here, bidirectional and unidirectional selective genotyping were simulated, using population sizes and selection intensities relevant to cereal breeding. Control of Type I error was usually adequate with marker-based analysis of variance or trait-based testing using the normal approximation of the binomial distribution. Bidirectional selective genotyping was more powerful than unidirectional. Trait-based analysis and marker-based analysis of variance were about equally powerful. With genotyping of the best 30 out of 500 lines (6%), a QTL explaining 15% of the phenotypic variance could be detected with a power of 0.8 when tests were conducted at a marker 10 cM from the QTL. With bidirectional selective genotyping, QTL with smaller effects and (or) QTL farther from the nearest marker could be detected. Similar QTL detection approaches were applied to data from a population of 436 recombinant inbred rice lines segregating for a large-effect QTL affecting grain yield under drought stress. That QTL was reliably detected by genotyping as few as 20 selected lines (4.5%). In experimental populations, selective genotyping can reduce costs of QTL detection, allowing larger numbers of potential donors to be screened for useful alleles with effects across different backgrounds. In plant breeding programs, selective genotyping can make it possible to detect QTL using even a limited number of progeny that have been retained after selection.
-
9.
Methods and tools for comparative genomics of foodborne pathogens.
Bhagwat, AA, Bhagwat, M
Foodborne pathogens and disease. 2008;(4):487-97
-
-
Free full text
-
Abstract
A comparison of genome sequences and of encoded proteins with the database of existing annotated sequences is a useful approach to understand the information at the genome level. Here we demonstrate the utility of several DNA and protein sequence comparison tools to interpret the information obtained from several genome projects. Comparisons are presented between closely related strains of Escherichia coli commensal isolates, different isolates of O157:H7, and Shigella spp. It is expected that comparative genome analysis will generate a wealth of data to compare pathogenic isolates with varying levels of pathogenicity, which in turn may reveal mechanisms by which the pathogen may adapt to a particular nutrient supply in certain foods. These genome sequence analysis tools will strengthen foodborne pathogen surveillance and subsequent risk assessment to enhance the safety of the food supply.
-
10.
Identification of genes encoding tRNA modification enzymes by comparative genomics.
de Crécy-Lagard, V
Methods in enzymology. 2007;:153-83
-
-
Free full text
-
Abstract
As the molecular adapters between codons and amino acids, transfer-RNAs are pivotal molecules of the genetic code. The coding properties of a tRNA molecule do not reside only in its primary sequence. Posttranscriptional nucleoside modifications, particularly in the anticodon loop, can modify cognate codon recognition, affect aminoacylation properties, or stabilize the codon-anticodon wobble base pairing to prevent ribosomal frameshifting. Despite a wealth of biophysical and structural knowledge of the tRNA modifications themselves, their pathways of biosynthesis had been until recently only partially characterized. This discrepancy was mainly due to the lack of obvious phenotypes for tRNA modification-deficient strains and to the difficulty of the biochemical assays used to detect tRNA modifications. However, the availability of hundreds of whole-genome sequences has allowed the identification of many of these missing tRNA-modification genes. This chapter reviews the methods that were used to identify these genes with a special emphasis on the comparative genomic approaches. Methods that link gene and function but do not rely on sequence homology will be detailed, with examples taken from the tRNA modification field.