1.
Current status and future perspectives of computational studies on human-virus protein-protein interactions.
Lian, X, Yang, X, Yang, S, Zhang, Z
Briefings in bioinformatics. 2021;(5)
Abstract
The protein-protein interactions (PPIs) between human and viruses mediate viral infection and host immunity processes. Therefore, the study of human-virus PPIs can help us understand the principles of human-virus relationships and can thus guide the development of highly effective drugs to break the transmission of viral infectious diseases. Recent years have witnessed the rapid accumulation of experimentally identified human-virus PPI data, which provides an unprecedented opportunity for bioinformatics studies revolving around human-virus PPIs. In this article, we provide a comprehensive overview of computational studies on human-virus PPIs, especially focusing on the method development for human-virus PPI predictions. We briefly introduce the experimental detection methods and existing database resources of human-virus PPIs, and then discuss the research progress in the development of computational prediction methods. In particular, we elaborate the machine learning-based prediction methods and highlight the need to embrace state-of-the-art deep-learning algorithms and new feature engineering techniques (e.g. the protein embedding technique derived from natural language processing). To further advance the understanding in this research topic, we also outline the practical applications of the human-virus interactome in fundamental biological discovery and new antiviral therapy development.
2.
Role of tyrosine hot-spot residues at the interface of colicin E9 and immunity protein 9: a comparative free energy simulation study.
Luitz, MP, Zacharias, M
Proteins. 2013;(3):461-8
Abstract
The endonuclease activity of the bacterial colicin 9 enzyme is controlled by the specific and high-affinity binding of immunity protein 9 (Im9). Molecular dynamics simulation studies in explicit solvent were used to investigate the free energy change associated with the mutation of two hot-spot interface residues [tyrosine (Tyr): Tyr54 and Tyr55] of Im9 to Ala. In addition, the effect of several other mutations (Leu33Ala, Leu52Ala, Val34Ala, Val37Ala, Ser48Ala, and Ile53Ala) with smaller influence on binding affinity was also studied. Good qualitative agreement of calculated free energy changes and experimental data on binding affinity of the mutations was observed. The simulation studies can help to elucidate the molecular details on how the mutations influence protein-protein binding affinity. The role of solvent and conformational flexibility of the partner proteins was studied by comparing the results in the presence or absence of solvent and with or without positional restraints. Restriction of the conformational mobility of protein partners resulted in significant changes of the calculated free energies but of similar magnitude for isolated Im9 and for the complex and therefore in only modest changes of binding free energy differences. Although the overall binding free energy change was similar for the two Tyr-Ala mutations, the physical origin appeared to be different with solvation changes contributing significantly to the Tyr55Ala mutation and to a loss of direct protein-protein interactions dominating the free energy change due to the Tyr54Ala mutation.
3.
Structural similarity-based predictions of protein interactions between HIV-1 and Homo sapiens.
Doolittle, JM, Gomez, SM
Virology journal. 2010;:82
Abstract
BACKGROUND In the course of infection, viruses such as HIV-1 must enter a cell, travel to sites where they can hijack host machinery to transcribe their genes and translate their proteins, assemble, and then leave the cell again, all while evading the host immune system. Thus, successful infection depends on the pathogen's ability to manipulate the biological pathways and processes of the organism it infects. Interactions between HIV-encoded and human proteins provide one means by which HIV-1 can connect into cellular pathways to carry out these survival processes. RESULTS We developed and applied a computational approach to predict interactions between HIV and human proteins based on structural similarity of 9 HIV-1 proteins to human proteins having known interactions. Using functional data from RNAi studies as a filter, we generated over 2000 interaction predictions between HIV proteins and 406 unique human proteins. Additional filtering based on Gene Ontology cellular component annotation reduced the number of predictions to 502 interactions involving 137 human proteins. We find numerous known interactions as well as novel interactions showing significant functional relevance based on supporting Gene Ontology and literature evidence. CONCLUSIONS Understanding the interplay between HIV-1 and its human host will help in understanding the viral lifecycle and the ways in which this virus is able to manipulate its host. The results shown here provide a potential set of interactions that are amenable to further experimental manipulation as well as potential targets for therapeutic intervention.
4.
Prediction of interactions between HIV-1 and human proteins by information integration.
Tastan, O, Qi, Y, Carbonell, JG, Klein-Seetharaman, J
Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing. 2009;:516-27
Abstract
Human immunodeficiency virus-1 (HIV-1) in acquired immune deficiency syndrome (AIDS) relies on human host cell proteins in virtually every aspect of its life cycle. Knowledge of the set of interacting human and viral proteins would greatly contribute to our understanding of the mechanisms of infection and subsequently to the design of new therapeutic approaches. This work is the first attempt to predict the global set of interactions between HIV-1 and human host cellular proteins. We propose a supervised learning framework, where multiple information data sources are utilized, including co-occurrence of functional motifs and their interaction domains and protein classes, gene ontology annotations, posttranslational modifications, tissue distributions and gene expression profiles, topological properties of the human protein in the interaction network and the similarity of HIV-1 proteins to human proteins' known binding partners. We trained and tested a Random Forest (RF) classifier with this extensive feature set. The model's predictions achieved an average Mean Average Precision (MAP) score of 23%. Among the predicted interactions was for example the pair, HIV-1 protein tat and human vitamin D receptor. This interaction had recently been independently validated experimentally. The rank-ordered lists of predicted interacting pairs are a rich source for generating biological hypotheses. Amongst the novel predictions, transcription regulator activity, immune system process and macromolecular complex were the top most significant molecular function, process and cellular compartments, respectively. Supplementary material is available at URL www.cs.cmu.edu/õznur/hiv/hivPPI.html
5.
Quantitative predictions of peptide binding to any HLA-DR molecule of known sequence: NetMHCIIpan.
Nielsen, M, Lundegaard, C, Blicher, T, Peters, B, Sette, A, Justesen, S, Buus, S, Lund, O
PLoS computational biology. 2008;(7):e1000107
Abstract
CD4 positive T helper cells control many aspects of specific immunity. These cells are specific for peptides derived from protein antigens and presented by molecules of the extremely polymorphic major histocompatibility complex (MHC) class II system. The identification of peptides that bind to MHC class II molecules is therefore of pivotal importance for rational discovery of immune epitopes. HLA-DR is a prominent example of a human MHC class II. Here, we present a method, NetMHCIIpan, that allows for pan-specific predictions of peptide binding to any HLA-DR molecule of known sequence. The method is derived from a large compilation of quantitative HLA-DR binding events covering 14 of the more than 500 known HLA-DR alleles. Taking both peptide and HLA sequence information into account, the method can generalize and predict peptide binding also for HLA-DR molecules where experimental data is absent. Validation of the method includes identification of endogenously derived HLA class II ligands, cross-validation, leave-one-molecule-out, and binding motif identification for hitherto uncharacterized HLA-DR molecules. The validation shows that the method can successfully predict binding for HLA-DR molecules-even in the absence of specific data for the particular molecule in question. Moreover, when compared to TEPITOPE, currently the only other publicly available prediction method aiming at providing broad HLA-DR allelic coverage, NetMHCIIpan performs equivalently for alleles included in the training of TEPITOPE while outperforming TEPITOPE on novel alleles. We propose that the method can be used to identify those hitherto uncharacterized alleles, which should be addressed experimentally in future updates of the method to cover the polymorphism of HLA-DR most efficiently. We thus conclude that the presented method meets the challenge of keeping up with the MHC polymorphism discovery rate and that it can be used to sample the MHC "space," enabling a highly efficient iterative process for improving MHC class II binding predictions.