Investigating the impact of gene cofunctionality in predicting gene mutations of E. coli

Loading...
Thumbnail Image
Authors
Okwori, Michael
Eslami, Ali
Issue Date
2020-09-14
Type
Article
Language
en_US
Keywords
Predictive models , Proteins , Training , Organisms , Bioinformatics , Immune system , Genomics
Research Projects
Organizational Units
Journal Issue
Alternative Title
Abstract

Machine learning algorithms (MLAs) have recently been applied to predict gene mutations of Escherichia coli (E. coli) under different exposure conditions, with room for improvement in performance. In a bid to improve performance, we hypothesize that incorporating the interactions between genes will help MLAs make better predictions. To investigate this, we integrated protein-coding gene cofunctional networks into a mutation dataset of E. coli exposed to different conditions. Also, we proposed a feature-selection algorithm based on gene cofunctional networks to pick the most relevant exposure conditions. Then, we used the extended dataset to train a support vector classifier, an artificial neural network, and an ensemble of both MLAs. Separate models were trained for each of the protein-coding genes. Validation results showed that our approach improved both the area under the receiver operating characteristic (ROC) curve (AUC) and the area under the precision-recall curve (AUPRC). A peak increase of 8.20% in AUPRC was observed. A similar analysis on selected genes, with ten or more mutation points for each gene, also showed improvement in the general performance of the MLAs. Out-of-sample testing on adaptive laboratory evolution experiments curated from the literature provided further evidence of an enhanced mutation-prediction performance, where a maximum 8.74% boost in the AUC was observed. Finally, we highlighted the genes with the most improved and most degraded predictions due to the additional information of the cofunctional genes. This work suggests that the functional relationship between genes may play a role in gene mutation and illustrates how the relationships might help to improve mutation prediction.

Description
This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/
Citation
Okwori, M., & Eslami, A. (2020). Investigating the impact of gene cofunctionality in predicting gene mutations of E. coli. IEEE Access, 8, 167397-167410. doi:10.1109/ACCESS.2020.3023662
Publisher
IEEE
License
Journal
Volume
Issue
PubMed ID
DOI
ISSN
2169-3536
EISSN