About this item:

582 Views | 526 Downloads

Author Notes:

Correspondence: Yongqun He; yongqunh@med.umich.edu

Authors' Contributions: ZX implemented the GenoMesh algorithm and developed the GenoMesh web program as the primary software developer and database administrator.

TQ supported data analysis. .

ZSQ provided statistics expertise and participated in project design and result interpretation.

YH co-designed the project, performed data analysis, served as a secondary software developer and database administrator, and wrote the draft.

All authors participated in manuscript editing and discussions.

Acknowledgments: We thank Drs. Fan Meng and David States for their valuable insight and suggestions.

The critical review and editing of this manuscript by Dr. George W. Jourdian and Ms. Rebecca Racz from the University of Michigan Medical School is gratefully acknowledged.

Disclosures: The authors declare that they have no competing interests.

Subjects:

Research Funding:

ZX and the GenoMesh server were supported by a NIH-NIAID grant 1R01AI081062 to YH.

Development of GenoMesh was supported by a pilot research grant to Drs. Qin and He from the Center for Computational Medicine and Biology (CCMB) at the University of Michigan.

A genome-wide MeSH-based literature mining system predicts implicit gene-to-gene relationships and networks

Tools:

Journal Title:

BMC Systems Biology

Volume:

Volume 7, Number 3

Publisher:

, Pages S9-S9

Type of Work:

Article | Final Publisher PDF

Abstract:

Background The large amount of literature in the post-genomics era enables the study of gene interactions and networks using all available articles published for a specific organism. MeSH is a controlled vocabulary of medical and scientific terms that is used by biomedical scientists to manually index articles in the PubMed literature database. We hypothesized that genome-wide gene-MeSH term associations from the PubMed literature database could be used to predict implicit gene-to-gene relationships and networks. While the gene-MeSH associations have been used to detect gene-gene interactions in some studies, different methods have not been well compared, and such a strategy has not been evaluated for a genome-wide literature analysis. Genome-wide literature mining of gene-to-gene interactions allows ranking of the best gene interactions and investigation of comprehensive biological networks at a genome level. Results The genome-wide GenoMesh literature mining algorithm was developed by sequentially generating a gene-article matrix, a normalized gene-MeSH term matrix, and a gene-gene matrix. The gene-gene matrix relies on the calculation of pairwise gene dissimilarities based on gene-MeSH relationships. An optimized dissimilarity score was identified from six well-studied functions based on a receiver operating characteristic (ROC) analysis. Based on the studies with well-studied Escherichia coli and less-studied Brucella spp., GenoMesh was found to accurately identify gene functions using weighted MeSH terms, predict gene-gene interactions not reported in the literature, and cluster all the genes studied from an organism using the MeSH-based gene-gene matrix. A web-based GenoMesh literature mining program is also available at: http://genomesh.hegroup.org. GenoMesh also predicts gene interactions and networks among genes associated with specific MeSH terms or user-selected gene lists. Conclusions The GenoMesh algorithm and web program provide the first genome-wide, MeSH-based literature mining system that effectively predicts implicit gene-gene interaction relationships and networks in a genome-wide scope.

Copyright information:

© 2013 Xiang et al.

This is an Open Access work distributed under the terms of the Creative Commons Attribution 2.0 Generic License (http://creativecommons.org/licenses/by/2.0/).

Creative Commons License

Export to EndNote