About this item:

73 Views | 31 Downloads

Author Notes:

Yunyun Zhou, Email: zhouy6@chop.edu

All authors made material contributions to the execution of this study. JP, YZ and KW designed the study. JP led the implementation of the web framework, while DX, RL and SX participated in data collection, data processing, data analysis, and application development. JP drafted the manuscript, and YZ and KW revised the manuscript. All authors read and approved the final manuscript.

The authors would like to thank Wang lab members for testing the web framework and offering advice on knowledge integration from other data sources. We also thank the support from the CHOP/Penn Intellectual and Developmental Disabilities Research Center—NIH/NICHD P50 HD105354.

The authors declare that they have no competing interests.

Subjects:

Research Funding:

The study is supported by NIH/NLM/NHGRI grant LM012895, Penn Undergraduate Research Mentoring Program, and the CHOP Research Institute. The funding bodies had no role in the design of the study and collection, analysis, and interpretation of data and in writing the manuscript. Publication costs are funded by NIH/NLM/NHGRI grant LM012895.

Keywords:

  • Science & Technology
  • Life Sciences & Biomedicine
  • Medical Informatics
  • Knowledge graph
  • COVID-19
  • Information extraction
  • Data visualization
  • Knowledge discovery
  • BIOMEDICAL TEXT
  • GENES
  • UMLS

Expediting knowledge acquisition by a web framework for Knowledge Graph Exploration and Visualization (KGEV): case studies on COVID-19 and Human Phenotype Ontology

Tools:

Journal Title:

BMC MEDICAL INFORMATICS AND DECISION MAKING

Volume:

Volume 22, Number SUPPL 2

Publisher:

, Pages 147-147

Type of Work:

Article | Final Publisher PDF

Abstract:

Background: Knowledges graphs (KGs) serve as a convenient framework for structuring knowledge. A number of computational methods have been developed to generate KGs from biomedical literature and use them for downstream tasks such as link prediction and question answering. However, there is a lack of computational tools or web frameworks to support the exploration and visualization of the KG themselves, which would facilitate interactive knowledge discovery and formulation of novel biological hypotheses. Method: We developed a web framework for Knowledge Graph Exploration and Visualization (KGEV), to construct and visualize KGs in five stages: triple extraction, triple filtration, metadata preparation, knowledge integration, and graph database preparation. The application has convenient user interface tools, such as node and edge search and filtering, data source filtering, neighborhood retrieval, and shortest path calculation, that work by querying a backend graph database. Unlike other KGs, our framework allows fast retrieval of relevant texts supporting the relationships in the KG, thus allowing human reviewers to judge the reliability of the knowledge extracted. Results: We demonstrated a case study of using the KGEV framework to perform research on COVID-19. The COVID-19 pandemic resulted in an explosion of relevant literature, making it challenging to make full use of the vast and heterogenous sources of information. We generated a COVID-19 KG with heterogenous information, including literature information from the CORD-19 dataset, as well as other existing knowledge from eight data sources. We showed the utility of KGEV in three intuitive case studies to explore and query knowledge on COVID-19. A demo of this web application can be accessed at http://covid19nlp.wglab.org. Finally, we also demonstrated a turn-key adaption of the KGEV framework to study clinical phenotypic presentation of human diseases by Human Phenotype Ontology (HPO), illustrating the versatility of the framework. Conclusion: In an era of literature explosion, the KGEV framework can be applied to many emerging diseases to support structured navigation of the vast amount of newly published biomedical literature and other existing biological knowledge in various databases. It can be also used as a general-purpose tool to explore and query gene-phenotype-disease-drug relationships interactively.

Copyright information:

© The Author(s) 2022

This is an Open Access work distributed under the terms of the Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/rdf).
Export to EndNote