Loading…

SympGAN: A systematic knowledge integration system for symptom–gene associations network

Phenotypes (i.e., symptoms and clinical signs) are essential for clinical diagnosis and research related to symptom science and precision health. As clinical observational manifestations of a disease, symptoms are clinically significant because they act as direct causes for patients to seek medical...

Full description

Saved in:
Bibliographic Details
Published in:Knowledge-based systems 2023-09, Vol.276, p.110752, Article 110752
Main Authors: Lu, Kezhi, Yang, Kuo, Sun, Hailong, Zhang, Qian, Zheng, Qiguang, Xu, Kuan, Chen, Jianxin, Zhou, Xuezhong
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Phenotypes (i.e., symptoms and clinical signs) are essential for clinical diagnosis and research related to symptom science and precision health. As clinical observational manifestations of a disease, symptoms are clinically significant because they act as direct causes for patients to seek medical care and the primary indicators for clinicians to provide diagnosis/treatments. However, a comprehensive phenotypic knowledge base and high-quality symptom–gene associations are lacking. Therefore, a thorough understanding of the relationships between symptoms and other entities is urgently needed to support scientific research and clinical health care. In this paper, we constructed a systematic, large-scale, and high-quality symptom-gene associations network system named SympGAN (accessible at http://www.sympgan.org/). We provide access to the database with millions of associations between symptoms, genes, diseases, and drugs, as well as the system for users to search, analyze, knowledge inference, and present data visualization. We utilize state-of-the-art machine learning and deep learning algorithms as the backbone to form the final dataset. In addition, we utilize the RoBERTa-PubMed neural network for name entity recognition to assist in data screening. The knowledge graph is adopted to organize the relationships between different entities. We adopt ConvE, TuckER, and HypER methods for knowledge completion experiments to validate the quality of final knowledge graph triples. Based on the results, we provide online automatic knowledge inference interfaces. The system, SympGAN, has promising value for disease diagnosis, decision support in health care, precision health, and scientific research, as researchers and practitioners can easily access information about symptoms, diseases, targets, gene ontology, and drugs. [Display omitted] SympGAN is a comprehensive framework designed for the integration of symptom phenotypes, utilizing neural network embeddings and deep information extraction models. We have developed an integrative framework that establishes connections between symptoms and genes. This framework encompasses relationship inference through deep network embedding, literature mining using named entity recognition methods, and manual curation. Consequently, we have created a robust database and knowledge graph containing millions of associations between symptoms, genes, diseases, and drugs. SympGAN is readily accessible at http://www.sympgan.org/, pro
ISSN:0950-7051
1872-7409
DOI:10.1016/j.knosys.2023.110752