Accepted for/Published in: JMIR Medical Informatics
Date Submitted: Dec 30, 2019
Date Accepted: Mar 19, 2020
Date Submitted to PubMed: Apr 29, 2020
Warning: This is an author submission that is not peer-reviewed or edited. Preprints - unless they show as "accepted" - should not be relied on to guide clinical practice or health-related behavior and should not be reported in news media as established information.
Chemical-Protein Interaction Extraction: A Method Based on Graph Convolutional Network
ABSTRACT
Background:
Extracting the interactions between chemicals and proteins from the biomedical literature is important for many biomedical tasks such as drug discovery, medicine precision and knowledge graph construction. Several computational methods have been proposed for automatic chemical-protein interaction (CPI) extraction. However, most of the proposed models could not effectively learn semantic and syntactic information from complex sentences in biomedical texts.
Objective:
To relieve the problem, we propose a method to effectively encode syntactic information from long text for CPI extraction.
Methods:
Due to the ability of capturing syntactic information from dependency graphs, graph convolutional networks (GCNs) have recently drawn increasing attention in natural language processing. To investigate the performance of the GCN on CPI extraction, this paper proposes a novel model based on the GCN. The model can effectively capture sequential information and long-range syntactic relations between words by using dependency structure of input sentences.
Results:
We evaluated our model on the ChemProt corpus released by BioCreative VI and it achieves an F-score of 65.17%, which is 1.07% higher than that of the state-of-the-art system.
Conclusions:
Our model can obtain more information from dependency graph than previous proposed models. Experimental results suggest that it is competitive to the state-of-the-art methods and significantly outperforms other methods on the ChemProt corpus, which is the benchmark dataset for CPI extraction.
Citation
Request queued. Please wait while the file is being generated. It may take some time.
Copyright
© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.