Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: JMIR Medical Informatics

Date Submitted: Dec 30, 2019
Date Accepted: Mar 19, 2020
Date Submitted to PubMed: Apr 29, 2020

The final, peer-reviewed published version of this preprint can be found here:

A Graph Convolutional Network–Based Method for Chemical-Protein Interaction Extraction: Algorithm Development

Wang E, Wang F, Yang Z, Wang L, Zhang Y, Lin H, Wang J

A Graph Convolutional Network–Based Method for Chemical-Protein Interaction Extraction: Algorithm Development

JMIR Med Inform 2020;8(5):e17643

DOI: 10.2196/17643

PMID: 32348257

PMCID: 7267994

Warning: This is an author submission that is not peer-reviewed or edited. Preprints - unless they show as "accepted" - should not be relied on to guide clinical practice or health-related behavior and should not be reported in news media as established information.

Chemical-Protein Interaction Extraction: A Method Based on Graph Convolutional Network

  • Erniu Wang; 
  • Fan Wang; 
  • Zhihao Yang; 
  • Lei Wang; 
  • Yin Zhang; 
  • Hongfei Lin; 
  • Jian Wang

ABSTRACT

Background:

Extracting the interactions between chemicals and proteins from the biomedical literature is important for many biomedical tasks such as drug discovery, medicine precision and knowledge graph construction. Several computational methods have been proposed for automatic chemical-protein interaction (CPI) extraction. However, most of the proposed models could not effectively learn semantic and syntactic information from complex sentences in biomedical texts.

Objective:

To relieve the problem, we propose a method to effectively encode syntactic information from long text for CPI extraction.

Methods:

Due to the ability of capturing syntactic information from dependency graphs, graph convolutional networks (GCNs) have recently drawn increasing attention in natural language processing. To investigate the performance of the GCN on CPI extraction, this paper proposes a novel model based on the GCN. The model can effectively capture sequential information and long-range syntactic relations between words by using dependency structure of input sentences.

Results:

We evaluated our model on the ChemProt corpus released by BioCreative VI and it achieves an F-score of 65.17%, which is 1.07% higher than that of the state-of-the-art system.

Conclusions:

Our model can obtain more information from dependency graph than previous proposed models. Experimental results suggest that it is competitive to the state-of-the-art methods and significantly outperforms other methods on the ChemProt corpus, which is the benchmark dataset for CPI extraction.


 Citation

Please cite as:

Wang E, Wang F, Yang Z, Wang L, Zhang Y, Lin H, Wang J

A Graph Convolutional Network–Based Method for Chemical-Protein Interaction Extraction: Algorithm Development

JMIR Med Inform 2020;8(5):e17643

DOI: 10.2196/17643

PMID: 32348257

PMCID: 7267994

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.