Comparative experiments on learning information extractors for proteins and their interactions

作者:

Highlights:

摘要

Objective:Automatically extracting information from biomedical text holds the promise of easily consolidating large amounts of biological knowledge in computer-accessible form. This strategy is particularly attractive for extracting data relevant to genes of the human genome from the 11 million abstracts in Medline. However, extraction efforts have been frustrated by the lack of conventions for describing human genes and proteins. We have developed and evaluated a variety of learned information extraction systems for identifying human protein names in Medline abstracts and subsequently extracting information on interactions between the proteins.

论文关键词:Information extraction,Text mining,Machine learning,Protein interactions,Medline

论文评审过程:Received 16 December 2002, Revised 14 July 2004, Accepted 16 July 2004, Available online 7 December 2004.

论文官网地址:https://doi.org/10.1016/j.artmed.2004.07.016