Machine learning classification of surgical pathology reports and chunk recognition for information extraction noise reduction

作者:

Highlights:

• We employ machine learning to improve text mining of cancer pathology reports.

• Machine learning can detect structure in free-text surgical pathology reports.

• ML algorithms can locate fragments containing cancer staging information.

• These algorithms are sensitive to the degree of structure in the documents.

摘要

•We employ machine learning to improve text mining of cancer pathology reports.•Machine learning can detect structure in free-text surgical pathology reports.•ML algorithms can locate fragments containing cancer staging information.•These algorithms are sensitive to the degree of structure in the documents.

论文关键词:Natural language processing,Information extraction,Supervised machine learning,Surgical pathology report,Cancer staging

论文评审过程:Received 27 July 2015, Revised 3 June 2016, Accepted 7 June 2016, Available online 8 June 2016, Version of Record 21 June 2016.

论文官网地址:https://doi.org/10.1016/j.artmed.2016.06.001