Bio2X: a rule-based approach for semi-automatic transformation of semi-structured biological data to XML
作者:
Highlights:
•
摘要
Data integration of geographically dispersed, heterogeneous, complex biological databases is a key research area. One of the key features of a successful data integration system is to have a simple self-describing data exchange format. However, many of the biological databases provide data in flat files which are poor data exchange formats. Fortunately, XML can be viewed as a powerful data model and better data exchange format. In this paper, we present the Bio2X system that transforms flat file data into highly hierarchical XML data using rule-based machine learning technique. Bio2X has been fully implemented using Java. Our experiments to transform real world biological data demonstrate the effectiveness of the Bio2X approach.
论文关键词:Flat files,Rule base,Machine learning,XML,Transformer
论文评审过程:Received 21 May 2004, Revised 21 May 2004, Accepted 21 May 2004, Available online 1 July 2004.
论文官网地址:https://doi.org/10.1016/j.datak.2004.05.008