Morphological compression of Arabic text

作者:

Highlights:

摘要

The morphological compression of Arabic text is a compression technique that replaces some words in the original text by their roots and morphological patterns. This method is studied by developing a new method to reduce Arabic words to their roots and patterns, and by a compression algorithm that encodes reducible words into a three byte format. The technique is implemented and tested by utilizing different texts. The results indicate a reduction ratio of 20% to 30% due to the morphological property of the language alone. However, 40% reduction is attainable if the morphological compression is used in conjunction with space elimination from the original text.

论文关键词:

论文评审过程:Received 15 March 1989, Accepted 15 March 1989, Available online 19 July 2002.

论文官网地址:https://doi.org/10.1016/0306-4573(90)90033-X