![]() |
![]() |
![]() |
Chinese Morphological Analyzer
Basis Technology's Chinese Morphological Analyzer is a portable, high-performance segmentation engine for Chinese text combined with a Chinese dictionary. Chinese text is normally written without any spaces between words. In order to index Chinese and perform keyword searches or other text manipulation, Chinese text needs to be broken into words. The Chinese Morphological Analyzer is able to accurately segment Chinese text into words based on a dictionary containing over 350,000 words and marked with information about parts of speech and pinyin pronunciation. When run over random web page text, its error rate is only about 1.3%. This robust, carefully debugged source code library is portable and capable of running on platforms ranging from low-spec 386 PCs to large-scale multi-CPU web servers processing hundreds of documents per minute.
|
Current Version: ??
License Type: Commercial
|
Home Site:
Source Code Availability: Yes
Available Binary Packages:
Targeted Platforms: Software/Hardware Requirements:
|
Other Links:
Mailing Lists/USENET News Groups: User Comments:
See A Screen Shot? (Not Yet)
|