SAL Home OTHERS Artificial Intelligence

Chinese Morphological Analyzer

Basis Technology's Chinese Morphological Analyzer is a portable, high-performance segmentation engine for Chinese text combined with a Chinese dictionary. Chinese text is normally written without any spaces between words. In order to index Chinese and perform keyword searches or other text manipulation, Chinese text needs to be broken into words.

The Chinese Morphological Analyzer is able to accurately segment Chinese text into words based on a dictionary containing over 350,000 words and marked with information about parts of speech and pinyin pronunciation. When run over random web page text, its error rate is only about 1.3%. This robust, carefully debugged source code library is portable and capable of running on platforms ranging from low-spec 386 PCs to large-scale multi-CPU web servers processing hundreds of documents per minute.

Current Version:   ??

License Type:   Commercial

Home Site:

Source Code Availability:   Yes

Available Binary Packages:

  • Debian Package:   No
  • RedHat RPM Package:   No
  • Other Packages:   ??

Targeted Platforms:

Licensed as royalty-free software development kit (SDK) or source code distribution for Win32 (as DLL or static library), Solaris, HP-UX, Irix, Digital Unix, and Linux x86

Software/Hardware Requirements:


Other Links:

Mailing Lists/USENET News Groups:


User Comments:

  • None

See A Screen Shot? (Not Yet)

  SAL Home   |   Other Scientific Fields   |   Artificial Intelligence

Comments? SAL@KachinaTech.COM
Copyright © 1995-2001 by Herng-Jeng Jou
Copyright © 1997-2001 by Kachina Technologies, Inc.
All rights reserved.