Showing revision 1

2009-02-26

Nepali support for text search !

Looks like we did first working patch for nepali language (Devanagari script) support in text search ! We introduced Virama/Halanta support and Spacing Combining category. A lot of Unicode documents readings :)

postgres=# set client_encoding to UTF8;
SET
Time: 0.119 ms
<¥रस्कार पुस्तकालय');                    ;
 tokid |  token  
-------+---------
     2 | मदन
    12 |  
     2 | पुरस्कार
    12 |  
     2 | पुस्तकालय
(5 rows)

Next step is to port nepali stemmer, so we can provide default text search configuration for nepali.

Also, we need to improve hunspell support, so nepali ispell dictionaries can be used with text search !

This project is a volunteer work to support PostgreSQL promotion in Nepal (btw, elephants are there). I will visit Nepal this april and will establish more close connections with nepali developers.