The Text Categorization tool 2007 version is the first offical public rlease.It was developed in pure Java, capable of handling UTF-8. Belows are some specificatio nof this tool.
Provides iscripts for command line tools
Provides options configurable by the Configuration file
Provides Java API classes
Embedded in the project by using HyperSql database