Select a tag to browse associated projects and drill deeper into the tag cloud.
NLTK — the Natural Language Toolkit — is a suite of open source Python modules, linguistic data and documentation for research and development in natural language processing, supporting dozens of NLP tasks, with distributions for Windows, Mac OSX and Linux.
MARF is an open-source research platform and a collection of voice/sound/speech/text and natural language processing (NLP) algorithms written in Java and arranged into a modular and extensible framework facilitating addition of new algorithms. MARF can run distributedly over the network and may act ... [More]
Mac 版本使用者，請改至 github 取得最新版本： http://github.com/lukhnos/openvanilla-oranje/downloads ，並請參考 0.9.0a1 的版本發佈說明： http://github.com/lukhnos/openvanilla-oranje/blob/master/Documents/20090826-Announcement.markdown The source tree for the Mac ... [More]
Affisix is a program for automatic recognition of affixes. It takes large amount of words and according to the user setting it tries to determine which segments of these words are prefixes.
A high-level interface to the CMU Link Grammar. This binding wraps the link-grammar shared library provided by the AbiWord project for their grammar-checker.
Sylli divides text transcripts into syllables. It can syllabify strings, text files, TIMIT files and directories.
NTextCat - free Language Identification API for .NET (C#): 280+ languages available out of the box. Recognizes language and encoding (UTF-8, Windows-1252, Big5, etc.) of text. Mono compatible.