Select a tag to browse associated projects and drill deeper into the tag cloud.
SunPinyin is an opensource'd (in CDDL/LGPLv2.1) and SLM (Statistical Language Model) based Chinese PinYin input method engine. Currently, it's available on all UNIX platforms including MacOSX.
NLTK — the Natural Language Toolkit — is a suite of open source Python modules, linguistic data and documentation for research and development in natural language processing, supporting dozens of NLP tasks, with distributions for Windows, Mac OSX and Linux.
GATE (General Architecture for Text Engineering) is an architecture, framework and development environment for developing, evaluating and embedding Human Language Technology
TreeTagger for Java is a Java wrapper around the popular TreeTagger package by Helmut Schmid. It was written with a focus on platform-independence and easy integration into applications. It is written in Java 5 and has been tested on OS X, Ubuntu Linux, and Windows.
Apertium is an open-source machine translation platform, aimed at related-language pairs but expanded to deal with more divergent language pairs. The platform provides 1. a language-independent machine translation engine 2. tools to manage the linguistic data necessary to build a machine ... [More]
The Open Cognition Framework (OpenCog) is software for the collaborative development of safe and beneficial Artificial General Intelligence. OpenCog provides research scientists and software developers with a common platform to build and share artificial intelligence programs. Programs written ... [More]
DKPro is a collection of software components for natural language processing (NLP) based on the Apache UIMA framework. Many powerful and state-of-the-art NLP components are already freely available in the NLP research community. New and improved components are being developed and released ... [More]
bamboo is a chinese natrual language processing system. Currently, it includes chinese word tokenization, part of speech tagging and name entity recognition. bamboo是一个中文语言处理系统。目前包括中文分词、词性标注和命名实体识别。
LanguageTool is an Open Source language checker for English, German, Polish, Dutch, and other languages. It's rule based, i.e. it will find errors for which a rule is defined in an XML configuration files. Rules for more complicated errors can be written in Java.