Projects tagged ‘computational_linguistics’


[14 total ]

19 Users
 

NLTK — the Natural Language Toolkit — is a suite of open source Python modules, linguistic data and documentation for research and development in natural language processing, supporting dozens of ... [More] NLP tasks, with distributions for Windows, Mac OSX and Linux. [Less]
Created over 3 years ago.

7 Users

Apertium is an open-source machine translation platform, aimed at related-language pairs but expanded to deal with more divergent language pairs. The platform provides 1. a language-independent ... [More] machine translation engine 2. tools to manage the linguistic data necessary to build a machine translation system for a given language pair and 3. linguistic data for a growing number of language pairs. Apertium uses a shallow-transfer machine translation engine which processes the input text in stages, as in an assembly line: de-formatting, morphological analysis, part-of-speech disambiguation, shallow structural transfer, lexical transfer, morphological generation, and re-formatting. [Less]
Created about 1 year ago.

2 Users
 

SimMetrics is a Similarity Metric Library, e.g. from edit distance's (Levenshtein, Gotoh, Jaro etc) to other metrics, (e.g Soundex, Chapman). Work provided by UK Sheffield University funded by (AKT) ... [More] an IRC sponsored by EPSRC, grant number GR/N15764/01. [Less]
Created about 1 year ago.

2 Users

RelEx is an English-language semantic relationship extractor, built on the Carnegie-Mellon link parser. It can identify subject, object, indirect object and many other relationships between words in a ... [More] sentence. It can also provide part-of-speech tagging, noun-number tagging, verb tense tagging, gender tagging, and so on. Relex includes a basic implementation of the Hobbs anaphora (pronoun) resolution algorithm. Optionally, it can use GATE for entity detection. RelEx also provides semantic relationship framing, similar to that of FrameNet. [Less]
Created about 1 year ago.

2 Users

The Open Cognition Framework (OpenCog) is software for the collaborative development of safe and beneficial Artificial General Intelligence. OpenCog provides research scientists and software ... [More] developers with a common platform to build and share artificial intelligence programs. Programs written or adapted for OpenCog may be combined and used in concert with one another for experimentation or to achieve better results compared to their stand-alone counterparts. OpenCog is under active development, with the first release expected in 4Q-2008. [Less]
Created about 1 year ago.

2 Users

Machine translation engine based on a dependency grammar and XML interchange format. The Spanish-Basque (es-eu) translation direction is currently supported.
Created about 1 year ago.

1 Users

TestEl is a Java-based learning analyzer for HTML (and possibly other) structured documents. It can be trained to detect structures in such documents and renders hits in XML.
Created about 1 year ago.

1 Users

A high-level interface to the CMU Link Grammar. This binding wraps the link-grammar shared library provided by the AbiWord project for their grammar-checker.
Created about 1 year ago.

1 Users
 

Ruby interface to the CRM114 Controllable Regex Mutilator, an advanced and fast text classifier that uses sparse binary polynomial matching with a Bayesian Chain Rule evaluator and a hidden Markov ... [More] model to categorize data with up to a 99.87% accuracy. [Less]
Created over 2 years ago.

1 Users
   

Affisix is a program for automatic recognition of affixes. It takes large amount of words and according to the user setting it tries to determine which segments of these words are prefixes.
Created about 1 year ago.