Projects tagged ‘information_retrieval’ and ‘natural_language’


[6 total ]

19 Users
 

NLTK — the Natural Language Toolkit — is a suite of open source Python modules, linguistic data and documentation for research and development in natural language processing, supporting dozens of ... [More] NLP tasks, with distributions for Windows, Mac OSX and Linux. [Less]
Created over 3 years ago.

7 Users

GATE (General Architecture for Text Engineering) is an architecture, framework and development environment for developing, evaluating and embedding Human Language Technology
Created over 3 years ago.

1 Users
 

OpenEphyra is the first open source question answering (QA) system. It retrieves answers to natural language questions from the Web and other sources - just type in your questions and get back ... [More] answers. OpenEphyra comes with implementations of algorithms that proved effective in Carnegie Mellon's Ephyra system, which participated in the TREC evaluations. It is platform independent and can be set up in just a few minutes. The goal of this project is to give researchers the opportunity to develop new QA techniques without worrying about the end-to-end system. OpenEphyra also facilitates evaluations and comparisons of different approaches by providing a common platform for experiments. In addition, OpenEphyra can be used for educational purposes, such as for computer science course projects. [Less]
Created about 1 year ago.

1 Users

TestEl is a Java-based learning analyzer for HTML (and possibly other) structured documents. It can be trained to detect structures in such documents and renders hits in XML.
Created 12 months ago.

1 Users

A high-level interface to the CMU Link Grammar. This binding wraps the link-grammar shared library provided by the AbiWord project for their grammar-checker.
Created about 1 year ago.

0 Users
 

webXcreta sucks down the latest entries for the currently most popular blogs on the Intarweb. It then parses each weblog entry using natural language processing (NLTK) and figures out what words are ... [More] verbs, nouns, adjectives, definite articles, etc. Next, it creates weighted values based on how high-ranking each blog is (higher ranking blogs have a greater influence over sentence count, word order, and vocabulary). The reassembled bits get spit out and posted here. [Less]
Created about 1 year ago.