Browsing projects by Tag(s)

Select a tag to browse associated projects and drill deeper into the tag cloud.

Showing page 1 of 23

SunPinyin is an opensource'd (in CDDL/LGPLv2.1) and SLM (Statistical Language Model) based Chinese PinYin input method engine. Currently, it's available on all UNIX platforms including MacOSX.

4.76923
   
  1 review  |  102 users  |  42,758 lines of code  |  9 current contributors  |  Analyzed 7 months ago
 
 

NLTK — the Natural Language Toolkit — is a suite of open source Python modules, linguistic data and documentation for research and development in natural language processing, supporting dozens of NLP tasks, with distributions for Windows, Mac OSX and Linux.

5.0
 
  0 reviews  |  40 users  |  214,336 lines of code  |  43 current contributors  |  Analyzed 4 days ago
 
 
Compare

GATE (General Architecture for Text Engineering) is an architecture, framework and development environment for developing, evaluating and embedding Human Language Technology

5.0
 
  0 reviews  |  12 users  |  613,598 lines of code  |  12 current contributors  |  Analyzed 1 day ago
 
 

TreeTagger for Java is a Java wrapper around the popular TreeTagger package by Helmut Schmid. It was written with a focus on platform-independence and easy integration into applications. It is written in Java 5 and has been tested on OS X, Ubuntu Linux, and Windows.

5.0
 
  0 reviews  |  11 users  |  2,460 lines of code  |  1 current contributor  |  Analyzed 7 days ago
 
 

Apache OpenNLP is a Java machine learning toolkit for natural language processing (NLP).

5.0
 
  0 reviews  |  10 users  |  467,231 lines of code  |  5 current contributors  |  Analyzed 4 days ago
 
 

Apertium is an open-source machine translation platform, aimed at related-language pairs but expanded to deal with more divergent language pairs. The platform provides 1. a language-independent machine translation engine 2. tools to manage the linguistic data necessary to build a machine ... [More] translation system for a given language pair and 3. linguistic data for a growing number of language pairs. Apertium uses a shallow-transfer machine translation engine which processes the input text in stages, as in an assembly line: de-formatting, morphological analysis, part-of-speech disambiguation, shallow structural transfer, lexical transfer, morphological generation, and re-formatting. [Less]

5.0
 
  0 reviews  |  10 users  |  16,830,653 lines of code  |  41 current contributors  |  Analyzed 6 days ago
 
 

The Open Cognition Framework (OpenCog) is software for the collaborative development of safe and beneficial Artificial General Intelligence. OpenCog provides research scientists and software developers with a common platform to build and share artificial intelligence programs. Programs written ... [More] or adapted for OpenCog may be combined and used in concert with one another for experimentation or to achieve better results compared to their stand-alone counterparts. OpenCog is under active development, but doesn't yet have a official release. It is currently best suited for machine learning developers, but have an interest in making more accessible to new comers. [Less]

5.0
 
  0 reviews  |  8 users  |  318,356 lines of code  |  32 current contributors  |  Analyzed 5 days ago
 
 

DKPro is a collection of software components for natural language processing (NLP) based on the Apache UIMA framework. Many powerful and state-of-the-art NLP components are already freely available in the NLP research community. New and improved components are being developed and released ... [More] continuously. The components cover the whole range of NLP-related processing tasks. DKPro provides wrappers for such third-party tool as well as original NLP components. DKPro builds heavily on uimaFIT which allows for rapid and easy development of NLP processing pipelines. [Less]

4.75
   
  0 reviews  |  7 users  |  162,207 lines of code  |  13 current contributors  |  Analyzed 11 days ago
 
 

bamboo is a chinese natrual language processing system. Currently, it includes chinese word tokenization, part of speech tagging and name entity recognition. bamboo是一个中文语言处理系统。目前包括中文分词、词性标注和命名实体识别。

5.0
 
  0 reviews  |  6 users  |  49,842 lines of code  |  0 current contributors  |  Analyzed 3 days ago
 
 

LanguageTool is an Open Source language checker for English, German, Polish, Dutch, and other languages. It's rule based, i.e. it will find errors for which a rule is defined in an XML configuration files. Rules for more complicated errors can be written in Java.

4.0
   
  0 reviews  |  6 users  |  297,879 lines of code  |  15 current contributors  |  Analyzed about 20 hours ago
 
 
 
 

Creative Commons License Copyright © 2013 Black Duck Software, Inc. and its contributors, Some Rights Reserved. Unless otherwise marked, this work is licensed under a Creative Commons Attribution 3.0 Unported License . Ohloh ® and the Ohloh logo are trademarks of Black Duck Software, Inc. in the United States and/or other jurisdictions. All other trademarks are the property of their respective holders.