Browsing projects by Tag(s)

Select a tag to browse associated projects and drill deeper into the tag cloud.

Showing page 1 of 1

Apache OpenNLP is a Java machine learning toolkit for natural language processing (NLP).

5.0
 
  0 reviews  |  11 users  |  467,326 lines of code  |  5 current contributors  |  Analyzed about 7 hours ago
 
 

DKPro is a collection of software components for natural language processing (NLP) based on the Apache UIMA framework. Many powerful and state-of-the-art NLP components are already freely available in the NLP research community. New and improved components are being developed and released ... [More] continuously. The components cover the whole range of NLP-related processing tasks. DKPro provides wrappers for such third-party tool as well as original NLP components. DKPro builds heavily on uimaFIT which allows for rapid and easy development of NLP processing pipelines. [Less]

4.75
   
  0 reviews  |  6 users  |  166,955 lines of code  |  12 current contributors  |  Analyzed 5 days ago
 
 

jTokeniser is a set of classes that provide a variety of tokenisers for your Java projects. Simple tokenisers such as WhiteSpaceTokeniser or StringTokeniser provide basic token extraction whereas RegexTokeniser and BreakIteratorTokeniser give more advantage possibilities for more thorough tokenisers ... [More] that discard punctuation too. Recent additions include RegexSeparatorTokeniser that allows complex definition of token delimiters. Also a SentenceTokeniser has been provided for segmenting text into a set of sentences. There is also a GUI frontend to experiment without having to code. [Less]

0
 
  0 reviews  |  0 users  |  2,443 lines of code  |  0 current contributors  |  Analyzed 1 day ago
 
 
 
 

Creative Commons License Copyright © 2013 Black Duck Software, Inc. and its contributors, Some Rights Reserved. Unless otherwise marked, this work is licensed under a Creative Commons Attribution 3.0 Unported License . Ohloh ® and the Ohloh logo are trademarks of Black Duck Software, Inc. in the United States and/or other jurisdictions. All other trademarks are the property of their respective holders.