jWeb1T is an open source Java tool for efficiently searching n-gram data in the Web 1T 5-gram corpus format. It is based on a binary search algorithm that finds the n-grams and returns their frequency counts in logarithmic time. As the corpus is stored in many files a simple index is used to retrieve the files containing the n-grams.
DKPro Spelling is a collection of software components for spelling correction, especially for correcting real-word spelling errors. It is based on DKPro Core and the Apache UIMA.
DKPro Relatedness is a collection of software components for relatedness computation between texts of any length. It is based on DKPro Core and Apache UIMA.
DKPro Similarity is a collection of software components and experiments for computating the similarity between texts of any length (i.e. also between words). It is based on DKPro Core and Apache UIMA.
Copyright © 2013 Black Duck Software, Inc. and its contributors, Some Rights Reserved. Unless otherwise marked, this work is licensed under a Creative Commons Attribution 3.0 Unported License . Ohloh ® and the Ohloh logo are trademarks of Black Duck Software, Inc. in the United States and/or other jurisdictions. All other trademarks are the property of their respective holders.