jWeb1T is an open source Java tool for efficiently searching n-gram data in the Web 1T 5-gram corpus format. It is based on a binary search algorithm that finds the n-grams and returns their frequency counts in logarithmic time. As the corpus is stored in many files a simple index is used to retrieve the files containing the n-grams.
DKPro Spelling is a collection of software components for spelling correction, especially for correcting real-word spelling errors. It is based on DKPro Core and the Apache UIMA.
DKPro Relatedness is a collection of software components for relatedness computation between texts of any length. It is based on DKPro Core and Apache UIMA.
DKPro Similarity is a collection of software components and experiments for computating the similarity between texts of any length (i.e. also between words). It is based on DKPro Core and Apache UIMA.
Copyright
©
2013
Black Duck Software, Inc.
and its contributors, Some Rights Reserved. Unless otherwise marked, this work is licensed under a
Creative Commons Attribution 3.0 Unported License
. Ohloh
®
and the Ohloh logo are trademarks of
Black Duck Software, Inc.
in the United States and/or other jurisdictions. All other trademarks are the property of their respective holders.