Browsing projects by Tag(s)

Select a tag to browse associated projects and drill deeper into the tag cloud.

Showing page 1 of 1

Raingrams is a flexible and general-purpose ngrams library written in Ruby. Raingrams supports ngram sizes greater than 1, text/non-text grams, multiple parsing styles and open/closed vocabulary models.

0
 
  0 reviews  |  1 user  |  1,601 lines of code  |  0 current contributors  |  Analyzed 4 days ago
 
 

PyGrams converts text to n-grams. Conversion is a three step process. 1) Extract all possible n-grams. Run "form_candidates.py" to create a file containing all possible n-grams. 2) Filter possible n-grams. Run "filter_candidates.py" to find just the n-grams which appear ... [More] sufficiently frequently in relation to the frequency of their components. 3) Convert documents to n-grams. Run "convert_docs.py" to convert documents into approved n-grams. Sample Input: We introduce a family of rings of symmetric functions depending on an infinite sequence of parameters. Sample Output: introduc famili ring symmetr_function depend infinit_sequenc paramet Additional documentation appears in the README file. Note that this software depends on the porter_stemmer.py module, which is available from http://tartarus.org/~martin/PorterStemmer/python.txt. PyGrams has been tested with Python 2.5 on Linux. PyGrams has been developed with support from The Bibliographic Knowledge Network. [Less]

0
 
  0 reviews  |  0 users  |  148 lines of code  |  0 current contributors  |  Analyzed 2 days ago
 
 
Compare

Mulm is a state-of-the-art Hidden Markov Model toolkit.

0
 
  0 reviews  |  0 users  |  5,046 lines of code  |  2 current contributors  |  Analyzed 8 days ago
 
 
 
 

Creative Commons License Copyright © 2013 Black Duck Software, Inc. and its contributors, Some Rights Reserved. Unless otherwise marked, this work is licensed under a Creative Commons Attribution 3.0 Unported License . Ohloh ® and the Ohloh logo are trademarks of Black Duck Software, Inc. in the United States and/or other jurisdictions. All other trademarks are the property of their respective holders.