Projects tagged ‘bayesian’ and ‘classifier’


[4 total ]

1 Users
 

Ruby interface to the CRM114 Controllable Regex Mutilator, an advanced and fast text classifier that uses sparse binary polynomial matching with a Bayesian Chain Rule evaluator and a hidden Markov ... [More] model to categorize data with up to a 99.87% accuracy. [Less]
Created over 3 years ago.

0 Users

This is a naive Bayesian text classifier that, given a bit of text can tell you the posterior probability (returned as a log likelihood) that it comes from each of the standard scientific article ... [More] sections (Introduction, Methods, Results, Discussion). What do we mean by a bit of text, well anything you want really, from a few words to a whole article, you decide on the boundaries. You can use it to:Pull out all the bits of text from an article that are from your chosen section. Score each bit of text in 4 dimensions (i.e. how introductory, methodological, results-based or conclusionary it is) which may be useful for finding similar text. Use it to create training data etc. This project contains:A local version of the classifier, for better performance (see Downloads, or choose a version from the downloads list to the right) Links and info on how to access the web service interface to the classifier (see below) Example client software Ruby and Perl for accessing the web service from your code (see Downloads) Some Taverna workflows that give a demonstration of how to use the classifier web service (coming soon) The training data (useful for error analyses) datastore.dump A user guide for the web service (coming soon) Some javadoc describing the code for the web service interface and its associated bean classes. ArticleSectionClassifier-1.0-javadoc.zip Test the classifierYou can test the classifier through your browser here Download the classifierDownloads are all listed here or you can see the featured downloads on the right hand side of this page. SOAP/WSDL Web ServiceThe wsdl document for the web service is here and you can test it directly through your browser here. You can also download local clients in Java, Perl and Ruby for accessing the web service. Using the classifier in JavaFirst you must include the ArticleSectionClassifer.jar file in your classpath. Secondly it is best to increase the memory allocated by the Java executable using the -Xmx Java VM argument, I usually suggest -Xmx256m. Then you can use the classifier in few lines of code. String textToClassify = "classify this text"; ArticleSectionClassifier classifier = new ArticleSectionClassifier(); String classifiedAs = classifiers.classifyText(textToClassify); System.out.println(classifiedAs);If you want more detailed output try String textToClassify = "classify this text"; ClassificationInput input = new ClassificationInput(textToClassify); ArticleSectionClassifier classifier = new ArticleSectionClassifier(); ClassificationResult result = classifier.classifyText(input); System.out.println(result);Source codeDetails on getting the source code for this project can be found by clicking on the 'Source' tab above or by clicking here. You can also download an archive of the source from here. The source is uploaded as part of a Netbeans project, which can be opened directly into Netbeans or imported into Eclipse or most other IDEs. ProblemsIf something isn't working, then please post an issue. I will be notified by email and i'll sort it out as soon as I can. It's very likely that something won't work, but its also likely that i've encountered it before and will know how to fix it, therefore if you have a problem, let me know. [Less]
Created about 1 year ago.

0 Users

Casual Encounters of the Third Kind is a Bayesian classifier for Craigslist Casual Encounters (in particular, w4m). Since the w4m section is the most abused section of Casual Encounters, this project ... [More] is aimed at filtering out posts that are likely spam, and showing only posts that are relevant. [Less]
Created 7 months ago.

0 Users

naivebayesianclassifier
Created about 1 year ago.