Browsing projects by Tag(s)

Select a tag to browse associated projects and drill deeper into the tag cloud.

Showing page 1 of 1

Adaptive Information Extraction (ALP)The ALP package implements an information extraction algorithm, the Learning Pattern by Language Processing (LP) algorithm as described in F. Ciravegna, (LP)2, an Adaptive Algorithm for Information Extraction from Web-related Texts. Simplified does the software ... [More] its best to find rules to detect the start and the end of some text (also known as Named Entity Recognition). For example finding the Person Peter Vankman in the Text "Peter Venkman, Ph.D. is a fictional scientist and member of the Ghostbusters, appearing in the films Ghostbusters and Ghostbusters II" What you need to have to start the rule learning process: A somehow pre NLP'ized gate document (containg Part of Speech, LEMMA, or Gazetteer informations) Some manually annotated Tokens in the Document as true Positives (such as PERSON, ORGANIZATION) If everything works as expected you should get ready to use gate rules for the provided examples. Reports/statistics regarding precision, recall and b-fmeasure values are generated too. The code uses but does not depend, on the gate framework (http://www.gate.ac.uk). The framework is therefore included in binary form in the java archiva in the distribution file. The current version is 1.0-SNAPSHOT and available in the svn trunk. The older versions are deprecated. Georg Öttl [Less]

0
 
  0 reviews  |  0 users  |  249,336 lines of code  |  0 current contributors  |  Analyzed 1 day ago
 
 

CPSC 315 Team Project 3: Information Retrieval and Visualization

0
 
  0 reviews  |  0 users  |  2,879 lines of code  |  0 current contributors  |  Analyzed 3 days ago
 
 

ThraxR - Information Retrieval é um software que realiza a indexação de arquivos-texto e permite a realização de buscas nos arquivos indexados. ThraxR - Information Retrieval is a software that performs the indexing of text files and allows the searching of the indexed files. Notes: ... [More] http://www.google.com/notebook/public/03954281157737682467/BDSedSgoQ2u34zsUk Blog: http://thraxr.blogspot.com/ [Less]

0
 
  0 reviews  |  0 users  |  0 current contributors  |  Analyzed 5 days ago
 
 
 
 

Creative Commons License Copyright © 2013 Black Duck Software, Inc. and its contributors, Some Rights Reserved. Unless otherwise marked, this work is licensed under a Creative Commons Attribution 3.0 Unported License . Ohloh ® and the Ohloh logo are trademarks of Black Duck Software, Inc. in the United States and/or other jurisdictions. All other trademarks are the property of their respective holders.