Browsing projects by Tag(s)

Select a tag to browse associated projects and drill deeper into the tag cloud.

Showing page 2 of 3

An open source full-text search engine and crawler based on best open source technologies: lucene, zkoss, tomcat, poi, pdfbox. Multilingual lemmatization, spellcheck, stop words, synonyms, facet, filters, web crawler, database crawler, local and remote file system crawler, documents indexation ... [More] with OCR, REST with XML or JSON and SOAP API. A stable, high-performance piece of software. It is a modern search engine and a suite of high-powered full text search algorithms. [Less]

5.0
 
  0 reviews  |  3 users  |  69,029 lines of code  |  2 current contributors  |  Analyzed 6 days ago
 
 

Hyper Estraier is a full-text search system. It works as with Google, but based on peer-to-peer architecture. Using Hyper Estraier, we can construct a large-scaled search engine with cheap computers.

5.0
 
  0 reviews  |  3 users  |  0 current contributors
 
 

Key features: Support for http, https, ftp, nntp and news URL schemes. htdb virtual URL scheme for indexing SQL databases. Indexes text/html, text/xml, text/plain, audio/mpeg (MP3) and image/gif mime types natively. External parsers support for other document types, including Microsoft Word ... [More] , Excel, RTF, PowerPoint, Adobe Acrobat PDF and Flash. Can index multilingual sites using content negotiation. Searching all of the word forms using ispell affixes and dictionaries. Synonym, acronym and abbreviation query expansion based on editable dictionaries, specified by language and charset. Stop-words, synonyms and acronyms lists. Options to query with all words, all words near to each others, any words, or Boolean queries. A subset of VQL (Verity Query Language) is supported. Popularity Rank ba [Less]

5.0
 
  0 reviews  |  3 users  |  276,070 lines of code  |  2 current contributors  |  Analyzed 5 days ago
 
 

BlackRay is a relational database system designed to offer performance features commonly associated with search engines. It offers SQL support and sophisticated operational and management features. Load-balancing and operational stability by means of N+1 redundance are included. BlackRay is ... [More] called a "Data Engine" since it combines traditional, relational database features and SQL with the power and flexibility of search engines. It is a true hybrid, offering transaction support, data-versioned snapshots, and sophisticated function-based indices. Wildcards, phonetic, and fuzzy logic searches are supported, as well. Commercial support is available, and the project is released under a the GPLv2 license. [Less]

0
 
  0 reviews  |  3 users  |  119,867 lines of code  |  2 current contributors  |  Analyzed over 1 year ago
 
 

KinoSearch is a loose port of the Java search engine library Apache Lucene, written in Perl and C. The archetypal application is website search, but it can be put to many different uses.

4.0
   
  0 reviews  |  3 users  |  61,380 lines of code  |  0 current contributors  |  Analyzed 10 days ago
 
 

The "xappy" python module is an easy-to-use interface to the Xapian search engine. Xapian provides a low level interface, dealing with terms and documents, but not really worrying about where terms come from, or how to build searches to match the way in which data has been indexed. In ... [More] contrast, "xappy" allows you to design a field structure, specifying what kind of information is held in particular fields, and then uses this field structure to index data appropriately, and to build and perform searches. [Less]

0
 
  0 reviews  |  2 users  |  17,660 lines of code  |  0 current contributors  |  Analyzed 9 days ago
 
 
Compare

SCAN (Smart Content Aggregation and Navigation) is a text analysis, search, tagging and metadata management tool for personal document collections and other information resources.

5.0
 
  0 reviews  |  2 users  |  31,808 lines of code  |  0 current contributors  |  Analyzed about 2 hours ago
 
 

eZ Find is an enterprise-ready search plugin for eZ Publish, making it possible to search multiple eZ Publish installations simultaneously.

0
 
  0 reviews  |  2 users  |  34,266 lines of code  |  26 current contributors  |  Analyzed 6 days ago
 
 

IntelliGID is an Open Cource Enterptise content management (ECM) developed by Doculibre. - Document Management - Record Management - Enterprise Search - Digital Assets Management - Workflows (Activiri) - Portail integration with Liferay

5.0
 
  0 reviews  |  1 user  |  242,061 lines of code  |  8 current contributors  |  Analyzed 3 days ago
 
 

Pyndexter provides a uniform API for accessing a variety of full-text search and indexing engines. It aims to be to full-text indexing systems what the Python DB API is to databases. It presents a uniform query syntax to the user, with support for quoted search terms, boolean operations ... [More] , sub-expressions and attribute (metadata) querying. Indexers supported are a basic but functional pure-Python indexer, adapters for Hype, Hyperestraier, Lucene, Lupy, Pyndex, Swish-e and Xapian. [Less]

0
 
  0 reviews  |  1 user  |  1,908 lines of code  |  0 current contributors  |  Analyzed almost 2 years ago
 
 
 
 

Creative Commons License Copyright © 2013 Black Duck Software, Inc. and its contributors, Some Rights Reserved. Unless otherwise marked, this work is licensed under a Creative Commons Attribution 3.0 Unported License . Ohloh ® and the Ohloh logo are trademarks of Black Duck Software, Inc. in the United States and/or other jurisdictions. All other trademarks are the property of their respective holders.