Projects tagged ‘information_retrieval’ and ‘search’


[21 total ]

198 Users
   

Lucene is an information retrieval API originally implemented in Java. Lucene has been ported to other programming languages including Perl, C#, C++, Python, Ruby and PHP.
Created over 3 years ago.

58 Users
   

Lucene.Net is a source code, class-per-class, API-per-API and algorithmatic port of the Java Lucene search engine to the C# and .NET platform utilizing Microsoft .NET Framework.
Created over 2 years ago.

14 Users
   

Xapian is an Open Source Search Engine Library, released under the GPL. It's written in C++, with bindings to allow use from Perl, Python, PHP, Java, Tcl, C#, and Ruby (so far!) Xapian is a highly ... [More] adaptable toolkit which allows developers to easily add advanced indexing and search facilities to their own applications. It supports the Probabilistic Information Retrieval model and also supports a rich set of boolean query operators. [Less]
Created over 3 years ago.

12 Users
   

Strigi is an information extraction and indexing library, that comes wih a daemon which uses a very fast and efficient crawler that can index data on your harddrive. Indexing operations are performed ... [More] without hammering your system, this makes Strigi the fastest and smallest desktop searching program. [Less]
Created over 3 years ago.

8 Users

Sphinx is a full-text search engine, it's a standalone search engine, meant to provide fast, size-efficient and relevant full-text search functions to other applications. Sphinx was specially designed ... [More] to integrate well with SQL databases and scripting languages. [Less]
Created about 1 year ago.

3 Users

KinoSearch is a loose port of the Java search engine library Apache Lucene, written in Perl and C. The archetypal application is website search, but it can be put to many different uses.
Created about 1 year ago.

3 Users
 

Hyper Estraier is a full-text search system. It works as with Google, but based on peer-to-peer architecture. Using Hyper Estraier, we can construct a large-scaled search engine with cheap computers.
Created over 3 years ago.

2 Users

An open source search engine based on best open source technologies: lucene, zkoss, tomcat, poi, tagsoup. A stable, high-performance piece of software. It is both a modern search engine and a suite of high-powered full text search algorithms.
Created 4 months ago.

2 Users

The "xappy" python module is an easy-to-use interface to the Xapian search engine. Xapian provides a low level interface, dealing with terms and documents, but not really worrying about where terms ... [More] come from, or how to build searches to match the way in which data has been indexed. In contrast, "xappy" allows you to design a field structure, specifying what kind of information is held in particular fields, and then uses this field structure to index data appropriately, and to build and perform searches. [Less]
Created over 2 years ago.

2 Users

Key features: Support for http, https, ftp, nntp and news URL schemes. htdb virtual URL scheme for indexing SQL databases. Indexes text/html, text/xml, text/plain, audio/mpeg (MP3) and image/gif ... [More] mime types natively. External parsers support for other document types, including Microsoft Word, Excel, RTF, PowerPoint, Adobe Acrobat PDF and Flash. Can index multilingual sites using content negotiation. Searching all of the word forms using ispell affixes and dictionaries. Synonym, acronym and abbreviation query expansion based on editable dictionaries, specified by language and charset. Stop-words, synonyms and acronyms lists. Options to query with all words, all words near to each others, any words, or Boolean queries. A subset of VQL (Verity Query Language) is supported. Popularity Rank ba [Less]
Created about 1 year ago.