[17 total ]
Lucene is an information retrieval API originally implemented in Java. Lucene has been ported to other programming languages including Perl, C#, C++, Python, Ruby and PHP.
Lucene.Net is a source code, class-per-class, API-per-API and algorithmatic port of the Java Lucene search engine to the C# and .NET platform utilizing Microsoft .NET Framework.
Xapian is an Open Source Search Engine Library, released under the GPL. It's written in C++, with bindings to allow use from Perl, Python, PHP, Java, Tcl, C#, and Ruby (so far!)
Xapian is a highly adaptable toolkit which allows developers to
... [More]
easily add advanced indexing and search facilities to their own applications. It supports the Probabilistic Information Retrieval model and also supports a rich set of boolean query operators.
Note that ohloh's automated summary below is confused by some ".st" data files which it assumes are uncommented smalltalk. So Xapian isn't written in smalltalk at all (it's mostly C++), and is actually quite well commented (27.6% comments compared to ohloh's 22% average for C++ projects). [Less]
Strigi is an information extraction and indexing library, that comes wih a daemon which uses a very fast and efficient crawler that can index data on your harddrive. Indexing operations are performed without hammering your system, this makes Strigi the fastest and smallest desktop searching program.
Hibernate Search brings the power of full text search engines to the persistence domain model and Hibernate experience, through transparent configuration (Hibernate Annotations) and a common API.
Full text search engines like Apache Lucene(tm)
... [More]
allow applications to execute free-text search queries. However, it becomes increasingly more difficult to index a more complex object domain model - keeping the index up to date, dealing with the mismatch between the index structure and the domain model, querying mismatches, and so on. [Less]
Hyper Estraier is a full-text search system. It works as with Google, but based on peer-to-peer architecture. Using Hyper Estraier, we can construct a large-scaled search engine with cheap computers.
The "xappy" python module is an easy-to-use interface to the Xapian search engine. Xapian provides a low level interface, dealing with terms and documents, but not really worrying about where terms come from, or how to build searches to match the way
... [More]
in which data has been indexed. In contrast, "xappy" allows you to design a field structure, specifying what kind of information is held in particular fields, and then uses this field structure to index data appropriately, and to build and perform searches. [Less]
eZ Find is an enterprise-ready search plugin for eZ Publish, making it possible to search multiple eZ Publish installations simultaneously.
Sphinx is a full-text search engine, it's a standalone search engine, meant to provide fast, size-efficient and relevant full-text search functions to other applications. Sphinx was specially designed to integrate well with SQL databases and scripting languages.
mnoGoSearch is a full-featured SQL based web search engine.
Pinot is :
- a D-Bus service that crawls, indexes your documents and monitors them for changes.
- a GTK-based user interface that enables to query the index built by the service or your favourite Web engine, and display and analyze the results.
It
... [More]
features :
* advanced queries
* language detection
* editing of metadata
* documents labeling
* stored queries
* results ranking history
* results lists export
* dynamic document summaries
* a D-Bus interface
* support for common file types.
* ability to query remote indexes [Less]
SCAN (Smart Content Aggregation and Navigation) is a text analysis, search, tagging and metadata management tool for personal document collections and other information resources.
Flax is a project to develop an open source enterprise search engine application based on the Xapian search engine library. It also contains a clean-and-simple Python interface suitable for many users of Xapian, built on the standard Xapian Python interface, together with various other add-ons such as performance testing utilities.
KinoSearch is a loose port of the Java search engine library Apache Lucene, written in Perl and C. The archetypal application is website search, but it can be put to many different uses.
Pyndexter provides a uniform API for accessing a variety of full-text search and indexing engines. It aims to be to full-text indexing systems what the Python DB API is to databases.
It presents a uniform query syntax to the user, with support for
... [More]
quoted search terms, boolean operations, sub-expressions and attribute (metadata) querying.
Indexers supported are a basic but functional pure-Python indexer, adapters for Hype, Hyperestraier, Lucene, Lupy, Pyndex, Swish-e and Xapian. [Less]
CS2 stands for C# Code Search. It's an academic project developed for the course of Information Retrieval at Università di Modena e Reggio Emilia, Italy.
Namazu is a full-text search engine intended for easy use. Not only does it work as a small or medium scale Web search engine, but also as a personal search system for email or other files.