[13 total ]
Lucene.Net is a source code, class-per-class, API-per-API and algorithmatic port of the Java Lucene search engine to the C# and .NET platform utilizing Microsoft .NET Framework.
The ht://Dig system is a complete WWW indexing and searching system for a domain or intranet. This system is not meant to replace the need for internet-wide search systems like Lycos, Infoseek, Google, and AltaVista. Instead, it is meant to cover the search needs for a single company, campus, or even a particular sub-section of a Web site.
Xapian is an Open Source Search Engine Library, released under the GPL. It's written in C++, with bindings to allow use from Perl, Python, PHP, Java, Tcl, C#, and Ruby (so far!)
Xapian is a highly adaptable toolkit which allows developers to
... [More]
easily add advanced indexing and search facilities to their own applications. It supports the Probabilistic Information Retrieval model and also supports a rich set of boolean query operators.
Note that ohloh's automated summary below is confused by some ".st" data files which it assumes are uncommented smalltalk. So Xapian isn't written in smalltalk at all (it's mostly C++), and is actually quite well commented (27.6% comments compared to ohloh's 22% average for C++ projects). [Less]
Senna is an embeddable fulltext search engine, which you can use in conjunction with various scripting languages and databases. Senna is an inverted index based engine, and combines the best of n-gram indexing and word indexing to achieve fast
... [More]
, precise searches. While senna codebase is rather compact it is scalable enough to handle large amounts of data and queries. [Less]
The "xappy" python module is an easy-to-use interface to the Xapian search engine. Xapian provides a low level interface, dealing with terms and documents, but not really worrying about where terms come from, or how to build searches to match the way
... [More]
in which data has been indexed. In contrast, "xappy" allows you to design a field structure, specifying what kind of information is held in particular fields, and then uses this field structure to index data appropriately, and to build and perform searches. [Less]
Sphinx is a full-text search engine, it's a standalone search engine, meant to provide fast, size-efficient and relevant full-text search functions to other applications. Sphinx was specially designed to integrate well with SQL databases and scripting languages.
Pyndexter provides a uniform API for accessing a variety of full-text search and indexing engines. It aims to be to full-text indexing systems what the Python DB API is to databases.
It presents a uniform query syntax to the user, with support for
... [More]
quoted search terms, boolean operations, sub-expressions and attribute (metadata) querying.
Indexers supported are a basic but functional pure-Python indexer, adapters for Hype, Hyperestraier, Lucene, Lupy, Pyndex, Swish-e and Xapian. [Less]
eZ Find is an enterprise-ready search plugin for eZ Publish, making it possible to search multiple eZ Publish installations simultaneously.
Flax is a project to develop an open source enterprise search engine application based on the Xapian search engine library. It also contains a clean-and-simple Python interface suitable for many users of Xapian, built on the standard Xapian Python interface, together with various other add-ons such as performance testing utilities.
mnoGoSearch is a full-featured SQL based web search engine.
Sphider is a lightweight web spider and search engine written in PHP, using MySQL as its back end database. It is a great tool for adding search functionality to your web site or building your custom search engine. Sphider is small, easy to set up
... [More]
and modify, and is used in thousands of websites across the world.
Sphider supports all standard search options, but also includes a plethora of advanced features such as word autocompletion, spelling suggestions etc. The sophisticated adminstration interface makes administering the system easy. [Less]
CS2 stands for C# Code Search. It's an academic project developed for the course of Information Retrieval at Università di Modena e Reggio Emilia, Italy.
Namazu is a full-text search engine intended for easy use. Not only does it work as a small or medium scale Web search engine, but also as a personal search system for email or other files.