Select a tag to browse associated projects and drill deeper into the tag cloud.
Apache Lucene is an information retrieval API originally implemented in Java. It is a technology suitable for nearly any application that requires full-text search, especially cross-platform. Lucene has been ported to other programming languages including Perl, C#, C++, Python, Ruby and PHP.
Lucene.Net is a source code, class-per-class, API-per-API and algorithmatic port of the Java Lucene search engine to the C# and .NET platform utilizing Microsoft .NET Framework.
Beagle is a desktop-independent service for indexing and searching your data. The Beagle daemon transparently monitors your data and updates the index to reflect any changes. On an inotify-enabled system, these updates (e.g. new files, emails, chat) happen more-or-less in real time. Beagle ... [More]
Tracker is a first class object database, extensible tag/metadata database, search tool and indexer. It can trawl through your hard drive and index existing files and data stores It has been designed from the ground up to be very lightweight (the tracker daemon consumes ~4MB of RAM in typical ... [More]
NLTK — the Natural Language Toolkit — is a suite of open source Python modules, linguistic data and documentation for research and development in natural language processing, supporting dozens of NLP tasks, with distributions for Windows, Mac OSX and Linux.
Sphinx is a full-text search engine, it's a standalone search engine, meant to provide fast, size-efficient and relevant full-text search functions to other applications. Sphinx was specially designed to integrate well with SQL databases and scripting languages.
Xapian is an Open Source Search Engine Library, released under the GPL. It's written in C++, with bindings to allow use from Perl, Python, PHP, Java, Tcl, C#, and Ruby (so far!) Xapian is a highly adaptable toolkit which allows developers to easily add advanced indexing and search facilities ... [More]
Hibernate Search brings the power of full text search engines to the persistence domain model and Hibernate experience, through transparent configuration via annotations and a common API. Full text search engines like Apache Lucene(tm) allow applications to execute free-text search queries. ... [More]
GATE (General Architecture for Text Engineering) is an architecture, framework and development environment for developing, evaluating and embedding Human Language Technology