Browsing projects by Tag(s)

Select a tag to browse associated projects and drill deeper into the tag cloud.

Showing page 1 of 2

Apache Lucene is an information retrieval API originally implemented in Java. It is a technology suitable for nearly any application that requires full-text search, especially cross-platform. Lucene has been ported to other programming languages including Perl, C#, C++, Python, Ruby and PHP.

4.54054
   
  0 reviews  |  298 users  |  436,612 lines of code  |  29 current contributors  |  Analyzed 9 days ago
 
 

Lucene.Net is a source code, class-per-class, API-per-API and algorithmatic port of the Java Lucene search engine to the C# and .NET platform utilizing Microsoft .NET Framework.

4.33333
   
  0 reviews  |  75 users  |  297,148 lines of code  |  7 current contributors  |  Analyzed 25 minutes ago
 
 

Beagle is a desktop-independent service for indexing and searching your data. The Beagle daemon transparently monitors your data and updates the index to reflect any changes. On an inotify-enabled system, these updates (e.g. new files, emails, chat) happen more-or-less in real time. Beagle ... [More] supports many different file formats and can index files/email/browsing history/chat logs/RSS feeds etc. [Less]

4.1875
   
  0 reviews  |  52 users  |  231,120 lines of code  |  1 current contributor  |  Analyzed 7 days ago
 
 

Tracker is a first class object database, extensible tag/metadata database, search tool and indexer. It can trawl through your hard drive and index existing files and data stores It has been designed from the ground up to be very lightweight (the tracker daemon consumes ~4MB of RAM in typical ... [More] use) yet at the same time very fast too. It provides a comprehensive, persistent and extensible storage system that can store and index almost any object. These objects can also have extensible user defined metadata and tags to create rich first class objects. [Less]

3.875
   
  0 reviews  |  46 users  |  147,268 lines of code  |  43 current contributors  |  Analyzed 1 day ago
 
 

Sphinx is a full-text search engine, it's a standalone search engine, meant to provide fast, size-efficient and relevant full-text search functions to other applications. Sphinx was specially designed to integrate well with SQL databases and scripting languages.

4.4
   
  0 reviews  |  20 users  |  136,597 lines of code  |  5 current contributors  |  Analyzed almost 3 years ago
 
 

Xapian is an Open Source Search Engine Library, released under the GPL. It's written in C++, with bindings to allow use from Perl, Python, PHP, Java, Tcl, C#, and Ruby (so far!) Xapian is a highly adaptable toolkit which allows developers to easily add advanced indexing and search facilities ... [More] to their own applications. It supports the Probabilistic Information Retrieval model and also supports a rich set of boolean query operators. [Less]

4.44444
   
  0 reviews  |  16 users  |  112,508 lines of code  |  2 current contributors  |  Analyzed 3 days ago
 
 

Senna is an embeddable fulltext search engine, which you can use in conjunction with various scripting languages and databases. Senna is an inverted index based engine, and combines the best of n-gram indexing and word indexing to achieve fast, precise searches. While senna codebase is rather ... [More] compact it is scalable enough to handle large amounts of data and queries. [Less]

5.0
 
  0 reviews  |  4 users  |  146,818 lines of code  |  0 current contributors  |  Analyzed 10 days ago
 
 

BlackRay is a relational database system designed to offer performance features commonly associated with search engines. It offers SQL support and sophisticated operational and management features. Load-balancing and operational stability by means of N+1 redundance are included. BlackRay is ... [More] called a "Data Engine" since it combines traditional, relational database features and SQL with the power and flexibility of search engines. It is a true hybrid, offering transaction support, data-versioned snapshots, and sophisticated function-based indices. Wildcards, phonetic, and fuzzy logic searches are supported, as well. Commercial support is available, and the project is released under a the GPLv2 license. [Less]

0
 
  0 reviews  |  3 users  |  119,867 lines of code  |  2 current contributors  |  Analyzed over 1 year ago
 
 

Key features: Support for http, https, ftp, nntp and news URL schemes. htdb virtual URL scheme for indexing SQL databases. Indexes text/html, text/xml, text/plain, audio/mpeg (MP3) and image/gif mime types natively. External parsers support for other document types, including Microsoft Word ... [More] , Excel, RTF, PowerPoint, Adobe Acrobat PDF and Flash. Can index multilingual sites using content negotiation. Searching all of the word forms using ispell affixes and dictionaries. Synonym, acronym and abbreviation query expansion based on editable dictionaries, specified by language and charset. Stop-words, synonyms and acronyms lists. Options to query with all words, all words near to each others, any words, or Boolean queries. A subset of VQL (Verity Query Language) is supported. Popularity Rank ba [Less]

5.0
 
  0 reviews  |  3 users  |  276,070 lines of code  |  2 current contributors  |  Analyzed about 8 hours ago
 
 

An open source full-text search engine and crawler based on best open source technologies: lucene, zkoss, tomcat, poi, pdfbox. Multilingual lemmatization, spellcheck, stop words, synonyms, facet, filters, web crawler, database crawler, local and remote file system crawler, documents indexation ... [More] with OCR, REST with XML or JSON and SOAP API. A stable, high-performance piece of software. It is a modern search engine and a suite of high-powered full text search algorithms. [Less]

5.0
 
  0 reviews  |  3 users  |  69,029 lines of code  |  2 current contributors  |  Analyzed 7 days ago
 
 
 
 

Creative Commons License Copyright © 2013 Black Duck Software, Inc. and its contributors, Some Rights Reserved. Unless otherwise marked, this work is licensed under a Creative Commons Attribution 3.0 Unported License . Ohloh ® and the Ohloh logo are trademarks of Black Duck Software, Inc. in the United States and/or other jurisdictions. All other trademarks are the property of their respective holders.