Select a tag to browse associated projects and drill deeper into the tag cloud.
ZFS is a new kind of file system that provides simple administration, transactional semantics, end-to-end data integrity, and immense scalability. ZFS is not an incremental improvement to existing technology; it is a fundamentally new approach to data management. We've blown away 20 years of ... [More]
FSlint is a toolkit to find various forms of lint on a filesystem. At the moment it reports duplicate files, bad symbolic links, troublesome file names, empty directories, non stripped executables, temporary files, duplicate/conflicting (binary) names, and unused ext2 directory blocks. The package contains both a GTK GUI and a CLI interface.
Memventi is a Venti daemon. It speaks the same Venti protocol as the real Venti in Plan 9 from Bell Labs. It is a storage server that stores data blocks up to 56KB using its SHA-1 hash (called its score) to address it. It keeps a mapping of score to disk location in memory (in a memory-efficient ... [More]
An improvement upon Linux's memory merging support enabling transparent full system scan at ultra speed.
Cyphertite is a tar-like secure remote archiver. It deduplicates, compresses, and encrypts data prior to transmission, providing total privacy while reducing unnecessary wire traffic. It seamlessly supports IPv6 and IPv4 on a variety of platforms.
Ventisrv is a venti daemon for inferno, written in limbo. It has an in-memory index. Vcache is a (in-memory) venti cache. This package also contains simple tools to write to/read from a venti server. This code is the partial result of a (successful) google summer of code 2007 project, for the ... [More]
Duke is a fast record linkage and deduplication engine written in Java. It provides both an API and a command-line interface, and supports incremental processing. Duke is based on Lucene.
HADU is an acronnym for "I Hate Duplicates". It is intended to find and delete duplicate files. This will free space in your hard disk. It is made by inserting the "files" in a B+ tree and searching for duplicates following some criteria.
GAPS is an image sorting and viewing application written to deal with enormous folders full of images. It can easily handle hundreds of thousands of images per folder and continue to operate quickly and stably. GAPS' DuperFinder feature can scan all of your image folders for duplicates, shows ... [More]
Arrow is a backup system that combines hashing, error-correction, and rsync-like searching to provide a versioned, deduplicating, verifiable, and safe data backup system. The core of Arrow was done as a part of Casey Marshall's Master's project in computer science at the University of ... [More]
Copyright © 2013 Black Duck Software, Inc. and its contributors, Some Rights Reserved. Unless otherwise marked, this work is licensed under a Creative Commons Attribution 3.0 Unported License . Ohloh ® and the Ohloh logo are trademarks of Black Duck Software, Inc. in the United States and/or other jurisdictions. All other trademarks are the property of their respective holders.