Select a tag to browse associated projects and drill deeper into the tag cloud.
K.E.T.T.L.E (Kettle ETTL Environment) is a meta-data driven ETTL tool. (ETTL: Extraction, Transformation, Transportation & Loading) This means that no code has to be written to perform complex data transformations. Environment means that it is possible to create plugins to do custom ... [More]
OpenJUMP is an open source GIS software written in Java. It is based on JUMP GIS by Vivid Solutions. It is a Vector GIS that can read rasters as well. It is not just another free demo viewer, but you can edit, save, analyze etc. with OpenJUMP. It works, even with medium size datasets, and with ... [More]
Apatar is an open source data integration application to help join your desktop data with the web. Apatar effectively integrates data and applications, and provides visual job designer and mapping, joins, filtering, data cleansing and validation capabilities. Connectors include MySQL, PostgreSQL ... [More]
Teiid is The Enterprise Information Integration (virtual) Database. Teiid is a data virtualization system that allows applications to use data from hetergenous data sources. Teiid is comprised of tools, components and services for creating and executing bi-directional data services. Through ... [More]
BioMart is a query-oriented data management system developed jointly by the European Bioinformatics Institute (EBI) and Cold Spring Harbor Laboratory (CSHL). The system can be used with any type of data and comes with a range of query interfaces and administration tools, including 'out of the ... [More]
ChronicleDroid assist in creating ETL mappings for tracicking changes on a existing OLTP Schema. ChronicleDroid implements The Data Vault architecture. Currently in early development stage
The Distributed Annotation System (DAS) defines a communication protocol used to exchange biological sequence annotations. DAS is a client-server system in which a single client integrates data from multiple servers. Data distribution, performed by DAS servers, is separated from visualization ... [More]
A collection of general purpose maven-driven TOS components with various intended uses, from social network analysis to webservice connectors to tweet parsing.
Jitterbit is an open source client and server designed to give end users a quick and easy way to design, configure, test, and deploy integration solutions. Organizations can use Jitterbit to connect data from ERP and CRM applications, data warehouses, online marketplaces, and much more. Jitterbit ... [More]
An extension package to Pentaho Data Integration (a.k.a. Kettle), providing plug-ins. Steps/job entries can be downloaded independently and each comes with source code in the .zip file. All are licensed as LGPL or GPL. Steps currently available: Asciify, TrimCut, Date Generator