Projects tagged ‘data’


[72 total ]

50USERS
   

gnuplot plots 2d and 3d graphs, from a data file or with a formula. It has an interactive mode with online help, or it can be used non-interactively. gnuplot does function fitting to data sets, and it does output to many terminals, among which are PostScript, X11 display, PNG, and GIF (via the old gd library).

43USERS
   

R is a language and environment for statistical computing and graphics. It is a GNU project which is similar to the S language and environment which was developed at Bell Laboratories (formerly AT&T, now Lucent Technologies) by John Chambers and ... [More] colleagues. R can be considered as a different implementation of S. There are some important differences, but much code written for S runs unaltered under R. R provides a wide variety of statistical (linear and nonlinear modelling, classical statistical tests, time-series analysis, classification, clustering, ...) and graphical techniques, and is highly extensible. The S language is often the vehicle of choice for research in statistical methodology, and R provides an Open Source route to participation in that activity. [Less]

26USERS
   

iReport is a visual reporting tool based on JasperReports and written in pure Java. You can manage charts, images, sub-reports, etc. Data can be retrieved using JDBC, TableModels, JavaBeans, and XML. It supports output in PDF, XML, XSL, CSV, HTML, and Java2D.

20USERS
   

Official datapack for L2J Project.

16USERS
 

SubSonic is a toolset that helps a website build itself. It brings some of the fun and elegance of Ruby on Rails to ASP.NET.

14USERS
   

Npgsql is a .NET data provider for PostgreSQL. It allows any program developed for .NET framework to access database server. It is implemented in 100% C# code. Works with PostgreSQL 7.x and 8.x.

12USERS
   

Weka is a collection of machine learning algorithms for solving real-world data mining problems. It is written in Java and runs on almost any platform. The algorithms can either be applied directly to a dataset or called from your own Java code.

9USERS
   

System.Data.SQLite is an enhanced version of the original SQLite database engine. It is a complete drop-in replacement for the original sqlite3.dll (you can even rename it to sqlite3.dll). It has no linker dependency on the .NET runtime so it can ... [More] be distributed independently of .NET, yet embedded in the binary is a complete ADO.NET 2.0 provider for full managed development. [Less]

8USERS
 

RapidMiner (formerly YALE) is the most comprehensive open-source software for intelligent data analysis, data mining, knowledge discovery, machine learning, predictive analytics, forecasting, and analytics in business intelligence (BI). RapidMiner ... [More] provides more than 400 data mining operators, a graphical user interface (GUI), an online tutorial with hands-on data mining applications, a comprehensive PDF tutorial, many visualization schemes for data sets and data mining results, many different learning and meta-learning schemes ranging from decision tree and rule learners to neural networks, SVMs, ensemble methods, etc. RapidMiner is implemented in Java and available under GPL (GNU General Public License) as well as under a developer license (OEM license) for closed-source developers [Less]

8USERS
   

A core event, loop, abstraction, IPC, Data structure etc. library used to share out commonly useful routines that lots of applications need/use.

6USERS
   

Apache Xindice is a database designed from the ground up to store XML data or what is more commonly referred to as a native XML database.

5USERS
   

DataCleaner is an open source project concerned with creating a data quality solutions for business and organizations wishing to measure and increase the quality of their data. DataCleaner includes functionality to profile and compare data, to ... [More] validate data against business rules and to monitor the progression of these measurements over time. To do this we provide you with both a standalone desktop-application for exploring and defining the data quality effort and a web application for continuous data quality deployments. [Less]

5USERS

Cleversafe provides a dispersed data storage solution.

4USERS
 

Stxxl - Standard Template Library for Extra Large Data Sets. Stxxl is an implementation of the C++ standard template library STL for external memory (out-of-core) computations, containers, and algorithms that can process huge volumes of data that only fit on disks.

3USERS

The JGoodies Data Binding framework provides an elegant means to present object properties in a user interface. It reduces the code necessary for object presentation, stream-lines the development process for data binding, provides advanced features for automatic update notification and assists you in separating the domain and presentation layers.

3USERS
 

PyTables is a package for managing hierarchical datasets and designed to efficiently and easily cope with extremely large amounts of data. PyTables is built on top of the HDF5 library, using the Python language and the NumPy package (it also ... [More] supports numarray and Numeric right out-of-the-box). It features an object-oriented interface that, combined with C extensions for the peformance-critical parts of the code (generated using Pyrex), makes it a fast, yet extremely easy to use tool for interactively saving and retrieving very large amounts of data. One important feature of PyTables is that it optimizes memory and disk resources so that data takes much less space (specially if on-flight compression is used) than other solutions such as relational or object oriented databases. [Less]

2USERS
 

CRM114 is a system to examine incoming e-mail, system log streams, data files or other data streams, and to sort, filter, or alter the incoming files or data streams according to the user's wildest desires. Criteria for categorization of data can be ... [More] via a host of methods, including regexes, approximate regexes, a Hidden Markov Model, Orthogonal Sparse Bigrams, WINNOW, Correllation, KNN/Hyperspace, or Bit Entropy ( or by other means- it's all programmable). [Less]

2USERS
   

Clover.ETL is an Open Source, Java based ETL framework which can be used to transform structured data. While using JAVA technology it allows for platform independence. It can be used standalone - as a server application or can be embedded in other applications as a library.

2USERS

pChart is a PHP class oriented framework designed to create aliased charts. Data can be retrieved from SQL queries, CSV files, or manually provided.

2USERS

The iBATIS Data Mapper framework makes it easier to use a database with Java and .NET applications. iBATIS couples objects with stored procedures or SQL statements using a XML descriptor. Simplicity is the biggest advantage of the iBATIS Data Mapper over object relational mapping tools.

2USERS
   

Orange is a component-based data mining software. It includes a range of data visualization, exploration, preprocessing and modelling techniques. It can be used through a nice and intuitive user interface or, for more advanced users, as a module for Python programming language.

2USERS
   

GGobi is an open source visualization program for exploring high-dimensional data. It provides highly dynamic and interactive graphics such as tours, as well as familiar graphics such as the scatterplot, barchart and parallel coordinates plots. Plots are interactive and linked with brushing and identification.

1USERS

A set of data analysis tools.

1USERS
   

GNU ddrescue is a data recovery tool. It copies data from one file or block device (hard disc, cdrom, etc) to another, trying hard to rescue data in case of read errors. The basic operation of ddrescue is fully automatic. That is, you don't have ... [More] to wait for an error, stop the program, read the log, run it in reverse mode, etc. If you use the logfile feature of ddrescue, the data is rescued very efficiently (only the needed blocks are read). Also you can interrupt the rescue at any time and resume it later at the same point. [Less]

1USERS
 

XmlCatalog is an application written for Windows operating systems. It is designed to help you maintain and organize a catalog of your computer media collection, including CD-ROMs, DVDs, diskettes, removable drives, hard drives, or any other present ... [More] or future storage media. For the catalog database XML is used and all data is stored in UTF-8 format. XmlCatalog application is written in C++. It is free and open source software for everyone. [Less]

1USERS
 

RDV is a visualization environment for scientific and engineering data. Built on top of the RBNB dynamic data server, it supports local and remote data sources and can visualize numeric, video, image, and audio data in a number of different ways.

1USERS

A minimalistic data analysis application

1USERS

Magic Rescue scans a block device for file types it knows how to recover and calls an external program to extract them. It looks at "magic bytes" in file contents, so it can be used both as an undelete utility and for recovering a corrupted drive or partition. As long as the file data is there, it will find it.

1USERS

Free Entity Relationship and Reverse Engineering Tool