[72 total ]
gnuplot plots 2d and 3d graphs, from a data file or with a formula. It has an interactive mode with online help, or it can be used non-interactively. gnuplot does function fitting to data sets, and it does output to many terminals, among which are PostScript, X11 display, PNG, and GIF (via the old gd library).
R is a language and environment for statistical computing and graphics. It is a GNU project which is similar to the S language and environment which was developed at Bell Laboratories (formerly AT&T, now Lucent Technologies) by John Chambers and
... [More]
colleagues. R can be considered as a different implementation of S. There are some important differences, but much code written for S runs unaltered under R.
R provides a wide variety of statistical (linear and nonlinear modelling, classical statistical tests, time-series analysis, classification, clustering, ...) and graphical techniques, and is highly extensible. The S language is often the vehicle of choice for research in statistical methodology, and R provides an Open Source route to participation in that activity. [Less]
iReport is a visual reporting tool based on JasperReports and written in pure Java. You can manage charts, images, sub-reports, etc. Data can be retrieved using JDBC, TableModels, JavaBeans, and XML. It supports output in PDF, XML, XSL, CSV, HTML, and Java2D.
Official datapack for L2J Project.
SubSonic is a toolset that helps a website build itself. It brings some of the fun and elegance of Ruby on Rails to ASP.NET.
Npgsql is a .NET data provider for PostgreSQL. It allows any program developed for .NET framework to access database server. It is implemented in 100% C# code. Works with PostgreSQL 7.x and 8.x.
Weka is a collection of machine learning algorithms for solving real-world data mining problems. It is written in Java and runs on almost any platform. The algorithms can either be applied directly to a dataset or called from your own Java code.
System.Data.SQLite is an enhanced version of the original SQLite database engine. It is a complete drop-in replacement for the original sqlite3.dll (you can even rename it to sqlite3.dll). It has no linker dependency on the .NET runtime so it can
... [More]
be distributed independently of .NET, yet embedded in the binary is a complete ADO.NET 2.0 provider for full managed development. [Less]
RapidMiner (formerly YALE) is the most comprehensive open-source software for intelligent data analysis, data mining, knowledge discovery, machine learning, predictive analytics, forecasting, and analytics in business intelligence (BI). RapidMiner
... [More]
provides more than 400 data mining operators, a graphical user interface (GUI), an online tutorial with hands-on data mining applications, a comprehensive PDF tutorial, many visualization schemes for data sets and data mining results, many different learning and meta-learning schemes ranging from decision tree and rule learners to neural networks, SVMs, ensemble methods, etc.
RapidMiner is implemented in Java and available under GPL (GNU General Public License) as well as under a developer license (OEM license) for closed-source developers [Less]
A core event, loop, abstraction, IPC, Data structure etc. library used to share out commonly useful routines that lots of applications need/use.
Apache Xindice is a database designed from the ground up to store XML data or what is more commonly referred to as a native XML database.
DataCleaner is an open source project concerned with creating a data quality solutions for business and organizations wishing to measure and increase the quality of their data. DataCleaner includes functionality to profile and compare data, to
... [More]
validate data against business rules and to monitor the progression of these measurements over time. To do this we provide you with both a standalone desktop-application for exploring and defining the data quality effort and a web application for continuous data quality deployments. [Less]
Cleversafe provides a dispersed data storage solution.
Stxxl - Standard Template Library for Extra Large Data Sets.
Stxxl is an implementation of the C++ standard template library STL for external memory (out-of-core) computations, containers, and algorithms that can process huge volumes of data that only fit on disks.
The JGoodies Data Binding framework provides an elegant means to present object properties in a user interface. It reduces the code necessary for object presentation, stream-lines the development process for data binding, provides advanced features for automatic update notification and assists you in separating the domain and presentation layers.
PyTables is a package for managing hierarchical datasets and designed to efficiently and easily cope with extremely large amounts of data.
PyTables is built on top of the HDF5 library, using the Python language and the NumPy package (it also
... [More]
supports numarray and Numeric right out-of-the-box). It features an object-oriented interface that, combined with C extensions for the peformance-critical parts of the code (generated using Pyrex), makes it a fast, yet extremely easy to use tool for interactively saving and retrieving very large amounts of data. One important feature of PyTables is that it optimizes memory and disk resources so that data takes much less space (specially if on-flight compression is used) than other solutions such as relational or object oriented databases. [Less]
CRM114 is a system to examine incoming e-mail, system log streams, data files or other data streams, and to sort, filter, or alter the incoming files or data streams according to the user's wildest desires. Criteria for categorization of data can be
... [More]
via a host of methods, including regexes, approximate regexes, a Hidden Markov Model, Orthogonal Sparse Bigrams, WINNOW, Correllation, KNN/Hyperspace, or Bit Entropy ( or by other means- it's all programmable). [Less]
Clover.ETL is an Open Source, Java based ETL framework which can be used to transform structured data. While using JAVA technology it allows for platform independence. It can be used standalone - as a server application or can be embedded in other applications as a library.
pChart is a PHP class oriented framework designed to create aliased charts. Data can be retrieved from SQL queries, CSV files, or manually provided.
The iBATIS Data Mapper framework makes it easier to use a database with Java and .NET applications. iBATIS couples objects with stored procedures or SQL statements using a XML descriptor. Simplicity is the biggest advantage of the iBATIS Data Mapper over object relational mapping tools.
Orange is a component-based data mining software. It includes a range of data visualization, exploration, preprocessing and modelling techniques. It can be used through a nice and intuitive user interface or, for more advanced users, as a module for Python programming language.
GGobi is an open source visualization program for exploring high-dimensional data. It provides highly dynamic and interactive graphics such as tours, as well as familiar graphics such as the scatterplot, barchart and parallel coordinates plots. Plots are interactive and linked with brushing and identification.
A set of data analysis tools.
GNU ddrescue is a data recovery tool. It copies data from one file or block device (hard disc, cdrom, etc) to another, trying hard to rescue data in case of read errors.
The basic operation of ddrescue is fully automatic. That is, you don't have
... [More]
to wait for an error, stop the program, read the log, run it in reverse mode, etc.
If you use the logfile feature of ddrescue, the data is rescued very efficiently (only the needed blocks are read). Also you can interrupt the rescue at any time and resume it later at the same point. [Less]
XmlCatalog is an application written for Windows operating systems. It is designed to help you maintain and organize a catalog of your computer media collection, including CD-ROMs, DVDs, diskettes, removable drives, hard drives, or any other present
... [More]
or future storage media. For the catalog database XML is used and all data is stored in UTF-8 format. XmlCatalog application is written in C++. It is free and open source software for everyone. [Less]
RDV is a visualization environment for scientific and engineering data. Built on top of the RBNB dynamic data server, it supports local and remote data sources and can visualize numeric, video, image, and audio data in a number of different ways.
A minimalistic data analysis application
Magic Rescue scans a block device for file types it knows how to recover and calls an external program to extract them. It looks at "magic bytes" in file contents, so it can be used both as an undelete utility and for recovering a corrupted drive or partition. As long as the file data is there, it will find it.
Free Entity Relationship and Reverse Engineering Tool