Posted
18 days
ago
by
kasper
We've released yet another version of MetaModel, namely version 1.1.6.
This release contains very few changes to the 1.1.5 release:
A convenience method was added to the Query class: select(FunctionType, Column).
Upgrading the
... [More]
Apache POI version in MetaModel introduced a few bugs that we did not discover in the 1.1.5 milestone. In 1.1.6 we fixed these bugs and unittesting was significantly improved for this part of the code to prevent any new bugs from emerging.
We hope you enjoy this release and excuse for the hectic release schedule - the before mentioned bug fixes where critical and we hope that you appreciate the quick response from the community. [Less]
Posted
23 days
ago
by
kasper
We have just released the newest version of MetaModel, 1.1.5. This release is a minor release which means no API changes, but a few upgrades in terms of performance, flexibility and ease of distribution (full list):
The most important
... [More]
upgrade have been to CSV performance. We encountered a bug when querying this type of datastore that meant that the whole DataSet was stored in memory while using it. This has undergone quite some refactoring so that it will now stream through memory as expected, thus keeping the door open for very large CSV files.
A minor change in the column naming scheme have been implemented for the Excel-based DataContext's. This means that if the first row of a spreadsheet contains only blank fields, we will automatically assign the names "[column 1]", "[column 2]" etc. accordingly.
The downloadable zip or tar.gz file will now contain a "MetaModel-1.1.5-all.jar" file, which is an assembled jar file containing the classes of all MetaModel modules (core, csv, jdbc, excel etc.), which should substantially ease deployment of the framework.
We hope you enjoy the new release of MetaModel and keep up the good work of providing the valuable feedback that drives development of it. [Less]
Posted
3 months
ago
by
kasper
We're happy to announce the release of DataCleaner version 1.5.1. This release is a minor release, nevertheless containing a few nice features - especially for the users who are enjoying the exporting features that was introduced in 1.5:
An
... [More]
additional HTML export format have been added to the built-in export formats (usable when exporting Profiler results in the desktop app and when executing the runjob command-line tool).
The export format is now choosable directly in the desktop app.
Four new measures where added to the String Analysis profile: avg. chars and max/min/avg white spaces.
The new version of DataCleaner is (as always) downloadable for free on the downloads page and feedback from users is also greatly appreciated, ie:
Fill out our online user survey, or
Post your comments and questions at our discussion forum.
We hope that you all enjoy DataCleaner 1.5.1. [Less]
Posted
4 months
ago
by
kasper
"Finally!" one might say. And this is definately what is going through my head right as I write this news-item. Finally, DataCleaner 1.5 has been released! Once again the effort to bring about the best open source data quality solution is bearing
... [More]
fruit.
The new release is definately one of the most significant ones in the history of DataCleaner. The overall goal of the release has been to step up from the shadows of the "small tools" pool and mark DataCleaner as an enterprise-ready application for profiling and validating datastores of all kinds - both in scheduled mode, on servers and in an intuitive desktop environment.
For those of you with an interest in every little detail about this release, please feel free to review the complete list of changes - for everyone else, here's the recap:
Change of license to LGPL.
Multi-threaded execution of Profiler and Validator.
Command line (batch) execution of DataCleaner tasks.
More elaborate status information during profiler and validator execution.
New profile: Date mask matcher.
New profile: Regex matcher.
Load regex from the online RegexSwap repository.
Automatic download and install of popular database drivers.
More file types supported (.dat, .txt)
XML file support improved (.xml)
Memory improvements in Time analysis profile.
Improved logging when running profiling and validation.
Information schema provided for file-based datastores.
Lazy-loading of columns in datastore-tree.
We hope you enjoy the new DataCleaner 1.5! Now go over and download it right away. [Less]
Posted
5 months
ago
by
kasper
Things are starting to shape up for the big release of DataCleaner 1.5. We are starting off with a bit of excitement around in the data quality community.
Probably the most dedicated online magazine about data quality, data quality pro, have
... [More]
launched a series of articles about profiling, validating and comparing data with DataCleaner. So far an introductory tutorial (including a complete and realistic example data-set) and a background article/interview have been published:
Learn how to profile and validate data (for free) using DataCleaner
Interview with Kasper Sørensen, creator of DataCleaner
We hope that you will enjoy the articles and we thank data quality pro for their great interest in our community. [Less]