DataCleaner is a data quality analysis application and a solution platform for DQ solutions. It's core is a strong data profiling engine, which is extensible and thereby adds data cleansing, transformations, enrichment, deduplication, matching and merging. Website:



Related Projects

Metamodel - a common domain model, query-engine and optimizer for different kinds of datastores.

MetaModelThe MetaModel is a project created for maximum reuse of a SQL 99 compliant domain model of the database domain. The MetaModel is a model that contains classes that represent the structure of a database (schemas, tables, column, relationships) and interaction with the database (queries, datasets, rows). In short a model for modelling (hence the word "metamodel") data in databases and other datastores. MetaModel is being used in projects as DataCleaner, ChronicleDroid and Tab

Deducorrect - A R package for deductive correction

A R package for solving common (data-entry) mistakes in numerical data. This package checks numerical records against linear equality constraints (balance restrictions). It can find and solve typing errors (interchanged digits) with correctTypos rounding errors with correctRounding sign errors and variable swaps, taking account of possible masking by rounding errors with correctSigns To install deducorrect in R: install.packages("deducorrect")For more information see deducorrect vignette.

data_cleaning - Miscellaneous scripts to clean data.

DataCleaner - Convert data as data frames for later inspection

a library that can find corrupt rows in data tables using data mining algorithms