DataCleaner is a data quality analysis application and a solution platform for DQ solutions. It's core is a strong data profiling engine, which is extensible and thereby adds data cleansing, transformations, enrichment, deduplication, matching and merging.

Features

  • Profiles and analyzes your database within minutes!
  • Access almost any datastore - Oracle, MySQL, PostgreSQL, MS SQL Server, MongoDB, CUBRID, CSV files, Excel spreadsheets, dbase and more
  • Discover patterns in your textual data with the Pattern Finder
  • Find out which values occur the most with the Value Distribution profile
  • Cleanse your contact details with name and address validations
  • Detect duplicates using fuzzy logic and configurable weights and thresholds
  • Merge your duplicates and create a single version of the truth
  • Write data back to relational databases, CSV files, Excel spreadsheets or MongoDB databases

Platforms

  • Windows
  • Linux
  • macOS

License

GNU Library or Lesser General Public License version 3.0 (LGPLv3)

Resources & Downloads

DataCleaner
Download DataCleaner for free. Data quality analysis, profiling, cleansing, duplicate detection +more. DataCleaner is a data quality analysis application and a solution platform for DQ solutions. It’s core is a strong data profiling engine, which is extensible and thereby adds data cleansing, transformations, enrichment, deduplication, matching and merging.
The premier open source Data Quality solution | DataCleaner
Open source software for data quality, data profiling, data warehousing, data wrangling, master data management, business intelligence and governance.