The data quality automation plugin for data teams. Experience data quality observability in your ELT/ETL pipeline that would usually take a year to build, in just a few hours.
Swiple is an automated data monitoring platform that provides a comprehensive solution for analytics and data engineering teams to seamlessly monitor and manage the quality of their data. The platform offers a range of features that are designed to ensure that data quality is maintained at all times, thus allowing teams to focus on other critical tasks. One of the key features that sets Swiple apart from other data monitoring platforms is its ability to provide real-time alerts in case of any data inconsistencies or anomalies.
This ensures that teams can quickly identify and address any issues, reducing the risk of data errors and improving overall data quality. Additionally, Swiple offers a user-friendly interface that allows teams to easily manage and monitor data across multiple sources, providing a centralized view of data quality across the organization.
With Swiple, teams can streamline their data monitoring processes, reduce manual effort, and ultimately improve the accuracy and reliability of their data.
Features
Easy setup
Zero config
Measure the data quality of a SQL query, table, or view.
Generate data expectations using Automated Data Profiling.
Schedule validations to run on any recurrence interval.
Automated Data Docs
Add Objectives / SLA's for your data.
Get notified when the quality of your data changes.
Notifications using Slack, Email, Microsoft Teams, OpsGenie, and PagerDuty
With Swiple, analytics and data engineering teams can resolve data quality issues before they impact mission-critical resources. The platform offers automated data analysis and profiling, scheduling and alerting to ensure data quality is maintained.
Supported Databases
MySQL
MariaDB
PostgreSQL
Redshift
BigQuery
Snowflake
Trino
Athena
Platforms
Windows
Linux
macOS
License
Copyright 2022 Swiple, Ltd.
The Source code in this Swiple repo is covered by the Elastic License 2.0.
OpenMetaData is a comprehensive platform that offers a range of functionalities, including data discovery, data lineage, data quality, observability, governance, and team collaboration. It is an open-source project that has gained immense popularity among companies across various industry verticals, thanks to its vibrant community and adoption.
OpenMetaData is built on
DQO is a powerful DataOps friendly data quality monitoring tool that is designed to help you monitor and maintain the quality of your data. With DQO, you get access to a wide range of customizable data quality checks and data quality dashboards that make it easy to keep an eye
Panel is an open-source Python library that lets you easily build powerful tools, dashboards, and complex applications entirely in Python. It has a batteries-included philosophy, putting the PyData ecosystem, powerful data tables and much more at your fingertips.
Features
High-level reactive APIs and lower-level callback based APIs ensure you can
Business Intelligence, commonly known as BI, is the process of collecting, analyzing, and presenting data to make informed business decisions. BI helps organizations to transform their raw data into meaningful insights that can drive their business strategies. BI provides a range of advantages to organizations, including improved decision-making, increased efficiency,
Voici is an exceptional tool that offers a unique way of generating static dashboards from Jupyter Notebooks. Notably, it can replace Voilà with ease. This is because it provides and supports most of Voilà's configuration options and commands, while also offering some unique features of its own.
One of the
Fityk [fi:tik] is an open-source program for data processing and nonlinear curve fitting, that works for Windows, Linux and macOS.
Usage
It is primarily used:
* by scientists who analyse data from powder diffraction, chromatography, photoluminescence and photoelectron spectroscopy, infrared and Raman spectroscopy, and other experimental techniques,
* to fit peaks
Magda is a data catalog system that will provide a single place where all of an organization's data can be cataloged, enriched, searched, tracked and prioritized - whether big or small, internally or externally sourced, available as files, databases or APIs. Magda is designed specifically around the concept of federation
What is DuckDB?
DuckDB is a relational (table-oriented) DBMS that supports the Structured Query Language (SQL).
DuckDB is designed to support analytical query workloads, also known as Online analytical processing (OLAP). These workloads are characterized by complex, relatively long-running queries that process significant portions of the stored dataset, for example