A Unified AI System For Data Quality Control and DataOps Management in Regulated Environments
By: Devender Saini , Bhavika Jain , Nitish Ujjwal and more
In regulated domains such as finance, the integrity and governance of data pipelines are critical - yet existing systems treat data quality control (QC) as an isolated preprocessing step rather than a first-class system component. We present a unified AI-driven Data QC and DataOps Management framework that embeds rule-based, statistical, and AI-based QC methods into a continuous, governed layer spanning ingestion, model pipelines, and downstream applications. Our architecture integrates open-source tools with custom modules for profiling, audit logging, breach handling, configuration-driven policies, and dynamic remediation. We demonstrate deployment in a production-grade financial setup: handling streaming and tabular data across multiple asset classes and transaction streams, with configurable thresholds, cloud-native storage interfaces, and automated alerts. We show empirical gains in anomaly detection recall, reduction of manual remediation effort, and improved auditability and traceability in high-throughput data workflows. By treating QC as a system concern rather than an afterthought, our framework provides a foundation for trustworthy, scalable, and compliant AI pipelines in regulated environments.
Similar Papers
DataOps-driven CI/CD for analytics repositories
Systems and Control
Makes data analysis more reliable and easier to check.
Design and Evaluation of a Scalable Data Pipeline for AI-Driven Air Quality Monitoring in Low-Resource Settings
Software Engineering
Cleans and shares air pollution data from many places.
Bench-2-CoP: Can We Trust Benchmarking for EU AI Compliance?
Artificial Intelligence
Tests AI for dangers like losing control.