
DeltaMax V2.0 is an advanced data quality and anomaly detection framework developed by KatalystStreet. It enables organizations to perform multi-period data validation, statistical drift detection, and machine learning–based anomaly identification.
This Azure-integrated version of DeltaMax is designed to work seamlessly with the Microsoft Azure ecosystem, replacing Google Cloud components with Azure-native services such as:
DeltaMax V2.0 provides a structured workflow that includes:
Generates Historical (H), Previous (A), and Current (B) datasets to enable drift and stability analysis.
Combines anomaly signals, drift metrics, and data validation checks into a normalized, interpretable score.
Performs statistical tests, KNN-based anomaly detection, PSI analysis, and structural validations.
Automatically uploads processed datasets to Azure Blob Storage and loads them into Synapse tables for scalable analytics.
Enables real-time dashboards and insights using Power BI connected to Synapse Analytics.
The DeltaMax Azure workflow ensures:
All outputs generated by DeltaMax scripts are:
All DeltaMax V2 scripts are already integrated with:
Once the scripts are executed, outputs are:
Multi-Period Data Generation (H-A-B) Framework
This step generates synthetic datasets from January to August, where January–June act as Historical (H), July is the Previous period (A), and August is the Current period (B). The August dataset includes controlled variations to simulate drift and anomalies for H–A–B risk evaluation.
This step scans all generated monthly datasets (January–August) and produces a consolidated structural summary, including row counts, column counts, storage size, and month-to-month entity churn.
This step combines all monthly datasets into a single structured H–A–B master file to streamline downstream drift analysis, anomaly detection, and Trust Score computation.
Executes anomaly detection, data integrity validation, and multi-period drift analysis (H–A–B) to quantify dataset stability and risk exposure.
It aggregates anomaly health, drift health, and business rule compliance into a weighted Trust Score (0–100), providing a single interpretable risk metric for the current month.
Runs Isolation Forest and IQR-based outlier detection across H–A–B datasets to identify global and statistical anomalies in entity behavior.
Measures variance shifts across H–A–B periods to detect distribution instability and structural data changes.
Performs Welch's T-test across periods to detect statistically significant mean shifts between Historical, Previous, and Current datasets
Identifies abnormal missingness patterns in the current dataset compared to historical benchmarks.
Calculates PSI scores to quantify distribution drift between H–A–B datasets and measure population stability.
Detects numeric precision and decimal formatting inconsistencies between historical and current datasets.
Validates string field consistency by detecting abnormal length deviations across structured text attributes.
Ensures entity uniqueness and detects duplicate or conflicting business identifiers across H–A–B datasets.
Note: This visualization is an example of how the data can be perceived using our generated datasets on Azure Analytic Studio. It is intended to illustrate potential insights and patterns rather than represent finalized outputs.
For customized dashboards and tailored reporting, please reach out to KatalystStreet to help visualize your outputs effectively.
