Skip to main content

Anofox Tabular

What is Anofox Tabular?

Anofox Tabular is the data validation engine for enterprise workflows. Validate business data, detect PII, enforce quality rules, and detect anomalies directly in DuckDB - before it reaches your forecasting and analytics pipelines.

Version 0.2.0

Functions now use the anofox_tab_* prefix (e.g., anofox_tab_email_validate). Short aliases without the prefix are also available (e.g., email_validate).

Key Features

  • 78 SQL Functions across 9 modules
  • Validation - Email, phone, address, VAT, and financial validation
  • PII Detection - Detect and mask 17 types of personal data (SSN, credit cards, IBAN, names, etc.)
  • Data Quality Metrics - Nullness, distinctness, freshness, and consistency
  • Anomaly Detection - Z-Score, IQR, Isolation Forest, DBSCAN, and OutlierTree (explainable)
  • Financial Operations - Currency conversion, amount validation, and arithmetic
  • Dataset Comparison - Hash-based and join-based table diffing
DocumentationDescription
InstallationSetup and prerequisites
Function FinderFind the right function for your task
ValidationEmail, phone, address, VAT validation
FinancialMoney and currency functions
QualityData profiling and metrics
AnomalyOutlier detection methods
PII DetectionDetect and mask personal data
OperationsDataset comparison and diffing

Module Summary

ModuleFunctionsDescription
Email Validation3RFC 5322, DNS, SMTP verification
Address Parsing4International address normalization (libpostal)
Phone Numbers9Google libphonenumber integration
Money & Currency17Multi-currency operations, 10 currencies
VAT Validation10European VAT compliance, 29 countries
PII Detection20Detect & mask 17 PII types
Quality Metrics8Volume, nulls, freshness, schema checks
Anomaly Detection5Isolation Forest, DBSCAN, OutlierTree
Data Diffing2Table comparison, migration validation

Basic Usage

-- Load the extension
LOAD anofox_tabular;

-- Validate emails, VAT IDs, and monetary amounts
SELECT
customer_id,
email,
vat_id,
amount,
anofox_tab_email_validate(email, 'dns') as email_valid,
anofox_tab_vat_is_valid(vat_id) as vat_valid,
anofox_tab_money_is_positive(amount) as amount_valid
FROM customers
WHERE anofox_tab_email_validate(email, 'dns') IS TRUE
AND anofox_tab_vat_is_valid(vat_id) IS TRUE;

-- Detect PII in customer notes
SELECT
customer_id,
anofox_tab_pii_detect(notes) as pii_found,
anofox_tab_pii_mask(notes, 'redact') as safe_notes
FROM customers
WHERE anofox_tab_pii_contains(notes);

Only valid records pass all validation gates. Invalid data is flagged.

Why Anofox Tabular?

In the AnoFox unified workflow, Tabular is the "Audit & Guard" stage. Before you forecast demand or analyze trends, your data must be trustworthy.

Poor data upstream breaks everything downstream:

  • Invalid emails = Failed notifications
  • Incorrect VAT IDs = Legal compliance failures
  • Exposed PII = Privacy violations and GDPR fines
  • Anomalous values = Biased forecasts and wrong coefficients

Tabular stops bad data at the gate.

🍪 Cookie Settings