Data Governance · 100% In-house

Data governance, fully in-house.

A white-label, self-hosted catalog with an AI assistant that explores your metadata and queries your live data — only what you expose, with the model you choose. Built for teams that take privacy seriously.

Book a demo See how it works

Self-hosted White-label Bring your own LLM Zero data egress

app.yourcompany.com / datagov

Total Reports

1,284

Total Measures

3,792

Cleanup

186

Lineage

What's inside

Everything your data team already wished they had.

A full governance toolkit, in one private deployment. No SaaS. No data leaving your perimeter.

Unified Catalog

One searchable inventory of every table, column, measure and report across your entire stack.

Data Dictionary

Definitions, owners, stewards, tags and SLAs on every asset — finally in one place.

Lineage Graph

Cross-tool dependency map. Click any node to see what breaks downstream if it changes.

Cleanup & Impact Insights

Surface unused columns and measures, plus the highest-impact assets your team should care about most.

AI Assistant

BYO LLM Live queries

Chat over your catalog, lineage and SQL — and now go one step further: query live PowerBI measures and BigQuery tables directly. Read-only, tool-grounded, zero hallucination.

Pipeline Alerts

Slack notifications on every sync — run metrics, durations and failure traces, so engineers catch issues fast.

Privacy & security

Your data never
leaves your perimeter.

datagov is delivered as a fully white-label, in-house deployment. There is no shared SaaS tenant, no third-party data plane, no telemetry phoning home.

You own the infrastructure. You own the database. You own the model. We just build the software that runs on top.

Dockerized · Postgres · Nginx · runs anywhere

Deployed in your VPC / on-prem

Your cloud account, your network, your auth. Zero external dependencies for the core product.

Bring your own model

Point the assistant at OpenAI, Azure OpenAI, Anthropic, or a fully local LLM. The model only ever sees the metadata you whitelist.

Read-only by default

Live query tools are SELECT-only, dry-run first, byte-capped, and row-capped.

Per-org feature flags

Toggle live tools per source. Your admins decide exactly what the assistant is allowed to touch.

AI assistant

An assistant that knows and queries your data.

Grounded in your catalog. Wired to your live BI & warehouse — read-only, byte-capped. Speaks your data definitions, then fetches the actual numbers.

Governance Assistant

Connected to your catalog

Online

Where is revenue_per_booking defined?

Found 1 measure in semantic model finance_core:

Name	Owner	Used by
revenue_per_booking	finance-team	14 reports

Defined in mart_finance.bookings. Want me to show the SQL?

And what's its live value last week?

LIVE · PowerBI read-only DAX

revenue_per_booking — last week

Period	Value	vs prev
2026-04-13 – 04-19	€84.20	+1.97%

Live data queried from PowerBI · dataset finance_core

Built-in guardrails

Zero hallucination

Always grounded in catalog tools. If the answer isn't in your metadata, the assistant says so.
Read-only by default

Live SQL is restricted to SELECT/WITH, dry-run, byte-capped.
Asks before guessing

Ambiguous prompts trigger a clarifying question — never a fabricated answer.
Per-org feature flags

Admins toggle which live tools the assistant can use, per source.

How it works

From zero to governed in four steps.

Connect

Add your sources with credentials you control. Your data never leaves your network.

Sync

A scheduled background pipeline extracts and transforms metadata automatically.

Govern

Use the dictionary, lineage and insights to clean up, document and track impact.

Discover

Ask the assistant anything about your data — it answers from your catalog, not the internet.