RegreSQL

Because 'It Works on My Database' Won't Fly in the Postmortem

Know when your query results change. Catch regressions before they reach production.

Version 2.0 available now

How It Works

# Initialize in your project
regresql init postgres://localhost/mydb

# Build reproducible test database snapshot
regresql snapshot build

# Add a query
regresql add my/first/query.sql

# Edit generated plan

# Generate performance baseline(s)
regresql baseline

# Generate expected results for your queries
regresql update

# Run regression tests
regresql test

Features

Cross-version planner A/B

Run one corpus against two PostgreSQL builds and get a scoreboard of plan, buffer, and result differences. A trust filter injects one build's statistics into the other and drops coin-flip plans, so ANALYZE sampling noise is never reported as a regression.

Test against production's reality

Inject production statistics, or use the pg_regresql extension, so EXPLAIN on a laptop against a tiny table sees production's row counts and picks production's plan. No production data copied.

Gate on measured actuals

EXPLAIN ANALYZE checks that gate on what actually happened: q-error cardinality, buffers, temp/spill to disk, and tuple-flow. Improvement detection flags queries that got faster so baselines stay current.

Plan-quality policy kits

Plan drift reported as a diff (index scan to seq scan, join strategy changed) with info/warning/error severity, critical_tables enforcement, and shareable policy packs pulled in with extends:.

Correctness oracles, no reference database

metamorphic flips a result-preserving optimization and confirms the rows don't move. admit keeps only queries whose result is stable across plans. Both catch wrong-results bugs on a single database, with no second engine to diff against.

Query Result Testing

Compare query outputs against known baselines. Structured, typed diffs highlight exactly what changed, with column/order/tolerance controls.

Database Snapshots

Reproducible test environments with schema-hash validation, tagging, and cross-version diff. Captures server settings, planner GUCs, and statistics.

Fixtures via fixturize

Declarative data generation through fixturize, with foreign-key detection and 15+ generators, built straight into snapshot build.

CI/CD Integration

Output formats for JUnit XML, pgTAP, GitHub Actions, and JSON. DATABASE_URL overrides config, non-zero exit on failures.

Migration Testing

Run queries before and after a migration and report the diff. Local SQL migrations or third-party tools like goose.

Installation

Homebrew

brew tap boringsql/regresql && brew install regresql

go install github.com/boringsql/regresql@latest

Why RegreSQL?

SQL queries are the number one cause of database problems. Yet most teams treat SQL as a second-class citizen when it comes to testing. Unit tests mock the database. Integration tests check application logic. Nobody tests the queries themselves.

And the problem is getting worse. Developers who learned just enough SQL, ORMs generating queries you have never seen, and now LLMs writing SQL at scale. More SQL written by things that never see production, at a faster rate of change. The guardrail is not better prompts. It is regression testing.

"It works on my database" is the new "it works on my machine." We solved environment drift for code with Docker and feature flags. But the same query hitting different data produces different results. Your laptop has 100 rows with uniform distribution. Production has 10 million rows with heavily skewed data. The PostgreSQL planner makes different choices in each environment, and code review cannot catch that.

What RegreSQL tests

RegreSQL tests two things:

Logical correctness. Does the query return the right data? RegreSQL compares query outputs against known baselines and generates diffs that show exactly what changed.
Performance correctness. Does it return them efficiently? Not by measuring timing, which varies wildly across machines. RegreSQL tracks buffers (pages accessed during execution), which are deterministic regardless of hardware. Same query, same data, same buffers, whether you run it on a Hetzner ARM box or an M4 Pro laptop. If your query suddenly reads 10% more buffers, that is a regression.

Production query plans without production data

PostgreSQL's planner picks execution strategies based on table statistics, available indexes, memory settings, and data patterns. Your test database with 100 rows will never produce the same plan as production with millions.

RegreSQL solves this with portable statistics. Using PostgreSQL 17's pg_restore_relation_stats and pg_restore_attribute_stats, you can give your test database production-like statistics without copying any actual data. The planner sees the same row counts, distributions, and correlations it would in production, and makes the same choices.