23 tables + alembic_version covering the v1 data model:
- Auth/RBAC (8): users, groups, permissions, user_groups, group_permissions,
invitations, invitation_groups, refresh_tokens.
- MITRE (4): mitre_tactics, mitre_techniques, mitre_subtechniques + the
technique↔tactic many-to-many.
- Templates (4): test_templates, test_template_mitre_tags (3 nullable FKs +
CHECK exactly_one_mitre_fk), scenario_templates, scenario_template_tests
(UUID PK + UNIQUE(scenario_id, position) so a test can appear at multiple
positions).
- Missions (6): missions, mission_members, mission_scenarios, mission_tests,
mission_test_mitre_tags (deliberately denormalised — copies external_id +
name + url, no FK to mitre_* — so a re-sync of the catalogue can't purge
historical tags), mission_categories.
- Evidence/settings/notifications (5): evidence_files, settings (JSONB
value), detection_levels, notifications.
SQLAlchemy 2.x with Mapped[]/mapped_column(), pk_/fk_/ck_/uq_/ix_ naming
convention. Reusable mixins (UuidPkMixin, TimestampMixin, SoftDeleteMixin —
no auto __table_args__ since classes silently clobber the mixin's).
Soft delete: deleted_at + partial indexes ix_<table>_active WHERE deleted_at
IS NULL on 9 tables (users, groups, test_templates, scenario_templates,
missions, mission_scenarios, mission_tests, mission_categories,
evidence_files). Notifications gets ix_..._unread WHERE read_at IS NULL.
CHECK constraints for status / state / opsec_level / mitre_kind enums.
New API endpoint GET /api/v1/diag/db: returns alembic_revision (short hash)
and the public-schema table_count. 503 with {"reachable": false} on a DB
outage. Database card on the SPA home consumes it.
Test stage in backend/Dockerfile (--target test): runtime + dev extras +
tests/. New make test-api spins an ephemeral pytest container against the
live DB on the compose network. backend/tests/test_schema.py: 8 integration
tests (tables, FK pairs, CHECK constraints, partial indexes, alembic-at-head,
negative INSERT proving the exactly_one_mitre_fk CHECK fires).
e2e/tests/m1-db.spec.ts: 4 Playwright tests covering the diag endpoint
contract + the Database card + footer/roadmap labels.
DoD: make clean && make up && make migrate → 23 tables, 32 FKs, 9 CHECKs,
make test-api → 9 passed, make e2e → 12 passed.
Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
94 lines
3.4 KiB
Python
94 lines
3.4 KiB
Python
"""Operational diagnostics. No auth in v1 (M0/M1 only expose non-sensitive
|
|
counts and the current Alembic revision).
|
|
|
|
The `/diag/reset` endpoint is **test-only** — it requires `APP_ENV=test` and
|
|
is the bedrock of the e2e suite (clean DB + freshly minted install token).
|
|
"""
|
|
|
|
from __future__ import annotations
|
|
|
|
import logging
|
|
|
|
from flask import Blueprint, abort, jsonify
|
|
from sqlalchemy import text
|
|
from sqlalchemy.exc import SQLAlchemyError
|
|
|
|
from app.core.config import settings
|
|
from app.core.install_token import regenerate_install_token
|
|
from app.db.session import get_engine
|
|
|
|
bp = Blueprint("diag", __name__, url_prefix="/diag")
|
|
log = logging.getLogger("metamorph.diag")
|
|
|
|
|
|
@bp.get("/db")
|
|
def db_diag():
|
|
"""Return the Alembic revision and the count of public-schema tables."""
|
|
try:
|
|
with get_engine().connect() as conn:
|
|
revision = conn.execute(
|
|
text("SELECT version_num FROM alembic_version")
|
|
).scalar()
|
|
table_count = conn.execute(
|
|
text(
|
|
"SELECT count(*) FROM information_schema.tables "
|
|
"WHERE table_schema='public' AND table_type='BASE TABLE'"
|
|
)
|
|
).scalar_one()
|
|
except SQLAlchemyError as e:
|
|
log.warning("metamorph.diag.db_unreachable", extra={"error": str(e)})
|
|
return jsonify({"reachable": False, "error": "database_unreachable"}), 503
|
|
|
|
return jsonify(
|
|
{
|
|
"reachable": True,
|
|
"alembic_revision": revision,
|
|
"table_count": int(table_count),
|
|
}
|
|
)
|
|
|
|
|
|
@bp.post("/reset")
|
|
def reset_test_state():
|
|
"""TEST-ONLY: wipe users/auth tables and mint a fresh install token.
|
|
|
|
Refuses unless `APP_ENV=test`. Used by the Playwright suite to start each
|
|
auth scenario from a deterministic state.
|
|
"""
|
|
# NOTE: this endpoint is the test-suite reset hook. Allowed in `dev` too so
|
|
# the e2e suite can run against a normal `make up` stack, but in dev it is
|
|
# destructive — equivalent to `make clean` for the auth tables. Production
|
|
# (APP_ENV=prod/staging) is locked out.
|
|
if settings.APP_ENV not in ("dev", "test"):
|
|
abort(403, description="diag/reset is only available in dev/test")
|
|
if settings.APP_ENV == "dev":
|
|
log.warning("metamorph.diag.reset_in_dev_environment")
|
|
|
|
try:
|
|
with get_engine().begin() as conn:
|
|
conn.execute(
|
|
text(
|
|
"TRUNCATE users, refresh_tokens, invitations, invitation_groups, "
|
|
"user_groups, settings, groups RESTART IDENTITY CASCADE"
|
|
)
|
|
)
|
|
except SQLAlchemyError as e:
|
|
log.error("metamorph.diag.reset_failed", extra={"error": str(e)})
|
|
return jsonify({"reset": False, "error": "database_error"}), 500
|
|
|
|
token = regenerate_install_token()
|
|
|
|
# Clear the in-memory rate-limit counters so the e2e suite that follows can
|
|
# log in repeatedly without hitting `/auth/login`/`/auth/refresh` limits.
|
|
# The limiter uses `memory://` in dev (cf. `app/core/rate_limit.py`).
|
|
try:
|
|
from app.core.rate_limit import limiter # noqa: PLC0415 — avoid import cycle
|
|
|
|
if limiter.enabled:
|
|
limiter.reset()
|
|
except Exception as e: # noqa: BLE001
|
|
log.warning("metamorph.diag.rate_limit_reset_failed", extra={"error": str(e)})
|
|
|
|
log.warning("metamorph.diag.reset_completed")
|
|
return jsonify({"reset": True, "install_token": token})
|