Andreas/mtg_python_deckbuilder

Fork 0

mirror of https://github.com/mwisnowski/mtg_python_deckbuilder.git synced 2025-12-16 15:40:12 +01:00

matt 84749da214 docs: refresh docker and readme guides

2025-10-02 16:28:19 -07:00

8.2 KiB

Raw Blame History

Theme Catalog Advanced Guide

Additional details for developers and power users working with the theme catalog, editorial tooling, and diagnostics.

HTMX API endpoints
Caching, diagnostics, and metrics
Governance principles
Operational tooling
Editorial pipeline
Validation and schema tooling

HTMX API endpoints

The upcoming theme picker UI is powered by two FastAPI endpoints.

`GET /themes/api/themes`

Parameters:

q: substring search across theme names and synergies.
archetype: filter by deck_archetype.
bucket: popularity bucket (Very Common, Common, Uncommon, Niche, Rare).
colors: comma-separated color initials (e.g. G,W).
limit / offset: pagination (limit defaults to 50, max 200).
diagnostics=1: surfaces has_fallback_description and editorial_quality (requires WEB_THEME_PICKER_DIAGNOSTICS=1).

The response includes count, the filtered items, and next_offset for subsequent requests. Diagnostic mode adds extra telemetry fields.

`GET /themes/api/theme/{id}`

Parameters:

uncapped=1: (diagnostics) returns uncapped_synergies, combining curated, enforced, and inferred sets.
diagnostics=1: exposes editorial metadata such as editorial_quality and has_fallback_description.

The payload merges curated data with editorial artifacts (example_cards, example_commanders, etc.) and respects the same diagnostic feature flag.

Caching, diagnostics, and metrics

Responses include an ETag header derived from catalog metadata so consumers can perform conditional GETs.
/themes/status reports freshness and stale indicators; /themes/refresh (POST) triggers a background rebuild.
When WEB_THEME_PICKER_DIAGNOSTICS=1 is set, the app records:
- Filter cache hits/misses and duration (X-ThemeCatalog-Filter-Duration-ms).
- Preview cache metrics (/themes/metrics exposes counts, hit rates, TTL, and average build time).
Skeleton loaders ship with the HTMX fragments to keep perceived latency low.

Governance principles

To keep the catalog healthy, the project follows a lightweight governance checklist:

Minimum examples – target at least two example cards and one commander per established theme.
Deterministic preview assembly – curated examples first, then role-based samples (payoff/enabler/support/wildcard), then placeholders if needed.
Splash relax policy – four- and five-color commanders may include a single off-color enabler with a small penalty, preventing over-pruning.
Popularity buckets are advisory – they guide filters and UI hints but never directly influence scoring.
Taxonomy expansion bar – new high-level archetypes require a distinct pattern, at least eight representative cards, and no overlap with existing themes.
Editorial quality tiers – optional editorial_quality: draft|reviewed|final helps prioritize review passes.
Deterministic sampling – seeds derive from theme|commander hashes; scoring code should emit reasons[] to explain decisions and remain regression-test friendly.

See docs/theme_taxonomy_rationale.md for the underlying rationale and roadmap.

Operational tooling

Refreshing catalogs

Primary builder: python code/scripts/build_theme_catalog.py
Options:
- --limit N: preview a subset without overwriting canonical outputs (unless --allow-limit-write).
- --output path: write to an alternate path; suppresses YAML backfill to avoid mutating tracked files.
- --backfill-yaml or EDITORIAL_BACKFILL_YAML=1: fill missing descriptions and popularity buckets in YAML files.
- --force-backfill-yaml: overwrite existing description/popularity fields.
- EDITORIAL_SEED=<int>: force a deterministic ordering when heuristics use randomness.
- EDITORIAL_AGGRESSIVE_FILL=1: pad sparse themes with inferred synergies.
- EDITORIAL_POP_BOUNDARIES="a,b,c,d": tune popularity thresholds.
- EDITORIAL_POP_EXPORT=1: emit theme_popularity_metrics.json summaries.

Snapshotting taxonomy

python -m code.scripts.snapshot_taxonomy writes logs/taxonomy_snapshots/taxonomy_<timestamp>.json with a SHA-256 hash. Identical content is skipped unless you supply --force. Use snapshots before experimenting with taxonomy-aware sampling.

Adaptive splash penalty experiments

Set SPLASH_ADAPTIVE=1 to scale off-color enabler penalties based on commander color count. Tune with SPLASH_ADAPTIVE_SCALE (e.g. 1:1.0,2:1.0,3:1.0,4:0.6,5:0.35). Analytics aggregate both static and adaptive reasons for comparison.

Editorial pipeline

Script summary

code/scripts/generate_theme_editorial_suggestions.py
- Proposes example_cards, example_commanders, and synergy_commanders using card CSVs and tagging heuristics.
- --augment-synergies can pad sparse synergies arrays prior to suggestion.
- --apply writes results; dry runs print suggestions for review.
code/scripts/lint_theme_editorial.py
- Validates annotation formats, min/max counts, and deduplication. Combine with environment toggles (EDITORIAL_REQUIRE_DESCRIPTION, EDITORIAL_REQUIRE_POPULARITY) for stricter gating.

Example configuration

# Dry run on the first 25 themes
python code/scripts/generate_theme_editorial_suggestions.py

# Apply across the catalog with augmentation and min example commanders set to 5
python code/scripts/generate_theme_editorial_suggestions.py --apply --augment-synergies --min-examples 5

# Lint results
python code/scripts/lint_theme_editorial.py

Editorial output depends on current CSV data. Expect ordering or composition changes after upstream dataset refreshes—treat full-catalog regeneration as an operational task and review diffs carefully.

Duplicate suppression controls

code/scripts/synergy_promote_fill.py can rebalance example cards:

python code/scripts/synergy_promote_fill.py --fill-example-cards --common-card-threshold 0.18 --print-dup-metrics

--common-card-threshold: filters cards appearing in more than the specified fraction of themes (default 0.18).
Use metrics output to tune thresholds so staple utility cards stay in check without removing legitimate thematic cards.

Coverage metrics and KPIs

EDITORIAL_INCLUDE_FALLBACK_SUMMARY=1 embeds a description_fallback_summary block in the generated catalog (generic_total, generic_plain, generic_pct, etc.).
Regression tests use these metrics to ratchet down generic descriptions over time.
Historical trends are appended to config/themes/description_fallback_history.jsonl for analysis.

Description mapping overrides

Customize automatic descriptions without editing code:

Add config/themes/description_mapping.yml with entries:

- triggers: ["sacrifice", "aristocrat"]
  description: "Leans on sacrifice loops and {SYNERGIES}."

The first matching trigger wins (case-insensitive substring search).
{SYNERGIES} expands to a short clause listing the top synergies when available, and disappears gracefully if not.
Internal defaults remain as fallbacks when the mapping file is absent.

Validation and schema tooling

Run validators to maintain catalog quality:

python code/scripts/validate_theme_catalog.py
python code/scripts/validate_theme_catalog.py --rebuild-pass
python code/scripts/validate_theme_catalog.py --schema
python code/scripts/validate_theme_catalog.py --yaml-schema
python code/scripts/validate_theme_catalog.py --strict-alias

Per-theme YAML files (under config/themes/catalog/) are tracked in source control. Keys such as metadata_info replace the legacy provenance; the validator treats missing migrations as warnings until the deprecation completes.

8.2 KiB Raw Blame History Unescape Escape