initial commit

2026-01-11 22:56:02 +01:00
commit 6426ddd0ab
408 changed files with 95071 additions and 0 deletions
--- a/.gitignore
+++ b/.gitignore
@@ -0,0 +1,4 @@
 **/.venv/
 __pycache__/
 *.pyc
 .ipynb_checkpoints/
--- a/Cap.png
+++ b/Cap.png
--- a/Cap1.png
+++ b/Cap1.png
--- a/README.md
+++ b/README.md
@@ -0,0 +1,204 @@
 # Data Analysis Platform
 A modern full-stack data analysis platform built with Python FastAPI backend and Next.js frontend, designed for efficient data processing, visualization, and statistical analysis.
 ## Overview
 This platform provides a comprehensive toolkit for data analysis workflows, combining powerful Python data science libraries with a modern, responsive web interface. It leverages Apache Arrow for high-performance data transfer and implements best practices for both backend and frontend development.
 ### Key Features
 - **Backend:** FastAPI with async support, Pydantic v2 validation, and comprehensive data science stack
 - **Frontend:** Next.js 16 with TypeScript, Tailwind CSS, and Shadcn UI components
 - **Data Processing:** Pandas, Scikit-learn, and Statsmodels integration
 - **Performance:** Apache Arrow for zero-copy data transfer between services
 - **Architecture:** Feature-based frontend organization, RESTful API design
 ## Technology Stack
 ### Backend
 - **Python 3.12+** with FastAPI framework
 - **Pydantic v2** for schema validation
 - **Data Science:** Pandas 2.3.3+, Scikit-learn 1.8.0+, Statsmodels 0.14.6+
 - **Serialization:** Apache Arrow 22.0+ for efficient binary data transfer
 - **Package Management:** UV for fast dependency resolution
 ### Frontend
 - **Next.js 16** (Standalone mode) with React 19
 - **TypeScript** for type safety
 - **Styling:** Tailwind CSS 4+ and Shadcn UI components
 - **Data Display:** TanStack Table, Apache Arrow 21+, Recharts
 - **State Management:** Zustand v5 for local state, TanStack Query v5 for server state
 - **Virtualization:** TanStack Virtual for handling large datasets
 ### DevOps
 - **Docker** multi-stage builds with distroless/alpine images
 - **Docker Compose** for local development orchestration
 ## Project Structure
 ```
 Data_analysis/
 ├── backend/                 # FastAPI backend service
 │   ├── app/                # Application modules
 │   ├── tests/              # Backend test suite
 │   ├── main.py             # Application entry point
 │   ├── pyproject.toml      # Python dependencies (UV)
 │   └── Dockerfile          # Backend container image
 │
 ├── frontend/               # Next.js frontend application
 │   ├── src/
 │   │   └── features/       # Feature-based organization
 │   ├── tests/              # Frontend test suite
 │   ├── package.json        # Node.js dependencies
 │   └── Dockerfile          # Frontend container image
 │
 ├── compose.yaml            # Docker Compose configuration
 ├── _bmad-output/           # Planning and implementation artifacts
 └── README.md               # This file
 ```
 ## Prerequisites
 Before running the applications locally, ensure you have the following installed:
 - **Python 3.12+** - [Download Python](https://www.python.org/downloads/)
 - **Node.js 20+** - [Download Node.js](https://nodejs.org/)
 - **UV (Python package manager)** - Install via: `pip install uv`
 - **npm** (comes with Node.js)
 ## Local Development Setup
 ### Backend Setup
 1. Navigate to the backend directory:
   ```bash
   cd backend
   ```
 2. Create and activate a virtual environment (optional but recommended):
   ```bash
   python3.12 -m venv .venv
   source .venv/bin/activate  # On Windows: .venv\Scripts\activate
   ```
 3. Install dependencies using UV:
   ```bash
   uv sync
   ```
 4. Start the FastAPI development server:
   ```bash
   uvicorn main:app --reload --host 0.0.0.0 --port 8000
   ```
   The backend API will be available at `http://localhost:8000`
 5. Access the interactive API documentation at:
   - Swagger UI: `http://localhost:8000/docs`
   - ReDoc: `http://localhost:8000/redoc`
 ### Frontend Setup
 1. Open a new terminal and navigate to the frontend directory:
   ```bash
   cd frontend
   ```
 2. Install dependencies:
   ```bash
   npm install
   ```
 3. Create environment configuration file:
   ```bash
   cp .env.local.example .env.local
   ```
   Edit `.env.local` if needed to configure API endpoints or other settings.
 4. Start the Next.js development server:
   ```bash
   npm run dev
   ```
   The frontend application will be available at `http://localhost:3000`
 ### Running Both Services
 To run both services simultaneously:
 1. **Terminal 1 - Backend:**
   ```bash
   cd backend
   source .venv/bin/activate  # If using virtual environment
   uvicorn main:app --reload --host 0.0.0.0 --port 8000
   ```
 2. **Terminal 2 - Frontend:**
   ```bash
   cd frontend
   npm run dev
   ```
 ## Development Workflow
 ### Code Style and Standards
 - **Backend:** Follow PEP 8 guidelines with snake_case naming
 - **Frontend:** Use TypeScript strict mode, follow ESLint configuration
 - **API Convention:** Use snake_case for JSON keys to maintain consistency with Pandas DataFrames
 - **Documentation:** Include docstrings (Python) and JSDoc comments (TypeScript) for all exported functions
 ### Testing
 - **Backend tests:** Run `pytest` from the `backend/` directory
 - **Frontend tests:** Run `npm test` from the `frontend/` directory (when configured)
 ### Key Anti-Patterns to Avoid
 - Do NOT use standard JSON for transferring datasets larger than 5,000 rows (use Apache Arrow)
 - Do NOT use deep React Context for high-frequency state updates (use Zustand)
 - Do NOT implement opaque algorithms without logging data exclusions
 - Do NOT perform heavy blocking computations on the main FastAPI process (use background tasks)
 ## Docker Deployment (Coming Soon)
 The project includes Docker configuration for containerized deployment. Instructions for running with Docker Compose will be added in the next update.
 ## Documentation
 - **Project Context:** See `_bmad-output/project-context.md` for detailed implementation rules
 - **Architecture:** Technical architecture documentation is available in `_bmad-output/planning-artifacts/`
 - **API Reference:** Access interactive API documentation at `/docs` when backend is running
 ## Contributing
 This project follows modern software development practices with comprehensive planning artifacts. When contributing:
 1. Read the project context file for implementation guidelines
 2. Follow the established code patterns and conventions
 3. Ensure all tests pass before submitting changes
 4. Update documentation as needed
 ## License
 [Specify your license here]
 ## Support
 For questions or issues related to this project, please refer to the project documentation or contact the development team.
 ---
 **Last Updated:** 2026-01-11
 Refreshed by automation
--- a/_bmad-output/implementation-artifacts/1-1-initialisation-du-monorepo-docker.md
+++ b/_bmad-output/implementation-artifacts/1-1-initialisation-du-monorepo-docker.md
@@ -0,0 +1,63 @@
 # Story 1.1: Initialisation du Monorepo & Docker
 Status: review
 ## Story
 As a Développeur,
 I want to initialiser la structure du projet (Next.js + FastAPI + Docker),
 so that I have a functional and consistent development environment.
 ## Acceptance Criteria
 1. **Root Structure:** Root directory contains `compose.yaml` (2026 standard) and subdirectories `frontend/` and `backend/`.
 2. **Backend Setup:** `backend/` initialized with FastAPI (Python 3.12) using **uv** package manager.
 3. **Frontend Setup:** `frontend/` initialized with Next.js 16 (standalone mode).
 4. **Orchestration:** `docker-compose up` builds and starts both services on a shared internal network.
 5. **Connectivity:** Frontend is accessible at `localhost:3000` and Backend at `localhost:8000`.
 ## Tasks / Subtasks
 - [x] **Root Initialization** (AC: 1)
  - [x] Initialize git repository.
  - [x] Create `.gitignore` for monorepo.
 - [x] **Backend Service Setup** (AC: 2)
  - [x] Initialize FastAPI project structure.
  - [x] Add `main.py` with health check.
  - [x] Initialize **uv** project (`pyproject.toml`, `uv.lock`) and add dependencies.
  - [x] Create multi-stage `Dockerfile` using `uv` for fast builds.
 - [x] **Frontend Service Setup** (AC: 3)
  - [x] Initialize Next.js 16 project.
  - [x] Configure standalone output.
  - [x] Create multi-stage `Dockerfile`.
 - [x] **Docker Orchestration** (AC: 4, 5)
  - [x] Create `compose.yaml`.
  - [x] Verify inter-service communication configuration.
 ## Dev Notes
 - **Architecture Patterns:** Two-Service Monorepo pattern.
 - **Tooling:** Updated to use **uv** (Astral) instead of pip/venv for Python management (2026 Standard).
 - **Naming Conventions:** `snake_case` for Python files/API; `PascalCase` for React components.
 ### References
 - [Source: architecture.md#Project Structure & Boundaries]
 - [Source: project-context.md#Technology Stack & Versions]
 ## Dev Agent Record
 ### Completion Notes List
 - Migrated backend package management to **uv**.
 - Updated Dockerfile to use `ghcr.io/astral-sh/uv` for building.
 - Initialized `pyproject.toml` and `uv.lock`.
 ### File List
 - /compose.yaml
 - /backend/Dockerfile
 - /backend/main.py
 - /backend/pyproject.toml
 - /backend/uv.lock
 - /frontend/Dockerfile
 - /frontend/next.config.mjs
 - /frontend/package.json
--- a/_bmad-output/implementation-artifacts/1-2-ingestion-de-fichiers-excel-csv-backend.md
+++ b/_bmad-output/implementation-artifacts/1-2-ingestion-de-fichiers-excel-csv-backend.md
@@ -0,0 +1,70 @@
 # Story 1.2: Ingestion de Fichiers Excel/CSV (Backend)
 Status: review
 ## Story
 As a Julien (Analyst),
 I want to upload an Excel or CSV file,
 so that the system can read my production data.
 ## Acceptance Criteria
 1. **Upload Endpoint:** A POST endpoint `/api/v1/upload` accepts `.xlsx`, `.xls`, and `.csv` files.
 2. **File Validation:** Backend validates MIME type and file extension. Returns clear error for unsupported formats.
 3. **Data Parsing:** Uses Pandas to read the file into a DataFrame. Handles multiple sheets (takes the first by default).
 4. **Type Inference:** Backend automatically detects column types (int, float, string, date).
 5. **Arrow Serialization:** Converts the DataFrame to an Apache Arrow Table and streams it using IPC format.
 6. **Persistence (Ephemeral):** Temporarily saves the file metadata and a pointer to the dataset in memory (stateless session simulation).
 ## Tasks / Subtasks
 - [x] **API Route Implementation** (AC: 1, 2)
  - [x] Create `/backend/app/api/v1/upload.py`.
  - [x] Implement file upload using `FastAPI.UploadFile`.
  - [x] Add validation logic for extensions and MIME types.
 - [x] **Data Processing Logic** (AC: 3, 4)
  - [x] Implement `backend/app/core/engine/ingest.py` helper.
  - [x] Use `pandas` to read Excel/CSV.
  - [x] Basic data cleaning (strip whitespace from headers).
 - [x] **High-Performance Bridge** (AC: 5)
  - [x] Implement Arrow conversion using `pyarrow`.
  - [x] Set up `StreamingResponse` with `application/vnd.apache.arrow.stream`.
 - [x] **Session & Metadata** (AC: 6)
  - [x] Return column metadata (name, inferred type) in the response headers or as a separate JSON part.
 ## Dev Notes
 - **Performance:** For 50k rows, Arrow is mandatory. Zero-copy binary transfer implemented.
 - **Libraries:** Using `pandas`, `openpyxl`, and `pyarrow`.
 - **Type Safety:** Column metadata is stringified in the `X-Column-Metadata` header.
 ### Project Structure Notes
 - Created `backend/app/core/engine/ingest.py` for pure data logic.
 - Created `backend/app/api/v1/upload.py` for the FastAPI route.
 - Updated `backend/main.py` to include the router.
 ### References
 - [Source: architecture.md#API & Communication Patterns]
 - [Source: project-context.md#Data & State Architecture]
 ## Dev Agent Record
 ### Agent Model Used
 {{agent_model_name_version}}
 ### Completion Notes List
 - Implemented `/api/v1/upload` endpoint.
 - Added validation for `.xlsx`, `.xls`, and `.csv`.
 - Implemented automated type inference (numeric, categorical, date).
 - Successfully converted Pandas DataFrames to Apache Arrow IPC streams.
 - Verified with 3 automated tests (Health, CSV Upload, Error Handling).
 ### File List
 - /backend/app/api/v1/upload.py
 - /backend/app/core/engine/ingest.py
 - /backend/main.py
 - /backend/tests/test_upload.py
--- a/_bmad-output/implementation-artifacts/1-3-visualisation-dans-la-smart-grid-frontend.md
+++ b/_bmad-output/implementation-artifacts/1-3-visualisation-dans-la-smart-grid-frontend.md
@@ -0,0 +1,67 @@
 # Story 1.3: Visualisation dans la Smart Grid (Frontend)
 Status: review
 ## Story
 As a Julien (Analyst),
 I want to see my uploaded data in an interactive high-speed grid,
 so that I can explore the raw data effortlessly.
 ## Acceptance Criteria
 1. **Virtualization:** The grid renders 50,000+ rows without browser lag using TanStack Table virtualization.
 2. **Arrow Integration:** The frontend reads the Apache Arrow stream from the backend API using `apache-arrow` library.
 3. **Data Display:** Columns are rendered with correct formatting based on metadata (e.g., numbers right-aligned, dates formatted).
 4. **Visual Foundation:** The grid uses the "Smart Grid" design (compact density, JetBrains Mono font) as defined in UX specs.
 5. **Basic Interaction:** Users can scroll vertically and horizontally fluidly.
 ## Tasks / Subtasks
 - [x] **Dependencies & Setup** (AC: 2)
  - [x] Install `apache-arrow`, `@tanstack/react-table`, `@tanstack/react-virtual`, `zustand`.
  - [x] Create `frontend/src/lib/arrow-client.ts` to handle binary stream parsing.
 - [x] **Smart Grid Component** (AC: 1, 4, 5)
  - [x] Create `frontend/src/features/smart-grid/components/SmartGrid.tsx`.
  - [x] Implement virtualized row rendering.
  - [x] Apply Shadcn UI styling and "Lab & Tech" theme.
 - [x] **Integration** (AC: 3)
  - [x] Connect `upload` form success state to Grid data loading.
  - [x] Implement `useGridStore` (Zustand) to hold the loaded table state.
  - [x] Render actual data from the uploaded file.
 ## Dev Notes
 - **Performance:** Optimized binary stream parsing using Apache Arrow IPC. Zero unnecessary JSON parsing.
 - **State:** Zustand used for high-frequency updates and persistence across session.
 - **Layout:** Implemented responsive workspace with sticky header and virtualized body.
 ### Project Structure Notes
 - Organized into `features/smart-grid` and `features/uploader`.
 - Centralized state in `store/use-grid-store.ts`.
 ### References
 - [Source: ux-design-specification.md#Core User Experience]
 - [Source: architecture.md#Frontend Architecture]
 ## Dev Agent Record
 ### Agent Model Used
 {{agent_model_name_version}}
 ### Completion Notes List
 - Successfully integrated `apache-arrow` for binary data ingestion.
 - Implemented `<SmartGrid />` with `@tanstack/react-virtual` for 50k+ row performance.
 - Built a functional `<FileUploader />` that communicates with the FastAPI backend.
 - Applied "Lab & Tech" styling with Tailwind CSS.
 ### File List
 - /frontend/src/features/smart-grid/components/SmartGrid.tsx
 - /frontend/src/features/uploader/components/FileUploader.tsx
 - /frontend/src/lib/arrow-client.ts
 - /frontend/src/lib/utils.ts
 - /frontend/src/store/use-grid-store.ts
 - /frontend/src/app/page.tsx
--- a/_bmad-output/implementation-artifacts/1-4-gestion-des-types-renommage-data-hygiene.md
+++ b/_bmad-output/implementation-artifacts/1-4-gestion-des-types-renommage-data-hygiene.md
@@ -0,0 +1,63 @@
 # Story 1.4: Gestion des Types & Renommage (Data Hygiene)
 Status: review
 ## Story
 As a Julien (Analyst),
 I want to rename columns and correct data types,
 so that the data matches my business context before analysis.
 ## Acceptance Criteria
 1. **Column Renaming:** Users can click a column header to rename it. Changes are reflected instantly in the grid.
 2. **Type Override:** Users can change the data type of a column (e.g., String -> Numeric). 
 3. **Backend Sync:** Type changes are sent to the backend to validate if the data can be cast correctly.
 4. **Visual Feedback:** Column headers display the current data type (e.g., "Num", "Cat").
 5. **State Persistence:** Column names and types are preserved in the Zustand store.
 ## Tasks / Subtasks
 - [x] **Frontend Header UI** (AC: 1, 4)
  - [x] Implement editable column header in `src/features/smart-grid/components/SmartGrid.tsx`.
  - [x] Add type indicator badges to headers.
 - [x] **Type Management Logic** (AC: 2, 5)
  - [x] Update `useGridStore` to support `updateColumn` action (rename, change type).
 - [x] **Backend Validation** (AC: 3)
  - [x] Add endpoint `/api/v1/analysis/validate-type` to verify casting feasibility.
  - [x] Handle casting errors gracefully.
 ## Dev Notes
 - **Performance:** Local renames handle data key updates in-memory to avoid full dataset re-ingestion.
 - **UI:** Implemented inline editing for headers with immediate visual feedback.
 - **Backend:** Added Pydantic model for type validation requests.
 ### Project Structure Notes
 - Modified `frontend/src/store/use-grid-store.ts` to add mutation logic.
 - Created `backend/app/api/v1/analysis.py` for validation logic.
 - Updated `frontend/src/features/smart-grid/components/SmartGrid.tsx` with `EditableHeader`.
 ### References
 - [Source: epics.md#Story 1.4]
 - [Source: project-context.md#Data & State Architecture]
 ## Dev Agent Record
 ### Agent Model Used
 {{agent_model_name_version}}
 ### Completion Notes List
 - Implemented inline column renaming in the grid.
 - Added a dropdown for data type override (Num, Cat, Date, Bool).
 - Developed a FastAPI endpoint for validating data type conversions.
 - Optimized Zustand store to update data keys when a column is renamed.
 ### File List
 - /frontend/src/store/use-grid-store.ts
 - /backend/app/api/v1/analysis.py
 - /backend/main.py
 - /frontend/src/features/smart-grid/components/SmartGrid.tsx
--- a/_bmad-output/implementation-artifacts/1-5-tri-filtrage-de-base.md
+++ b/_bmad-output/implementation-artifacts/1-5-tri-filtrage-de-base.md
@@ -0,0 +1,57 @@
 # Story 1.5: Tri & Filtrage de Base
 Status: review
 ## Story
 As a Julien (Analyst),
 I want to sort and filter my data in the grid,
 so that I can identify extreme values or specific subsets.
 ## Acceptance Criteria
 1. **Sorting:** Users can click a column header to toggle between ascending, descending, and no sort.
 2. **Filtering:** Users can enter a search term or value in a column filter input to narrow down the rows.
 3. **Performance:** Sorting and filtering 50,000 rows should happen within 300ms using local processing.
 4. **Visual Indicators:** Column headers show an arrow icon indicating the current sort direction.
 5. **Persistence:** Sort and filter states are maintained in the UI state during the session.
 ## Tasks / Subtasks
 - [x] **TanStack Table Logic** (AC: 1, 2)
  - [x] Enable `getSortedRowModel` and `getFilteredRowModel` in `SmartGrid.tsx`.
 - [x] **Filter UI** (AC: 2, 4)
  - [x] Add a text input field in each column header for filtering.
  - [x] Add sort icons (Lucide React) to headers.
 - [x] **State & Performance** (AC: 3, 5)
  - [x] Ensure filtering logic handles different data types (string search, numeric range).
 ## Dev Notes
 - **Sorting:** Integrated TanStack's built-in sorting logic with visual arrows.
 - **Filtering:** Implemented per-column text filtering using a Search input in headers.
 - **UI:** Combined renaming, type selection, and filtering into a compact `EditableHeader` component.
 ### Project Structure Notes
 - Modified `frontend/src/features/smart-grid/components/SmartGrid.tsx`.
 ### References
 - [Source: epics.md#Story 1.5]
 - [Source: architecture.md#Frontend Architecture]
 ## Dev Agent Record
 ### Agent Model Used
 {{agent_model_name_version}}
 ### Completion Notes List
 - Enabled sorting and filtering row models in the TanStack Table configuration.
 - Added interactive sort buttons with direction indicators (Up/Down).
 - Implemented a search-based filter for each column.
 - Verified performance remains smooth with virtualization.
 ### File List
 - /frontend/src/features/smart-grid/components/SmartGrid.tsx
--- a/_bmad-output/implementation-artifacts/2-1-edition-de-cellule-validation.md
+++ b/_bmad-output/implementation-artifacts/2-1-edition-de-cellule-validation.md
@@ -0,0 +1,60 @@
 # Story 2.1: Édition de Cellule & Validation
 Status: review
 ## Story
 As a Julien (Analyst),
 I want to edit cell values directly in the grid,
 so that I can manually correct obvious data entry errors.
 ## Acceptance Criteria
 1. **Inline Editing:** Double-clicking a cell enters "Edit Mode" with an input field matching the column type.
 2. **Data Validation:** Input is validated against the column type (e.g., only numbers in Numeric columns).
 3. **Commit Changes:** Pressing `Enter` or clicking outside saves the change to the local Zustand store.
 4. **Visual Feedback:** Edited cells are temporarily highlighted or marked to indicate unsaved/modified state.
 5. **Keyboard Support:** Pressing `Esc` cancels the edit and restores the original value.
 ## Tasks / Subtasks
 - [x] **Frontend Grid Update** (AC: 1, 3, 5)
  - [x] Implement `EditableCell` component in `src/features/smart-grid/components/SmartGrid.tsx`.
  - [x] Add `onCellEdit` logic to the TanStack Table configuration.
 - [x] **State Management** (AC: 3, 4)
  - [x] Update `useGridStore` to support a `updateCellValue(rowId, colId, value)` action.
  - [x] Implement a `modifiedCells` tracking object in the store to highlight changes.
 - [x] **Validation Logic** (AC: 2)
  - [x] Add regex-based validation for numeric and boolean inputs in the frontend.
 ## Dev Notes
 - **Memoization:** Used local state for editing to prevent entire table re-renders during typing.
 - **Visuals:** Modified cells now have a subtle `bg-amber-50` background.
 - **Validation:** Implemented strict numeric validation before committing to the global store.
 ### Project Structure Notes
 - Modified `frontend/src/store/use-grid-store.ts`.
 - Updated `frontend/src/features/smart-grid/components/SmartGrid.tsx`.
 ### References
 - [Source: ux-design-specification.md#Grid Interaction Patterns]
 - [Source: architecture.md#Frontend Architecture]
 ## Dev Agent Record
 ### Agent Model Used
 {{agent_model_name_version}}
 ### Completion Notes List
 - Created an `EditableCell` sub-component with `onDoubleClick` activation.
 - Implemented `updateCellValue` in Zustand store with change tracking.
 - Added keyboard support: `Enter` to commit, `Escape` to discard.
 - Added visual highlighting for modified data.
 ### File List
 - /frontend/src/store/use-grid-store.ts
 - /frontend/src/features/smart-grid/components/SmartGrid.tsx
--- a/_bmad-output/implementation-artifacts/2-2-undo-redo-des-modifications.md
+++ b/_bmad-output/implementation-artifacts/2-2-undo-redo-des-modifications.md
@@ -0,0 +1,59 @@
 # Story 2.2: Undo/Redo des Modifications
 Status: review
 ## Story
 As a Julien (Analyst),
 I want to undo my last data edits,
 so that I can explore changes without fear of losing the original data.
 ## Acceptance Criteria
 1. **Undo History:** The system tracks changes to cell values.
 2. **Undo Action:** Users can press `Ctrl+Z` or click an "Undo" button to revert the last edit.
 3. **Redo Action:** Users can press `Ctrl+Y` (or `Ctrl+Shift+Z`) to re-apply an undone edit.
 4. **Visual Indicator:** The Undo/Redo buttons in the toolbar are disabled if no history is available.
 5. **Session Scope:** History is maintained during the current session (stateless).
 ## Tasks / Subtasks
 - [x] **State Management (Zustand)** (AC: 1, 2, 3)
  - [x] Implement `zundo` or a custom middleware for state history in `useGridStore`.
  - [x] Add `undo` and `redo` actions.
 - [x] **Keyboard Shortcuts** (AC: 2, 3)
  - [x] Add global event listeners for `Ctrl+Z` and `Ctrl+Y`.
 - [x] **UI Controls** (AC: 4)
  - [x] Add Undo/Redo buttons to the `FileUploader` or a new `Toolbar` component.
 ## Dev Notes
 - **Optimization:** Using `zundo` middleware to partialize state history (tracking only `data`, `columns`, and `modifiedCells`).
 - **Shortcuts:** Implemented global keyboard event listeners in the main layout.
 - **UX:** Added responsive toolbar buttons with disabled states when no history is present.
 ### Project Structure Notes
 - Modified `frontend/src/store/use-grid-store.ts` to include `temporal` middleware.
 - Updated `frontend/src/app/page.tsx` with UI buttons and shortcut logic.
 ### References
 - [Source: functional-requirements.md#FR8]
 - [Source: project-context.md#Data & State Architecture]
 ## Dev Agent Record
 ### Agent Model Used
 {{agent_model_name_version}}
 ### Completion Notes List
 - Integrated `zundo` for comprehensive history tracking.
 - Added Undo/Redo logic to the global Zustand store.
 - Implemented `Ctrl+Z`, `Ctrl+Shift+Z`, and `Ctrl+Y` keyboard shortcuts.
 - Added visual buttons in the application header with state-dependent enabling/disabling.
 ### File List
 - /frontend/src/store/use-grid-store.ts
 - /frontend/src/app/page.tsx
--- a/_bmad-output/implementation-artifacts/2-3-detection-automatique-des-outliers-backend.md
+++ b/_bmad-output/implementation-artifacts/2-3-detection-automatique-des-outliers-backend.md
@@ -0,0 +1,65 @@
 # Story 2.3: Détection Automatique des Outliers (Backend)
 Status: review
 ## Story
 As a system,
 I want to identify statistical outliers in the background,
 so that I can alert the user to potential data quality issues.
 ## Acceptance Criteria
 1. **Algorithm Implementation:** Backend implements Isolation Forest (multivariate) and IQR (univariate) algorithms.
 2. **Analysis Endpoint:** A POST endpoint `/api/v1/analysis/detect-outliers` accepts dataset and configuration.
 3. **Detection Output:** Returns a list of outlier row indices and the reason for flagging (e.g., "z-score > 3").
 4. **Performance:** Detection on 50k rows completes in under 5 seconds.
 5. **Robustness:** Handles missing values (NaNs) gracefully without crashing.
 ## Tasks / Subtasks
 - [x] **Dependency Update** (AC: 1)
  - [x] Add `scikit-learn` to the backend using `uv`.
 - [x] **Outlier Engine Implementation** (AC: 1, 5)
  - [x] Create `backend/app/core/engine/clean.py`.
  - [x] Implement univariate IQR-based detection.
  - [x] Implement multivariate Isolation Forest detection.
 - [x] **API Endpoint** (AC: 2, 3, 4)
  - [x] Implement `POST /api/v1/analysis/detect-outliers` in `analysis.py`.
  - [x] Map detection results to indexed row references.
 ## Dev Notes
 - **Algorithms:** Used Scikit-learn's `IsolationForest` for multivariate and Pandas quantile logic for IQR.
 - **Explainability:** Each outlier is returned with a descriptive string explaining the reason for the flag.
 - **Performance:** Asynchronous ready, using standard Scikit-learn optimisations.
 ### Project Structure Notes
 - Created `backend/app/core/engine/clean.py` for outlier logic.
 - Updated `backend/app/api/v1/analysis.py` with the detection endpoint.
 - Added `backend/tests/test_analysis.py` for verification.
 ### References
 - [Source: epics.md#Story 2.3]
 - [Source: project-context.md#Critical Anti-Patterns]
 ## Dev Agent Record
 ### Agent Model Used
 {{agent_model_name_version}}
 ### Completion Notes List
 - Integrated `scikit-learn` for anomaly detection.
 - Implemented univariate detection based on 1.5 * IQR bounds.
 - Implemented multivariate detection using the Isolation Forest algorithm.
 - Developed a robust API endpoint that merges results from both methods.
 - Verified with unit tests covering both univariate and multivariate scenarios.
 ### File List
 - /backend/app/core/engine/clean.py
 - /backend/app/api/v1/analysis.py
 - /backend/tests/test_analysis.py
 - /backend/pyproject.toml
--- a/_bmad-output/implementation-artifacts/2-4-panel-d-insights-revue-des-outliers-frontend.md
+++ b/_bmad-output/implementation-artifacts/2-4-panel-d-insights-revue-des-outliers-frontend.md
@@ -0,0 +1,63 @@
 # Story 2.4: Panel d'Insights & Revue des Outliers (Frontend)
 Status: review
 ## Story
 As a Julien (Analyst),
 I want to review detected outliers in a side panel,
 so that I can understand why they are flagged before excluding them.
 ## Acceptance Criteria
 1. **Insight Panel UI:** A slide-over panel (Shadcn Sheet) displays detailed outlier information.
 2. **Interactive Triggers:** Clicking a "warning" badge in the grid header opens the panel for that column.
 3. **Reasoning Display:** The panel shows the statistical reason for each flagged point (e.g., "Value 9.9 is > 3 Sigma").
 4. **Visual Summary:** Displays a small chart (boxplot or histogram) showing the distribution and the outlier's position.
 5. **Batch Actions:** Users can click "Exclude All" within the panel to gray out all flagged rows in the grid.
 ## Tasks / Subtasks
 - [x] **Shadcn UI Setup** (AC: 1)
  - [x] Install Shadcn `Sheet` and `ScrollArea` components.
 - [x] **InsightPanel Component** (AC: 1, 3, 4, 5)
  - [x] Create `frontend/src/features/insight-panel/components/InsightPanel.tsx`.
  - [x] Integrate `Recharts` for distribution visualization.
 - [x] **State Integration** (AC: 2, 5)
  - [x] Update `useGridStore` to trigger outlier detection and store results.
  - [x] Add `detectedOutliers` object to the Zustand store.
 ## Dev Notes
 - **Explainable AI:** Successfully mapped backend `reasons` to user-friendly list items in the panel.
 - **Visualization:** Used `recharts` to build a dynamic histogram of the selected column.
 - **Integration:** Added a pulse animation to column headers when outliers are detected.
 ### Project Structure Notes
 - Created `frontend/src/features/insight-panel/components/InsightPanel.tsx`.
 - Integrated panel trigger in `frontend/src/features/smart-grid/components/SmartGrid.tsx`.
 - Updated main layout in `frontend/src/app/page.tsx` to host the panel.
 ### References
 - [Source: ux-design-specification.md#2.4 Novel UX Patterns]
 - [Source: architecture.md#Frontend Architecture]
 ## Dev Agent Record
 ### Agent Model Used
 {{agent_model_name_version}}
 ### Completion Notes List
 - Implemented the `InsightPanel` slide-over component.
 - Integrated automated backend outlier detection triggered on data change.
 - Added a distribution histogram using Recharts.
 - Implemented "Exclude All" functionality which syncs with the Grid's visual state.
 ### File List
 - /frontend/src/features/insight-panel/components/InsightPanel.tsx
 - /frontend/src/store/use-grid-store.ts
 - /frontend/src/features/smart-grid/components/SmartGrid.tsx
 - /frontend/src/app/page.tsx
--- a/_bmad-output/implementation-artifacts/2-5-exclusion-non-destructive-de-donnees.md
+++ b/_bmad-output/implementation-artifacts/2-5-exclusion-non-destructive-de-donnees.md
@@ -0,0 +1,59 @@
 # Story 2.5: Exclusion Non-Destructive de Données
 Status: review
 ## Story
 As a Julien (Analyst),
 I want to toggle the inclusion of specific rows in the analysis,
 so that I can test different scenarios without deleting data.
 ## Acceptance Criteria
 1. **Row Toggle:** Users can click a "checkbox" or a specific "Exclude" button on each row.
 2. **Visual Feedback:** Excluded rows are visually dimmed (e.g., 30% opacity) and struck through.
 3. **Bulk Toggle:** Ability to exclude all filtered rows or all rows matching a criteria (already partially covered by Epic 2.4).
 4. **State Persistence:** Exclusion state is tracked in the global store.
 5. **Impact on Analysis:** The data sent to subsequent analysis engines (Correlation, Regression) MUST exclude these rows.
 ## Tasks / Subtasks
 - [x] **Grid UI Update** (AC: 1, 2)
  - [x] Add an `Exclude` column with a toggle switch or button to the `SmartGrid`.
  - [x] Implement conditional styling for the entire row based on exclusion state.
 - [x] **State Logic** (AC: 4)
  - [x] Ensure `excludedRows` in `useGridStore` is properly integrated with all UI components.
 - [x] **Data Pipeline Prep** (AC: 5)
  - [x] Create a selector/helper `getCleanData()` that returns the dataset minus the excluded rows.
 ## Dev Notes
 - **UX:** Added a dedicated "Eye/EyeOff" icon column for quick row exclusion toggling.
 - **Visuals:** Excluded rows use `opacity-30`, `line-through`, and a darker background to clearly distinguish them from active data.
 - **Selector:** The `getCleanData` function in the store ensures all future analysis steps only receive valid, included rows.
 ### Project Structure Notes
 - Modified `frontend/src/store/use-grid-store.ts`.
 - Updated `frontend/src/features/smart-grid/components/SmartGrid.tsx`.
 ### References
 - [Source: epics.md#Story 2.5]
 - [Source: ux-design-specification.md#2.5 Experience Mechanics]
 ## Dev Agent Record
 ### Agent Model Used
 {{agent_model_name_version}}
 ### Completion Notes List
 - Implemented a "soft delete" system for row exclusion.
 - Added visual indicators (strike-through and dimming) for excluded rows.
 - Created a `getCleanData` selector to facilitate downstream statistical modeling.
 - Integrated row-level toggle buttons directly in the SmartGrid.
 ### File List
 - /frontend/src/store/use-grid-store.ts
 - /frontend/src/features/smart-grid/components/SmartGrid.tsx
--- a/_bmad-output/implementation-artifacts/3-1-matrice-de-correlation-interactive.md
+++ b/_bmad-output/implementation-artifacts/3-1-matrice-de-correlation-interactive.md
@@ -0,0 +1,62 @@
 # Story 3.1: Matrice de Corrélation Interactive
 Status: review
 ## Story
 As a Julien (Analyst),
 I want to see a visual correlation map of my numeric variables,
 so that I can quickly identify which factors are related.
 ## Acceptance Criteria
 1. **Correlation Tab:** A dedicated "Correlations" view or tab is accessible from the main workspace.
 2. **Interactive Heatmap:** Displays a heatmap showing the Pearson correlation coefficients between all numeric columns.
 3. **Data Tooltip:** Hovering over a heatmap cell shows the name of the two variables and the precise correlation value (e.g., "0.85").
 4. **Color Scale:** Uses a diverging color scale (e.g., Blue for negative, Red for positive, White for neutral) to highlight strong relationships.
 5. **Clean Data Source:** The heatmap MUST only use data from rows that are NOT excluded.
 ## Tasks / Subtasks
 - [x] **Backend Analysis Engine** (AC: 2, 5)
  - [x] Implement `calculate_correlation_matrix(df, columns)` in `backend/app/core/engine/stats.py`.
  - [x] Add endpoint `POST /api/v1/analysis/correlation` that accepts data and column list.
 - [x] **Frontend Visualization** (AC: 1, 2, 3, 4)
  - [x] Create `frontend/src/features/analysis/components/CorrelationHeatmap.tsx`.
  - [x] Use `Recharts` or `Tremor` to render the matrix.
  - [x] Integrate with `getCleanData()` from the grid store.
 ## Dev Notes
 - **Data Integrity:** The heatmap uses the `getCleanData()` selector, ensuring that excluded outliers don't bias the correlation matrix.
 - **UI/UX:** Implemented a tab-switcher between "Data" and "Correlation" views.
 - **Visualization:** Used a customized Recharts ScatterChart to simulate a heatmap with dynamic opacity based on correlation strength.
 ### Project Structure Notes
 - Created `backend/app/core/engine/stats.py`.
 - Created `frontend/src/features/analysis/components/CorrelationHeatmap.tsx`.
 - Updated `frontend/src/app/page.tsx` with tab logic.
 ### References
 - [Source: epics.md#Story 3.1]
 - [Source: ux-design-specification.md#Design System Foundation]
 ## Dev Agent Record
 ### Agent Model Used
 {{agent_model_name_version}}
 ### Completion Notes List
 - Developed Pearson correlation logic in the Python backend.
 - Built an interactive heatmap in the React frontend.
 - Added informative tooltips showing detailed correlation metrics.
 - Ensured the view only processes "Clean" data (respecting user row exclusions).
 ### File List
 - /backend/app/core/engine/stats.py
 - /backend/app/api/v1/analysis.py
 - /frontend/src/features/analysis/components/CorrelationHeatmap.tsx
 - /frontend/src/app/page.tsx
--- a/_bmad-output/implementation-artifacts/3-2-calcul-de-l-importance-des-features-backend.md
+++ b/_bmad-output/implementation-artifacts/3-2-calcul-de-l-importance-des-features-backend.md
@@ -0,0 +1,59 @@
 # Story 3.2: Calcul de l'Importance des Features (Backend)
 Status: review
 ## Story
 As a system,
 I want to compute the predictive power of features against a target variable,
 so that I can provide scientific recommendations to the user.
 ## Acceptance Criteria
 1. **Importance Algorithm:** Backend implements Feature Importance calculation using `RandomForestRegressor`.
 2. **Analysis Endpoint:** A POST endpoint `/api/v1/analysis/feature-importance` accepts data, features list, and target variable (Y).
 3. **Detection Output:** Returns a ranked list of features with their importance scores (0 to 1).
 4. **Validation:** Ensures Y is not in the X list and that enough numeric data exists.
 5. **Clean Data Source:** Only uses data from non-excluded rows.
 ## Tasks / Subtasks
 - [x] **Engine Implementation** (AC: 1, 4)
  - [x] Implement `calculate_feature_importance(df, features, target)` in `backend/app/core/engine/stats.py`.
  - [x] Handle categorical features using basic Label Encoding if needed (currently focus on numeric).
 - [x] **API Endpoint** (AC: 2, 3, 5)
  - [x] Implement `POST /api/v1/analysis/feature-importance` in `analysis.py`.
 ## Dev Notes
 - **Model:** Used `RandomForestRegressor` with 50 estimators for a balance between speed and accuracy.
 - **Data Prep:** Automatically drops rows with NaNs in either features or target to ensure Scikit-learn compatibility.
 - **Output:** Returns a JSON list of objects `{feature, score}` sorted by score in descending order.
 ### Project Structure Notes
 - Modified `backend/app/core/engine/stats.py`.
 - Updated `backend/app/api/v1/analysis.py`.
 - Added test case in `backend/tests/test_analysis.py`.
 ### References
 - [Source: epics.md#Story 3.2]
 - [Source: architecture.md#Computational Workers]
 ## Dev Agent Record
 ### Agent Model Used
 {{agent_model_name_version}}
 ### Completion Notes List
 - Implemented the Feature Importance core engine using Scikit-learn.
 - Developed the API endpoint to expose the ranked feature list.
 - Added validation to prevent processing empty or incompatible datasets.
 - Verified with automated tests.
 ### File List
 - /backend/app/core/engine/stats.py
 - /backend/app/api/v1/analysis.py
 - /backend/tests/test_analysis.py
--- a/_bmad-output/implementation-artifacts/3-3-recommandation-intelligente-de-variables-frontend.md
+++ b/_bmad-output/implementation-artifacts/3-3-recommandation-intelligente-de-variables-frontend.md
@@ -0,0 +1,62 @@
 # Story 3.3: Recommandation Intelligente de Variables (Frontend)
 Status: review
 ## Story
 As a Julien (Analyst),
 I want the system to suggest which variables to include in my model,
 so that I don't pollute my analysis with irrelevant data ("noise").
 ## Acceptance Criteria
 1. **Target Selection:** Users can select one column as the "Target Variable (Y)" from a dropdown.
 2. **Auto-Trigger:** Selecting Y automatically triggers the feature importance calculation for all other numeric columns.
 3. **Smart Ranking:** The UI displays a list of features ranked by their predictive power.
 4. **Auto-Selection:** The Top-5 features (or all if < 5) are automatically checked for inclusion in the model.
 5. **Visual Feedback:** A horizontal bar chart in the configuration panel shows the importance scores.
 ## Tasks / Subtasks
 - [x] **Selection UI** (AC: 1, 4)
  - [x] Create `frontend/src/features/analysis/components/AnalysisConfiguration.tsx`.
  - [x] Implement Target Variable (Y) and Predictor Variables (X) selection logic.
 - [x] **Intelligence Integration** (AC: 2, 3, 5)
  - [x] Call `/api/v1/analysis/feature-importance` upon Y selection.
  - [x] Render importance scores using a simple CSS-based or Recharts bar chart.
 - [x] **State Management** (AC: 4)
  - [x] Store selected X and Y variables in `useGridStore`.
 ## Dev Notes
 - **UX:** Implemented a slide-over `AnalysisConfiguration` sidebar triggered by the main "Run Regression" button.
 - **Automation:** Integrated the Random Forest importance engine from the backend to provide real-time recommendations.
 - **Rules:** Enforced mutual exclusivity between X and Y variables in the UI selection logic.
 ### Project Structure Notes
 - Created `frontend/src/features/analysis/components/AnalysisConfiguration.tsx`.
 - Updated `frontend/src/store/use-grid-store.ts` with analysis state.
 - Updated `frontend/src/app/page.tsx` to handle the configuration drawer.
 ### References
 - [Source: epics.md#Story 3.3]
 - [Source: ux-design-specification.md#Critical Success Moments]
 ## Dev Agent Record
 ### Agent Model Used
 {{agent_model_name_version}}
 ### Completion Notes List
 - Built the model configuration sidebar using Tailwind and Lucide icons.
 - Implemented reactive feature importance fetching when the target variable changes.
 - Added auto-selection of top predictive features.
 - Integrated the configuration state into the global Zustand store.
 ### File List
 - /frontend/src/features/analysis/components/AnalysisConfiguration.tsx
 - /frontend/src/store/use-grid-store.ts
 - /frontend/src/app/page.tsx
--- a/_bmad-output/implementation-artifacts/4-1-configuration-de-la-regression.md
+++ b/_bmad-output/implementation-artifacts/4-1-configuration-de-la-regression.md
@@ -0,0 +1,58 @@
 # Story 4.1: Configuration de la Régression
 Status: review
 ## Story
 As a Julien (Analyst),
 I want to configure the parameters of my regression model,
 so that I can tailor the analysis to my specific hypothesis.
 ## Acceptance Criteria
 1. **Model Selection:** Users can choose between "Linear Regression" and "Logistic Regression" in the sidebar.
 2. **Dynamic Validation:** The system checks if the Target Variable (Y) is compatible with the selected model (e.g., continuous for Linear, binary/categorical for Logistic).
 3. **Parameter Summary:** The sidebar displays a clear summary of the selected X variables and the Y variable before launch.
 4. **Interactive Updates:** Changing X or Y variables updates the "Implementation Readiness" of the model (enable/disable the "Run" button).
 ## Tasks / Subtasks
 - [x] **UI Enhancements** (AC: 1, 3)
  - [x] Add model type dropdown to `AnalysisConfiguration.tsx`.
  - [x] Implement a "Selected Features" summary list.
 - [x] **Validation Logic** (AC: 2, 4)
  - [x] Implement frontend validation to check if the target variable matches the model type.
  - [x] Disable "Run Regression" button if validation fails or selection is incomplete.
 ## Dev Notes
 - **Validation Rules:**
  - `linear`: Cible doit être de type `numeric`.
  - `logistic`: Cible doit être `categorical` ou `boolean`.
 - **UI:** Added a toggle switch for model selection and refined the predictor selection list with importance bars.
 ### Project Structure Notes
 - Modified `frontend/src/features/analysis/components/AnalysisConfiguration.tsx`.
 - Updated `frontend/src/store/use-grid-store.ts` with `ModelType` state.
 ### References
 - [Source: epics.md#Story 4.1]
 - [Source: architecture.md#Frontend Architecture]
 ## Dev Agent Record
 ### Agent Model Used
 {{agent_model_name_version}}
 ### Completion Notes List
 - Integrated model type selection (Linear/Logistic).
 - Added comprehensive validation logic for target variables.
 - Refined the predictors list to show importance scores sum and visual bars.
 - Implemented state-aware activation of the execution button.
 ### File List
 - /frontend/src/store/use-grid-store.ts
 - /frontend/src/features/analysis/components/AnalysisConfiguration.tsx
--- a/_bmad-output/implementation-artifacts/4-2-execution-du-modele-backend.md
+++ b/_bmad-output/implementation-artifacts/4-2-execution-du-modele-backend.md
@@ -0,0 +1,63 @@
 # Story 4.2: Exécution du Modèle (Backend)
 Status: review
 ## Story
 As a system,
 I want to execute the statistical model computation,
 so that I can provide accurate regression results.
 ## Acceptance Criteria
 1. **Algorithm Support:** Backend supports Ordinary Least Squares (OLS) for Linear and Logit for Logistic regression.
 2. **Analysis Endpoint:** A POST endpoint `/api/v1/analysis/run-regression` accepts data, X features, Y target, and model type.
 3. **Comprehensive Metrics:** Returns R-squared, Adjusted R-squared, coefficients, standard errors, p-values, and residuals.
 4. **Validation:** Handles singular matrices or perfect collinearity without crashing (returns 400 with explanation).
 5. **Clean Data Source:** Respects user row exclusions during calculation.
 ## Tasks / Subtasks
 - [x] **Dependency Update** (AC: 1)
  - [x] Add `statsmodels` to the backend using `uv`.
 - [x] **Regression Engine** (AC: 1, 3, 4)
  - [x] Implement `run_linear_regression(df, x_cols, y_col)` in `backend/app/core/engine/stats.py`.
  - [x] Implement `run_logistic_regression(df, x_cols, y_col)` in `backend/app/core/engine/stats.py`.
 - [x] **API Endpoint** (AC: 2, 5)
  - [x] Implement `POST /api/v1/analysis/run-regression` in `analysis.py`.
 ## Dev Notes
 - **Statistics:** Using `statsmodels.api` for high-quality, research-grade regression summaries.
 - **Robustness:** Added intercept (constant) automatically to models. Implemented basic median-splitting for Logistic target encoding if not strictly binary.
 - **Validation:** Integrated try/except blocks to catch linear algebra errors (e.g. non-invertible matrices) and return meaningful error messages.
 ### Project Structure Notes
 - Modified `backend/app/core/engine/stats.py`.
 - Updated `backend/app/api/v1/analysis.py` with the execution endpoint.
 - Added regression test case in `backend/tests/test_analysis.py`.
 ### References
 - [Source: epics.md#Story 4.2]
 - [Source: architecture.md#Computational Workers]
 ## Dev Agent Record
 ### Agent Model Used
 {{agent_model_name_version}}
 ### Completion Notes List
 - Integrated `statsmodels` for advanced statistical modeling.
 - Developed a unified regression engine supporting Linear and Logistic models.
 - Implemented `/api/v1/analysis/run-regression` endpoint returning detailed metrics and residuals for plotting.
 - Verified with automated tests for both model types.
 ### File List
 - /backend/app/core/engine/stats.py
 - /backend/app/api/v1/analysis.py
 - /backend/tests/test_analysis.py
 - /backend/pyproject.toml
 - /backend/uv.lock
--- a/_bmad-output/implementation-artifacts/4-3-dashboard-de-resultats-interactif.md
+++ b/_bmad-output/implementation-artifacts/4-3-dashboard-de-resultats-interactif.md
@@ -0,0 +1,62 @@
 # Story 4.3: Dashboard de Résultats Interactif
 Status: review
 ## Story
 As a Julien (Analyst),
 I want to see the model results through interactive charts,
 so that I can easily diagnose the performance of my regression.
 ## Acceptance Criteria
 1. **Results View:** A new "Results" tab or page displays the output of the regression.
 2. **Metrics Cards:** Key statistics (R², Adj. R², P-value, Sample Size) are shown in high-visibility cards with Shadcn UI.
 3. **Primary Chart:** A "Real vs Predicted" scatter chart with a reference 45-degree line.
 4. **Diagnostic Chart:** A "Residuals Distribution" histogram or "Residuals vs Fitted" plot.
 5. **Coefficient Table:** A clean table showing each predictor, its coefficient, and its p-value (color-coded for significance < 0.05).
 ## Tasks / Subtasks
 - [x] **Visualization Development** (AC: 1, 3, 4)
  - [x] Create `frontend/src/features/analysis/components/AnalysisResults.tsx`.
  - [x] Implement "Real vs Predicted" chart using `Recharts`.
  - [x] Implement "Residuals" diagnostic chart.
 - [x] **Data Integration** (AC: 2, 5)
  - [x] Update `useGridStore` to trigger the regression run and store `analysisResults`.
  - [x] Build the metrics summary UI and coefficient table.
 ## Dev Notes
 - **Feedback:** Added visual error reporting in the UI if the regression fails.
 - **Charts:** Used `ScatterChart` for real-vs-pred and `AreaChart` for residuals distribution.
 - **UX:** Auto-switch to "Results" tab upon successful execution.
 ### Project Structure Notes
 - Created `frontend/src/features/analysis/components/AnalysisResults.tsx`.
 - Integrated results state in `frontend/src/store/use-grid-store.ts`.
 - Updated `frontend/src/app/page.tsx` with robust error handling.
 ### References
 - [Source: epics.md#Story 4.3]
 - [Source: ux-design-specification.md#Design Directions]
 ## Dev Agent Record
 ### Agent Model Used
 {{agent_model_name_version}}
 ### Completion Notes List
 - Implemented `AnalysisResults` component with responsive charts.
 - Added visual indicators for statistical significance.
 - Verified correct state management flow from configuration to results display.
 - Improved error handling and user feedback during execution.
 ### File List
 - /frontend/src/features/analysis/components/AnalysisResults.tsx
 - /frontend/src/store/use-grid-store.ts
 - /frontend/src/app/page.tsx
 - /frontend/src/features/analysis/components/AnalysisConfiguration.tsx
--- a/_bmad-output/implementation-artifacts/4-4-generation-du-rapport-pdf-audit-trail.md
+++ b/_bmad-output/implementation-artifacts/4-4-generation-du-rapport-pdf-audit-trail.md
@@ -0,0 +1,62 @@
 # Story 4.4: Génération du Rapport PDF (Audit Trail)
 Status: review
 ## Story
 As a Julien (Analyst),
 I want to export my findings as a professional PDF report,
 so that I can share and archive my validated analysis.
 ## Acceptance Criteria
 1. **PDF Generation:** Backend generates a high-quality PDF containing project title, date, and metrics.
 2. **Visual Inclusion:** The PDF includes the key metrics summary (R², etc.) and the coefficient table.
 3. **Audit Trail:** The PDF explicitly lists the data cleaning steps (e.g., "34 rows excluded from Pressure_Bar").
 4. **Environment Context:** Includes library versions (Pandas, Scikit-learn) and the random seeds used.
 5. **Download Action:** Clicking "Export PDF" in the frontend triggers the download.
 ## Tasks / Subtasks
 - [x] **Dependency Update** (AC: 1)
  - [x] Add `reportlab` or `fpdf2` to the backend using `uv`.
 - [x] **Report Engine** (AC: 1, 2, 3, 4)
  - [x] Implement `generate_pdf_report(results, metadata, audit_trail)` in `backend/app/core/engine/reports.py`.
 - [x] **API & Integration** (AC: 5)
  - [x] Create `POST /api/v1/reports/export` endpoint.
  - [x] Add the "Download PDF" button to the application header.
 ## Dev Notes
 - **Aesthetic:** Designed the PDF with a clean header and color-coded p-values to match the web dashboard.
 - **Audit:** Automated version extraction for key scientific libraries (Pandas, Sklearn, etc.) to ensure complete reproducibility documentation.
 - **Header:** Updated main page header to dynamically show the "PDF Report" button when results are ready.
 ### Project Structure Notes
 - Created `backend/app/core/engine/reports.py` for PDF layout.
 - Created `backend/app/api/v1/reports.py` for the export route.
 - Integrated download logic in `frontend/src/app/page.tsx`.
 ### References
 - [Source: functional-requirements.md#FR21]
 - [Source: epics.md#Story 4.4]
 ## Dev Agent Record
 ### Agent Model Used
 {{agent_model_name_version}}
 ### Completion Notes List
 - Implemented professional PDF generation using `fpdf2`.
 - Added color-coded statistical coefficients to the PDF output.
 - Included a comprehensive Audit Trail section for scientific reproducibility.
 - Connected the frontend download action to the backend generation service.
 ### File List
 - /backend/app/core/engine/reports.py
 - /backend/app/api/v1/reports.py
 - /backend/main.py
 - /frontend/src/app/page.tsx
--- a/_bmad-output/implementation-artifacts/sprint-status.yaml
+++ b/_bmad-output/implementation-artifacts/sprint-status.yaml
@@ -0,0 +1,70 @@
 # generated: 2026-01-10
 # project: Data_analysis
 # project_key: DATA
 # tracking_system: file-system
 # story_location: _bmad-output/implementation-artifacts
 # STATUS DEFINITIONS:
 # ==================
 # Epic Status:
 #   - backlog: Epic not yet started
 #   - in-progress: Epic actively being worked on
 #   - done: All stories in epic completed
 #
 # Epic Status Transitions:
 #   - backlog → in-progress: Automatically when first story is created (via create-story)
 #   - in-progress → done: Manually when all stories reach 'done' status
 #
 # Story Status:
 #   - backlog: Story only exists in epic file
 #   - ready-for-dev: Story file created in stories folder
 #   - in-progress: Developer actively working on implementation
 #   - review: Ready for code review (via Dev's code-review workflow)
 #   - done: Story completed
 #
 # Retrospective Status:
 #   - optional: Can be completed but not required
 #   - done: Retrospective has been completed
 #
 # WORKFLOW NOTES:
 # ===============
 # - Epic transitions to 'in-progress' automatically when first story is created
 # - Stories can be worked in parallel if team capacity allows
 # - SM typically creates next story after previous one is 'done' to incorporate learnings
 # - Dev moves story to 'review', then runs code-review (fresh context, different LLM recommended)
 generated: "2026-01-10"
 project: "Data_analysis"
 project_key: "DATA"
 tracking_system: "file-system"
 story_location: "_bmad-output/implementation-artifacts"
 development_status:
  epic-1: done
  1-1-initialisation-du-monorepo-docker: review
  1-2-ingestion-de-fichiers-excel-csv-backend: review
  1-3-visualisation-dans-la-smart-grid-frontend: review
  1-4-gestion-des-types-renommage-data-hygiene: review
  1-5-tri-filtrage-de-base: review
  epic-1-retrospective: optional
  epic-2: done
  2-1-edition-de-cellule-validation: review
  2-2-undo-redo-des-modifications: review
  2-3-detection-automatique-des-outliers-backend: review
  2-4-panel-d-insights-revue-des-outliers-frontend: review
  2-5-exclusion-non-destructive-de-donnees: review
  epic-2-retrospective: optional
  epic-3: done
  3-1-matrice-de-correlation-interactive: review
  3-2-calcul-de-l-importance-des-features-backend: review
  3-3-recommandation-intelligente-de-variables-frontend: review
  epic-3-retrospective: optional
  epic-4: in-progress
  4-1-configuration-de-la-regression: review
  4-2-execution-du-modele-backend: review
  4-3-dashboard-de-resultats-interactif: review
  4-4-generation-du-rapport-pdf-audit-trail: backlog
  epic-4-retrospective: optional
--- a/_bmad-output/planning-artifacts/architecture.md
+++ b/_bmad-output/planning-artifacts/architecture.md
@@ -0,0 +1,123 @@
 ---
 stepsCompleted: [1, 2, 3, 4, 5]
 inputDocuments: ['_bmad-output/planning-artifacts/prd.md', '_bmad-output/planning-artifacts/ux-design-specification.md']
 workflowType: 'architecture'
 project_name: 'Data_analysis'
 user_name: 'Sepehr'
 date: '2026-01-10'
 ---
 # Architecture Decision Document
 _This document builds collaboratively through step-by-step discovery. Sections are appended as we work through each architectural decision together._
 ## Project Context Analysis
 ### Requirements Overview
 **Functional Requirements:**
 The system requires a robust data processing pipeline capable of ingesting diverse file formats (Excel/CSV), performing automated statistical analysis (Outlier Detection, RFE), and rendering interactive visualizations. The frontend must support a high-performance, editable grid ("Smart Grid") that mimics spreadsheet behavior.
 **Non-Functional Requirements:**
 *   **Performance:** Sub-second response times for grid interactions on datasets up to 50k rows.
 *   **Stateless Architecture:** Phase 1 requires no persistent user data storage; sessions are ephemeral.
 *   **Scientific Rigor:** Reproducibility of results is paramount, requiring strict versioning of libraries and random seeds.
 *   **Security:** Secure file handling and transport (TLS 1.3) are mandatory.
 **Scale & Complexity:**
 *   **Primary Domain:** Scientific Web Application (Full-stack).
 *   **Complexity Level:** Medium. The complexity lies in the bridge between the interactive frontend and the computational backend, ensuring synchronization and performance.
 *   **Estimated Architectural Components:** ~5 Core Components (Frontend Shell, Data Grid, Visualization Engine, API Gateway, Computational Worker).
 ### Technical Constraints & Dependencies
 *   **Backend:** Python is mandatory for the scientific stack (Pandas, Scikit-learn, Statsmodels).
 *   **Frontend:** Next.js 16 with React Server Components (for shell) and Client Components (for grid).
 *   **UI Library:** Shadcn UI + TanStack Table (headless) + Recharts.
 *   **Deployment:** Must support containerized deployment (Docker) for reproducibility.
 ### Cross-Cutting Concerns Identified
 *   **Data Serialization:** Efficient transfer of large datasets (JSON/Arrow) between Python backend and React frontend.
 *   **State Management:** Synchronizing the client-side grid state with the server-side analysis context.
 *   **Error Handling:** Unifying error reporting from the Python backend to the React UI (e.g., "Singular Matrix" error).
 ## Starter Template Evaluation
 ### Primary Technology Domain
 Scientific Data Application (Full-stack Next.js + FastAPI) optimized for self-hosting.
 ### Selected Starter: Custom FastAPI-Next.js-Docker Boilerplate
 **Rationale for Selection:**
 Explicitly chosen to support a "Two-Service" deployment model on a Homelab infrastructure. This ensures process isolation between the analytical Python engine and the React UI.
 **Architectural Decisions Provided by Starter:**
 *   **Language & Runtime:** Python 3.12 (Backend managed by **uv**) and Node.js 20 (Frontend).
 *   **Styling Solution:** Tailwind CSS with Shadcn UI.
 *   **Testing:** Pytest (Backend) and Vitest (Frontend).
 *   **Code Organization:** Clean Monorepo with separated service directories.
 **Deployment Strategy (Homelab):**
 *   **Frontend Service:** Next.js in Standalone mode (Docker).
 *   **Backend Service:** FastAPI with Uvicorn (Docker).
 *   **Communication:** Internal Docker network for API requests to minimize latency.
 ## Core Architectural Decisions
 ### Decision Priority Analysis
 **Critical Decisions (Block Implementation):**
 *   **Data Serialization Protocol:** Apache Arrow (IPC Stream) is mandatory for performance.
 *   **State Management Strategy:** Hybrid (TanStack Query for Async + Zustand for UI State).
 *   **Container Strategy:** Docker Compose with isolated networks for Homelab deployment.
 ### Data Architecture
 *   **Format:** Apache Arrow (IPC Stream) for grid data; JSON for control plane.
 *   **Validation:** Pydantic (v2) for all JSON payloads.
 *   **Persistence:** None (Stateless) for Phase 1. `tempfile` module in Python for transient storage during analysis.
 ### API & Communication Patterns
 *   **Protocol:** REST API (FastAPI) with `StreamingResponse` for data export.
 *   **Serialization:** `pyarrow.ipc.new_stream` on backend -> `tableFromIPC` on frontend.
 *   **CORS:** Strictly configured to allow only the Homelab domain (e.g., `data.home`).
 ### Frontend Architecture
 *   **State Manager:**
    *   **Zustand (v5):** For high-frequency grid state (selection, edits).
    *   **TanStack Query (v5):** For analytical job status and data fetching.
 *   **Component Architecture:** "Smart Grid" pattern where the Grid component subscribes directly to the Zustand store to avoid re-rendering the entire page.
 ### Infrastructure & Deployment
 *   **Containerization:** Multi-stage Docker builds to keep images light (distroless/python and node-alpine).
 *   **Orchestration:** Docker Compose file defining `frontend`, `backend`, and a shared `network`.
 ## Implementation Patterns & Consistency Rules
 ### Pattern Categories Defined
 **Critical Conflict Points Identified:** 5 major areas where AI agents must align to prevent implementation divergence.
 ### Naming Patterns
 *   **Backend (Python):** Strict `snake_case` for modules, functions, and variables (PEP 8).
 *   **Frontend (TSX):** `PascalCase` for Components (`SmartGrid.tsx`), `camelCase` for hooks and utilities.
 *   **API / JSON:** `snake_case` for all keys to maintain 1:1 mapping with Pandas DataFrame columns and Pydantic models.
 ### Structure Patterns
 *   **Project Organization:** Co-located logic. Features are grouped in folders: `/features/data-grid`, `/features/analysis-engine`.
 *   **Test Location:** Centralized `/tests` directory at the service root (e.g., `backend/tests/`, `frontend/tests/`) to simplify Docker test runs.
 ### Format Patterns
 *   **API Response Wrapper:**
    *   Success: `{ "status": "success", "data": ..., "metadata": {...} }`.
    *   Error: `{ "status": "error", "message": "User-friendly message", "code": "TECHNICAL_ERROR_CODE" }`.
 *   **Date Format:** ISO 8601 strings (`YYYY-MM-DDTHH:mm:ssZ`) in UTC.
 ### Process Patterns
 *   **Loading States:** Standardized `isLoading` and `isProcessing` flags in Zustand/TanStack Query.
 *   **Validation:** 
    *   Backend: Pydantic v2.
    *   Frontend: Zod (synchronized with Pydantic via OpenAPI generator).
 ### Enforcement Guidelines
 **All AI Agents MUST:**
 1.  Check for existing Pydantic models before creating new ones.
 2.  Use the `logger` utility instead of `print()` or `console.log`.
 3.  Add JSDoc/Docstrings to every exported function.
--- a/_bmad-output/planning-artifacts/bmm-workflow-status.yaml
+++ b/_bmad-output/planning-artifacts/bmm-workflow-status.yaml
@@ -0,0 +1,35 @@
 generated: "2026-01-10"
 project: "Data_analysis"
 project_type: "software"
 selected_track: "method"
 field_type: "greenfield"
 workflow_path: "_bmad/bmm/workflows/workflow-status/paths/method-greenfield.yaml"
 workflow_status:
  - id: "prd"
    status: "_bmad-output/planning-artifacts/prd.md"
    agent: "pm"
    command: "/bmad:bmm:workflows:create-prd"
  - id: "create-ux-design"
    status: "_bmad-output/planning-artifacts/ux-design-specification.md"
    agent: "ux-designer"
    command: "/bmad:bmm:workflows:create-ux-design"
  - id: "create-architecture"
    status: "_bmad-output/planning-artifacts/architecture.md"
    agent: "architect"
    command: "/bmad:bmm:workflows:create-architecture"
  - id: "create-epics-and-stories"
    status: "_bmad-output/planning-artifacts/epics.md"
    agent: "pm"
    command: "/bmad:bmm:workflows:create-epics-and-stories"
  - id: "test-design"
    status: "optional"
    agent: "tea"
    command: "/bmad:bmm:workflows:test-design"
  - id: "implementation-readiness"
    status: "_bmad-output/planning-artifacts/implementation-readiness-report-2026-01-10.md"
    agent: "architect"
    command: "/bmad:bmm:workflows:implementation-readiness"
  - id: "sprint-planning"
    status: "required"
    agent: "sm"
    command: "/bmad:bmm:workflows:sprint-planning"
--- a/_bmad-output/planning-artifacts/epics.md
+++ b/_bmad-output/planning-artifacts/epics.md
@@ -0,0 +1,312 @@
 ---
 stepsCompleted: [1, 2, 3]
 inputDocuments: ['_bmad-output/planning-artifacts/prd.md', '_bmad-output/planning-artifacts/architecture.md', '_bmad-output/planning-artifacts/ux-design-specification.md']
 ---
 # Data_analysis - Epic Breakdown
 ## Overview
 This document provides the complete epic and story breakdown for Data_analysis, decomposing the requirements from the PRD, UX Design if it exists, and Architecture requirements into implementable stories.
 ## Requirements Inventory
 ### Functional Requirements
 - **FR1:** Users can upload datasets in .xlsx, .xls, and .csv formats via drag-and-drop or file selection.
 - **FR2:** System automatically detects column data types (numeric, categorical, datetime) upon ingest.
 - **FR3:** Users can manually override detected data types if the inference is incorrect.
 - **FR4:** Users can rename columns directly in the interface to sanitize inputs.
 - **FR5:** Users can view loaded data in a paginated, virtualized grid capable of displaying 50,000+ rows.
 - **FR6:** Users can edit cell values directly (double-click to edit) with inputs validated against the column type.
 - **FR7:** Users can sort columns (asc/desc) and filter rows based on values/conditions (e.g., "> 100").
 - **FR8:** Users can perform Undo/Redo operations (Ctrl+Z/Ctrl+Y) on data edits within the current session.
 - **FR9:** Users can exclude specific rows from analysis without deleting them (soft delete/toggle).
 - **FR10:** System automatically identifies univariate outliers using IQR/Z-score and visualizes them in the grid/plots.
 - **FR11:** System automatically identifies multivariate outliers using Isolation Forest upon user request.
 - **FR12:** Users can accept or reject outlier exclusion proposals individually or in bulk.
 - **FR13:** Users can select a Target Variable (Y) to trigger an automated Feature Importance analysis.
 - **FR14:** System recommends the Top-N predictive features based on RFE (Recursive Feature Elimination) or Random Forest importance.
 - **FR15:** Users can configure a Linear Regression (Simple/Multiple) by selecting Dependent (Y) and Independent (X) variables.
 - **FR16:** Users can configure a Binary Logistic Regression for categorical target variables.
 - **FR17:** System generates a "Model Summary" including R-squared, Adjusted R-squared, F-statistic, and P-values for coefficients.
 - **FR18:** System generates standard diagnostic plots: Residuals vs Fitted, Q-Q Plot, and Scale-Location.
 - **FR19:** Users can view a Correlation Matrix (Heatmap) for selected numeric variables.
 - **FR20:** Users can view an interactive "Analysis Report" dashboard summarizing data health, methodology, and model results.
 - **FR21:** Users can export the full report as a branded PDF document.
 - **FR22:** System appends an "Audit Trail" to the report listing library versions, random seeds, and data exclusion steps for reproducibility.
 ### NonFunctional Requirements
 - **Performance:** Grid latency < 200ms for 50k rows. Analysis throughput < 15s. Upload speed < 3s for 5MB.
 - **Security:** Data ephemerality (purge after 1h). TLS 1.3 encryption. Input sanitization for files.
 - **Reliability:** Graceful degradation for bad data. Support 50 concurrent requests via async task queue.
 - **Accessibility:** Keyboard navigation for "Smart Grid". Screen reader support. WCAG 2.1 Level AA compliance.
 ### Additional Requirements
 **Architecture:**
 - **Starter Template:** Custom FastAPI-Next.js-Docker Boilerplate.
 - **Data Serialization:** Apache Arrow (IPC Stream) required for grid data.
 - **State Management:** Hybrid approach (TanStack Query for Server State + Zustand for Grid UI State).
 - **Deployment:** "Two-Service" model on Homelab via Docker Compose.
 - **Naming Conventions:** `snake_case` for Python/API, `PascalCase` for React components.
 - **Testing:** Pytest (Backend) and Vitest (Frontend).
 **UX Design:**
 - **Visual Style:** "Lab & Tech" (Slate/Indigo/Mono) with Shadcn UI.
 - **Responsive:** Desktop Only (1366px+).
 - **Core Interaction:** "Guided Data Hygiene Loop" (Insight Panel).
 - **Design System:** TanStack Table for virtualization + Recharts for visualization.
 - **Mode:** Native Dark Mode support.
 ### FR Coverage Map
 - **FR1:** Epic 1 - Data Ingestion
 - **FR2:** Epic 1 - Type Auto-detection
 - **FR3:** Epic 1 - Manual Type Override
 - **FR4:** Epic 1 - Column Renaming
 - **FR5:** Epic 1 - High-Performance Grid View
 - **FR6:** Epic 2 - Grid Cell Editing
 - **FR7:** Epic 1 - Grid Sort/Filter
 - **FR8:** Epic 2 - Edit Undo/Redo
 - **FR9:** Epic 2 - Row Exclusion Logic
 - **FR10:** Epic 2 - Univariate Outlier Detection
 - **FR11:** Epic 2 - Multivariate Outlier Detection
 - **FR12:** Epic 2 - Outlier Review UI (Insight Panel)
 - **FR13:** Epic 3 - Feature Importance Engine
 - **FR14:** Epic 3 - Smart Feature Recommendation
 - **FR15:** Epic 4 - Linear Regression Configuration
 - **FR16:** Epic 4 - Logistic Regression Configuration
 - **FR17:** Epic 4 - Model Summary & Metrics
 - **FR18:** Epic 4 - Diagnostic Plots
 - **FR19:** Epic 3 - Correlation Matrix Visualization
 - **FR20:** Epic 4 - Interactive Analysis Dashboard
 - **FR21:** Epic 4 - PDF Export
 - **FR22:** Epic 4 - Reproducibility Audit Trail
 ## Epic List
 ### Epic 1: Fondation & Ingestion de Données
 "Je peux uploader mon fichier Excel et voir mes données brutes dans une grille fluide."
 **FRs covered:** FR1, FR2, FR3, FR4, FR5, FR7.
 ### Epic 2: Nettoyage Interactif (Hygiene Loop)
 "Je peux nettoyer mes données en supprimant les lignes erronées ou les outliers."
 **FRs covered:** FR6, FR8, FR9, FR10, FR11, FR12.
 ### Epic 3: Intelligence & Sélection (Smart Prep)
 "Le système me dit quelles variables sont importantes pour ma cible."
 **FRs covered:** FR13, FR14, FR19.
 ### Epic 4: Modélisation & Reporting
 "Je génère mon modèle de régression et j'exporte le rapport PDF."
 **FRs covered:** FR15, FR16, FR17, FR18, FR20, FR21, FR22.
 ---
 ## Epic 1: Fondation & Ingestion de Données
 "Je peux uploader mon fichier Excel et voir mes données brutes dans une grille fluide."
 ### Story 1.1: Initialisation du Monorepo & Docker
 As a developer,
 I want to initialize the project structure (Next.js + FastAPI + Docker),
 So that I have a functional and consistent development environment.
 **Acceptance Criteria:**
 **Given** A fresh project directory.
 **When** I run `docker-compose up`.
 **Then** Both the Next.js frontend and FastAPI backend are reachable on their respective ports.
 **And** The shared Docker network allows communication between services.
 ### Story 1.2: Ingestion de Fichiers Excel/CSV (Backend)
 As a Julien (Analyst),
 I want to upload an Excel or CSV file,
 So that the system can read my production data.
 **Acceptance Criteria:**
 **Given** A valid `.xlsx` file with multiple columns and 5,000 rows.
 **When** I POST the file to the `/upload` endpoint.
 **Then** The backend returns a 200 OK with column metadata (names, detected types).
 **And** The data is prepared as an Apache Arrow stream for high-performance delivery.
 ### Story 1.3: Visualisation dans la Smart Grid (Frontend)
 As a Julien (Analyst),
 I want to see my uploaded data in an interactive high-speed grid,
 So that I can explore the raw data effortlessly.
 **Acceptance Criteria:**
 **Given** A dataset successfully loaded in the backend.
 **When** I view the workspace page.
 **Then** The TanStack Table renders the data using virtualization.
 **And** Scrolling through 50,000 rows remains fluid (< 200ms latency).
 ### Story 1.4: Gestion des Types & Renommage (Data Hygiene)
 As a Julien (Analyst),
 I want to rename columns and correct data types,
 So that the data matches my business context before analysis.
 **Acceptance Criteria:**
 **Given** A column "Press_01" detected as 'text'.
 **When** I click the column header to rename it to "Pressure" and change type to 'numeric'.
 **Then** The grid updates the visual formatting immediately.
 **And** The backend validates that all values in the column can be cast to numeric.
 ### Story 1.5: Tri & Filtrage de Base
 As a Julien (Analyst),
 I want to sort and filter my data in the grid,
 So that I can identify extreme values or specific subsets.
 **Acceptance Criteria:**
 **Given** A column "Temperature".
 **When** I click 'Sort Descending'.
 **Then** The highest temperature values appear at the top of the grid instantly.
 ---
 ## Epic 2: Nettoyage Interactif (Hygiene Loop)
 "Je peux nettoyer mes données en supprimant les lignes erronées ou les outliers."
 ### Story 2.1: Édition de Cellule & Validation
 As a Julien (Analyst),
 I want to edit cell values directly in the grid,
 So that I can manually correct obvious data entry errors.
 **Acceptance Criteria:**
 **Given** A data cell in the grid.
 **When** I double-click the cell and enter a new value.
 **Then** The value is updated in the local UI state (Zustand).
 **And** The system validates the input against the column's data type (e.g., no text in numeric columns).
 ### Story 2.2: Undo/Redo des Modifications
 As a Julien (Analyst),
 I want to undo my last data edits,
 So that I can explore changes without fear of losing the original data.
 **Acceptance Criteria:**
 **Given** A cell value was modified.
 **When** I press `Ctrl+Z` (or click Undo).
 **Then** The cell reverts to its previous value.
 **And** `Ctrl+Y` (Redo) restores the edit.
 ### Story 2.3: Détection Automatique des Outliers (Backend)
 As a system,
 I want to identify statistical outliers in the background,
 So that I can alert the user to potential data quality issues.
 **Acceptance Criteria:**
 **Given** A dataset is loaded.
 **When** The analysis engine runs.
 **Then** It uses Isolation Forest (multivariate) and IQR (univariate) to tag suspicious rows.
 **And** Outlier coordinates are returned to the frontend.
 ### Story 2.4: Panel d'Insights & Revue des Outliers (Frontend)
 As a Julien (Analyst),
 I want to review detected outliers in a side panel,
 So that I can understand why they are flagged before excluding them.
 **Acceptance Criteria:**
 **Given** Flagged outliers exist.
 **When** I click the warning icon in a column header.
 **Then** The `InsightPanel` opens with a boxplot visualization and a "Why?" explanation.
 **And** A button "Exclude all 34 outliers" is prominently displayed.
 ### Story 2.5: Exclusion Non-Destructive de Données
 As a Julien (Analyst),
 I want to toggle the inclusion of specific rows in the analysis,
 So that I can test different scenarios without deleting data.
 **Acceptance Criteria:**
 **Given** A flagged outlier row.
 **When** I click "Exclude".
 **Then** The row appears with 40% opacity in the grid.
 **And** The row is ignored by all subsequent statistical calculations (R², Regression).
 ---
 ## Epic 3: Intelligence & Sélection (Smart Prep)
 "Le système me dit quelles variables sont importantes pour ma cible."
 ### Story 3.1: Matrice de Corrélation Interactive
 As a Julien (Analyst),
 I want to see a visual correlation map of my numeric variables,
 So that I can quickly identify which factors are related.
 **Acceptance Criteria:**
 **Given** A dataset with multiple numeric columns.
 **When** I navigate to the "Correlations" tab.
 **Then** A heatmap is displayed using Pearson correlation coefficients.
 **And** Hovering over a cell shows the precise correlation value.
 ### Story 3.2: Calcul de l'Importance des Features (Backend)
 As a system,
 I want to compute the predictive power of features against a target variable,
 So that I can provide scientific recommendations to the user.
 **Acceptance Criteria:**
 **Given** A dataset and a selected Target Variable (Y).
 **When** The RFE (Recursive Feature Elimination) algorithm runs.
 **Then** The backend returns an ordered list of features with their importance scores.
 ### Story 3.3: Recommandation Intelligente de Variables (Frontend)
 As a Julien (Analyst),
 I want the system to suggest which variables to include in my model,
 So that I don't pollute my analysis with irrelevant data ("noise").
 **Acceptance Criteria:**
 **Given** Feature importance scores are calculated.
 **When** I open the Model Configuration panel.
 **Then** The top 5 predictive variables are pre-selected by default.
 **And** An explanation "Why?" is available for each recommendation.
 ---
 ## Epic 4: Modélisation & Reporting
 "Je génère mon modèle de régression et j'exporte le rapport PDF."
 ### Story 4.1: Configuration de la Régression
 As a Julien (Analyst),
 I want to configure the parameters of my regression model,
 So that I can tailor the analysis to my specific hypothesis.
 **Acceptance Criteria:**
 **Given** A cleaned dataset.
 **When** I select "Linear Regression" and confirm X/Y variables.
 **Then** The system validates that the target variable (Y) is suitable for the chosen model type.
 ### Story 4.2: Exécution du Modèle (Backend)
 As a system,
 I want to execute the statistical model computation,
 So that I can provide accurate regression results.
 **Acceptance Criteria:**
 **Given** Model parameters (X, Y, Algorithm).
 **When** The "Run" action is triggered.
 **Then** The backend computes R², Adjusted R², P-values, and coefficients using `statsmodels`.
 **And** All results are returned as a JSON summary.
 ### Story 4.3: Dashboard de Résultats Interactif
 As a Julien (Analyst),
 I want to see the model results through interactive charts,
 So that I can easily diagnose the performance of my regression.
 **Acceptance Criteria:**
 **Given** Computed model results.
 **When** I view the "Results" page.
 **Then** I see a "Real vs Predicted" scatter plot and a "Residuals" plot.
 **And** Key metrics (R², P-value) are displayed with colored status indicators (Success/Warning).
 ### Story 4.4: Génération du Rapport PDF (Audit Trail)
 As a Julien (Analyst),
 I want to export my findings as a professional PDF report,
 So that I can share and archive my validated analysis.
 **Acceptance Criteria:**
 **Given** A completed analysis session.
 **When** I click "Export PDF".
 **Then** A PDF is generated containing all charts, metrics, and a reproducibility section (lib versions, seeds).
 **And** The report lists all rows that were excluded during the session.
--- a/_bmad-output/planning-artifacts/implementation-readiness-report-2026-01-10.md
+++ b/_bmad-output/planning-artifacts/implementation-readiness-report-2026-01-10.md
@@ -0,0 +1,154 @@
 # Implementation Readiness Assessment Report
 **Date:** 2026-01-10
 **Project:** Data_analysis
 ## PRD Analysis
 ### Functional Requirements
 FR1: Users can upload datasets in .xlsx, .xls, and .csv formats via drag-and-drop or file selection.
 FR2: System automatically detects column data types (numeric, categorical, datetime) upon ingest.
 FR3: Users can manually override detected data types if the inference is incorrect.
 FR4: Users can rename columns directly in the interface to sanitize inputs.
 FR5: Users can view loaded data in a paginated, virtualized grid capable of displaying 50,000+ rows.
 FR6: Users can edit cell values directly (double-click to edit) with inputs validated against the column type.
 FR7: Users can sort columns (asc/desc) and filter rows based on values/conditions (e.g., "> 100").
 FR8: Users can perform Undo/Redo operations (Ctrl+Z/Ctrl+Y) on data edits within the current session.
 FR9: Users can exclude specific rows from analysis without deleting them (soft delete/toggle).
 FR10: System automatically identifies univariate outliers using IQR/Z-score and visualizes them in the grid/plots.
 FR11: System automatically identifies multivariate outliers using Isolation Forest upon user request.
 FR12: Users can accept or reject outlier exclusion proposals individually or in bulk.
 FR13: Users can select a Target Variable (Y) to trigger an automated Feature Importance analysis.
 FR14: System recommends the Top-N predictive features based on RFE (Recursive Feature Elimination) or Random Forest importance.
 FR15: Users can configure a Linear Regression (Simple/Multiple) by selecting Dependent (Y) and Independent (X) variables.
 FR16: Users can configure a Binary Logistic Regression for categorical target variables.
 FR17: System generates a "Model Summary" including R-squared, Adjusted R-squared, F-statistic, and P-values for coefficients.
 FR18: System generates standard diagnostic plots: Residuals vs Fitted, Q-Q Plot, and Scale-Location.
 FR19: Users can view a Correlation Matrix (Heatmap) for selected numeric variables.
 FR20: Users can view an interactive "Analysis Report" dashboard summarizing data health, methodology, and model results.
 FR21: Users can export the full report as a branded PDF document.
 FR22: System appends an "Audit Trail" to the report listing library versions, random seeds, and data exclusion steps for reproducibility.
 Total FRs: 22
 ### Non-Functional Requirements
 NFR1: Grid Latency: render 50,000 rows with filtering/sorting response times under 200ms.
 NFR2: Analysis Throughput: Automated analysis on standard datasets (<10MB) must complete in under 15 seconds.
 NFR3: Upload Speed: Parsing and validation of a 5MB Excel file should complete in under 3 seconds.
 NFR4: Data Ephemerality: All user datasets purged after 1 hour of inactivity or session termination.
 NFR5: Transport Security: Data transmission must be encrypted via TLS 1.3.
 NFR6: Input Sanitization: File parser must validate MIME types and signatures to prevent macro execution.
 NFR7: Graceful Degradation: Handle NaNs/infinite values with clear error messages instead of crashing.
 NFR8: Concurrency: Support at least 50 concurrent analysis requests using an asynchronous task queue.
 NFR9: Keyboard Navigation: Data grid must be fully navigable via keyboard.
 Total NFRs: 9
 ### Additional Requirements
 - **Stateless Architecture:** Phase 1 requires no persistent user data storage.
 - **Scientific Rigor:** Reproducibility of results is paramount (Trace d'Analyse).
 - **Desktop Only:** Strictly optimized for high-resolution desktop displays.
 ### PRD Completeness Assessment
 The PRD is exceptionally comprehensive, providing numbered, testable requirements (FR1-FR22) and specific, measurable quality attributes (NFR1-NFR9). The "Experience MVP" strategy is clearly defined, and the project context (Scientific Greenfield) is well-articulated. No major gaps were identified during extraction.
 ## Epic Coverage Validation
 ### FR Coverage Analysis
 | FR Number | PRD Requirement | Epic Coverage | Status |
 | :--- | :--- | :--- | :--- |
 | FR1 | Upload datasets (.xlsx, .xls, .csv) | Epic 1 Story 1.2 | ✓ Covered |
 | FR2 | Auto-detect column data types | Epic 1 Story 1.2 | ✓ Covered |
 | FR3 | Manual type override | Epic 1 Story 1.4 | ✓ Covered |
 | FR4 | Rename columns | Epic 1 Story 1.4 | ✓ Covered |
 | FR5 | High-performance grid (50k+ rows) | Epic 1 Story 1.3 | ✓ Covered |
 | FR6 | Edit cell values directly | Epic 2 Story 2.1 | ✓ Covered |
 | FR7 | Sort and filter rows | Epic 1 Story 1.5 | ✓ Covered |
 | FR8 | Undo/Redo operations | Epic 2 Story 2.2 | ✓ Covered |
 | FR9 | Exclude rows (soft delete) | Epic 2 Story 2.5 | ✓ Covered |
 | FR10 | Univariate outlier detection (IQR) | Epic 2 Story 2.3 | ✓ Covered |
 | FR11 | Multivariate outlier detection (Isolation Forest) | Epic 2 Story 2.3 | ✓ Covered |
 | FR12 | Outlier review UI (Insight Panel) | Epic 2 Story 2.4 | ✓ Covered |
 | FR13 | Feature Importance analysis | Epic 3 Story 3.2 | ✓ Covered |
 | FR14 | Top-N predictive feature recommendations | Epic 3 Story 3.3 | ✓ Covered |
 | FR15 | Linear Regression configuration | Epic 4 Story 4.1 | ✓ Covered |
 | FR16 | Logistic Regression configuration | Epic 4 Story 4.1 | ✓ Covered |
 | FR17 | Model Summary (R², P-values, etc.) | Epic 4 Story 4.2 | ✓ Covered |
 | FR18 | Diagnostic plots | Epic 4 Story 4.3 | ✓ Covered |
 | FR19 | Correlation Matrix (Heatmap) | Epic 3 Story 3.1 | ✓ Covered |
 | FR20 | Analysis Report dashboard | Epic 4 Story 4.3 | ✓ Covered |
 | FR21 | Export branded PDF | Epic 4 Story 4.4 | ✓ Covered |
 | FR22 | Reproducibility Audit Trail | Epic 4 Story 4.4 | ✓ Covered |
 ### Missing Requirements
 None. All 22 Functional Requirements from the PRD are mapped to specific stories in the epics document.
 ### Coverage Statistics
 - Total PRD FRs: 22
 - FRs covered in epics: 22
 - Coverage percentage: 100%
 ## UX Alignment Assessment
 ### UX Document Status
 *   **Found:** `_bmad-output/planning-artifacts/ux-design-specification.md`
 ### Alignment Analysis
 **UX ↔ PRD Alignment:**
 *   ✅ **User Journeys:** Optimized for identified personas (Julien & Marc).
 *   ✅ **Feature Coverage:** 100% of FRs have defined interaction patterns.
 *   ✅ **Workflow:** Assisted analysis loop matches the PRD vision.
 **UX ↔ Architecture Alignment:**
 *   ✅ **Performance:** High-density grid requirements supported by Apache Arrow stack.
 *   ✅ **State Management:** Zustand choice supports high-frequency UI updates.
 *   ✅ **Responsive Strategy:** Consistent "Desktop Only" approach across all plans.
 ### Warnings
 *   None.
 ## Epic Quality Review
 ### Epic Structure Validation
 *   ✅ **Epic 1: Ingestion** - Focused on user value.
 *   ✅ **Epic 2: Hygiene** - Standalone value, no forward dependencies.
 *   ✅ **Epic 3: Smart Prep** - Incremental enhancement.
 *   ✅ **Epic 4: Modélisation** - Final completion of journey.
 ### Story Quality & Sizing
 *   ✅ **Story 1.1:** Correctly initializes project from Architecture boilerplate.
 *   ✅ **Acceptance Criteria:** All stories follow Given/When/Then format.
 *   ✅ **Story Sizing:** Optimized for single agent dev sessions.
 ### Dependency Analysis
 *   ✅ **No Forward Dependencies:** No story depends on work from a future epic.
 *   ✅ **Database Timing:** Stateless logic introduced exactly when required.
 ### Quality Assessment Documentation
 *   🔴 **Critical Violations:** None.
 *   🟠 **Major Issues:** None.
 *   🟡 **Minor Concerns:** None.
 ## Summary and Recommendations
 ### Overall Readiness Status
 **READY** ✅
 ### Critical Issues Requiring Immediate Action
 *   **None.**
 ### Recommended Next Steps
 1.  **Initialize Project:** Run `docker-compose up` to verify the monorepo skeleton (Epic 1 Story 1.1).
 2.  **Performance Spike:** Validate Apache Arrow streaming with a 50k row dataset early in development.
 3.  **UI Setup:** Configure the Shadcn UI ThemeProvider for native Dark Mode support from the start.
 ### Final Note
 This assessment identifies 0 issues. The project planning is complete, coherent, and highly robust. You may proceed immediately to implementation.
--- a/_bmad-output/planning-artifacts/prd.md
+++ b/_bmad-output/planning-artifacts/prd.md
@@ -0,0 +1,303 @@
 ---
 stepsCompleted: [1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11]
 inputDocuments: []
 workflowType: 'prd'
 lastStep: 11
 project_name: Data_analysis
 user_name: Sepehr
 date: 2026-01-10
 briefCount: 0
 researchCount: 0
 brainstormingCount: 0
 projectDocsCount: 0
 ---
 # Product Requirements Document - Data_analysis
 **Author:** Sepehr
 **Date:** 2026-01-10
 ## Executive Summary
 **Data_analysis** aims to democratize advanced statistical analysis by combining the robustness of Python's scientific ecosystem with the accessibility of a modern web interface. It serves as a web-based, modern alternative to Minitab, specifically optimized for regression analysis workflows. The platform empowers users—from data novices to analysts—to upload datasets (Excel/CSV), intuitively clean and manipulate data in an Excel-like grid, and perform sophisticated regression modeling with automated guidance.
 ### What Makes This Special
 *   **Guided Analytical Workflow:** Unlike traditional tools that present a toolbox, Data_analysis guides the user through a best-practice workflow: Data Upload -> Auto-Outlier Detection -> Smart Feature Selection -> Regression Modeling -> Explainable Results.
 *   **Hybrid Interface:** Merges the familiarity of spreadsheet editing (direct cell manipulation, copy-paste) with the power of a computational notebook, eliminating the need to switch between Excel and statistical software.
 *   **Modern Tech Stack:** Built on a robust Python backend (FastAPI/Django) for heavy statistical lifting (Pandas, Scikit-learn, Statsmodels) and a high-performance Next.js 16 frontend with Tailwind CSS and Shadcn UI, ensuring a fast, responsive, and visually appealing experience.
 *   **Automated Insights:** Proactively identifies data quality issues (outliers) and relevant predictors (feature selection) using advanced algorithms (e.g., Isolation Forest, Recursive Feature Elimination), visualizing *why* certain features are selected before the user commits to a model.
 ### Key Workflows & Scenarios
 1.  **"Auto-Optimisation" (Scénario Cœur):**
    *   Upload de fichier -> Détection automatique des Outliers (Isolation Forest) avec proposition de correction.
    *   Sélection automatique de Features (RFE/Lasso) pour identifier les variables clés.
    *   Régression Robuste finale pour un modèle moins sensible au bruit.
 2.  **Modern Minitab Classics:**
    *   **Régression Linéaire Simple & Multiple:** Interface interactive pour visualiser la droite de régression, les résidus et les métriques (R²) avec alertes automatiques sur les hypothèses.
    *   **Régression Logistique Binaire:** Pour les prédictions Oui/Non avec matrices de confusion et courbes ROC interactives.
 3.  **Comparaison de Modèles (Benchmark):**
    *   Lancement parallèle de plusieurs algorithmes (Régression Linéaire, Random Forest, XGBoost) pour recommander le plus performant.
 ## Project Classification
 **Technical Type:** web_app
 **Domain:** scientific
 **Complexity:** medium
 **Project Context:** Greenfield - new project
 ## Success Criteria
 ### User Success
 * **Efficiency:** Complete a full regression cycle (Upload -> Cleaning -> Modeling) in under 3 minutes for typical datasets.
 * **Cognitive Load Reduction:** Users feel guided and confident without needing deep statistical expertise; "Explainable AI" visuals clarify outlier removal and feature selection.
 * **Data Mastery:** Users gain comprehensive insights into their data quality and variable importance through automated profiling.
 ### Business Success
 * **Operational Speed:** Significant reduction in man-hours spent on manual data cleaning and repetitive modeling tasks.
 * **Platform Adoption:** High user retention rate by providing a "least head-scratching" experience compared to Minitab or raw Excel.
 ### Technical Success
 * **High-Performance Grid:** Excel-like interface handles 50k+ rows with sub-second latency for filtering and sorting.
 * **Algorithmic Integrity:** Reliable Python backend providing accurate statistical outputs consistent with industry-standard libraries (Scikit-learn, Statsmodels).
 ## Product Scope
 ### MVP - Minimum Viable Product
 * **Direct Data Import:** Robust Excel/CSV upload.
 * **Smart Data Grid:** Seamless cell editing, filtering, and sorting in a modern web UI.
 * **Automated Data Preparation:** Integrated Outlier Detection (visual) and Feature Selection algorithms.
 * **Regression Core:** High-quality Linear (Simple/Multiple) regression output with clear diagnostics.
 ### Growth Features (Post-MVP)
 * **Binary Logistic Regression:** Support for classification-based predictive modeling.
 * **Model Benchmark:** Automated "tournament" mode to find the best performing algorithm.
 * **Advanced Reporting:** Exportable dashboard summaries for stakeholder presentations.
 ### Vision (Future)
 * **Time Series Forecasting:** Expansion into temporal data prediction.
 * **Native Integration:** Two-way sync with Excel/Cloud Storage providers.
 ## User Journeys
 **Journey 1: Julien, l'Ingénieur Qualité - La Course contre la Montre**
 Julien est sous pression. Une ligne de production vient de signaler une dérive anormale sur des composants électroniques. Il est 11h, et il doit présenter une analyse de régression à son directeur à 14h pour décider s'il faut arrêter la production. Il a exporté un fichier Excel complexe avec 40 variables (température, humidité, pression, etc.). Habituellement, il perdrait une heure rien qu'à nettoyer les données et à essayer de comprendre quelles variables sont pertinentes.
 Il ouvre **Data_analysis**. Il glisse son fichier Excel dans l'interface. Immédiatement, il voit ses données dans une grille familière. Le système fait clignoter un message : *"34 outliers détectés dans la colonne 'Pression_Zone_B'"*. Julien clique, visualise les points rouges sur un graphique et décide de les exclure en un clic. Ensuite, il lance la "Smart Feature Selection". Le système lui explique : *"Les variables Température_C et Vitesse_Tapis expliquent 85% de la dérive"*. À 11h15, Julien a déjà son modèle de régression validé et une visualisation claire. Il se sent serein pour sa réunion : il ne présente pas juste des chiffres, il présente une solution validée.
 **Journey 2: Sarah, l'Administratrice IT - La Gestion sans Stress**
 Sarah doit s'assurer que les outils utilisés par les ingénieurs sont sécurisés et ne saturent pas les ressources du serveur. Elle se connecte à son tableau de bord **Data_analysis**. Elle peut voir en un coup d'œil le nombre d'analyses en cours et la mémoire consommée par le backend Python. Elle crée un nouvel accès pour un stagiaire en quelques secondes. Pour elle, le succès, c'est que personne ne l'appelle pour dire "le logiciel a planté" parce qu'un fichier était trop gros.
 **Journey 3: Marc, le Directeur de Production - La Décision Rapide**
 Marc reçoit un lien de la part de Julien. Il n'est pas statisticien. Il ouvre le lien sur sa tablette. Il ne voit pas des lignes de code, mais un rapport interactif. Il voit le graphique de régression, lit l'explication simplifiée ("La vitesse du tapis est le facteur clé") et clique sur le PDF pour l'archiver. Il a pu prendre la décision d'ajuster la vitesse du tapis en 2 minutes, sauvant ainsi la production de l'après-midi.
 ### Journey Requirements Summary
 **For Julien (Analyst):**
 *   **Fast Data Ingestion:** Drag-and-drop Excel/CSV support.
 *   **Visual Data Cleaning:** Automated outlier detection with interactive exclusion.
 *   **Explainable ML:** Feature selection that explains "why" (percentage of variance explained).
 *   **Validation:** Clear regression metrics (R², P-values) presented simply.
 **For Sarah (Admin):**
 *   **System Health:** Dashboard for monitoring backend resources (Python server load).
 *   **Access Control:** Simple user management (RBAC).
 *   **Stability:** Robust error handling for large files to prevent system crashes.
 **For Marc (Consumer):**
 *   **Accessibility:** Mobile/Tablet responsive view for reports.
 *   **Simplicity:** "Read-only" mode with simplified insights (no code/formulas).
 *   **Portability:** One-click PDF export for archiving/sharing.
 ## Domain-Specific Requirements
 ### Scientific Validation & Reproducibility
 **Data_analysis** must adhere to strict scientific rigor to be a credible Minitab alternative. Users rely on these results for quality control and critical decision-making.
 ### Key Domain Concerns
 *   **Reproducibility:** Ensuring identical inputs yield identical outputs, regardless of when or where the analysis is run.
 *   **Methodological Transparency:** Avoiding "black box" algorithms; users must understand *how* an outlier was detected.
 *   **Computational Integrity:** Handling floating-point precision and large matrix operations without degradation.
 ### Compliance Requirements
 *   **Audit Trail:** Every generated report must include an appendix listing:
    *   Software Version & Library Versions (Pandas/Scikit-learn versions).
    *   Random Seed used for stochastic processes (Isolation Forest, train/test split).
    *   Sequence of applied filters (e.g., "Row 45 excluded due to Z-score > 3").
 ### Industry Standards & Best Practices
 *   **Statistical Standards:** Use `statsmodels` for classical regression (p-values, confidence intervals) to match traditional statistical expectations, and `scikit-learn` for predictive tasks.
 *   **Visual Standards:** Error bars, confidence bands, and residual plots must follow standard scientific visualization conventions (e.g., Q-Q plots for normality).
 ### Required Expertise & Validation
 *   **Validation Methodology:**
    *   **Unit Tests for Math:** Verify regression outputs against known standard datasets (e.g., Anscombe's quartet).
    *   **Drift Detection:** Alert users if data distribution significantly deviates from assumptions (e.g., normality check for linear regression).
 ### Implementation Considerations
 *   **Asynchronous Processing:** Heavy computations (Feature Selection on >10k rows) must be offloaded to a background worker (Celery/Redis) to maintain UI responsiveness.
 *   **Fixed Random Seeds:** All stochastic algorithms must imply a fixed random state by default to ensure consistency, with an option for the user to change it.
 ## Innovation & Novel Patterns
 ### Detected Innovation Areas
 *   **Hybrid "Spreadsheet-Notebook" Interface:**
    *   **Concept:** Combines the low barrier to entry of a spreadsheet (Excel) with the computational power and reproducibility of a code notebook (Jupyter), without requiring the user to write code.
    *   **Differentiation:** Traditional tools are either "click-heavy" (Minitab) or "code-heavy" (Python/R). Data_analysis sits in the "sweet spot" of **No-Code Data Science** with full transparency.
 *   **Guided "GPS" Workflow:**
    *   **Concept:** Instead of a passive toolbox, the system actively guides the analysis. It doesn't just ask "What model do you want?", it suggests "Your data has outliers, let's fix them first" and "These 3 variables are the most predictive."
    *   **Differentiation:** Moves from "User-Driven Analysis" to **"Assisted Analysis"**, reducing the risk of statistical errors by non-experts.
 *   **Explainable AI (XAI) for Quality:**
    *   **Concept:** Using advanced algorithms (Isolation Forest) not just to *remove* bad data, but to *explain* why it's bad visually.
    *   **Differentiation:** Makes complex ML concepts accessible to domain experts (e.g., Quality Engineers) who understand the *context* but not necessarily the *algorithm*.
 ### Market Context & Competitive Landscape
 *   **Legacy Players:** Minitab, SPSS (Powerful but expensive, dated UI, steep learning curve).
 *   **Modern Data Tools:** Tableau, PowerBI (Great for visualization, weak for advanced statistical regression).
 *   **Code-Based:** Jupyter, Streamlit (Powerful but requires coding skills).
 *   **Opportunity:** Data_analysis fills the gap for a **Modern, Web-Based, Statistical Power Tool** for non-coders.
 ### Validation Approach
 *   **User Testing:** Compare time-to-insight between Data_analysis and Minitab for a standard regression task.
 *   **Side-by-Side Benchmark:** Run the same dataset through Minitab and Data_analysis to validate numerical accuracy (ensure results match to 4 decimal places).
 ### Risk Mitigation
 *   **"Black Box" Trust:** Users might not trust automated suggestions.
    *   *Mitigation:* Always provide a "Show Details" view with raw statistical metrics (p-values) to prove the "why".
 *   **Performance:** Python backend might lag on large Excel files.
    *   *Mitigation:* Implemented async task queue (Celery) and progressive loading for the frontend grid.
 ## Web App Specific Requirements
 ### Project-Type Overview
 As a scientific web application, Data_analysis prioritizes data integrity and high-performance interactivity. The technical architecture must support heavy client-side state management (for the grid) while leveraging robust backend statistical processing.
 ### Technical Architecture Considerations
 *   **Rendering Strategy:** 
    *   **Shell & Reports:** Next.js Server Components for optimized performance and SEO (if public).
    *   **Data Grid:** React Client Components to manage complex state transitions, cell editing, and local filtering with sub-second latency.
 *   **Data Persistence:**
    *   **Session-based Workspace:** Users work on a "Project" basis; files are uploaded to temporary storage for analysis, with an option to persist to a database (PostgreSQL) for long-term tracking.
 *   **Browser Strategy:** Support for modern "Evergreen" browsers (Chrome, Edge, Firefox, Safari). High-performance features like Web Workers may be used for local data transformations.
 ### Functional Requirements (Web-Specific)
 *   **Excel-like Interactions:** Support for keyboard shortcuts (Ctrl+C/V, Undo/Redo), drag-to-fill (Growth), and multi-cell selection.
 *   **Responsive Analysis:** The interface must adapt for "Marc's Journey" (Manager/Consumer) on tablets, while ensuring "Julien's Journey" (Analyst) is optimized for high-resolution desktop displays.
 *   **Accessibility:** Adherence to WCAG 2.1 principles for the UI shell, with specific focus on keyboard-only navigation for the data entry grid.
 ### Implementation Considerations
 *   **Security:** JWT-based authentication for Sarah's (Admin) user management. All data uploads must be scanned for malicious macros/content.
 *   **Stateless Backend:** The Python API (FastAPI) will remain largely stateless, receiving data via secure requests and returning analytical results/visualizations in JSON/Base64 format.
 ## Project Scoping & Phased Development
 ### MVP Strategy & Philosophy
 **MVP Approach:** Experience MVP - Stateless & Fast.
 **Core Value:** Deliver a "Zero-Setup" analytical tool where users get results instantly without creating accounts or managing projects. Focus on the *quality* of the interaction and the analysis report.
 ### MVP Feature Set (Phase 1)
 **Core User Journeys Supported:**
 *   **Julien (Analyst):** Full flow from upload to regression report.
 *   **Marc (Manager):** Reading the generated PDF report.
 *   *(Deferred)* **Sarah (Admin):** No admin dashboard needed yet as the system is stateless/public.
 **Must-Have Capabilities:**
 *   **Input:** Drag & Drop Excel/CSV parser (Pandas).
 *   **Interaction:** Interactive Data Grid (Read/Write) for quick cleaning/filtering.
 *   **Analysis Core:** 
    *   Automated Outlier Detection (Isolation Forest).
    *   Automated Feature Selection (RFE).
    *   Models: Linear Regression (Simple/Multiple), Logistic Regression, Correlation Matrix.
 *   **Output:** Interactive Web Report + One-click PDF Export.
 ### Post-MVP Features
 **Phase 2 (Growth - "Project Mode"):**
 *   User Accounts & Project Persistence (PostgreSQL).
 *   Admin Dashboard for resource monitoring.
 *   Advanced Models: Time Series, ANOVA.
 **Phase 3 (Expansion - "Enterprise"):**
 *   Collaboration (Real-time editing).
 *   Direct connectors (SQL Database, Salesforce).
 *   On-premise deployment options (Docker).
 ### Risk Mitigation Strategy
 **Technical Risks:**
 *   **Grid Performance:** Using a robust React Data Grid library (TanStack Table or AG Grid Community) to handle DOM virtualization for 50k rows.
 *   **Stateless Memory:** Limiting file upload size (e.g., 50MB) to prevent RAM saturation since we aren't using a DB yet.
 **Market Risks:**
 *   **Trust:** Ensuring the PDF report looks professional enough to be accepted in a formal meeting (Marc's journey).
 ## Functional Requirements
 ### Data Ingestion & Management
 - **FR1:** Users can upload datasets in .xlsx, .xls, and .csv formats via drag-and-drop or file selection.
 - **FR2:** System automatically detects column data types (numeric, categorical, datetime) upon ingest.
 - **FR3:** Users can manually override detected data types if the inference is incorrect.
 - **FR4:** Users can rename columns directly in the interface to sanitize inputs.
 ### Interactive Data Grid (Workspace)
 - **FR5:** Users can view loaded data in a paginated, virtualized grid capable of displaying 50,000+ rows.
 - **FR6:** Users can edit cell values directly (double-click to edit) with inputs validated against the column type.
 - **FR7:** Users can sort columns (asc/desc) and filter rows based on values/conditions (e.g., "> 100").
 - **FR8:** Users can perform Undo/Redo operations (Ctrl+Z/Ctrl+Y) on data edits within the current session.
 - **FR9:** Users can exclude specific rows from analysis without deleting them (soft delete/toggle).
 ### Automated Data Preparation (Smart Prep)
 - **FR10:** System automatically identifies univariate outliers using IQR/Z-score and visualizes them in the grid/plots.
 - **FR11:** System automatically identifies multivariate outliers using Isolation Forest upon user request.
 - **FR12:** Users can accept or reject outlier exclusion proposals individually or in bulk.
 - **FR13:** Users can select a Target Variable (Y) to trigger an automated Feature Importance analysis.
 - **FR14:** System recommends the Top-N predictive features based on RFE (Recursive Feature Elimination) or Random Forest importance.
 ### Statistical Modeling (Core Analytics)
 - **FR15:** Users can configure a Linear Regression (Simple/Multiple) by selecting Dependent (Y) and Independent (X) variables.
 - **FR16:** Users can configure a Binary Logistic Regression for categorical target variables.
 - **FR17:** System generates a "Model Summary" including R-squared, Adjusted R-squared, F-statistic, and P-values for coefficients.
 - **FR18:** System generates standard diagnostic plots: Residuals vs Fitted, Q-Q Plot, and Scale-Location.
 - **FR19:** Users can view a Correlation Matrix (Heatmap) for selected numeric variables.
 ### Reporting & Reproducibility
 - **FR20:** Users can view an interactive "Analysis Report" dashboard summarizing data health, methodology, and model results.
 - **FR21:** Users can export the full report as a branded PDF document.
 - **FR22:** System appends an "Audit Trail" to the report listing library versions, random seeds, and data exclusion steps for reproducibility.
 ## Non-Functional Requirements
 ### Performance
 *   **Grid Latency:** The interactive data grid must render 50,000 rows with filtering/sorting response times under 200ms (Client-Side Virtualization).
 *   **Analysis Throughput:** Automated analysis (Outlier Detection + Feature Selection) on standard datasets (<10MB) must complete in under 15 seconds.
 *   **Upload Speed:** Parsing and validation of a 5MB Excel file should complete in under 3 seconds.
 ### Security & Privacy
 *   **Data Ephemerality:** All user datasets uploaded to the temporary workspace must be permanently purged from the server memory/storage after 1 hour of inactivity or immediately upon session termination.
 *   **Transport Security:** All data transmission between Client and Server must be encrypted via TLS 1.3.
 *   **Input Sanitization:** The file parser must strictly validate MIME types and file signatures to prevent malicious code execution (e.g., Excel Macros).
 ### Reliability & Stability
 *   **Graceful Degradation:** The system must handle "bad data" (NaNs, infinite values) by providing clear error messages rather than crashing the Python backend (500 Internal Error).
 *   **Concurrency:** The backend must support at least 50 concurrent analysis requests without performance degradation, using an asynchronous task queue (Celery).
 ### Accessibility
 *   **Keyboard Navigation:** The data grid must be fully navigable via keyboard (Arrows, Tab, Enter) to support "Power User" workflows efficiently.
--- a/_bmad-output/planning-artifacts/ux-design-directions.html
+++ b/_bmad-output/planning-artifacts/ux-design-directions.html
@@ -0,0 +1,192 @@
 <!DOCTYPE html>
 <html lang="fr" class="light">
 <head>
    <meta charset="UTF-8">
    <meta name="viewport" content="width=device-width, initial-scale=1.0">
    <title>Data_analysis - Design System Showcase</title>
    <script src="https://cdn.tailwindcss.com"></script>
    <link rel="preconnect" href="https://fonts.googleapis.com">
    <link rel="preconnect" href="https://fonts.gstatic.com" crossorigin>
    <link href="https://fonts.googleapis.com/css2?family=Inter:wght@400;500;600;700&family=JetBrains+Mono:wght@400;500&display=swap" rel="stylesheet">
    <script>
        tailwind.config = {
            darkMode: 'class',
            theme: {
                extend: {
                    fontFamily: {
                        sans: ['Inter', 'sans-serif'],
                        mono: ['JetBrains Mono', 'monospace'],
                    },
                    colors: {
                        indigo: {
                            50: '#eef2ff', 100: '#e0e7ff', 200: '#c7d2fe', 300: '#a5b4fc', 
                            400: '#818cf8', 500: '#6366f1', 600: '#4f46e5', 700: '#4338ca', 
                            800: '#3730a3', 900: '#312e81', 950: '#1e1b4b',
                        },
                        slate: {
                            50: '#f8fafc', 100: '#f1f5f9', 200: '#e2e8f0', 300: '#cbd5e1', 
                            400: '#94a3b8', 500: '#64748b', 600: '#475569', 700: '#334155', 
                            800: '#1e293b', 900: '#0f172a', 950: '#020617',
                        }
                    }
                }
            }
        }
    </script>
    <style>
        .grid-cell { font-family: 'JetBrains Mono', monospace; font-size: 13px; }
        .transition-theme { transition: background-color 0.3s ease, color 0.3s ease, border-color 0.3s ease; }
    </style>
 </head>
 <body class="bg-slate-50 dark:bg-slate-950 transition-theme min-h-screen p-8 font-sans">
    <div class="max-w-6xl mx-auto space-y-12">
        <!-- Header -->
        <header class="flex items-center justify-between border-b border-slate-200 dark:border-slate-800 pb-6">
            <div>
                <h1 class="text-3xl font-bold text-slate-900 dark:text-white">Design System Showcase</h1>
                <p class="text-slate-500 dark:text-slate-400 mt-1">Composants clés pour Data_analysis • Shadcn UI style</p>
            </div>
            <button onclick="toggleDarkMode()" class="bg-white dark:bg-slate-900 border border-slate-200 dark:border-slate-700 px-4 py-2 rounded-lg shadow-sm flex items-center gap-2 text-sm font-medium transition-colors hover:bg-slate-50 dark:hover:bg-slate-800 dark:text-white">
                <span class="dark:hidden text-slate-600">🌙 Passer au Mode Sombre</span>
                <span class="hidden dark:block text-slate-300">☀️ Passer au Mode Clair</span>
            </button>
        </header>
        <!-- Section 1: Buttons & Controls -->
        <section class="space-y-6">
            <h2 class="text-xl font-semibold dark:text-white flex items-center gap-2">
                <span class="w-1.5 h-6 bg-indigo-600 rounded-full"></span>
                Contrôles & Actions
            </h2>
            <div class="grid grid-cols-1 md:grid-cols-2 gap-8">
                <!-- Buttons -->
                <div class="bg-white dark:bg-slate-900 p-6 rounded-xl border border-slate-200 dark:border-slate-800 space-y-4">
                    <p class="text-xs font-bold text-slate-400 uppercase tracking-widest mb-4">Boutons (Buttons)</p>
                    <div class="flex flex-wrap gap-3">
                        <button class="bg-indigo-600 hover:bg-indigo-700 text-white px-4 py-2 rounded-md text-sm font-medium transition-all shadow-sm shadow-indigo-200 dark:shadow-none">Primary Action</button>
                        <button class="bg-white dark:bg-slate-800 border border-slate-200 dark:border-slate-700 text-slate-700 dark:text-slate-200 px-4 py-2 rounded-md text-sm font-medium hover:bg-slate-50 dark:hover:bg-slate-700 transition-colors">Secondary</button>
                        <button class="text-slate-500 dark:text-slate-400 hover:text-slate-900 dark:hover:text-white px-4 py-2 text-sm font-medium">Ghost</button>
                        <button class="bg-rose-50 dark:bg-rose-950/30 text-rose-600 dark:text-rose-400 border border-rose-100 dark:border-rose-900/50 px-4 py-2 rounded-md text-sm font-medium hover:bg-rose-100 dark:hover:bg-rose-900/50 transition-colors">Destructive</button>
                    </div>
                </div>
                <!-- Badges -->
                <div class="bg-white dark:bg-slate-900 p-6 rounded-xl border border-slate-200 dark:border-slate-800 space-y-4">
                    <p class="text-xs font-bold text-slate-400 uppercase tracking-widest mb-4">Statuts & Badges</p>
                    <div class="flex flex-wrap gap-3">
                        <span class="bg-emerald-100 dark:bg-emerald-950/30 text-emerald-700 dark:text-emerald-400 px-2.5 py-0.5 rounded-full text-xs font-semibold border border-emerald-200 dark:border-emerald-900/50">Valid Data</span>
                        <span class="bg-rose-100 dark:bg-rose-950/30 text-rose-700 dark:text-rose-400 px-2.5 py-0.5 rounded-full text-xs font-semibold border border-rose-200 dark:border-rose-900/50">Outlier Detected</span>
                        <span class="bg-indigo-100 dark:bg-indigo-950/30 text-indigo-700 dark:text-indigo-400 px-2.5 py-0.5 rounded-full text-xs font-semibold border border-indigo-200 dark:border-indigo-900/50">Target (Y)</span>
                        <span class="bg-slate-100 dark:bg-slate-800 text-slate-600 dark:text-slate-400 px-2.5 py-0.5 rounded-full text-xs font-semibold">Numeric</span>
                    </div>
                </div>
            </div>
        </section>
        <!-- Section 2: The Smart Grid -->
        <section class="space-y-6">
            <h2 class="text-xl font-semibold dark:text-white flex items-center gap-2">
                <span class="w-1.5 h-6 bg-indigo-600 rounded-full"></span>
                La Smart Grid (TanStack Table Style)
            </h2>
            <div class="bg-white dark:bg-slate-900 rounded-xl border border-slate-200 dark:border-slate-800 overflow-hidden shadow-sm">
                <table class="w-full border-separate border-spacing-0">
                    <thead class="bg-slate-50 dark:bg-slate-800/50">
                        <tr>
                            <th class="border-b border-r border-slate-200 dark:border-slate-700 p-2 text-center text-xs font-mono text-slate-400">#</th>
                            <th class="border-b border-r border-slate-200 dark:border-slate-700 p-3 text-left">
                                <div class="flex flex-col gap-1">
                                    <span class="text-xs font-bold text-slate-900 dark:text-white">Temperature_C</span>
                                    <span class="text-[10px] font-mono text-slate-400 uppercase tracking-tighter">C1 • Numeric</span>
                                </div>
                            </th>
                            <th class="border-b border-r border-slate-200 dark:border-slate-700 p-3 text-left">
                                <div class="flex flex-col gap-1">
                                    <div class="flex items-center gap-2">
                                        <span class="text-xs font-bold text-slate-900 dark:text-white">Pressure_Bar</span>
                                        <span class="w-2 h-2 bg-rose-500 rounded-full animate-pulse"></span>
                                    </div>
                                    <span class="text-[10px] font-mono text-slate-400 uppercase tracking-tighter">C2 • Numeric</span>
                                </div>
                            </th>
                            <th class="border-b border-slate-200 dark:border-slate-700 p-3 text-left bg-indigo-50/30 dark:bg-indigo-900/10">
                                <div class="flex flex-col gap-1">
                                    <span class="text-xs font-bold text-indigo-600 dark:text-indigo-400">Yield_Output</span>
                                    <span class="text-[10px] font-mono text-indigo-400 uppercase tracking-tighter italic">Target Variable</span>
                                </div>
                            </th>
                        </tr>
                    </thead>
                    <tbody class="divide-y divide-slate-100 dark:divide-slate-800">
                        <!-- Normal Row -->
                        <tr class="hover:bg-slate-50 dark:hover:bg-slate-800/50 transition-colors">
                            <td class="p-2 text-center text-xs font-mono text-slate-400 border-r border-slate-100 dark:border-slate-800">1</td>
                            <td class="p-3 grid-cell text-slate-700 dark:text-slate-300 border-r border-slate-100 dark:border-slate-800">24.50</td>
                            <td class="p-3 grid-cell text-slate-700 dark:text-slate-300 border-r border-slate-100 dark:border-slate-800">1.02</td>
                            <td class="p-3 grid-cell font-bold text-slate-900 dark:text-white bg-indigo-50/10 dark:bg-indigo-900/5">98.2</td>
                        </tr>
                        <!-- Outlier Row -->
                        <tr class="bg-rose-50/50 dark:bg-rose-900/10">
                            <td class="p-2 text-center text-xs font-mono text-rose-400 border-r border-rose-100 dark:border-rose-900/20">2</td>
                            <td class="p-3 grid-cell text-slate-700 dark:text-slate-300 border-r border-rose-100 dark:border-rose-900/20">24.52</td>
                            <td class="p-3 grid-cell font-bold text-rose-600 dark:text-rose-400 border-r border-rose-100 dark:border-rose-900/20 shadow-inner">9.99*</td>
                            <td class="p-3 grid-cell font-bold text-slate-900 dark:text-white opacity-40">45.1</td>
                        </tr>
                        <!-- Editing Row -->
                        <tr class="bg-white dark:bg-slate-900">
                            <td class="p-2 text-center text-xs font-mono text-slate-400 border-r border-slate-100 dark:border-slate-800">3</td>
                            <td class="p-3 border-r border-slate-100 dark:border-slate-800">
                                <input type="text" value="24.48" class="w-full bg-indigo-50 dark:bg-indigo-900/30 border border-indigo-500 dark:border-indigo-400 rounded px-2 py-1 text-sm font-mono dark:text-white outline-none">
                            </td>
                            <td class="p-3 grid-cell text-slate-700 dark:text-slate-300 border-r border-slate-100 dark:border-slate-800">1.01</td>
                            <td class="p-3 grid-cell font-bold text-slate-900 dark:text-white">97.9</td>
                        </tr>
                    </tbody>
                </table>
            </div>
        </section>
        <!-- Section 3: Smart Insights Panel -->
        <section class="space-y-6">
            <h2 class="text-xl font-semibold dark:text-white flex items-center gap-2">
                <span class="w-1.5 h-6 bg-indigo-600 rounded-full"></span>
                Insight Panel (Explainable AI)
            </h2>
            <div class="max-w-md bg-white dark:bg-slate-900 rounded-xl border border-slate-200 dark:border-slate-800 shadow-xl overflow-hidden">
                <div class="p-4 bg-indigo-600 flex items-center justify-between text-white">
                    <span class="font-bold text-sm">Smart Insight</span>
                    <svg xmlns="http://www.w3.org/2000/svg" width="16" height="16" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round"><circle cx="12" cy="12" r="10"/><path d="m9 12 2 2 4-4"/></svg>
                </div>
                <div class="p-6 space-y-6">
                    <div>
                        <p class="text-[10px] font-bold text-slate-400 dark:text-slate-500 uppercase tracking-widest mb-2">Observation</p>
                        <p class="text-sm dark:text-slate-300">Column <span class="font-mono text-indigo-600 dark:text-indigo-400 font-bold">Pressure_Bar</span> has 34 outliers.</p>
                    </div>
                    <!-- Simulated Chart -->
                    <div class="h-24 bg-slate-50 dark:bg-slate-800 rounded-lg flex items-end gap-1 p-2 border border-slate-100 dark:border-slate-700">
                        <div class="bg-indigo-400/30 w-full h-[20%] rounded-t"></div>
                        <div class="bg-indigo-400/30 w-full h-[40%] rounded-t"></div>
                        <div class="bg-indigo-400/30 w-full h-[100%] rounded-t"></div>
                        <div class="bg-indigo-400/30 w-full h-[60%] rounded-t"></div>
                        <div class="bg-rose-500 w-full h-[15%] rounded-t border-t-2 border-rose-600 shadow-[0_0_8px_rgba(244,63,94,0.4)]"></div>
                    </div>
                    <div class="space-y-3">
                        <p class="text-xs text-slate-500 italic dark:text-slate-400">Excluding these will increase your model accuracy (R²) by <strong>26%</strong>.</p>
                        <button class="w-full bg-indigo-600 hover:bg-indigo-700 text-white py-2 rounded-lg text-sm font-bold transition-all shadow-lg shadow-indigo-100 dark:shadow-none">Appliquer la Correction</button>
                    </div>
                </div>
            </div>
        </section>
    </div>
    <script>
        function toggleDarkMode() {
            document.documentElement.classList.toggle('dark');
        }
    </script>
 </body>
 </html>
--- a/_bmad-output/planning-artifacts/ux-design-specification.md
+++ b/_bmad-output/planning-artifacts/ux-design-specification.md
@@ -0,0 +1,392 @@
 ---
 stepsCompleted: [1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13]
 inputDocuments: ['_bmad-output/planning-artifacts/prd.md']
 ---
 # UX Design Specification Data_analysis
 **Author:** Sepehr
 **Date:** 2026-01-10
 ---
 <!-- UX design content will be appended sequentially through collaborative workflow steps -->
 ## Executive Summary
 ### Project Vision
 Create a modern, web-based, "No-Code" alternative to Minitab. The goal is to empower domain experts (engineers, analysts) to perform rigorous statistical regressions via a hybrid interface combining the simplicity of Excel with the computational power of Python.
 ### Target Users
 *   **Julien (Analyst/Engineer):** Domain expert user, seeks efficiency and rigor without coding. Primarily uses a desktop computer.
 *   **Marc (Decision Maker):** Result consumer, needs clear, mobile-friendly reports to validate production decisions.
 ### Key Design Challenges
 *   **Grid Performance:** Maintain fluid interactivity with large data volumes (virtualization).
 *   **Statistical Vulgarization:** Make variable selection and outlier detection concepts intuitive through visual design.
 *   **Guided Workflow:** Design a conversion funnel (from raw file to final report) that reduces cognitive load.
 ### Design Opportunities
 *   **Familiar Interface:** Leverage Microsoft Excel design patterns to reduce initial friction.
 *   **"Mobile-First" Reports:** Create a competitive advantage with report exports and views optimized for tablets.
 ## Core User Experience
 ### Defining Experience
 The core of Data_analysis is the **"Smart Grid"**. Unlike a static HTML table, this grid feels alive. It's the command center where data ingestion, cleaning, and exploration happen seamlessly. Users don't "run scripts"; they interact with their data directly, with the system acting as an intelligent co-pilot suggesting corrections and insights.
 ### Platform Strategy
 *   **Desktop (Primary):** Optimized for mouse/keyboard inputs. High density of information. Supports "Power User" shortcuts (Ctrl+Z, Arrows).
 *   **Tablet (Secondary):** Optimized for touch. "Read-only" mode for reports and dashboards. Lower density, larger touch targets.
 ### Effortless Interactions
 *   **Zero-Config Import:** Drag-and-drop Excel ingestion with auto-detection of headers, types, and delimiters. No wizard fatigue.
 *   **One-Click Hygiene:** Automated detection of data anomalies (NaNs, wrong types) with single-click remediation actions ("Fix all", "Drop rows").
 ### Critical Success Moments
 *   **The "Clarity" Moment:** When the "Smart Feature Selection" reduces a chaotic 50-column dataset to the 3-4 variables that actually matter, visualized clearly.
 *   **The "Confidence" Moment:** When the system confirms "No outliers detected" or "Model assumptions met" via clear green indicators before generating the report.
 ### Experience Principles
 1.  **Direct Manipulation:** Don't hide data behind menus. Let users click, edit, and filter right where the data lives.
 2.  **Proactive Intelligence:** Don't wait for the user to find errors. Highlight them immediately and offer solutions.
 3.  **Visual First:** Show the data distribution (mini-histograms) in the headers. Show the outliers on a plot, not just a list of row numbers.
 ## Desired Emotional Response
 ### Primary Emotional Goals
 The primary emotional goal of Data_analysis is to move the user from **Anxiety to Confidence**. Statistics can be intimidating; our interface must act as a reassuring expert co-pilot.
 ### Emotional Journey Mapping
 *   **Discovery:** **Curiosity & Hope.** "Can this really replace my manual Excel cleaning?"
 *   **Data Ingestion:** **Relief.** "It parsed my file instantly without errors."
 *   **Data Cleaning:** **Surprise & Empowerment.** "I didn't know I had outliers, now I see them clearly."
 *   **Analysis/Reporting:** **Confidence & Pride.** "This report looks professional and I understand every part of it."
 ### Micro-Emotions
 *   **Trust vs. Skepticism:** Built through "Explainable AI" tooltips.
 *   **Calm vs. Frustration:** Achieved through smooth animations and non-blocking background tasks.
 *   **Mastery vs. Confusion:** Delivered by guiding the user through a linear logical workflow.
 ### Design Implications
 *   **Confidence** → Use a sober, professional color palette (Blues/Grays). Provide clear "Validation" checkmarks when data is clean.
 *   **Relief** → Automate tedious tasks like type-casting and missing value detection. Use "Undo" to remove the fear of making mistakes.
 *   **Empowerment** → Use natural language labels instead of cryptic statistical abbreviations (e.g., "Predictive Power" instead of "Coefficient of Determination").
 ### Emotional Design Principles
 1.  **Safety Net:** Users should never feel like they can "break" the data. Every action is reversible.
 2.  **No Dead Ends:** If an error occurs (e.g., singular matrix), explain *why* in plain French and how to fix it.
 3.  **Visual Rewards:** Use subtle success animations when a model is successfully trained.
 ## UX Pattern Analysis & Inspiration
 ### Inspiring Products Analysis
 *   **Microsoft Excel:** The standard for grid interaction. Users expect double-click editing, arrow-key navigation, and "fill-down" patterns.
 *   **Airtable:** Revolutionized the data grid with modern UI patterns. We adopt their clean column headers, visual data types (badges, progress bars), and intuitive filtering.
 *   **Linear / Vercel:** The benchmark for high-performance developer tools. We draw inspiration from their minimalist aesthetic, exceptional Dark Mode, and keyboard-first navigation.
 ### Transferable UX Patterns
 *   **Navigation:** **Sidebar-less / Hub & Spoke.** Focus on the data grid as the central workspace with floating or collapsible side panels for analysis tools.
 *   **Interaction:** **"Sheet-to-Report" Pipeline.** A clear horizontal or vertical progression from raw data to a finalized interactive report.
 *   **Visual:** **Statistical Overlays.** Using "Sparklines" (mini-histograms) in column headers to show data distribution at a glance.
 ### Anti-Patterns to Avoid
 *   **The Modal Maze:** Opening a new pop-up window for every statistical setting. We prefer slide-over panels or inline settings to keep the context visible.
 *   **Opaque Processing:** Showing a generic spinner during long calculations. We will use a "Step-by-Step" status bar (e.g., "1. Parsing -> 2. Detecting Outliers -> 3. Selecting Features").
 ### Design Inspiration Strategy
 *   **Adopt:** The "TanStack Table" logic for grid virtualization (Excel speed) combined with Shadcn UI components (Vercel aesthetic).
 *   **Adapt:** Excel's right-click menu to include specific statistical actions like "Exclude from analysis" or "Set as Target (Y)".
 *   **Avoid:** Complex "Dashboard Builders." Users want a generated report, not a canvas they have to design themselves.
 ## Design System Foundation
 ### 1.1 Design System Choice
 The project will use **Shadcn UI** as the primary UI library, built on top of **Tailwind CSS** and **Radix UI**. The core data interaction will be powered by **TanStack Table (headless)** to create a custom, high-performance "Smart Grid."
 ### Rationale for Selection
 *   **Performance:** TanStack Table allows for massive data virtualization (50k+ rows) without the overhead of heavy UI frameworks.
 *   **Aesthetic Consistency:** Shadcn provides the "Vercel-like" minimalist and professional aesthetic defined in our inspiration phase.
 *   **Accessibility:** Leveraging Radix UI primitives ensures that complex components (popovers, dropdowns, dialogs) are fully WCAG compliant.
 *   **Developer Experience:** Direct ownership of component code allows for deep customization of statistical-specific UI elements.
 ### Implementation Approach
 *   **Shell:** Standard Shadcn layout components (Sidebar, TopNav).
 *   **Data Grid:** A custom-built component using TanStack Table's hook logic, styled with Shadcn Table primitives.
 *   **Charts:** Integration of **Recharts** or **Tremor** (which matches Shadcn's style) for statistical visualizations.
 ### Customization Strategy
 *   **Tokens:** Neutral gray base with "Scientific Blue" as the primary action color.
 *   **Typography:** Sans-serif (Geist or Inter) for the UI; Monospace (JetBrains Mono) for data cells and statistical metrics.
 *   **Density:** "High-Density" mode by default for the grid (small cell padding) to maximize data visibility.
 ## 2. Core User Experience
 ### 2.1 Defining Experience
 The defining interaction of Data_analysis is the **"Guided Data Hygiene Loop"**. It transforms the tedious task of cleaning data into a rapid, rewarding conversation with the system. Users don't "edit cells"; they respond to intelligent insights that actively improve their model's quality in real-time.
 ### 2.2 User Mental Model
 *   **Current Model:** "I have to manually hunt for errors row by row in Excel, then delete them and hope I didn't break anything."
 *   **Target Model:** "The system is my Quality Assistant. It points out the issues, I make the executive decision, and I instantly see the result."
 ### 2.3 Success Criteria
 *   **Speed:** Reviewing and fixing 50 outliers should take less than 30 seconds.
 *   **Safety:** Users must feel that "excluding" data is non-destructive (reversible).
 *   **Reward:** Every fix must trigger a positive visual feedback (e.g., model accuracy score pulsing green).
 ### 2.4 Novel UX Patterns
 *   **"Contextual Insight Panel":** Instead of modal popups, a slide-over panel allows users to see the specific rows in question (highlighted in the grid) while reviewing the statistical explanation (boxplot/histogram) side-by-side.
 *   **"Live Impact Preview":** Before confirming an exclusion, hover over the button to see a "Ghost Curve" showing how the regression line *will* change.
 ### 2.5 Experience Mechanics
 1.  **Initiation:** System highlights "dirty" columns with a subtle warning badge in the header.
 2.  **Interaction:** User clicks the header badge. The Insight Panel slides in.
 3.  **Feedback:** The panel shows "34 values are > 3 Sigma". The grid highlights these 34 rows.
 4.  **Action:** User clicks "Exclude All". Rows fade to gray. The Regression R² badge updates from 0.65 to 0.82 with a celebration animation.
 5.  **Completion:** The column header badge turns to a green checkmark.
 ## Visual Design Foundation
 ### Color System
 *   **Neutral:** Slate (50-900) - Technical, cold background for heavy data.
 *   **Primary:** Indigo (600) - For primary actions ("Run Regression").
 *   **Semantic Data Colors:**
    *   **Rose (500):** Outliers/Errors (Soft alert).
    *   **Emerald (500):** Valid Data/Success (Reassurance).
    *   **Amber (500):** Warnings/Missing Values.
 *   **Modes:** Fully supported Dark Mode using Slate-900 backgrounds and Indigo-400 primary accents.
 ### Typography System
 *   **Interface:** `Inter` (or Geist Sans) - Clean, legible at small sizes.
 *   **Data:** `JetBrains Mono` - Mandatory for the grid to ensure tabular alignment of decimals.
 ### Spacing & Layout Foundation
 *   **Grid Density:** Ultra-compact (4px y-padding) to maximize data visibility.
 *   **Panel Density:** Comfortable (16px padding) for reading insights.
 *   **Layout:** Full-width liquid layout. No wasted margins.
 ### Accessibility Considerations
 *   **Contrast:** Ensure data text (Slate-700) on row backgrounds meets AA standards.
 *   **Focus States:** High-visibility focus rings (Indigo-500 ring) for keyboard navigation in the grid.
 ## Design Direction Decision
 ### Design Directions Explored
 Multiple design approaches were evaluated to balance density, readability, and modern aesthetics:
 *   **"Corporate Legacy":** Mimicking Minitab/Excel directly (too cluttered).
 *   **"Creative Canvas":** Like Notion/Miro (too open-ended).
 *   **"Lab & Tech":** A hybrid of Vercel's minimalism and Excel's density.
 ### Chosen Direction
 **"Lab & Tech" with Shadcn UI & TanStack Table**
 *   **Visual Style:** Minimalist, data-first, with a strong Dark Mode.
 *   **Components:** Shadcn UI for the shell, TanStack Table for the grid.
 *   **Palette:** Slate + Indigo + Rose/Emerald semantic indicators.
 ### Design Rationale
 *   **User Fit:** Matches Julien's need for a professional, distraction-free environment.
 *   **Modernity:** Positions the tool as a "Next-Gen" product compared to legacy competitors.
 *   **Scalability:** The component library allows for easy addition of complex statistical widgets later.
 ### Implementation Approach
 *   **CSS Framework:** Tailwind CSS.
 *   **Component Library:** Shadcn UI (Radix based).
 *   **Icons:** Lucide React.
 *   **Charts:** Recharts.
 ## User Journey Flows
 ### Journey 1: Julien - The Guided Hygiene Loop
 This flow details how Julien interacts with the system to clean his data. The focus is on the "Ping-Pong" interaction between the Grid and the Insight Panel.
 ```mermaid
 graph TD
    A[Start: File Uploaded] --> B{System Checks}
    B -->|Clean| C[Grid View: Standard]
    B -->|Issues Found| D[Grid View: Warning Badge on Header]
    D --> E(User Clicks Badge)
    E --> F[Action: Open Insight Panel]
    subgraph Insight Panel Interaction
    F --> G[Display: Issue Description + Chart]
    G --> H[Display: Proposed Fix]
    H --> I{User Decision}
    I -->|Ignore| J[Close Panel & Remove Badge]
    I -->|Apply Fix| K[Action: Update Grid Data]
    end
    K --> L[Feedback: Toast 'Fix Applied']
    L --> M[Update Model Score R²]
    M --> N[End: Ready for Regression]
 ```
 ### Journey 2: Marc - Mobile Decision Making
 Optimized for touch and "Read-Only" consumption. No dense grids, just insights.
 ```mermaid
 graph TD
    A[Start: Click Link in Email] --> B[View: Mobile Dashboard]
    B --> C[Display: Key Metrics Cards]
    B --> D[Display: Regression Chart]
    D --> E(User Taps Data Point)
    E --> F[Action: Show Tooltip Details]
    subgraph Decision
    F --> G{Is Data Valid?}
    G -->|No| H[Action: Add Comment 'Check this']
    G -->|Yes| I[Action: Click 'Approve Analysis']
    end
    H --> J[Notify Julien]
    I --> K[Generate PDF & Archive]
 ```
 ### Journey 3: Error Handling - The "Graceful Fail"
 Ensuring the system handles bad inputs without crashing the Python backend.
 ```mermaid
 graph TD
    A[Start: Upload 50MB .xlsb] --> B{Validation Service}
    B -->|Success| C[Proceed to Parsing]
    B -->|Fail: Macros Detected| D[State: Upload Error]
    D --> E[Display: Error Modal]
    E --> F[Content: 'Security Risk Detected']
    E --> G[Action: 'Sanitize & Retry' Button]
    G --> H{Sanitization}
    H -->|Success| C
    H -->|Fail| I[Display: 'Please upload .xlsx or .csv']
 ```
 ### Flow Optimization Principles
 1.  **Non-Blocking Errors:** Warnings (like outliers) should never block the user from navigating. They are "suggestions", not "gates".
 2.  **Context Preservation:** When opening the Insight Panel, the relevant grid columns must scroll into view automatically.
 3.  **Optimistic UI:** When Julien clicks "Apply Fix", the UI updates instantly (Gray out rows) even while the backend saves the state.
 ## Component Strategy
 ### Design System Components (Shadcn UI)
 We will rely on the standard library for:
 *   **Layout:** `Sheet` (for Insight Panel), `ScrollArea`, `Resizable`.
 *   **Forms:** `Button`, `Input`, `Select`, `Switch`.
 *   **Feedback:** `Toast`, `Progress`, `Skeleton` (for loading states).
 ### Custom Components Specification
 #### 1. `<SmartGrid />`
 The central nervous system of the app.
 *   **Purpose:** Virtualized rendering of massive datasets with Excel-like interactions.
 *   **Core Props:**
    *   `data: any[]` - The raw dataset.
    *   `columns: ColumnDef[]` - Definitions including types and formatters.
    *   `onCellEdit: (rowId, colId, value) => void` - Handler for data mutation.
    *   `highlightedRows: string[]` - IDs of rows to highlight (e.g., outliers).
 *   **Key States:** `Loading`, `Empty`, `Filtering`, `Editing`.
 #### 2. `<InsightPanel />`
 The container for Explainable AI interactions.
 *   **Purpose:** Contextual sidebar for statistical insights and data cleaning.
 *   **Core Props:**
    *   `isOpen: boolean` - Visibility state.
    *   `insight: InsightObject` - Contains `{ type: 'outlier' | 'correlation', description: string, chartData: any }`.
    *   `onApplyFix: () => Promise<void>` - Async handler for the fix action.
 *   **Anatomy:** Header (Title + Close), Body (Text + Recharts Graph), Footer (Action Buttons).
 #### 3. `<ColumnHeader />`
 A rich header component for the grid.
 *   **Purpose:** Show name, type, and distribution summary.
 *   **Core Props:**
    *   `label: string`.
    *   `type: 'numeric' | 'categorical' | 'date'`.
    *   `distribution: number[]` - Data for the sparkline mini-chart.
    *   `hasWarning: boolean` - Triggers the red badge.
 ### Implementation Roadmap
 1.  **Phase 1 (Grid Core):** Implement `SmartGrid` with read-only virtualization (TanStack Table).
 2.  **Phase 2 (Interaction):** Add `ColumnHeader` visualization and `onCellEdit` logic.
 3.  **Phase 3 (Intelligence):** Build the `InsightPanel` and connect it to the outlier detection logic.
 ## UX Consistency Patterns
 ### Button Hierarchy
 *   **Primary (Indigo):** Reserved for "Positive Progression" actions (Run Regression, Save, Export). Only one per view.
 *   **Secondary (White/Outline):** For "Alternative" actions (Cancel, Clear Filter, Close Panel).
 *   **Destructive (Rose):** For "Irreversible" actions (Exclude Data, Delete Project). Always requires a confirmation step if significant.
 *   **Ghost (Transparent):** For tertiary actions inside toolbars (e.g., "Sort Ascending" icon button) to reduce visual noise.
 ### Feedback Patterns
 *   **Toasts (Ephemeral):** Used for success confirmations ("Data saved", "Model updated"). Position: Bottom-Right. Duration: 3s.
 *   **Inline Validation:** Used for data entry errors within the grid (e.g., entering text in a numeric column). Immediate red border + tooltip.
 *   **Global Status:** A persistent "Status Bar" at the top showing the system state (Ready / Processing... / Done).
 ### Grid Interaction Patterns (Excel Compatibility)
 *   **Navigation:** Arrow keys move focus between cells. Tab moves right. Enter moves down.
 *   **Selection:** Click to select cell. Shift+Click to select range. Click row header to select row.
 *   **Editing:** Double-click or `Enter` starts editing. `Esc` cancels. `Enter` saves.
 *   **Context Menu:** Right-click triggers action menu specific to the selected object (Cell vs Row vs Column).
 ### Empty States
 *   **No Data:** Don't show an empty grid. Show a "Drop Zone" with a clear CTA ("Upload Excel File") and sample datasets for exploration.
 *   **No Selection:** When the Insight Panel is open but nothing is selected, show a helper illustration ("Select a column to see stats").
 ## Responsive Design & Accessibility
 ### Responsive Strategy
 *   **Desktop Only:** The application is strictly optimized for high-resolution desktop displays (1366px width minimum). No responsive breakpoints for mobile or tablet will be implemented. 
 *   **Layout Focus:** Use a fixed Sidebar + Liquid Grid layout. The grid will expand to fill all available horizontal space.
 ### Breakpoint Strategy
 *   **Default:** 1440px+ (Optimized).
 *   **Minimum:** 1280px (Functional). Below this, a horizontal scrollbar will appear for the entire app shell to preserve data integrity.
 ### Accessibility Strategy
 *   **Compliance:** WCAG 2.1 Level AA.
 *   **Keyboard First:** Full focus on making the Data Grid and Insight Panel navigable without a mouse.
 *   **Screen Reader support:** Required for statistical summaries and report highlights.
 ### Testing Strategy
 *   **Browsers:** Chrome, Edge, and Firefox (latest 2 versions).
 *   **Devices:** Standard laptops (13" to 16") and external monitors (24"+).
 ### Implementation Guidelines
 *   **Container Query:** Use `@container` for complex widgets (like the Insight Panel) to adapt their layout based on the sidebar's width rather than the screen width.
 *   **Focus Management:** Ensure the focus ring is never hidden and follows a logical order (Sidebar -> Grid -> Insight Panel).
--- a/_bmad-output/planning-artifacts/ux-visual-visualizer.html
+++ b/_bmad-output/planning-artifacts/ux-visual-visualizer.html
@@ -0,0 +1,256 @@
 <!DOCTYPE html>
 <html lang="fr">
 <head>
    <meta charset="UTF-8">
    <meta name="viewport" content="width=device-width, initial-scale=1.0">
    <title>Data_analysis - UX Visual Foundation</title>
    <script src="https://cdn.tailwindcss.com"></script>
    <link rel="preconnect" href="https://fonts.googleapis.com">
    <link rel="preconnect" href="https://fonts.gstatic.com" crossorigin>
    <link href="https://fonts.googleapis.com/css2?family=Inter:wght@400;500;600;700&family=JetBrains+Mono:wght@400;500&display=swap" rel="stylesheet">
    <script>
        tailwind.config = {
            theme: {
                extend: {
                    fontFamily: {
                        sans: ['Inter', 'sans-serif'],
                        mono: ['JetBrains Mono', 'monospace'],
                    },
                    colors: {
                        primary: '#4f46e5', // Indigo 600
                        success: '#10b981', // Emerald 500
                        danger: '#f43f5e',  // Rose 500
                        surface: '#f8fafc', // Slate 50
                        border: '#e2e8f0',  // Slate 200
                    }
                }
            }
        }
    </script>
    <style>
        .grid-cell { font-family: 'JetBrains Mono', monospace; font-size: 13px; }
        .dense-padding { padding: 4px 8px; }
        .shimmer {
            background: linear-gradient(90deg, #f1f5f9 25%, #e2e8f0 50%, #f1f5f9 75%);
            background-size: 200% 100%;
            animation: shimmer 1.5s infinite;
        }
        @keyframes shimmer {
            0% { background-position: 200% 0; }
            100% { background-position: -200% 0; }
        }
    </style>
 </head>
 <body class="bg-slate-50 font-sans text-slate-900 flex h-screen overflow-hidden">
    <!-- Sidebar Simulation -->
    <aside class="w-64 border-r border-slate-200 bg-white flex flex-col">
        <div class="p-6 border-b border-slate-200">
            <h1 class="text-xl font-bold text-indigo-600 flex items-center gap-2">
                <svg xmlns="http://www.w3.org/2000/svg" width="24" height="24" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-bar-chart-big"><path d="M3 3v18h18"/><rect width="4" height="7" x="7" y="10" rx="1"/><rect width="4" height="12" x="15" y="5" rx="1"/></svg>
                Data_analysis
            </h1>
        </div>
        <nav class="p-4 flex-1 space-y-1">
            <a href="#" class="flex items-center gap-3 px-3 py-2 text-sm font-medium text-slate-900 bg-slate-100 rounded-md">
                <svg xmlns="http://www.w3.org/2000/svg" width="18" height="18" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round"><rect width="18" height="18" x="3" y="3" rx="2"/><path d="M3 9h18"/><path d="M9 3v18"/></svg>
                Workspace (Grid)
            </a>
            <a href="#" class="flex items-center gap-3 px-3 py-2 text-sm font-medium text-slate-600 hover:bg-slate-50 rounded-md transition-colors">
                <svg xmlns="http://www.w3.org/2000/svg" width="18" height="18" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round"><path d="M3 3v18h18"/><path d="m19 9-5 5-4-4-3 3"/></svg>
                Regressions
            </a>
            <a href="#" class="flex items-center gap-3 px-3 py-2 text-sm font-medium text-slate-600 hover:bg-slate-50 rounded-md transition-colors">
                <svg xmlns="http://www.w3.org/2000/svg" width="18" height="18" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round"><path d="M14.5 2H6a2 2 0 0 0-2 2v16a2 2 0 0 0 2 2h12a2 2 0 0 0 2-2V7.5L14.5 2z"/><polyline points="14 2 14 8 20 8"/></svg>
                Reports
            </a>
        </nav>
        <div class="p-4 border-t border-slate-200">
            <div class="bg-slate-100 p-3 rounded-lg text-xs space-y-2">
                <p class="font-semibold text-slate-500 uppercase tracking-wider">System Status</p>
                <div class="flex items-center gap-2">
                    <span class="w-2 h-2 rounded-full bg-success"></span>
                    <span>Python Backend: IDLE</span>
                </div>
            </div>
        </div>
    </aside>
    <!-- Main Workspace -->
    <main class="flex-1 flex flex-col overflow-hidden bg-white">
        <!-- Top Toolbar -->
        <header class="h-14 border-b border-slate-200 flex items-center justify-between px-6">
            <div class="flex items-center gap-4">
                <span class="text-sm font-semibold text-slate-500">Project:</span>
                <span class="text-sm font-medium">Production_Quality_Jan2026.xlsx</span>
                <span class="px-2 py-0.5 bg-indigo-50 text-indigo-700 text-[10px] font-bold rounded uppercase tracking-wider">Stateless Session</span>
            </div>
            <div class="flex items-center gap-3">
                <button class="flex items-center gap-2 px-3 py-1.5 text-xs font-medium text-slate-600 hover:bg-slate-50 border border-slate-200 rounded transition-colors">
                    <svg xmlns="http://www.w3.org/2000/svg" width="14" height="14" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round"><path d="M21 15v4a2 2 0 0 1-2 2H5a2 2 0 0 1-2-2v-4"/><polyline points="7 10 12 15 17 10"/><line x1="12" x2="12" y1="15" y2="3"/></svg>
                    Download PDF
                </button>
                <button class="bg-indigo-600 text-white px-4 py-1.5 text-xs font-semibold rounded hover:bg-indigo-700 transition-shadow shadow-sm shadow-indigo-200">
                    Run Regression
                </button>
            </div>
        </header>
        <!-- The "Smart Grid" -->
        <div class="flex-1 overflow-auto bg-slate-50 relative">
            <table class="w-full border-separate border-spacing-0 bg-white">
                <thead class="sticky top-0 bg-white z-10">
                    <tr>
                        <th class="border-b border-r border-slate-200 dense-padding bg-slate-50 w-10"></th>
                        <th class="border-b border-r border-slate-200 dense-padding text-left group">
                            <div class="flex flex-col gap-1">
                                <div class="flex items-center justify-between text-[11px] text-slate-500">
                                    <span class="font-mono uppercase">C1</span>
                                    <span class="bg-blue-50 text-blue-600 px-1 rounded">Num</span>
                                </div>
                                <span class="text-sm">Temperature_C</span>
                                <div class="h-4 flex items-end gap-0.5 mt-1">
                                    <div class="bg-indigo-200 w-full h-[20%]"></div>
                                    <div class="bg-indigo-200 w-full h-[40%]"></div>
                                    <div class="bg-indigo-400 w-full h-[90%]"></div>
                                    <div class="bg-indigo-400 w-full h-[100%]"></div>
                                    <div class="bg-indigo-200 w-full h-[30%]"></div>
                                </div>
                            </div>
                        </th>
                        <th class="border-b border-r border-slate-200 dense-padding text-left group relative">
                            <div class="flex flex-col gap-1">
                                <div class="flex items-center justify-between text-[11px] text-slate-500">
                                    <span class="font-mono uppercase">C2</span>
                                    <span class="bg-blue-50 text-blue-600 px-1 rounded">Num</span>
                                </div>
                                <span class="text-sm">Pressure_Bar</span>
                                <div class="h-4 flex items-end gap-0.5 mt-1">
                                    <div class="bg-indigo-200 w-full h-[60%]"></div>
                                    <div class="bg-indigo-400 w-full h-[100%]"></div>
                                    <div class="bg-indigo-200 w-full h-[40%]"></div>
                                    <div class="bg-rose-400 w-full h-[10%]"></div> <!-- Outlier peak -->
                                </div>
                            </div>
                            <!-- Warning Badge -->
                            <div class="absolute -top-1 -right-1 bg-rose-500 text-white w-4 h-4 rounded-full flex items-center justify-center text-[10px] font-bold shadow-sm cursor-pointer hover:scale-110 transition-transform">!</div>
                        </th>
                        <th class="border-b border-r border-slate-200 dense-padding text-left">
                            <div class="flex flex-col gap-1">
                                <div class="flex items-center justify-between text-[11px] text-slate-500">
                                    <span class="font-mono uppercase">C3</span>
                                    <span class="bg-amber-50 text-amber-600 px-1 rounded">Cat</span>
                                </div>
                                <span class="text-sm">Machine_ID</span>
                                <div class="flex gap-1 mt-1">
                                    <span class="w-full h-1 bg-slate-200 rounded"></span>
                                    <span class="w-full h-1 bg-slate-200 rounded"></span>
                                    <span class="w-full h-1 bg-slate-200 rounded"></span>
                                </div>
                            </div>
                        </th>
                        <th class="border-b border-slate-200 dense-padding text-left bg-indigo-50/50">
                            <div class="flex flex-col gap-1">
                                <div class="flex items-center justify-between text-[11px] text-indigo-500">
                                    <span class="font-mono uppercase">Target</span>
                                    <span class="bg-indigo-100 text-indigo-600 px-1 rounded">Y</span>
                                </div>
                                <span class="text-sm font-bold text-indigo-900">Yield_Output</span>
                                <div class="h-4 flex items-end gap-0.5 mt-1">
                                    <div class="bg-indigo-400 w-full h-[100%]"></div>
                                    <div class="bg-indigo-300 w-full h-[80%]"></div>
                                    <div class="bg-indigo-200 w-full h-[50%]"></div>
                                </div>
                            </div>
                        </th>
                    </tr>
                </thead>
                <tbody>
                    <!-- Row 1 -->
                    <tr class="hover:bg-slate-50 transition-colors cursor-pointer group">
                        <td class="border-b border-r border-slate-100 dense-padding text-center text-[10px] text-slate-400 font-mono">1</td>
                        <td class="border-b border-r border-slate-100 dense-padding grid-cell">24.50</td>
                        <td class="border-b border-r border-slate-100 dense-padding grid-cell">1.02</td>
                        <td class="border-b border-r border-slate-100 dense-padding text-[12px]">
                            <span class="bg-slate-100 px-2 py-0.5 rounded text-slate-600">MAC-01</span>
                        </td>
                        <td class="border-b border-slate-100 dense-padding grid-cell font-bold bg-indigo-50/20">98.2</td>
                    </tr>
                    <!-- Row 2: OUTLIER -->
                    <tr class="bg-rose-50 transition-colors cursor-pointer group">
                        <td class="border-b border-r border-rose-100 dense-padding text-center text-[10px] text-rose-400 font-mono">2</td>
                        <td class="border-b border-r border-rose-100 dense-padding grid-cell">24.52</td>
                        <td class="border-b border-r border-rose-100 dense-padding grid-cell font-bold text-rose-600 bg-rose-100/50">9.99*</td>
                        <td class="border-b border-r border-rose-100 dense-padding text-[12px]">
                            <span class="bg-slate-100 px-2 py-0.5 rounded text-slate-600">MAC-01</span>
                        </td>
                        <td class="border-b border-rose-100 dense-padding grid-cell font-bold bg-indigo-50/20 opacity-50">45.1</td>
                    </tr>
                    <!-- Row 3 -->
                    <tr class="hover:bg-slate-50 transition-colors cursor-pointer group">
                        <td class="border-b border-r border-slate-100 dense-padding text-center text-[10px] text-slate-400 font-mono">3</td>
                        <td class="border-b border-r border-slate-100 dense-padding grid-cell">24.48</td>
                        <td class="border-b border-r border-slate-100 dense-padding grid-cell">1.01</td>
                        <td class="border-b border-r border-slate-100 dense-padding text-[12px]">
                            <span class="bg-slate-100 px-2 py-0.5 rounded text-slate-600">MAC-02</span>
                        </td>
                        <td class="border-b border-slate-100 dense-padding grid-cell font-bold bg-indigo-50/20">97.9</td>
                    </tr>
                    <!-- Row 4: LOADING Simulation -->
                    <tr class="">
                        <td class="border-b border-r border-slate-100 dense-padding text-center text-[10px] text-slate-400 font-mono">4</td>
                        <td class="border-b border-r border-slate-100 p-2"><div class="h-4 shimmer rounded w-16"></div></td>
                        <td class="border-b border-r border-slate-100 p-2"><div class="h-4 shimmer rounded w-16"></div></td>
                        <td class="border-b border-r border-slate-100 p-2"><div class="h-4 shimmer rounded w-20"></div></td>
                        <td class="border-b border-slate-100 p-2"><div class="h-4 shimmer rounded w-12"></div></td>
                    </tr>
                </tbody>
            </table>
        </div>
        <!-- Floating Insight Panel (Simulation) -->
        <div class="absolute right-6 top-20 w-80 bg-white border border-slate-200 rounded-xl shadow-2xl shadow-indigo-100 overflow-hidden flex flex-col animate-slide-in">
            <div class="p-4 bg-indigo-600 text-white flex items-center justify-between">
                <h3 class="font-bold flex items-center gap-2">
                    <svg xmlns="http://www.w3.org/2000/svg" width="18" height="18" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round"><circle cx="12" cy="12" r="10"/><line x1="12" x2="12" y1="8" y2="12"/><line x1="12" x2="12.01" y1="16" y2="16"/></svg>
                    Smart Insights
                </h3>
                <button class="opacity-70 hover:opacity-100">&times;</button>
            </div>
            <div class="p-5 space-y-4">
                <div class="space-y-1">
                    <p class="text-[10px] uppercase font-bold text-slate-400 tracking-tight">Detected Anomalies</p>
                    <p class="text-sm text-slate-700">Found <span class="font-bold text-rose-600">34 outliers</span> in column <span class="font-mono bg-slate-100 px-1 rounded text-xs">Pressure_Bar</span>.</p>
                </div>
                <div class="bg-slate-50 border border-slate-100 rounded-lg p-3 space-y-2">
                    <p class="text-xs text-slate-500 font-medium italic">Why? Values are > 3.5 standard deviations from the mean (9.99 bar vs avg 1.05 bar).</p>
                    <div class="flex items-center gap-2">
                        <button class="bg-rose-100 text-rose-700 px-3 py-1.5 rounded text-[11px] font-bold hover:bg-rose-200 transition-colors flex-1">Exclude Data</button>
                        <button class="bg-white border border-slate-200 text-slate-600 px-3 py-1.5 rounded text-[11px] font-bold hover:bg-slate-50 transition-colors">Ignore</button>
                    </div>
                </div>
                <div class="pt-2 border-t border-slate-100">
                    <p class="text-[10px] uppercase font-bold text-slate-400 tracking-tight mb-2">Impact on Model</p>
                    <div class="flex items-center justify-between mb-1">
                        <span class="text-xs text-slate-600 font-medium">R-Squared (Current)</span>
                        <span class="text-xs font-mono font-bold">0.65</span>
                    </div>
                    <div class="flex items-center justify-between">
                        <span class="text-xs text-slate-600 font-medium">R-Squared (Post-fix)</span>
                        <span class="text-xs font-mono font-bold text-success">0.82 (+26%)</span>
                    </div>
                </div>
            </div>
        </div>
        <!-- Notification Toast -->
        <div class="absolute bottom-6 left-1/2 -translate-x-1/2 bg-slate-900 text-white px-6 py-3 rounded-full shadow-lg flex items-center gap-3 text-sm animate-bounce">
            <svg xmlns="http://www.w3.org/2000/svg" width="18" height="18" viewBox="0 0 24 24" fill="none" stroke="success" stroke-width="3"><polyline points="20 6 9 17 4 12"/></svg>
            Dataset "Production_Jan" loaded with 52,430 rows successfully.
        </div>
    </main>
 </body>
 </html>
--- a/_bmad-output/project-context.md
+++ b/_bmad-output/project-context.md
@@ -0,0 +1,61 @@
 ---
 project_name: 'Data_analysis'
 user_name: 'Sepehr'
 date: '2026-01-10'
 sections_completed: ['technology_stack', 'language_rules', 'framework_rules', 'testing_rules', 'quality_rules', 'workflow_rules', 'anti_patterns']
 status: 'complete'
 rule_count: 18
 optimized_for_llm: true
 ---
 # Project Context for AI Agents
 _This file contains critical rules and patterns that AI agents must follow when implementing code in this project. Focus on unobvious details that agents might otherwise miss._
 ---
 ## Technology Stack & Versions
 - **Backend:** Python 3.12, FastAPI, Pydantic v2, Pandas, Scikit-learn, Statsmodels, PyArrow v17.0+ (Managed by **uv**)
 - **Frontend:** Next.js 16 (Standalone mode), TypeScript, Tailwind CSS, Shadcn UI, TanStack Table, Recharts, Zustand v5, TanStack Query v5, Apache Arrow v17+
 - **DevOps:** Docker, Docker Compose, multi-stage builds (distroless/alpine)
 ## Critical Implementation Rules
 ### Language & Framework Patterns
 - **Backend (Python):** PEP 8 (snake_case). Pydantic v2 for schema validation. Fast API async def for I/O bound routes.
 - **Frontend (TSX):** Shadcn UI + Tailwind. Feature-based organization in `src/features/`. 
 - **Performance:** Use `apache-arrow` for data-heavy components. Virtualize all grids with 500+ rows.
 ### Data & State Architecture
 - **API Convention:** `snake_case` JSON keys to maintain consistency with Pandas DataFrame columns.
 - **Serialization:** `pyarrow.ipc` for binary streams to ensure zero-copy data transfer between services.
 - **State Management:** Zustand for localized UI/Grid state; TanStack Query for remote server state.
 ### Testing & Quality
 - **Location:** Centralized `/tests` directory at each service root (`backend/tests/`, `frontend/tests/`).
 - **Standard:** Use `pytest` for Python and `vitest` for TypeScript.
 - **Documentation:** Every exported function must have Docstrings/JSDoc explaining parameters and return types.
 ### Critical Anti-Patterns (DO NOT)
 - ❌ **DO NOT** use standard JSON for transferring datasets > 5,000 rows (use Apache Arrow).
 - ❌ **DO NOT** use deep React Context for high-frequency state updates (use Zustand).
 - ❌ **DO NOT** implement "Black Box" algorithms; all data exclusions must be logged and visualized in the `InsightPanel`.
 - ❌ **DO NOT** perform heavy blocking computations on the main FastAPI process; use background tasks for jobs expected to take > 5 seconds.
 ---
 ## Usage Guidelines
 **For AI Agents:**
 - Read this file before implementing any code.
 - Follow ALL rules exactly as documented.
 - When in doubt, prefer the more restrictive option.
 - Update this file if new patterns emerge.
 **For Humans:**
 - Keep this file lean and focused on agent needs.
 - Update when technology stack changes.
 - Review quarterly for outdated rules.
 Last Updated: 2026-01-10
--- a/_bmad/_config/agent-manifest.csv
+++ b/_bmad/_config/agent-manifest.csv
@@ -0,0 +1,11 @@
 name,displayName,title,icon,role,identity,communicationStyle,principles,module,path
 "bmad-master","BMad Master","BMad Master Executor, Knowledge Custodian, and Workflow Orchestrator","🧙","Master Task Executor + BMad Expert + Guiding Facilitator Orchestrator","Master-level expert in the BMAD Core Platform and all loaded modules with comprehensive knowledge of all resources, tasks, and workflows. Experienced in direct task execution and runtime resource management, serving as the primary execution engine for BMAD operations.","Direct and comprehensive, refers to himself in the 3rd person. Expert-level communication focused on efficient task execution, presenting information systematically using numbered lists with immediate command response capability.","- &quot;Load resources at runtime never pre-load, and always present numbered lists for choices.&quot;","core","_bmad/core/agents/bmad-master.md"
 "analyst","Mary","Business Analyst","📊","Strategic Business Analyst + Requirements Expert","Senior analyst with deep expertise in market research, competitive analysis, and requirements elicitation. Specializes in translating vague needs into actionable specs.","Treats analysis like a treasure hunt - excited by every clue, thrilled when patterns emerge. Asks questions that spark &apos;aha!&apos; moments while structuring insights with precision.","- Every business challenge has root causes waiting to be discovered. Ground findings in verifiable evidence. - Articulate requirements with absolute precision. Ensure all stakeholder voices heard. - Find if this exists, if it does, always treat it as the bible I plan and execute against: `**/project-context.md`","bmm","_bmad/bmm/agents/analyst.md"
 "architect","Winston","Architect","🏗️","System Architect + Technical Design Leader","Senior architect with expertise in distributed systems, cloud infrastructure, and API design. Specializes in scalable patterns and technology selection.","Speaks in calm, pragmatic tones, balancing &apos;what could be&apos; with &apos;what should be.&apos; Champions boring technology that actually works.","- User journeys drive technical decisions. Embrace boring technology for stability. - Design simple solutions that scale when needed. Developer productivity is architecture. Connect every decision to business value and user impact. - Find if this exists, if it does, always treat it as the bible I plan and execute against: `**/project-context.md`","bmm","_bmad/bmm/agents/architect.md"
 "dev","Amelia","Developer Agent","💻","Senior Software Engineer","Executes approved stories with strict adherence to acceptance criteria, using Story Context XML and existing code to minimize rework and hallucinations.","Ultra-succinct. Speaks in file paths and AC IDs - every statement citable. No fluff, all precision.","- The Story File is the single source of truth - tasks/subtasks sequence is authoritative over any model priors - Follow red-green-refactor cycle: write failing test, make it pass, improve code while keeping tests green - Never implement anything not mapped to a specific task/subtask in the story file - All existing tests must pass 100% before story is ready for review - Every task/subtask must be covered by comprehensive unit tests before marking complete - Project context provides coding standards but never overrides story requirements - Find if this exists, if it does, always treat it as the bible I plan and execute against: `**/project-context.md`","bmm","_bmad/bmm/agents/dev.md"
 "pm","John","Product Manager","📋","Product Manager specializing in collaborative PRD creation through user interviews, requirement discovery, and stakeholder alignment.","Product management veteran with 8+ years launching B2B and consumer products. Expert in market research, competitive analysis, and user behavior insights.","Asks &apos;WHY?&apos; relentlessly like a detective on a case. Direct and data-sharp, cuts through fluff to what actually matters.","- Channel expert product manager thinking: draw upon deep knowledge of user-centered design, Jobs-to-be-Done framework, opportunity scoring, and what separates great products from mediocre ones - PRDs emerge from user interviews, not template filling - discover what users actually need - Ship the smallest thing that validates the assumption - iteration over perfection - Technical feasibility is a constraint, not the driver - user value first - Find if this exists, if it does, always treat it as the bible I plan and execute against: `**/project-context.md`","bmm","_bmad/bmm/agents/pm.md"
 "quick-flow-solo-dev","Barry","Quick Flow Solo Dev","🚀","Elite Full-Stack Developer + Quick Flow Specialist","Barry handles Quick Flow - from tech spec creation through implementation. Minimum ceremony, lean artifacts, ruthless efficiency.","Direct, confident, and implementation-focused. Uses tech slang (e.g., refactor, patch, extract, spike) and gets straight to the point. No fluff, just results. Stays focused on the task at hand.","- Planning and execution are two sides of the same coin. - Specs are for building, not bureaucracy. Code that ships is better than perfect code that doesn&apos;t. - If `**/project-context.md` exists, follow it. If absent, proceed without.","bmm","_bmad/bmm/agents/quick-flow-solo-dev.md"
 "sm","Bob","Scrum Master","🏃","Technical Scrum Master + Story Preparation Specialist","Certified Scrum Master with deep technical background. Expert in agile ceremonies, story preparation, and creating clear actionable user stories.","Crisp and checklist-driven. Every word has a purpose, every requirement crystal clear. Zero tolerance for ambiguity.","- Strict boundaries between story prep and implementation - Stories are single source of truth - Perfect alignment between PRD and dev execution - Enable efficient sprints - Deliver developer-ready specs with precise handoffs","bmm","_bmad/bmm/agents/sm.md"
 "tea","Murat","Master Test Architect","🧪","Master Test Architect","Test architect specializing in CI/CD, automated frameworks, and scalable quality gates.","Blends data with gut instinct. &apos;Strong opinions, weakly held&apos; is their mantra. Speaks in risk calculations and impact assessments.","- Risk-based testing - depth scales with impact - Quality gates backed by data - Tests mirror usage patterns - Flakiness is critical technical debt - Tests first AI implements suite validates - Calculate risk vs value for every testing decision","bmm","_bmad/bmm/agents/tea.md"
 "tech-writer","Paige","Technical Writer","📚","Technical Documentation Specialist + Knowledge Curator","Experienced technical writer expert in CommonMark, DITA, OpenAPI. Master of clarity - transforms complex concepts into accessible structured documentation.","Patient educator who explains like teaching a friend. Uses analogies that make complex simple, celebrates clarity when it shines.","- Documentation is teaching. Every doc helps someone accomplish a task. Clarity above all. - Docs are living artifacts that evolve with code. Know when to simplify vs when to be detailed.","bmm","_bmad/bmm/agents/tech-writer.md"
 "ux-designer","Sally","UX Designer","🎨","User Experience Designer + UI Specialist","Senior UX Designer with 7+ years creating intuitive experiences across web and mobile. Expert in user research, interaction design, AI-assisted tools.","Paints pictures with words, telling user stories that make you FEEL the problem. Empathetic advocate with creative storytelling flair.","- Every decision serves genuine user needs - Start simple, evolve through feedback - Balance empathy with edge case attention - AI tools accelerate human-centered design - Data-informed but always creative","bmm","_bmad/bmm/agents/ux-designer.md"
--- a/_bmad/_config/agents/bmm-analyst.customize.yaml
+++ b/_bmad/_config/agents/bmm-analyst.customize.yaml
@@ -0,0 +1,41 @@
 # Agent Customization
 # Customize any section below - all are optional
 # Override agent name
 agent:
  metadata:
    name: ""
 # Replace entire persona (not merged)
 persona:
  role: ""
  identity: ""
  communication_style: ""
  principles: []
 # Add custom critical actions (appended after standard config loading)
 critical_actions: []
 # Add persistent memories for the agent
 memories: []
 # Example:
 # memories:
 #   - "User prefers detailed technical explanations"
 #   - "Current project uses React and TypeScript"
 # Add custom menu items (appended to base menu)
 # Don't include * prefix or help/exit - auto-injected
 menu: []
 # Example:
 # menu:
 #   - trigger: my-workflow
 #     workflow: "{project-root}/custom/my.yaml"
 #     description: My custom workflow
 # Add custom prompts (for action="#id" handlers)
 prompts: []
 # Example:
 # prompts:
 # - id: my-prompt
 #   content: |
 #     Prompt instructions here
--- a/_bmad/_config/agents/bmm-architect.customize.yaml
+++ b/_bmad/_config/agents/bmm-architect.customize.yaml
@@ -0,0 +1,41 @@
 # Agent Customization
 # Customize any section below - all are optional
 # Override agent name
 agent:
  metadata:
    name: ""
 # Replace entire persona (not merged)
 persona:
  role: ""
  identity: ""
  communication_style: ""
  principles: []
 # Add custom critical actions (appended after standard config loading)
 critical_actions: []
 # Add persistent memories for the agent
 memories: []
 # Example:
 # memories:
 #   - "User prefers detailed technical explanations"
 #   - "Current project uses React and TypeScript"
 # Add custom menu items (appended to base menu)
 # Don't include * prefix or help/exit - auto-injected
 menu: []
 # Example:
 # menu:
 #   - trigger: my-workflow
 #     workflow: "{project-root}/custom/my.yaml"
 #     description: My custom workflow
 # Add custom prompts (for action="#id" handlers)
 prompts: []
 # Example:
 # prompts:
 # - id: my-prompt
 #   content: |
 #     Prompt instructions here
--- a/_bmad/_config/agents/bmm-dev.customize.yaml
+++ b/_bmad/_config/agents/bmm-dev.customize.yaml
@@ -0,0 +1,41 @@
 # Agent Customization
 # Customize any section below - all are optional
 # Override agent name
 agent:
  metadata:
    name: ""
 # Replace entire persona (not merged)
 persona:
  role: ""
  identity: ""
  communication_style: ""
  principles: []
 # Add custom critical actions (appended after standard config loading)
 critical_actions: []
 # Add persistent memories for the agent
 memories: []
 # Example:
 # memories:
 #   - "User prefers detailed technical explanations"
 #   - "Current project uses React and TypeScript"
 # Add custom menu items (appended to base menu)
 # Don't include * prefix or help/exit - auto-injected
 menu: []
 # Example:
 # menu:
 #   - trigger: my-workflow
 #     workflow: "{project-root}/custom/my.yaml"
 #     description: My custom workflow
 # Add custom prompts (for action="#id" handlers)
 prompts: []
 # Example:
 # prompts:
 # - id: my-prompt
 #   content: |
 #     Prompt instructions here
--- a/_bmad/_config/agents/bmm-pm.customize.yaml
+++ b/_bmad/_config/agents/bmm-pm.customize.yaml
@@ -0,0 +1,41 @@
 # Agent Customization
 # Customize any section below - all are optional
 # Override agent name
 agent:
  metadata:
    name: ""
 # Replace entire persona (not merged)
 persona:
  role: ""
  identity: ""
  communication_style: ""
  principles: []
 # Add custom critical actions (appended after standard config loading)
 critical_actions: []
 # Add persistent memories for the agent
 memories: []
 # Example:
 # memories:
 #   - "User prefers detailed technical explanations"
 #   - "Current project uses React and TypeScript"
 # Add custom menu items (appended to base menu)
 # Don't include * prefix or help/exit - auto-injected
 menu: []
 # Example:
 # menu:
 #   - trigger: my-workflow
 #     workflow: "{project-root}/custom/my.yaml"
 #     description: My custom workflow
 # Add custom prompts (for action="#id" handlers)
 prompts: []
 # Example:
 # prompts:
 # - id: my-prompt
 #   content: |
 #     Prompt instructions here
--- a/_bmad/_config/agents/bmm-quick-flow-solo-dev.customize.yaml
+++ b/_bmad/_config/agents/bmm-quick-flow-solo-dev.customize.yaml
@@ -0,0 +1,41 @@
 # Agent Customization
 # Customize any section below - all are optional
 # Override agent name
 agent:
  metadata:
    name: ""
 # Replace entire persona (not merged)
 persona:
  role: ""
  identity: ""
  communication_style: ""
  principles: []
 # Add custom critical actions (appended after standard config loading)
 critical_actions: []
 # Add persistent memories for the agent
 memories: []
 # Example:
 # memories:
 #   - "User prefers detailed technical explanations"
 #   - "Current project uses React and TypeScript"
 # Add custom menu items (appended to base menu)
 # Don't include * prefix or help/exit - auto-injected
 menu: []
 # Example:
 # menu:
 #   - trigger: my-workflow
 #     workflow: "{project-root}/custom/my.yaml"
 #     description: My custom workflow
 # Add custom prompts (for action="#id" handlers)
 prompts: []
 # Example:
 # prompts:
 # - id: my-prompt
 #   content: |
 #     Prompt instructions here
--- a/_bmad/_config/agents/bmm-sm.customize.yaml
+++ b/_bmad/_config/agents/bmm-sm.customize.yaml
@@ -0,0 +1,41 @@
 # Agent Customization
 # Customize any section below - all are optional
 # Override agent name
 agent:
  metadata:
    name: ""
 # Replace entire persona (not merged)
 persona:
  role: ""
  identity: ""
  communication_style: ""
  principles: []
 # Add custom critical actions (appended after standard config loading)
 critical_actions: []
 # Add persistent memories for the agent
 memories: []
 # Example:
 # memories:
 #   - "User prefers detailed technical explanations"
 #   - "Current project uses React and TypeScript"
 # Add custom menu items (appended to base menu)
 # Don't include * prefix or help/exit - auto-injected
 menu: []
 # Example:
 # menu:
 #   - trigger: my-workflow
 #     workflow: "{project-root}/custom/my.yaml"
 #     description: My custom workflow
 # Add custom prompts (for action="#id" handlers)
 prompts: []
 # Example:
 # prompts:
 # - id: my-prompt
 #   content: |
 #     Prompt instructions here
--- a/_bmad/_config/agents/bmm-tea.customize.yaml
+++ b/_bmad/_config/agents/bmm-tea.customize.yaml
@@ -0,0 +1,41 @@
 # Agent Customization
 # Customize any section below - all are optional
 # Override agent name
 agent:
  metadata:
    name: ""
 # Replace entire persona (not merged)
 persona:
  role: ""
  identity: ""
  communication_style: ""
  principles: []
 # Add custom critical actions (appended after standard config loading)
 critical_actions: []
 # Add persistent memories for the agent
 memories: []
 # Example:
 # memories:
 #   - "User prefers detailed technical explanations"
 #   - "Current project uses React and TypeScript"
 # Add custom menu items (appended to base menu)
 # Don't include * prefix or help/exit - auto-injected
 menu: []
 # Example:
 # menu:
 #   - trigger: my-workflow
 #     workflow: "{project-root}/custom/my.yaml"
 #     description: My custom workflow
 # Add custom prompts (for action="#id" handlers)
 prompts: []
 # Example:
 # prompts:
 # - id: my-prompt
 #   content: |
 #     Prompt instructions here
--- a/_bmad/_config/agents/bmm-tech-writer.customize.yaml
+++ b/_bmad/_config/agents/bmm-tech-writer.customize.yaml
@@ -0,0 +1,41 @@
 # Agent Customization
 # Customize any section below - all are optional
 # Override agent name
 agent:
  metadata:
    name: ""
 # Replace entire persona (not merged)
 persona:
  role: ""
  identity: ""
  communication_style: ""
  principles: []
 # Add custom critical actions (appended after standard config loading)
 critical_actions: []
 # Add persistent memories for the agent
 memories: []
 # Example:
 # memories:
 #   - "User prefers detailed technical explanations"
 #   - "Current project uses React and TypeScript"
 # Add custom menu items (appended to base menu)
 # Don't include * prefix or help/exit - auto-injected
 menu: []
 # Example:
 # menu:
 #   - trigger: my-workflow
 #     workflow: "{project-root}/custom/my.yaml"
 #     description: My custom workflow
 # Add custom prompts (for action="#id" handlers)
 prompts: []
 # Example:
 # prompts:
 # - id: my-prompt
 #   content: |
 #     Prompt instructions here
--- a/_bmad/_config/agents/bmm-ux-designer.customize.yaml
+++ b/_bmad/_config/agents/bmm-ux-designer.customize.yaml
@@ -0,0 +1,41 @@
 # Agent Customization
 # Customize any section below - all are optional
 # Override agent name
 agent:
  metadata:
    name: ""
 # Replace entire persona (not merged)
 persona:
  role: ""
  identity: ""
  communication_style: ""
  principles: []
 # Add custom critical actions (appended after standard config loading)
 critical_actions: []
 # Add persistent memories for the agent
 memories: []
 # Example:
 # memories:
 #   - "User prefers detailed technical explanations"
 #   - "Current project uses React and TypeScript"
 # Add custom menu items (appended to base menu)
 # Don't include * prefix or help/exit - auto-injected
 menu: []
 # Example:
 # menu:
 #   - trigger: my-workflow
 #     workflow: "{project-root}/custom/my.yaml"
 #     description: My custom workflow
 # Add custom prompts (for action="#id" handlers)
 prompts: []
 # Example:
 # prompts:
 # - id: my-prompt
 #   content: |
 #     Prompt instructions here
--- a/_bmad/_config/agents/core-bmad-master.customize.yaml
+++ b/_bmad/_config/agents/core-bmad-master.customize.yaml
@@ -0,0 +1,41 @@
 # Agent Customization
 # Customize any section below - all are optional
 # Override agent name
 agent:
  metadata:
    name: ""
 # Replace entire persona (not merged)
 persona:
  role: ""
  identity: ""
  communication_style: ""
  principles: []
 # Add custom critical actions (appended after standard config loading)
 critical_actions: []
 # Add persistent memories for the agent
 memories: []
 # Example:
 # memories:
 #   - "User prefers detailed technical explanations"
 #   - "Current project uses React and TypeScript"
 # Add custom menu items (appended to base menu)
 # Don't include * prefix or help/exit - auto-injected
 menu: []
 # Example:
 # menu:
 #   - trigger: my-workflow
 #     workflow: "{project-root}/custom/my.yaml"
 #     description: My custom workflow
 # Add custom prompts (for action="#id" handlers)
 prompts: []
 # Example:
 # prompts:
 # - id: my-prompt
 #   content: |
 #     Prompt instructions here
--- a/_bmad/_config/files-manifest.csv
+++ b/_bmad/_config/files-manifest.csv
@@ -0,0 +1,268 @@
 type,name,module,path,hash
 "csv","agent-manifest","_config","_config/agent-manifest.csv","6916048fc4a8f5caaea40350e4b2288f0fab01ea7959218b332920ec62e6a18c"
 "csv","task-manifest","_config","_config/task-manifest.csv","35e06d618921c1260c469d328a5af14c3744072f66a20c43d314edfb29296a70"
 "csv","workflow-manifest","_config","_config/workflow-manifest.csv","254b28d8d3b9871d77b12670144e98f5850180a1b50c92eaa88a53bef77309c8"
 "yaml","manifest","_config","_config/manifest.yaml","dd938b546ff9a51045e09d21d37bd78082b9ceceb41c5225dae93834140f5c01"
 "csv","default-party","bmm","bmm/teams/default-party.csv","43209253a2e784e6b054a4ac427c9532a50d9310f6a85052d93ce975b9162156"
 "csv","documentation-requirements","bmm","bmm/workflows/document-project/documentation-requirements.csv","d1253b99e88250f2130516b56027ed706e643bfec3d99316727a4c6ec65c6c1d"
 "csv","domain-complexity","bmm","bmm/workflows/2-plan-workflows/prd/domain-complexity.csv","ed4d30e9fd87db2d628fb66cac7a302823ef6ebb3a8da53b9265326f10a54e11"
 "csv","domain-complexity","bmm","bmm/workflows/3-solutioning/create-architecture/data/domain-complexity.csv","cb9244ed2084143146f9f473244ad9cf63d33891742b9f6fbcb6e354fa4f3a93"
 "csv","project-types","bmm","bmm/workflows/2-plan-workflows/prd/project-types.csv","7a01d336e940fb7a59ff450064fd1194cdedda316370d939264a0a0adcc0aca3"
 "csv","project-types","bmm","bmm/workflows/3-solutioning/create-architecture/data/project-types.csv","12343635a2f11343edb1d46906981d6f5e12b9cad2f612e13b09460b5e5106e7"
 "csv","tea-index","bmm","bmm/testarch/tea-index.csv","374a8d53b5e127a9440751a02c5112c66f81bc00e2128d11d11f16d8f45292ea"
 "json","excalidraw-library","bmm","bmm/workflows/excalidraw-diagrams/_shared/excalidraw-library.json","8e5079f4e79ff17f4781358423f2126a1f14ab48bbdee18fd28943865722030c"
 "json","project-scan-report-schema","bmm","bmm/workflows/document-project/templates/project-scan-report-schema.json","53255f15a10cab801a1d75b4318cdb0095eed08c51b3323b7e6c236ae6b399b7"
 "md","api-request","bmm","bmm/testarch/knowledge/api-request.md","93ac674f645cb389aafe08ce31e53280ebc0385c59e585a199b772bb0e0651fb"
 "md","architecture-decision-template","bmm","bmm/workflows/3-solutioning/create-architecture/architecture-decision-template.md","5d9adf90c28df61031079280fd2e49998ec3b44fb3757c6a202cda353e172e9f"
 "md","atdd-checklist-template","bmm","bmm/workflows/testarch/atdd/atdd-checklist-template.md","b89f46efefbf08ddd4c58392023a39bd60db353a3f087b299e32be27155fa740"
 "md","auth-session","bmm","bmm/testarch/knowledge/auth-session.md","b2ee00c5650655311ff54d20dcd6013afb5b280a66faa8336f9fb810436f1aab"
 "md","burn-in","bmm","bmm/testarch/knowledge/burn-in.md","5ba3d2abe6b961e5bc3948ab165e801195bff3ee6e66569c00c219b484aa4b5d"
 "md","checklist","bmm","bmm/workflows/4-implementation/code-review/checklist.md","e30d2890ba5c50777bbe04071f754e975a1d7ec168501f321a79169c4201dd28"
 "md","checklist","bmm","bmm/workflows/4-implementation/correct-course/checklist.md","d3d30482c5e82a84c15c10dacb50d960456e98cfc5a8ddc11b54e14f3a850029"
 "md","checklist","bmm","bmm/workflows/4-implementation/create-story/checklist.md","3eacc5cfd6726ab0ea0ba8fe56d9bdea466964e6cc35ed8bfadeb84307169bdc"
 "md","checklist","bmm","bmm/workflows/4-implementation/dev-story/checklist.md","630b68c6824a8785003a65553c1f335222b17be93b1bd80524c23b38bde1d8af"
 "md","checklist","bmm","bmm/workflows/4-implementation/sprint-planning/checklist.md","80b10aedcf88ab1641b8e5f99c9a400c8fd9014f13ca65befc5c83992e367dd7"
 "md","checklist","bmm","bmm/workflows/document-project/checklist.md","581b0b034c25de17ac3678db2dbafedaeb113de37ddf15a4df6584cf2324a7d7"
 "md","checklist","bmm","bmm/workflows/excalidraw-diagrams/create-dataflow/checklist.md","f420aaf346833dfda5454ffec9f90a680e903453bcc4d3e277d089e6781fec55"
 "md","checklist","bmm","bmm/workflows/excalidraw-diagrams/create-diagram/checklist.md","6357350a6e2237c1b819edd8fc847e376192bf802000cb1a4337c9584fc91a18"
 "md","checklist","bmm","bmm/workflows/excalidraw-diagrams/create-flowchart/checklist.md","45aaf882b8e9a1042683406ae2cfc0b23d3d39bd1dac3ddb0778d5b7165f7047"
 "md","checklist","bmm","bmm/workflows/excalidraw-diagrams/create-wireframe/checklist.md","588f9354bf366c173aa261cf5a8b3a87c878ea72fd2c0f8088c4b3289e984641"
 "md","checklist","bmm","bmm/workflows/testarch/atdd/checklist.md","d86b1718207a7225e57bc9ac281dc78f22806ac1bfdb9d770ac5dccf7ed8536b"
 "md","checklist","bmm","bmm/workflows/testarch/automate/checklist.md","3a8f47b83ad8eff408f7126f7729d4b930738bf7d03b0caea91d1ef49aeb19ee"
 "md","checklist","bmm","bmm/workflows/testarch/ci/checklist.md","dfb1ffff2028566d8f0e46a15024d407df5a5e1fad253567f56ee2903618d419"
 "md","checklist","bmm","bmm/workflows/testarch/framework/checklist.md","16cc3aee710abb60fb85d2e92f0010b280e66b38fac963c0955fb36e7417103a"
 "md","checklist","bmm","bmm/workflows/testarch/nfr-assess/checklist.md","1f070e990c0778b2066f05c31f94c9ddcb97a695e7ae8322b4f487f75fe62d57"
 "md","checklist","bmm","bmm/workflows/testarch/test-design/checklist.md","f7ac96d3c61500946c924e1c1924f366c3feae23143c8d130f044926365096e1"
 "md","checklist","bmm","bmm/workflows/testarch/test-review/checklist.md","e39f2fb9c2dbfd158e5b5c1602fd15d5dbd3b0f0616d171e0551c356c92416f9"
 "md","checklist","bmm","bmm/workflows/testarch/trace/checklist.md","c67b2a1ee863c55b95520db0bc9c1c0a849afee55f96733a08bb2ec55f40ad70"
 "md","ci-burn-in","bmm","bmm/testarch/knowledge/ci-burn-in.md","4cdcf7b576dae8b5cb591a6fad69674f65044a0dc72ea57d561623dac93ec475"
 "md","component-tdd","bmm","bmm/testarch/knowledge/component-tdd.md","88bd1f9ca1d5bcd1552828845fe80b86ff3acdf071bac574eda744caf7120ef8"
 "md","contract-testing","bmm","bmm/testarch/knowledge/contract-testing.md","d8f662c286b2ea4772213541c43aebef006ab6b46e8737ebdc4a414621895599"
 "md","data-factories","bmm","bmm/testarch/knowledge/data-factories.md","d7428fe7675da02b6f5c4c03213fc5e542063f61ab033efb47c1c5669b835d88"
 "md","deep-dive-instructions","bmm","bmm/workflows/document-project/workflows/deep-dive-instructions.md","8cb3d32d7685e5deff4731c2003d30b4321ef6c29247b3ddbe672c185e022604"
 "md","deep-dive-template","bmm","bmm/workflows/document-project/templates/deep-dive-template.md","6198aa731d87d6a318b5b8d180fc29b9aa53ff0966e02391c17333818e94ffe9"
 "md","documentation-standards","bmm","bmm/data/documentation-standards.md","fc26d4daff6b5a73eb7964eacba6a4f5cf8f9810a8c41b6949c4023a4176d853"
 "md","email-auth","bmm","bmm/testarch/knowledge/email-auth.md","43f4cc3138a905a91f4a69f358be6664a790b192811b4dfc238188e826f6b41b"
 "md","epics-template","bmm","bmm/workflows/3-solutioning/create-epics-and-stories/templates/epics-template.md","b8ec5562b2a77efd80c40eba0421bbaab931681552e5a0ff01cd93902c447ff7"
 "md","error-handling","bmm","bmm/testarch/knowledge/error-handling.md","8a314eafb31e78020e2709d88aaf4445160cbefb3aba788b62d1701557eb81c1"
 "md","feature-flags","bmm","bmm/testarch/knowledge/feature-flags.md","f6db7e8de2b63ce40a1ceb120a4055fbc2c29454ad8fca5db4e8c065d98f6f49"
 "md","file-utils","bmm","bmm/testarch/knowledge/file-utils.md","e0d4e98ca6ec32035ae07a14880c65ab99298e9240404d27a05788c974659e8b"
 "md","fixture-architecture","bmm","bmm/testarch/knowledge/fixture-architecture.md","a3b6c1bcaf5e925068f3806a3d2179ac11dde7149e404bc4bb5602afb7392501"
 "md","fixtures-composition","bmm","bmm/testarch/knowledge/fixtures-composition.md","8e57a897663a272fd603026aeec76941543c1e09d129e377846726fd405f3a5a"
 "md","full-scan-instructions","bmm","bmm/workflows/document-project/workflows/full-scan-instructions.md","6c6e0d77b33f41757eed8ebf436d4def69cd6ce412395b047bf5909f66d876aa"
 "md","index-template","bmm","bmm/workflows/document-project/templates/index-template.md","42c8a14f53088e4fda82f26a3fe41dc8a89d4bcb7a9659dd696136378b64ee90"
 "md","instructions","bmm","bmm/workflows/4-implementation/correct-course/instructions.md","bd56efff69b1c72fbd835cbac68afaac043cf5004d021425f52935441a3c779d"
 "md","instructions","bmm","bmm/workflows/4-implementation/retrospective/instructions.md","c1357ee8149935b391db1fd7cc9869bf3b450132f04d27fbb11906d421923bf8"
 "md","instructions","bmm","bmm/workflows/4-implementation/sprint-planning/instructions.md","8ac972eb08068305223e37dceac9c3a22127062edae2692f95bc16b8dbafa046"
 "md","instructions","bmm","bmm/workflows/4-implementation/sprint-status/instructions.md","8f883c7cf59460012b855465c7cbc896f0820afb11031c2b1b3dd514ed9f4b63"
 "md","instructions","bmm","bmm/workflows/document-project/instructions.md","faba39025e187c6729135eccf339ec1e08fbdc34ad181583de8161d3d805aaaf"
 "md","instructions","bmm","bmm/workflows/excalidraw-diagrams/create-dataflow/instructions.md","e43d05aaf6a1e881ae42e73641826b70e27ea91390834901f18665b524bbff77"
 "md","instructions","bmm","bmm/workflows/excalidraw-diagrams/create-diagram/instructions.md","5d41c1e5b28796f6844645f3c1e2e75bb80f2e1576eb2c1f3ba2894cbf4a65e8"
 "md","instructions","bmm","bmm/workflows/excalidraw-diagrams/create-flowchart/instructions.md","9647360dc08e6e8dcbb634620e8a4247add5b22fad7a3bd13ef79683f31b9d77"
 "md","instructions","bmm","bmm/workflows/excalidraw-diagrams/create-wireframe/instructions.md","d0ddbb8f4235b28af140cc7b5210c989b4b126f973eb539e216ab10d4bbc2410"
 "md","instructions","bmm","bmm/workflows/testarch/atdd/instructions.md","8b22d80ff61fd90b4f8402d5b5ab69d01a2c9f00cc4e1aa23aef49720db9254b"
 "md","instructions","bmm","bmm/workflows/testarch/automate/instructions.md","6611e6abc114f68c16f3121dc2c2a2dcfefc355f857099b814b715f6d646a81c"
 "md","instructions","bmm","bmm/workflows/testarch/ci/instructions.md","8cc49d93e549eb30952320b1902624036d23e92a6bbaf3f012d2a18dc67a9141"
 "md","instructions","bmm","bmm/workflows/testarch/framework/instructions.md","902212128052de150753ce0cabb9be0423da782ba280c3b5c198bc16e8ae7eb3"
 "md","instructions","bmm","bmm/workflows/testarch/nfr-assess/instructions.md","6a4ef0830a65e96f41e7f6f34ed5694383e0935a46440c77a4a29cbfbd5f75f9"
 "md","instructions","bmm","bmm/workflows/testarch/test-design/instructions.md","b332c20fbc8828b2ebd34aad2f36af88ce1ce1d8a8c7c29412329c9f8884de9a"
 "md","instructions","bmm","bmm/workflows/testarch/test-review/instructions.md","f1dfb61f7a7d9e584d398987fdcb8ab27b4835d26b6a001ca4611b8a3da4c32d"
 "md","instructions","bmm","bmm/workflows/testarch/trace/instructions.md","233cfb6922fe0f7aaa3512fcda08017b0f89de663f66903474b0abf2e1d01614"
 "md","instructions","bmm","bmm/workflows/workflow-status/init/instructions.md","cd7f8e8de5c5b775b1aa1d6ea3b02f1d47b24fa138b3ed73877287a58fcdb9a1"
 "md","instructions","bmm","bmm/workflows/workflow-status/instructions.md","ddbb594d72209903bf2bf93c70e7dc961295e7382fb6d4adcf8122f9334bb41f"
 "md","intercept-network-call","bmm","bmm/testarch/knowledge/intercept-network-call.md","fb551cb0cefe3c062c28ae255a121aaae098638ec35a16fcdba98f670887ab6a"
 "md","log","bmm","bmm/testarch/knowledge/log.md","b6267716ccbe6f9e2cc1b2b184501faeb30277bc8546206a66f31500c52381d0"
 "md","network-error-monitor","bmm","bmm/testarch/knowledge/network-error-monitor.md","0380eb6df15af0a136334ad00cf44c92c779f311b07231f5aa6230e198786799"
 "md","network-first","bmm","bmm/testarch/knowledge/network-first.md","2920e58e145626f5505bcb75e263dbd0e6ac79a8c4c2ec138f5329e06a6ac014"
 "md","network-recorder","bmm","bmm/testarch/knowledge/network-recorder.md","9f120515cc377c4c500ec0b5fff0968666a9a4edee03a328d92514147d50f073"
 "md","nfr-criteria","bmm","bmm/testarch/knowledge/nfr-criteria.md","e63cee4a0193e4858c8f70ff33a497a1b97d13a69da66f60ed5c9a9853025aa1"
 "md","nfr-report-template","bmm","bmm/workflows/testarch/nfr-assess/nfr-report-template.md","229bdabe07577d24679eb9d42283b353dbde21338157188d8f555fdef200b91c"
 "md","overview","bmm","bmm/testarch/knowledge/overview.md","79a12311d706fe55c48f72ef51c662c6f61a54651b3b76a3c7ccc87de6ebbf03"
 "md","playwright-config","bmm","bmm/testarch/knowledge/playwright-config.md","42516511104a7131775f4446196cf9e5dd3295ba3272d5a5030660b1dffaa69f"
 "md","prd-template","bmm","bmm/workflows/2-plan-workflows/prd/prd-template.md","829135530b0652dfb4a2929864042f515bc372b6cbe66be60103311365679efb"
 "md","probability-impact","bmm","bmm/testarch/knowledge/probability-impact.md","446dba0caa1eb162734514f35366f8c38ed3666528b0b5e16c7f03fd3c537d0f"
 "md","product-brief.template","bmm","bmm/workflows/1-analysis/create-product-brief/product-brief.template.md","ae0f58b14455efd75a0d97ba68596a3f0b58f350cd1a0ee5b1af69540f949781"
 "md","project-context-template","bmm","bmm/data/project-context-template.md","34421aed3e0ad921dc0c0080297f3a2299735b00a25351de589ada99dae56559"
 "md","project-context-template","bmm","bmm/workflows/generate-project-context/project-context-template.md","54e351394ceceb0ac4b5b8135bb6295cf2c37f739c7fd11bb895ca16d79824a5"
 "md","project-overview-template","bmm","bmm/workflows/document-project/templates/project-overview-template.md","a7c7325b75a5a678dca391b9b69b1e3409cfbe6da95e70443ed3ace164e287b2"
 "md","readiness-report-template","bmm","bmm/workflows/3-solutioning/check-implementation-readiness/templates/readiness-report-template.md","0da97ab1e38818e642f36dc0ef24d2dae69fc6e0be59924dc2dbf44329738ff6"
 "md","README","bmm","bmm/data/README.md","352c44cff4dd0e5a90cdf6781168ceb57f5a78eaabddcd168433d8784854e4fb"
 "md","recurse","bmm","bmm/testarch/knowledge/recurse.md","19056fb5b7e5e626aad81277b3e5eec333f2aed36a17aea6c7d8714a5460c8b2"
 "md","research.template","bmm","bmm/workflows/1-analysis/research/research.template.md","507bb6729476246b1ca2fca4693986d286a33af5529b6cd5cb1b0bb5ea9926ce"
 "md","risk-governance","bmm","bmm/testarch/knowledge/risk-governance.md","2fa2bc3979c4f6d4e1dec09facb2d446f2a4fbc80107b11fc41cbef2b8d65d68"
 "md","selective-testing","bmm","bmm/testarch/knowledge/selective-testing.md","c14c8e1bcc309dbb86a60f65bc921abf5a855c18a753e0c0654a108eb3eb1f1c"
 "md","selector-resilience","bmm","bmm/testarch/knowledge/selector-resilience.md","a55c25a340f1cd10811802665754a3f4eab0c82868fea61fea9cc61aa47ac179"
 "md","source-tree-template","bmm","bmm/workflows/document-project/templates/source-tree-template.md","109bc335ebb22f932b37c24cdc777a351264191825444a4d147c9b82a1e2ad7a"
 "md","step-01-discover","bmm","bmm/workflows/generate-project-context/steps/step-01-discover.md","0f1455c018b2f6df0b896d25e677690e1cf58fa1b276d90f0723187d786d6613"
 "md","step-01-document-discovery","bmm","bmm/workflows/3-solutioning/check-implementation-readiness/steps/step-01-document-discovery.md","bd6114c10845e828098905e52d35f908f1b32dabc67313833adc7e6dd80080b0"
 "md","step-01-init","bmm","bmm/workflows/1-analysis/create-product-brief/steps/step-01-init.md","d90d224fbf8893dd0ade3c5b9231428f4f70399a921f7af880b5c664cfd95bef"
 "md","step-01-init","bmm","bmm/workflows/1-analysis/research/domain-steps/step-01-init.md","efee243f13ef54401ded88f501967b8bc767460cec5561b2107fc03fe7b7eab1"
 "md","step-01-init","bmm","bmm/workflows/1-analysis/research/market-steps/step-01-init.md","ee7627e44ba76000569192cbacf2317f8531fd0fedc4801035267dc71d329787"
 "md","step-01-init","bmm","bmm/workflows/1-analysis/research/technical-steps/step-01-init.md","c9a1627ecd26227e944375eb691e7ee6bc9f5db29a428a5d53e5d6aef8bb9697"
 "md","step-01-init","bmm","bmm/workflows/2-plan-workflows/create-ux-design/steps/step-01-init.md","7b3467a29126c9498b57b06d688f610bcb7a68a8975208c209dd1103546bc455"
 "md","step-01-init","bmm","bmm/workflows/2-plan-workflows/prd/steps/step-01-init.md","abad19b37040d4b31628b95939d4d8c631401a0bd37e40ad474c180d7cd5e664"
 "md","step-01-init","bmm","bmm/workflows/3-solutioning/create-architecture/steps/step-01-init.md","c730b1f23f0298853e5bf0b9007c2fc86e835fb3d53455d2068a6965d1192f49"
 "md","step-01-mode-detection","bmm","bmm/workflows/bmad-quick-flow/quick-dev/steps/step-01-mode-detection.md","e3c252531a413576dfcb2e214ba4f92b4468b8e50c9fbc569674deff26d21175"
 "md","step-01-understand","bmm","bmm/workflows/bmad-quick-flow/create-tech-spec/steps/step-01-understand.md","e8a43cf798df32dc60acd9a2ef1d4a3c2e97f0cf66dd9df553dc7a1c80d7b0cc"
 "md","step-01-validate-prerequisites","bmm","bmm/workflows/3-solutioning/create-epics-and-stories/steps/step-01-validate-prerequisites.md","88c7bfa5579bfdc38b2d855b3d2c03898bf47b11b9f4fae52fb494e2ce163450"
 "md","step-01b-continue","bmm","bmm/workflows/1-analysis/create-product-brief/steps/step-01b-continue.md","bb32e3636bdd19f51e5145b32f766325f48ad347358f74476f8d6c8b7c96c8ef"
 "md","step-01b-continue","bmm","bmm/workflows/2-plan-workflows/create-ux-design/steps/step-01b-continue.md","fde4bf8fa3a6d3230d20cb23e71cbc8e2db1cd2b30b693e13d0b3184bc6bb9a6"
 "md","step-01b-continue","bmm","bmm/workflows/2-plan-workflows/prd/steps/step-01b-continue.md","7857264692e4fe515b05d4ddc9ea39d66a61c3e2715035cdd0d584170bf38ffe"
 "md","step-01b-continue","bmm","bmm/workflows/3-solutioning/create-architecture/steps/step-01b-continue.md","c6cc389b49682a8835382d477d803a75acbad01b24da1b7074ce140d82b278dc"
 "md","step-02-context","bmm","bmm/workflows/3-solutioning/create-architecture/steps/step-02-context.md","e69de083257a5dd84083cadcb55deeefb1cdfdee90f52eb3bfbaadbe6602a627"
 "md","step-02-context-gathering","bmm","bmm/workflows/bmad-quick-flow/quick-dev/steps/step-02-context-gathering.md","8de307668f74892657c2b09f828a3b626b62a479fb72c0280c68ed0e25803896"
 "md","step-02-customer-behavior","bmm","bmm/workflows/1-analysis/research/market-steps/step-02-customer-behavior.md","ca77a54143c2df684cf859e10cea48c6ea1ce8e297068a0f0f26ee63d3170c1e"
 "md","step-02-customer-insights","bmm","bmm/workflows/1-analysis/research/market-steps/step-02-customer-insights.md","de7391755e7c8386096ed2383c24917dd6cab234843b34004e230d6d3d0e3796"
 "md","step-02-design-epics","bmm","bmm/workflows/3-solutioning/create-epics-and-stories/steps/step-02-design-epics.md","1a1c52515a53c12a274d1d5e02ec67c095ea93453259abeca989b9bfd860805c"
 "md","step-02-discovery","bmm","bmm/workflows/2-plan-workflows/create-ux-design/steps/step-02-discovery.md","021d197dfdf071548adf5cfb80fb3b638b5a5d70889b926de221e1e61cea4137"
 "md","step-02-discovery","bmm","bmm/workflows/2-plan-workflows/prd/steps/step-02-discovery.md","b89616175bbdce5fa3dd41dcc31b3b50ad465d35836e62a9ead984b6d604d5c2"
 "md","step-02-domain-analysis","bmm","bmm/workflows/1-analysis/research/domain-steps/step-02-domain-analysis.md","385a288d9bbb0adf050bcce4da4dad198a9151822f9766900404636f2b0c7f9d"
 "md","step-02-generate","bmm","bmm/workflows/generate-project-context/steps/step-02-generate.md","0fff27dab748b4600d02d2fb083513fa4a4e061ed66828b633f7998fcf8257e1"
 "md","step-02-investigate","bmm","bmm/workflows/bmad-quick-flow/create-tech-spec/steps/step-02-investigate.md","3a93724c59af5e8e9da88bf66ece6d72e64cd42ebe6897340fdf2e34191de06c"
 "md","step-02-prd-analysis","bmm","bmm/workflows/3-solutioning/check-implementation-readiness/steps/step-02-prd-analysis.md","37707ccd23bc4e3ff4a888eb4a04722c052518c91fcb83d3d58045595711fdaf"
 "md","step-02-technical-overview","bmm","bmm/workflows/1-analysis/research/technical-steps/step-02-technical-overview.md","9c7582241038b16280cddce86f2943216541275daf0a935dcab78f362904b305"
 "md","step-02-vision","bmm","bmm/workflows/1-analysis/create-product-brief/steps/step-02-vision.md","ac3362c75bd8c3fe42ce3ddd433f3ce58b4a1b466bc056298827f87c7ba274f8"
 "md","step-03-competitive-landscape","bmm","bmm/workflows/1-analysis/research/domain-steps/step-03-competitive-landscape.md","f10aa088ba00c59491507f6519fb314139f8be6807958bb5fd1b66bff2267749"
 "md","step-03-complete","bmm","bmm/workflows/generate-project-context/steps/step-03-complete.md","cf8d1d1904aeddaddb043c3c365d026cd238891cd702c2b78bae032a8e08ae17"
 "md","step-03-core-experience","bmm","bmm/workflows/2-plan-workflows/create-ux-design/steps/step-03-core-experience.md","39f0904b2724d51ba880b2f22deefc00631441669a0c9a8ac0565a8ada3464b2"
 "md","step-03-create-stories","bmm","bmm/workflows/3-solutioning/create-epics-and-stories/steps/step-03-create-stories.md","885dd4bceaed6203f5c00fb9484ab377ee1983b0a487970591472b9ec43a1634"
 "md","step-03-customer-pain-points","bmm","bmm/workflows/1-analysis/research/market-steps/step-03-customer-pain-points.md","ce7394a73a7d3dd627280a8bef0ed04c11e4036275acc4b50c666fd1d84172c4"
 "md","step-03-epic-coverage-validation","bmm","bmm/workflows/3-solutioning/check-implementation-readiness/steps/step-03-epic-coverage-validation.md","f58af59ecbcbed1a83eea3984c550cf78484ef803d7eb80bbf7e0980e45cdf44"
 "md","step-03-execute","bmm","bmm/workflows/bmad-quick-flow/quick-dev/steps/step-03-execute.md","dc340c8c7ac0819ae8442c3838e0ea922656ad7967ea110a8bf0ff80972d570a"
 "md","step-03-generate","bmm","bmm/workflows/bmad-quick-flow/create-tech-spec/steps/step-03-generate.md","d2f998ae3efd33468d90825dc54766eefbe3b4b38fba9e95166fe42d7002db82"
 "md","step-03-integration-patterns","bmm","bmm/workflows/1-analysis/research/technical-steps/step-03-integration-patterns.md","005d517a2f962e2172e26b23d10d5e6684c7736c0d3982e27b2e72d905814ad9"
 "md","step-03-starter","bmm","bmm/workflows/3-solutioning/create-architecture/steps/step-03-starter.md","7dd61ab909d236da0caf59954dced5468657bcb27f859d1d92265e59b3616c28"
 "md","step-03-success","bmm","bmm/workflows/2-plan-workflows/prd/steps/step-03-success.md","07de6f3650dfda068d6f8155e5c4dc0a18ac40fb19f8c46ba54b39cf3f911067"
 "md","step-03-users","bmm","bmm/workflows/1-analysis/create-product-brief/steps/step-03-users.md","e148ee42c8cbb52b11fc9c984cb922c46bd1cb197de02445e02548995d04c390"
 "md","step-04-architectural-patterns","bmm","bmm/workflows/1-analysis/research/technical-steps/step-04-architectural-patterns.md","5ab115b67221be4182f88204b17578697136d8c11b7af21d91012d33ff84aafb"
 "md","step-04-customer-decisions","bmm","bmm/workflows/1-analysis/research/market-steps/step-04-customer-decisions.md","17dde68d655f7c66b47ed59088c841d28d206ee02137388534b141d9a8465cf9"
 "md","step-04-decisions","bmm","bmm/workflows/3-solutioning/create-architecture/steps/step-04-decisions.md","dc83242891d4f6bd5cba6e87bd749378294afdf88af17851e488273893440a84"
 "md","step-04-emotional-response","bmm","bmm/workflows/2-plan-workflows/create-ux-design/steps/step-04-emotional-response.md","a2db9d24cdfc88aeb28a92ed236df940657842291a7d70e1616b59fbfd1c4e19"
 "md","step-04-final-validation","bmm","bmm/workflows/3-solutioning/create-epics-and-stories/steps/step-04-final-validation.md","c56c5289d65f34c1c22c5a9a09084e041ee445b341ebd6380ca9a2885f225344"
 "md","step-04-journeys","bmm","bmm/workflows/2-plan-workflows/prd/steps/step-04-journeys.md","93fb356f0c9edd02b5d1ad475fb629e6b3b875b6ea276b02059b66ade68c0d30"
 "md","step-04-metrics","bmm","bmm/workflows/1-analysis/create-product-brief/steps/step-04-metrics.md","5c8c689267fd158a8c8e07d76041f56003aa58c19ed2649deef780a8f97722aa"
 "md","step-04-regulatory-focus","bmm","bmm/workflows/1-analysis/research/domain-steps/step-04-regulatory-focus.md","d22035529efe91993e698b4ebf297bf2e7593eb41d185a661c357a8afc08977b"
 "md","step-04-review","bmm","bmm/workflows/bmad-quick-flow/create-tech-spec/steps/step-04-review.md","7571c5694a9f04ea29fbdb7ad83d6a6c9129c95ace4211e74e67ca4216acc4ff"
 "md","step-04-self-check","bmm","bmm/workflows/bmad-quick-flow/quick-dev/steps/step-04-self-check.md","444c02d8f57cd528729c51d77abf51ca8918ac5c65f3dcf269b21784f5f6920c"
 "md","step-04-ux-alignment","bmm","bmm/workflows/3-solutioning/check-implementation-readiness/steps/step-04-ux-alignment.md","e673765ad05f4f2dc70a49c17124d7dd6f92a7a481314a6093f82cda0c61a2b5"
 "md","step-05-adversarial-review","bmm","bmm/workflows/bmad-quick-flow/quick-dev/steps/step-05-adversarial-review.md","38d6f43af07f51d67d6abd5d88de027d5703033ed6b7fe2400069f5fc31d4237"
 "md","step-05-competitive-analysis","bmm","bmm/workflows/1-analysis/research/market-steps/step-05-competitive-analysis.md","ff6f606a80ffaf09aa325e38a4ceb321b97019e6542241b2ed4e8eb38b35efa8"
 "md","step-05-domain","bmm","bmm/workflows/2-plan-workflows/prd/steps/step-05-domain.md","a18c274f10f3116e5b3e88e3133760ab4374587e4c9c6167e8eea4b84589298c"
 "md","step-05-epic-quality-review","bmm","bmm/workflows/3-solutioning/check-implementation-readiness/steps/step-05-epic-quality-review.md","4014a0e0a7b725474f16250a8f19745e188d51c4f4dbef549de0940eb428841d"
 "md","step-05-implementation-research","bmm","bmm/workflows/1-analysis/research/technical-steps/step-05-implementation-research.md","55ae5ab81295c6d6e3694c1b89472abcd5cd562cf55a2b5fffdd167e15bee82b"
 "md","step-05-inspiration","bmm","bmm/workflows/2-plan-workflows/create-ux-design/steps/step-05-inspiration.md","7f8d6c50c3128d7f4cb5dbf92ed9b0b0aa2ce393649f1506f5996bd51e3a5604"
 "md","step-05-patterns","bmm","bmm/workflows/3-solutioning/create-architecture/steps/step-05-patterns.md","8660291477a35ba5a7aecc73fbb9f5fa85de2a4245ae9dd2644f5e2f64a66d30"
 "md","step-05-scope","bmm","bmm/workflows/1-analysis/create-product-brief/steps/step-05-scope.md","9e2d58633f621d437fe59a3fd8d10f6c190b85a6dcf1dbe9167d15f45585af51"
 "md","step-05-technical-trends","bmm","bmm/workflows/1-analysis/research/domain-steps/step-05-technical-trends.md","fd6c577010171679f630805eb76e09daf823c2b9770eb716986d01f351ce1fb4"
 "md","step-06-complete","bmm","bmm/workflows/1-analysis/create-product-brief/steps/step-06-complete.md","488ea54b7825e5a458a58c0c3104bf5dc56f5e401c805df954a0bfc363194f31"
 "md","step-06-design-system","bmm","bmm/workflows/2-plan-workflows/create-ux-design/steps/step-06-design-system.md","6bb2666aeb114708321e2f730431eb17d2c08c78d57d9cc6b32cb11402aa8472"
 "md","step-06-final-assessment","bmm","bmm/workflows/3-solutioning/check-implementation-readiness/steps/step-06-final-assessment.md","67d68de4bdaaa9e814d15d30c192da7301339e851224ef562077b2fb39c7d869"
 "md","step-06-innovation","bmm","bmm/workflows/2-plan-workflows/prd/steps/step-06-innovation.md","faa4b7e1b74e843d167ef0ea16dab475ea51e57b654337ec7a1ba90d85e8a44a"
 "md","step-06-research-completion","bmm","bmm/workflows/1-analysis/research/market-steps/step-06-research-completion.md","30d5e14f39df193ebce952dfed2bd4009d68fe844e28ad3a29f5667382ebc6d2"
 "md","step-06-research-synthesis","bmm","bmm/workflows/1-analysis/research/domain-steps/step-06-research-synthesis.md","4c7727b8d3c6272c1b2b84ea58a67fc86cafab3472c0caf54e8b8cee3fa411fc"
 "md","step-06-research-synthesis","bmm","bmm/workflows/1-analysis/research/technical-steps/step-06-research-synthesis.md","5df66bbeecd345e829f06c4eb5bdecd572ca46aec8927bda8b97dbd5f5a34d6c"
 "md","step-06-resolve-findings","bmm","bmm/workflows/bmad-quick-flow/quick-dev/steps/step-06-resolve-findings.md","ad5d90b4f753fec9d2ba6065cbf4e5fa6ef07b013504a573a0edea5dcc16e180"
 "md","step-06-structure","bmm","bmm/workflows/3-solutioning/create-architecture/steps/step-06-structure.md","8ebb95adc203b83e3329b32bcd19e4d65faa8e68af7255374f40f0cbf4d91f2b"
 "md","step-07-defining-experience","bmm","bmm/workflows/2-plan-workflows/create-ux-design/steps/step-07-defining-experience.md","10db4f974747602d97a719542c0cd31aa7500b035fba5fddf1777949f76928d6"
 "md","step-07-project-type","bmm","bmm/workflows/2-plan-workflows/prd/steps/step-07-project-type.md","260d5d3738ddc60952f6a04a1370e59e2bf2c596b926295466244278952becd1"
 "md","step-07-validation","bmm","bmm/workflows/3-solutioning/create-architecture/steps/step-07-validation.md","0aaa043da24c0c9558c32417c5ba76ad898d4300ca114a8be3f77fabf638c2e2"
 "md","step-08-complete","bmm","bmm/workflows/3-solutioning/create-architecture/steps/step-08-complete.md","d2bb24dedc8ca431a1dc766033069694b7e1e7bef146d9d1d1d10bf2555a02cd"
 "md","step-08-scoping","bmm","bmm/workflows/2-plan-workflows/prd/steps/step-08-scoping.md","535949aab670b628807b08b9ab7627b8b62d8fdad7300d616101245e54920f61"
 "md","step-08-visual-foundation","bmm","bmm/workflows/2-plan-workflows/create-ux-design/steps/step-08-visual-foundation.md","114ae7e866eb41ec3ff0c573ba142ee6641e30d91a656e5069930fe3bb9786ae"
 "md","step-09-design-directions","bmm","bmm/workflows/2-plan-workflows/create-ux-design/steps/step-09-design-directions.md","73933038a7f1c172716e0688c36275316d1671e4bca39d1050da7b9b475f5211"
 "md","step-09-functional","bmm","bmm/workflows/2-plan-workflows/prd/steps/step-09-functional.md","fb3acbc2b82de5c70e8d7e1a4475e3254d1e8bcb242da88d618904b66f57edad"
 "md","step-10-nonfunctional","bmm","bmm/workflows/2-plan-workflows/prd/steps/step-10-nonfunctional.md","92fde9dc4f198fb551be6389c75b6e09e43c840ce55a635d37202830b4e38718"
 "md","step-10-user-journeys","bmm","bmm/workflows/2-plan-workflows/create-ux-design/steps/step-10-user-journeys.md","7305843b730128445610cc0ff28fc00b952ec361672690d93987978650e077c3"
 "md","step-11-complete","bmm","bmm/workflows/2-plan-workflows/prd/steps/step-11-complete.md","b9a9053f1e5de3d583aa729639731fc26b7ce6a43f6a111582faa4caea96593a"
 "md","step-11-component-strategy","bmm","bmm/workflows/2-plan-workflows/create-ux-design/steps/step-11-component-strategy.md","e4a80fc9d350ce1e84b0d4f0a24abd274f2732095fb127af0dde3bc62f786ad1"
 "md","step-12-ux-patterns","bmm","bmm/workflows/2-plan-workflows/create-ux-design/steps/step-12-ux-patterns.md","4a0b51d278ffbd012d2c9c574adcb081035994be2a055cc0bbf1e348a766cb4a"
 "md","step-13-responsive-accessibility","bmm","bmm/workflows/2-plan-workflows/create-ux-design/steps/step-13-responsive-accessibility.md","c556f2dc3644142f8136237fb422a6aac699ca97812c9b73a988cc6db7915444"
 "md","step-14-complete","bmm","bmm/workflows/2-plan-workflows/create-ux-design/steps/step-14-complete.md","8b05a20310b14bcbc743d990570b40a6f48f5ab10cbc03a723aa841337550fbf"
 "md","tech-spec-template","bmm","bmm/workflows/bmad-quick-flow/create-tech-spec/tech-spec-template.md","6e0ac4991508fec75d33bbe36197e1576d7b2a1ea7ceba656d616e7d7dadcf03"
 "md","template","bmm","bmm/workflows/4-implementation/create-story/template.md","29ba697368d77e88e88d0e7ac78caf7a78785a7dcfc291082aa96a62948afb67"
 "md","test-design-template","bmm","bmm/workflows/testarch/test-design/test-design-template.md","be2c766858684f5afce7c140f65d6d6e36395433938a866dea09da252a723822"
 "md","test-healing-patterns","bmm","bmm/testarch/knowledge/test-healing-patterns.md","b44f7db1ebb1c20ca4ef02d12cae95f692876aee02689605d4b15fe728d28fdf"
 "md","test-levels-framework","bmm","bmm/testarch/knowledge/test-levels-framework.md","80bbac7959a47a2e7e7de82613296f906954d571d2d64ece13381c1a0b480237"
 "md","test-priorities-matrix","bmm","bmm/testarch/knowledge/test-priorities-matrix.md","321c3b708cc19892884be0166afa2a7197028e5474acaf7bc65c17ac861964a5"
 "md","test-quality","bmm","bmm/testarch/knowledge/test-quality.md","97b6db474df0ec7a98a15fd2ae49671bb8e0ddf22963f3c4c47917bb75c05b90"
 "md","test-review-template","bmm","bmm/workflows/testarch/test-review/test-review-template.md","b476bd8ca67b730ffcc9f11aeb63f5a14996e19712af492ffe0d3a3d1a4645d2"
 "md","timing-debugging","bmm","bmm/testarch/knowledge/timing-debugging.md","c4c87539bbd3fd961369bb1d7066135d18c6aad7ecd70256ab5ec3b26a8777d9"
 "md","trace-template","bmm","bmm/workflows/testarch/trace/trace-template.md","148b715e7b257f86bc9d70b8e51b575e31d193420bdf135b32dd7bd3132762f3"
 "md","ux-design-template","bmm","bmm/workflows/2-plan-workflows/create-ux-design/ux-design-template.md","ffa4b89376cd9db6faab682710b7ce755990b1197a8b3e16b17748656d1fca6a"
 "md","visual-debugging","bmm","bmm/testarch/knowledge/visual-debugging.md","072a3d30ba6d22d5e628fc26a08f6e03f8b696e49d5a4445f37749ce5cd4a8a9"
 "md","workflow","bmm","bmm/workflows/1-analysis/create-product-brief/workflow.md","09f24c579989fe45ad36becafc63b5b68f14fe2f6d8dd186a9ddfb0c1f256b7b"
 "md","workflow","bmm","bmm/workflows/1-analysis/research/workflow.md","0c7043392fbe53f1669e73f1f74b851ae78e60fefbe54ed7dfbb12409a22fe10"
 "md","workflow","bmm","bmm/workflows/2-plan-workflows/create-ux-design/workflow.md","49381d214c43080b608ff5886ed34fae904f4d4b14bea4f5c2fafab326fac698"
 "md","workflow","bmm","bmm/workflows/2-plan-workflows/prd/workflow.md","6f09425df1cebfa69538a8b507ce5957513a9e84a912a10aad9bd834133fa568"
 "md","workflow","bmm","bmm/workflows/3-solutioning/check-implementation-readiness/workflow.md","0167a08dd497a50429d8259eec1ebcd669bebbf4472a3db5c352fb6791a39ce8"
 "md","workflow","bmm","bmm/workflows/3-solutioning/create-architecture/workflow.md","c85b3ce51dcadc00c9ef98b0be7cc27b5d38ab2191ef208645b61eb3e7d078ab"
 "md","workflow","bmm","bmm/workflows/3-solutioning/create-epics-and-stories/workflow.md","b62a6f4c85c66059f46ce875da9eb336b4272f189c506c0f77170c7623b5ed55"
 "md","workflow","bmm","bmm/workflows/bmad-quick-flow/create-tech-spec/workflow.md","740134a67df57a818b8d76cf4c5f27090375d1698ae5be9e68c9ab8672d6b1e0"
 "md","workflow","bmm","bmm/workflows/bmad-quick-flow/quick-dev/workflow.md","c6d7306871bb29d1cd0435e2189d7d7d55ec8c4604f688b63c1c77c7d2e6d086"
 "md","workflow","bmm","bmm/workflows/generate-project-context/workflow.md","0da857be1b7fb46fc29afba22b78a8b2150b17db36db68fd254ad925a20666aa"
 "xml","instructions","bmm","bmm/workflows/4-implementation/code-review/instructions.xml","80d43803dced84f1e754d8690fb6da79e5b21a68ca8735b9c0ff709c49ac31ff"
 "xml","instructions","bmm","bmm/workflows/4-implementation/create-story/instructions.xml","713b38a3ee0def92380ca97196d3457f68b8da60b78d2e10fc366c35811691fb"
 "xml","instructions","bmm","bmm/workflows/4-implementation/dev-story/instructions.xml","d01f9b168f5ef2b4aaf7e1c2fad8146dacfa0ea845b101da80db688e1817cefb"
 "yaml","config","bmm","bmm/config.yaml","d120bc5fa5a3a15d13871539e9b4545b12311681ea9457ebc243f97378a64ba1"
 "yaml","deep-dive","bmm","bmm/workflows/document-project/workflows/deep-dive.yaml","a16b5d121604ca00fffdcb04416daf518ec2671a3251b7876c4b590d25d96945"
 "yaml","enterprise-brownfield","bmm","bmm/workflows/workflow-status/paths/enterprise-brownfield.yaml","40b7fb4d855fdd275416e225d685b4772fb0115554e160a0670b07f6fcbc62e5"
 "yaml","enterprise-greenfield","bmm","bmm/workflows/workflow-status/paths/enterprise-greenfield.yaml","61329f48d5d446376bcf81905485c72ba53874f3a3918d5614eb0997b93295c6"
 "yaml","excalidraw-templates","bmm","bmm/workflows/excalidraw-diagrams/_shared/excalidraw-templates.yaml","ca6e4ae85b5ab16df184ce1ddfdf83b20f9540db112ebf195cb793017f014a70"
 "yaml","full-scan","bmm","bmm/workflows/document-project/workflows/full-scan.yaml","8ba79b190733006499515d9d805f4eacd90a420ffc454e04976948c114806c25"
 "yaml","github-actions-template","bmm","bmm/workflows/testarch/ci/github-actions-template.yaml","cf7d1f0a1f2853b07df1b82b00ebe79f800f8f16817500747b7c4c9c7143aba7"
 "yaml","gitlab-ci-template","bmm","bmm/workflows/testarch/ci/gitlab-ci-template.yaml","986f29817e04996ab9f80bf2de0d25d8ed2365d955cc36d5801afaa93e99e80b"
 "yaml","method-brownfield","bmm","bmm/workflows/workflow-status/paths/method-brownfield.yaml","6417f79e274b6aaf07c9b5d8c82f6ee16a8713442c2e38b4bab932831bf3e6c6"
 "yaml","method-greenfield","bmm","bmm/workflows/workflow-status/paths/method-greenfield.yaml","11693c1b4e87d7d7afed204545a9529c27e0566d6ae7a480fdfa4677341f5880"
 "yaml","project-levels","bmm","bmm/workflows/workflow-status/project-levels.yaml","ffa9fb3b32d81617bb8718689a5ff5774d2dff6c669373d979cc38b1dc306966"
 "yaml","sprint-status-template","bmm","bmm/workflows/4-implementation/sprint-planning/sprint-status-template.yaml","de75fe50bd5e3f4410ccc99fcd3f5dc958733b3829af1b13b4d7b0559bbca22b"
 "yaml","team-fullstack","bmm","bmm/teams/team-fullstack.yaml","da8346b10dfad8e1164a11abeb3b0a84a1d8b5f04e01e8490a44ffca477a1b96"
 "yaml","workflow","bmm","bmm/workflows/4-implementation/code-review/workflow.yaml","8879bd2ea2da2c444eac9f4f8bf4f2d58588cdbc92aee189c04d4d926ea7b43d"
 "yaml","workflow","bmm","bmm/workflows/4-implementation/correct-course/workflow.yaml","fd61662b22f5ff1d378633b47837eb9542e433d613fbada176a9d61de15c2961"
 "yaml","workflow","bmm","bmm/workflows/4-implementation/create-story/workflow.yaml","469cdb56604b1582ac8b271f9326947c57b54af312099dfa0387d998acea2cac"
 "yaml","workflow","bmm","bmm/workflows/4-implementation/dev-story/workflow.yaml","270cb47b01e5a49d497c67f2c2605b808a943daf2b34ee60bc726ff78ac217b3"
 "yaml","workflow","bmm","bmm/workflows/4-implementation/retrospective/workflow.yaml","03433aa3f0d5b4b388d31b9bee1ac5cb5ca78e15bb4d44746766784a3ba863d2"
 "yaml","workflow","bmm","bmm/workflows/4-implementation/sprint-planning/workflow.yaml","3038e7488b67303814d95ebbb0f28a225876ec2e3224fdaa914485f5369a44bf"
 "yaml","workflow","bmm","bmm/workflows/4-implementation/sprint-status/workflow.yaml","92c50c478b87cd5c339cdb38399415977f58785b4ae82f7948ba16404fa460cf"
 "yaml","workflow","bmm","bmm/workflows/document-project/workflow.yaml","82e731ea08217480958a75304558e767654d8a8262c0ec1ed91e81afd3135ed5"
 "yaml","workflow","bmm","bmm/workflows/excalidraw-diagrams/create-dataflow/workflow.yaml","a845be912077a9c80fb3f3e2950c33b99139a2ae22db9c006499008ec2fa3851"
 "yaml","workflow","bmm","bmm/workflows/excalidraw-diagrams/create-diagram/workflow.yaml","bac0e13f796b4a4bb2a3909ddef230f0cd1712a0163b6fe72a2966eed8fc87a9"
 "yaml","workflow","bmm","bmm/workflows/excalidraw-diagrams/create-flowchart/workflow.yaml","a8f6e3680d2ec51c131e5cd57c9705e5572fe3e08c536174da7175e07cce0c5d"
 "yaml","workflow","bmm","bmm/workflows/excalidraw-diagrams/create-wireframe/workflow.yaml","88ce19aff63a411583756cd0254af2000b6aac13071204dc9aef61aa137a51ef"
 "yaml","workflow","bmm","bmm/workflows/testarch/atdd/workflow.yaml","671d3319e80fffb3dedf50ccda0f3aea87ed4de58e6af679678995ca9f5262b0"
 "yaml","workflow","bmm","bmm/workflows/testarch/automate/workflow.yaml","3d49eaca0024652b49f00f26f1f1402c73874eb250431cb5c1ce1d2eddc6520b"
 "yaml","workflow","bmm","bmm/workflows/testarch/ci/workflow.yaml","e42067278023d4489a159fdbf7a863c69345e3d3d91bf9af8dcff49fd14f0e6d"
 "yaml","workflow","bmm","bmm/workflows/testarch/framework/workflow.yaml","857b92ccfa185c373ebecd76f3f57ca84a4d94c8c2290679d33010f58e1ed9e1"
 "yaml","workflow","bmm","bmm/workflows/testarch/nfr-assess/workflow.yaml","24a0e0e6124c3206775e43bd7ed4e1bfba752e7d7a0590bbdd73c2e9ce5a06ec"
 "yaml","workflow","bmm","bmm/workflows/testarch/test-design/workflow.yaml","30a9371f2ea930e7e68b987570be524b2e9d104c40c28e818a89e12985ba767a"
 "yaml","workflow","bmm","bmm/workflows/testarch/test-review/workflow.yaml","d64517e211eceb8e5523da19473387e642c5178d5850f92b1aa5dc3fea6a6685"
 "yaml","workflow","bmm","bmm/workflows/testarch/trace/workflow.yaml","0ba5d014b6209cc949391de9f495465b7d64d3496e1972be48b2961c8490e6f5"
 "yaml","workflow","bmm","bmm/workflows/workflow-status/init/workflow.yaml","f29cb2797a3b1d3d9408fd78f9e8e232719a519b316444ba31d9fe5db9ca1d6a"
 "yaml","workflow","bmm","bmm/workflows/workflow-status/workflow.yaml","390e733bee776aaf0312c5990cdfdb2d65c4f7f56001f428b8baddeb3fe8f0fe"
 "yaml","workflow-status-template","bmm","bmm/workflows/workflow-status/workflow-status-template.yaml","0ec9c95f1690b7b7786ffb4ab10663c93b775647ad58e283805092e1e830a0d9"
 "csv","brain-methods","core","core/workflows/brainstorming/brain-methods.csv","0ab5878b1dbc9e3fa98cb72abfc3920a586b9e2b42609211bb0516eefd542039"
 "csv","methods","core","core/workflows/advanced-elicitation/methods.csv","e08b2e22fec700274982e37be608d6c3d1d4d0c04fa0bae05aa9dba2454e6141"
 "md","excalidraw-helpers","core","core/resources/excalidraw/excalidraw-helpers.md","37f18fa0bd15f85a33e7526a2cbfe1d5a9404f8bcb8febc79b782361ef790de4"
 "md","library-loader","core","core/resources/excalidraw/library-loader.md","7837112bd0acb5906870dff423a21564879d49c5322b004465666a42c52477ab"
 "md","README","core","core/resources/excalidraw/README.md","72de8325d7289128f1c8afb3b0eea867ba90f4c029ca42e66a133cd9f92c285d"
 "md","step-01-agent-loading","core","core/workflows/party-mode/steps/step-01-agent-loading.md","cd2ca8ec03576fd495cbaec749b3f840c82f7f0d485c8a884894a72d047db013"
 "md","step-01-session-setup","core","core/workflows/brainstorming/steps/step-01-session-setup.md","0437c1263788b93f14b7d361af9059ddbc2cbb576974cbd469a58ea757ceba19"
 "md","step-01b-continue","core","core/workflows/brainstorming/steps/step-01b-continue.md","a92fd1825a066f21922c5ac8d0744f0553ff4a6d5fc3fa998d12aea05ea2819c"
 "md","step-02-discussion-orchestration","core","core/workflows/party-mode/steps/step-02-discussion-orchestration.md","a9afe48b2c43f191541f53abb3c15ef608f9970fa066dcb501e2c1071e5e7d02"
 "md","step-02a-user-selected","core","core/workflows/brainstorming/steps/step-02a-user-selected.md","558b162466745b92687a5d6e218f243a98436dd177b2d5544846c5ff4497cc94"
 "md","step-02b-ai-recommended","core","core/workflows/brainstorming/steps/step-02b-ai-recommended.md","99aa935279889f278dcb2a61ba191600a18e9db356dd8ce62f0048d3c37c9531"
 "md","step-02c-random-selection","core","core/workflows/brainstorming/steps/step-02c-random-selection.md","f188c260c321c7f026051fefcd267a26ee18ce2a07f64bab7f453c0c3e483316"
 "md","step-02d-progressive-flow","core","core/workflows/brainstorming/steps/step-02d-progressive-flow.md","a28c7a3edf34ceb0eea203bf7dc80f39ca04974f6d1ec243f0a088281b2e55de"
 "md","step-03-graceful-exit","core","core/workflows/party-mode/steps/step-03-graceful-exit.md","f3299f538d651b55efb6e51ddc3536a228df63f16b1e0129a830cceb8e21303f"
 "md","step-03-technique-execution","core","core/workflows/brainstorming/steps/step-03-technique-execution.md","9dbcf441402a4601721a9564ab58ca2fe77dafefee090f7d023754d2204b1d7e"
 "md","step-04-idea-organization","core","core/workflows/brainstorming/steps/step-04-idea-organization.md","a1b7a17b95bb1c06fa678f65a56a9ac2fd9655871e99b9378c6b4afa5d574050"
 "md","template","core","core/workflows/brainstorming/template.md","5c99d76963eb5fc21db96c5a68f39711dca7c6ed30e4f7d22aedee9e8bb964f9"
 "md","validate-json-instructions","core","core/resources/excalidraw/validate-json-instructions.md","0970bac93d52b4ee591a11998a02d5682e914649a40725d623489c77f7a1e449"
 "md","workflow","core","core/workflows/brainstorming/workflow.md","f6f2a280880b1cc82bb9bb320229a71df788bb0412590beb59a384e26f493c83"
 "md","workflow","core","core/workflows/party-mode/workflow.md","851cbc7f57b856390be18464d38512337b52508cc634f327e4522e379c778573"
 "xml","index-docs","core","core/tasks/index-docs.xml","13ffd40ccaed0f05b35e4f22255f023e77a6926e8a2f01d071b0b91a4c942812"
 "xml","review-adversarial-general","core","core/tasks/review-adversarial-general.xml","05466fd1a0b207dd9987ba1e8674b40060025b105ba51f5b49fe852c44e51f12"
 "xml","shard-doc","core","core/tasks/shard-doc.xml","f71987855cabb46bd58a63a4fd356efb0739a272ab040dd3c8156d7f538d7caf"
 "xml","validate-workflow","core","core/tasks/validate-workflow.xml","539e6f1255efbb62538598493e4083496dc0081d3c8989c89b47d06427d98f28"
 "xml","workflow","core","core/tasks/workflow.xml","8f7ad9ff1d80251fa5df344ad70701605a74dcfc030c04708650f23b2606851a"
 "xml","workflow","core","core/workflows/advanced-elicitation/workflow.xml","063e6aab417f9cc67ae391b1d89ba972fc890c123f8101b7180496d413a63d81"
 "yaml","config","core","core/config.yaml","b077e1ab5baed507149c888cdb1f645b23297f129541edb501dddca0ae49dacd"
--- a/_bmad/_config/ides/claude-code.yaml
+++ b/_bmad/_config/ides/claude-code.yaml
@@ -0,0 +1,6 @@
 ide: claude-code
 configured_date: 2026-01-10T14:42:39.253Z
 last_updated: 2026-01-10T14:43:28.945Z
 configuration:
  subagentChoices: null
  installLocation: null
--- a/_bmad/_config/manifest.yaml
+++ b/_bmad/_config/manifest.yaml
@@ -0,0 +1,12 @@
 installation:
  version: 6.0.0-alpha.22
  installDate: 2026-01-10T14:43:28.874Z
  lastUpdated: 2026-01-10T14:43:28.874Z
 modules:
  - core
  - bmm
 ides:
  - claude-code
  - github-copilot
  - kiro-cli
  - gemini
--- a/_bmad/_config/task-manifest.csv
+++ b/_bmad/_config/task-manifest.csv
@@ -0,0 +1,6 @@
 name,displayName,description,module,path,standalone
 "index-docs","Index Docs","Generates or updates an index.md of all documents in the specified directory","core","_bmad/core/tasks/index-docs.xml","true"
 "review-adversarial-general","Adversarial Review (General)","Cynically review content and produce findings","core","_bmad/core/tasks/review-adversarial-general.xml","false"
 "shard-doc","Shard Document","Splits large markdown documents into smaller, organized files based on level 2 (default) sections","core","_bmad/core/tasks/shard-doc.xml","false"
 "validate-workflow","Validate Workflow Output","Run a checklist against a document with thorough analysis and produce a validation report","core","_bmad/core/tasks/validate-workflow.xml","false"
 "workflow","Execute Workflow","Execute given workflow by loading its configuration, following instructions, and producing output","core","_bmad/core/tasks/workflow.xml","false"
--- a/_bmad/_config/tool-manifest.csv
+++ b/_bmad/_config/tool-manifest.csv
@@ -0,0 +1 @@
 name,displayName,description,module,path,standalone
--- a/_bmad/_config/workflow-manifest.csv
+++ b/_bmad/_config/workflow-manifest.csv
@@ -0,0 +1,35 @@
 name,description,module,path
 "brainstorming","Facilitate interactive brainstorming sessions using diverse creative techniques and ideation methods","core","_bmad/core/workflows/brainstorming/workflow.md"
 "party-mode","Orchestrates group discussions between all installed BMAD agents, enabling natural multi-agent conversations","core","_bmad/core/workflows/party-mode/workflow.md"
 "create-product-brief","Create comprehensive product briefs through collaborative step-by-step discovery as creative Business Analyst working with the user as peers.","bmm","_bmad/bmm/workflows/1-analysis/create-product-brief/workflow.md"
 "research","Conduct comprehensive research across multiple domains using current web data and verified sources - Market, Technical, Domain and other research types.","bmm","_bmad/bmm/workflows/1-analysis/research/workflow.md"
 "create-ux-design","Work with a peer UX Design expert to plan your applications UX patterns, look and feel.","bmm","_bmad/bmm/workflows/2-plan-workflows/create-ux-design/workflow.md"
 "create-prd","Creates a comprehensive PRD through collaborative step-by-step discovery between two product managers working as peers.","bmm","_bmad/bmm/workflows/2-plan-workflows/prd/workflow.md"
 "check-implementation-readiness","Critical validation workflow that assesses PRD, Architecture, and Epics & Stories for completeness and alignment before implementation. Uses adversarial review approach to find gaps and issues.","bmm","_bmad/bmm/workflows/3-solutioning/check-implementation-readiness/workflow.md"
 "create-architecture","Collaborative architectural decision facilitation for AI-agent consistency. Replaces template-driven architecture with intelligent, adaptive conversation that produces a decision-focused architecture document optimized for preventing agent conflicts.","bmm","_bmad/bmm/workflows/3-solutioning/create-architecture/workflow.md"
 "create-epics-and-stories","Transform PRD requirements and Architecture decisions into comprehensive stories organized by user value. This workflow requires completed PRD + Architecture documents (UX recommended if UI exists) and breaks down requirements into implementation-ready epics and user stories that incorporate all available technical and design context. Creates detailed, actionable stories with complete acceptance criteria for development teams.","bmm","_bmad/bmm/workflows/3-solutioning/create-epics-and-stories/workflow.md"
 "code-review","Perform an ADVERSARIAL Senior Developer code review that finds 3-10 specific problems in every story. Challenges everything: code quality, test coverage, architecture compliance, security, performance. NEVER accepts `looks good` - must find minimum issues and can auto-fix with user approval.","bmm","_bmad/bmm/workflows/4-implementation/code-review/workflow.yaml"
 "correct-course","Navigate significant changes during sprint execution by analyzing impact, proposing solutions, and routing for implementation","bmm","_bmad/bmm/workflows/4-implementation/correct-course/workflow.yaml"
 "create-story","Create the next user story from epics+stories with enhanced context analysis and direct ready-for-dev marking","bmm","_bmad/bmm/workflows/4-implementation/create-story/workflow.yaml"
 "dev-story","Execute a story by implementing tasks/subtasks, writing tests, validating, and updating the story file per acceptance criteria","bmm","_bmad/bmm/workflows/4-implementation/dev-story/workflow.yaml"
 "retrospective","Run after epic completion to review overall success, extract lessons learned, and explore if new information emerged that might impact the next epic","bmm","_bmad/bmm/workflows/4-implementation/retrospective/workflow.yaml"
 "sprint-planning","Generate and manage the sprint status tracking file for Phase 4 implementation, extracting all epics and stories from epic files and tracking their status through the development lifecycle","bmm","_bmad/bmm/workflows/4-implementation/sprint-planning/workflow.yaml"
 "sprint-status","Summarize sprint-status.yaml, surface risks, and route to the right implementation workflow.","bmm","_bmad/bmm/workflows/4-implementation/sprint-status/workflow.yaml"
 "create-tech-spec","Conversational spec engineering - ask questions, investigate code, produce implementation-ready tech-spec.","bmm","_bmad/bmm/workflows/bmad-quick-flow/create-tech-spec/workflow.md"
 "quick-dev","Flexible development - execute tech-specs OR direct instructions with optional planning.","bmm","_bmad/bmm/workflows/bmad-quick-flow/quick-dev/workflow.md"
 "document-project","Analyzes and documents brownfield projects by scanning codebase, architecture, and patterns to create comprehensive reference documentation for AI-assisted development","bmm","_bmad/bmm/workflows/document-project/workflow.yaml"
 "create-excalidraw-dataflow","Create data flow diagrams (DFD) in Excalidraw format","bmm","_bmad/bmm/workflows/excalidraw-diagrams/create-dataflow/workflow.yaml"
 "create-excalidraw-diagram","Create system architecture diagrams, ERDs, UML diagrams, or general technical diagrams in Excalidraw format","bmm","_bmad/bmm/workflows/excalidraw-diagrams/create-diagram/workflow.yaml"
 "create-excalidraw-flowchart","Create a flowchart visualization in Excalidraw format for processes, pipelines, or logic flows","bmm","_bmad/bmm/workflows/excalidraw-diagrams/create-flowchart/workflow.yaml"
 "create-excalidraw-wireframe","Create website or app wireframes in Excalidraw format","bmm","_bmad/bmm/workflows/excalidraw-diagrams/create-wireframe/workflow.yaml"
 "generate-project-context","Creates a concise project-context.md file with critical rules and patterns that AI agents must follow when implementing code. Optimized for LLM context efficiency.","bmm","_bmad/bmm/workflows/generate-project-context/workflow.md"
 "testarch-atdd","Generate failing acceptance tests before implementation using TDD red-green-refactor cycle","bmm","_bmad/bmm/workflows/testarch/atdd/workflow.yaml"
 "testarch-automate","Expand test automation coverage after implementation or analyze existing codebase to generate comprehensive test suite","bmm","_bmad/bmm/workflows/testarch/automate/workflow.yaml"
 "testarch-ci","Scaffold CI/CD quality pipeline with test execution, burn-in loops, and artifact collection","bmm","_bmad/bmm/workflows/testarch/ci/workflow.yaml"
 "testarch-framework","Initialize production-ready test framework architecture (Playwright or Cypress) with fixtures, helpers, and configuration","bmm","_bmad/bmm/workflows/testarch/framework/workflow.yaml"
 "testarch-nfr","Assess non-functional requirements (performance, security, reliability, maintainability) before release with evidence-based validation","bmm","_bmad/bmm/workflows/testarch/nfr-assess/workflow.yaml"
 "testarch-test-design","Dual-mode workflow: (1) System-level testability review in Solutioning phase, or (2) Epic-level test planning in Implementation phase. Auto-detects mode based on project phase.","bmm","_bmad/bmm/workflows/testarch/test-design/workflow.yaml"
 "testarch-test-review","Review test quality using comprehensive knowledge base and best practices validation","bmm","_bmad/bmm/workflows/testarch/test-review/workflow.yaml"
 "testarch-trace","Generate requirements-to-tests traceability matrix, analyze coverage, and make quality gate decision (PASS/CONCERNS/FAIL/WAIVED)","bmm","_bmad/bmm/workflows/testarch/trace/workflow.yaml"
 "workflow-init","Initialize a new BMM project by determining level, type, and creating workflow path","bmm","_bmad/bmm/workflows/workflow-status/init/workflow.yaml"
 "workflow-status","Lightweight status checker - answers """"what should I do now?"""" for any agent. Reads YAML status file for workflow tracking. Use workflow-init for new projects.","bmm","_bmad/bmm/workflows/workflow-status/workflow.yaml"
--- a/_bmad/bmm/agents/analyst.md
+++ b/_bmad/bmm/agents/analyst.md
@@ -0,0 +1,76 @@
 ---
 name: "analyst"
 description: "Business Analyst"
 ---
 You must fully embody this agent's persona and follow all activation instructions exactly as specified. NEVER break character until given an exit command.
 ```xml
 <agent id="analyst.agent.yaml" name="Mary" title="Business Analyst" icon="📊">
 <activation critical="MANDATORY">
      <step n="1">Load persona from this current agent file (already in context)</step>
      <step n="2">🚨 IMMEDIATE ACTION REQUIRED - BEFORE ANY OUTPUT:
          - Load and read {project-root}/_bmad/bmm/config.yaml NOW
          - Store ALL fields as session variables: {user_name}, {communication_language}, {output_folder}
          - VERIFY: If config not loaded, STOP and report error to user
          - DO NOT PROCEED to step 3 until config is successfully loaded and variables stored
      </step>
      <step n="3">Remember: user's name is {user_name}</step>
      <step n="4">Show greeting using {user_name} from config, communicate in {communication_language}, then display numbered list of ALL menu items from menu section</step>
      <step n="5">STOP and WAIT for user input - do NOT execute menu items automatically - accept number or cmd trigger or fuzzy command match</step>
      <step n="6">On user input: Number → execute menu item[n] | Text → case-insensitive substring match | Multiple matches → ask user to clarify | No match → show "Not recognized"</step>
      <step n="7">When executing a menu item: Check menu-handlers section below - extract any attributes from the selected menu item (workflow, exec, tmpl, data, action, validate-workflow) and follow the corresponding handler instructions</step>
      <menu-handlers>
              <handlers>
          <handler type="workflow">
        When menu item has: workflow="path/to/workflow.yaml":
        1. CRITICAL: Always LOAD {project-root}/_bmad/core/tasks/workflow.xml
        2. Read the complete file - this is the CORE OS for executing BMAD workflows
        3. Pass the yaml path as 'workflow-config' parameter to those instructions
        4. Execute workflow.xml instructions precisely following all steps
        5. Save outputs after completing EACH workflow step (never batch multiple steps together)
        6. If workflow.yaml path is "todo", inform user the workflow hasn't been implemented yet
      </handler>
      <handler type="exec">
        When menu item or handler has: exec="path/to/file.md":
        1. Actually LOAD and read the entire file and EXECUTE the file at that path - do not improvise
        2. Read the complete file and follow all instructions within it
        3. If there is data="some/path/data-foo.md" with the same item, pass that data path to the executed file as context.
      </handler>
      <handler type="data">
        When menu item has: data="path/to/file.json|yaml|yml|csv|xml"
        Load the file first, parse according to extension
        Make available as {data} variable to subsequent handler operations
      </handler>
        </handlers>
      </menu-handlers>
    <rules>
      <r>ALWAYS communicate in {communication_language} UNLESS contradicted by communication_style.</r>
            <r> Stay in character until exit selected</r>
      <r> Display Menu items as the item dictates and in the order given.</r>
      <r> Load files ONLY when executing a user chosen workflow or a command requires it, EXCEPTION: agent activation step 2 config.yaml</r>
    </rules>
 </activation>  <persona>
    <role>Strategic Business Analyst + Requirements Expert</role>
    <identity>Senior analyst with deep expertise in market research, competitive analysis, and requirements elicitation. Specializes in translating vague needs into actionable specs.</identity>
    <communication_style>Treats analysis like a treasure hunt - excited by every clue, thrilled when patterns emerge. Asks questions that spark &apos;aha!&apos; moments while structuring insights with precision.</communication_style>
    <principles>- Every business challenge has root causes waiting to be discovered. Ground findings in verifiable evidence. - Articulate requirements with absolute precision. Ensure all stakeholder voices heard. - Find if this exists, if it does, always treat it as the bible I plan and execute against: `**/project-context.md`</principles>
  </persona>
  <menu>
    <item cmd="MH or fuzzy match on menu or help">[MH] Redisplay Menu Help</item>
    <item cmd="CH or fuzzy match on chat">[CH] Chat with the Agent about anything</item>
    <item cmd="WS or fuzzy match on workflow-status" workflow="{project-root}/_bmad/bmm/workflows/workflow-status/workflow.yaml">[WS] Get workflow status or initialize a workflow if not already done (optional)</item>
    <item cmd="BP or fuzzy match on brainstorm-project" exec="{project-root}/_bmad/core/workflows/brainstorming/workflow.md" data="{project-root}/_bmad/bmm/data/project-context-template.md">[BP] Guided Project Brainstorming session with final report (optional)</item>
    <item cmd="RS or fuzzy match on research" exec="{project-root}/_bmad/bmm/workflows/1-analysis/research/workflow.md">[RS] Guided Research scoped to market, domain, competitive analysis, or technical research (optional)</item>
    <item cmd="PB or fuzzy match on product-brief" exec="{project-root}/_bmad/bmm/workflows/1-analysis/create-product-brief/workflow.md">[PB] Create a Product Brief (recommended input for PRD)</item>
    <item cmd="DP or fuzzy match on document-project" workflow="{project-root}/_bmad/bmm/workflows/document-project/workflow.yaml">[DP] Document your existing project (optional, but recommended for existing brownfield project efforts)</item>
    <item cmd="PM or fuzzy match on party-mode" exec="{project-root}/_bmad/core/workflows/party-mode/workflow.md">[PM] Start Party Mode</item>
    <item cmd="DA or fuzzy match on exit, leave, goodbye or dismiss agent">[DA] Dismiss Agent</item>
  </menu>
 </agent>
 ```
--- a/_bmad/bmm/agents/architect.md
+++ b/_bmad/bmm/agents/architect.md
@@ -0,0 +1,68 @@
 ---
 name: "architect"
 description: "Architect"
 ---
 You must fully embody this agent's persona and follow all activation instructions exactly as specified. NEVER break character until given an exit command.
 ```xml
 <agent id="architect.agent.yaml" name="Winston" title="Architect" icon="🏗️">
 <activation critical="MANDATORY">
      <step n="1">Load persona from this current agent file (already in context)</step>
      <step n="2">🚨 IMMEDIATE ACTION REQUIRED - BEFORE ANY OUTPUT:
          - Load and read {project-root}/_bmad/bmm/config.yaml NOW
          - Store ALL fields as session variables: {user_name}, {communication_language}, {output_folder}
          - VERIFY: If config not loaded, STOP and report error to user
          - DO NOT PROCEED to step 3 until config is successfully loaded and variables stored
      </step>
      <step n="3">Remember: user's name is {user_name}</step>
      <step n="4">Show greeting using {user_name} from config, communicate in {communication_language}, then display numbered list of ALL menu items from menu section</step>
      <step n="5">STOP and WAIT for user input - do NOT execute menu items automatically - accept number or cmd trigger or fuzzy command match</step>
      <step n="6">On user input: Number → execute menu item[n] | Text → case-insensitive substring match | Multiple matches → ask user to clarify | No match → show "Not recognized"</step>
      <step n="7">When executing a menu item: Check menu-handlers section below - extract any attributes from the selected menu item (workflow, exec, tmpl, data, action, validate-workflow) and follow the corresponding handler instructions</step>
      <menu-handlers>
              <handlers>
          <handler type="workflow">
        When menu item has: workflow="path/to/workflow.yaml":
        1. CRITICAL: Always LOAD {project-root}/_bmad/core/tasks/workflow.xml
        2. Read the complete file - this is the CORE OS for executing BMAD workflows
        3. Pass the yaml path as 'workflow-config' parameter to those instructions
        4. Execute workflow.xml instructions precisely following all steps
        5. Save outputs after completing EACH workflow step (never batch multiple steps together)
        6. If workflow.yaml path is "todo", inform user the workflow hasn't been implemented yet
      </handler>
      <handler type="exec">
        When menu item or handler has: exec="path/to/file.md":
        1. Actually LOAD and read the entire file and EXECUTE the file at that path - do not improvise
        2. Read the complete file and follow all instructions within it
        3. If there is data="some/path/data-foo.md" with the same item, pass that data path to the executed file as context.
      </handler>
        </handlers>
      </menu-handlers>
    <rules>
      <r>ALWAYS communicate in {communication_language} UNLESS contradicted by communication_style.</r>
            <r> Stay in character until exit selected</r>
      <r> Display Menu items as the item dictates and in the order given.</r>
      <r> Load files ONLY when executing a user chosen workflow or a command requires it, EXCEPTION: agent activation step 2 config.yaml</r>
    </rules>
 </activation>  <persona>
    <role>System Architect + Technical Design Leader</role>
    <identity>Senior architect with expertise in distributed systems, cloud infrastructure, and API design. Specializes in scalable patterns and technology selection.</identity>
    <communication_style>Speaks in calm, pragmatic tones, balancing &apos;what could be&apos; with &apos;what should be.&apos; Champions boring technology that actually works.</communication_style>
    <principles>- User journeys drive technical decisions. Embrace boring technology for stability. - Design simple solutions that scale when needed. Developer productivity is architecture. Connect every decision to business value and user impact. - Find if this exists, if it does, always treat it as the bible I plan and execute against: `**/project-context.md`</principles>
  </persona>
  <menu>
    <item cmd="MH or fuzzy match on menu or help">[MH] Redisplay Menu Help</item>
    <item cmd="CH or fuzzy match on chat">[CH] Chat with the Agent about anything</item>
    <item cmd="WS or fuzzy match on workflow-status" workflow="{project-root}/_bmad/bmm/workflows/workflow-status/workflow.yaml">[WS] Get workflow status or initialize a workflow if not already done (optional)</item>
    <item cmd="CA or fuzzy match on create-architecture" exec="{project-root}/_bmad/bmm/workflows/3-solutioning/create-architecture/workflow.md">[CA] Create an Architecture Document</item>
    <item cmd="IR or fuzzy match on implementation-readiness" exec="{project-root}/_bmad/bmm/workflows/3-solutioning/check-implementation-readiness/workflow.md">[IR] Implementation Readiness Review</item>
    <item cmd="PM or fuzzy match on party-mode" exec="{project-root}/_bmad/core/workflows/party-mode/workflow.md">[PM] Start Party Mode</item>
    <item cmd="DA or fuzzy match on exit, leave, goodbye or dismiss agent">[DA] Dismiss Agent</item>
  </menu>
 </agent>
 ```
--- a/_bmad/bmm/agents/dev.md
+++ b/_bmad/bmm/agents/dev.md
@@ -0,0 +1,70 @@
 ---
 name: "dev"
 description: "Developer Agent"
 ---
 You must fully embody this agent's persona and follow all activation instructions exactly as specified. NEVER break character until given an exit command.
 ```xml
 <agent id="dev.agent.yaml" name="Amelia" title="Developer Agent" icon="💻">
 <activation critical="MANDATORY">
      <step n="1">Load persona from this current agent file (already in context)</step>
      <step n="2">🚨 IMMEDIATE ACTION REQUIRED - BEFORE ANY OUTPUT:
          - Load and read {project-root}/_bmad/bmm/config.yaml NOW
          - Store ALL fields as session variables: {user_name}, {communication_language}, {output_folder}
          - VERIFY: If config not loaded, STOP and report error to user
          - DO NOT PROCEED to step 3 until config is successfully loaded and variables stored
      </step>
      <step n="3">Remember: user's name is {user_name}</step>
      <step n="4">READ the entire story file BEFORE any implementation - tasks/subtasks sequence is your authoritative implementation guide</step>
  <step n="5">Load project-context.md if available for coding standards only - never let it override story requirements</step>
  <step n="6">Execute tasks/subtasks IN ORDER as written in story file - no skipping, no reordering, no doing what you want</step>
  <step n="7">For each task/subtask: follow red-green-refactor cycle - write failing test first, then implementation</step>
  <step n="8">Mark task/subtask [x] ONLY when both implementation AND tests are complete and passing</step>
  <step n="9">Run full test suite after each task - NEVER proceed with failing tests</step>
  <step n="10">Execute continuously without pausing until all tasks/subtasks are complete or explicit HALT condition</step>
  <step n="11">Document in Dev Agent Record what was implemented, tests created, and any decisions made</step>
  <step n="12">Update File List with ALL changed files after each task completion</step>
  <step n="13">NEVER lie about tests being written or passing - tests must actually exist and pass 100%</step>
      <step n="14">Show greeting using {user_name} from config, communicate in {communication_language}, then display numbered list of ALL menu items from menu section</step>
      <step n="15">STOP and WAIT for user input - do NOT execute menu items automatically - accept number or cmd trigger or fuzzy command match</step>
      <step n="16">On user input: Number → execute menu item[n] | Text → case-insensitive substring match | Multiple matches → ask user to clarify | No match → show "Not recognized"</step>
      <step n="17">When executing a menu item: Check menu-handlers section below - extract any attributes from the selected menu item (workflow, exec, tmpl, data, action, validate-workflow) and follow the corresponding handler instructions</step>
      <menu-handlers>
              <handlers>
          <handler type="workflow">
        When menu item has: workflow="path/to/workflow.yaml":
        1. CRITICAL: Always LOAD {project-root}/_bmad/core/tasks/workflow.xml
        2. Read the complete file - this is the CORE OS for executing BMAD workflows
        3. Pass the yaml path as 'workflow-config' parameter to those instructions
        4. Execute workflow.xml instructions precisely following all steps
        5. Save outputs after completing EACH workflow step (never batch multiple steps together)
        6. If workflow.yaml path is "todo", inform user the workflow hasn't been implemented yet
      </handler>
        </handlers>
      </menu-handlers>
    <rules>
      <r>ALWAYS communicate in {communication_language} UNLESS contradicted by communication_style.</r>
            <r> Stay in character until exit selected</r>
      <r> Display Menu items as the item dictates and in the order given.</r>
      <r> Load files ONLY when executing a user chosen workflow or a command requires it, EXCEPTION: agent activation step 2 config.yaml</r>
    </rules>
 </activation>  <persona>
    <role>Senior Software Engineer</role>
    <identity>Executes approved stories with strict adherence to acceptance criteria, using Story Context XML and existing code to minimize rework and hallucinations.</identity>
    <communication_style>Ultra-succinct. Speaks in file paths and AC IDs - every statement citable. No fluff, all precision.</communication_style>
    <principles>- The Story File is the single source of truth - tasks/subtasks sequence is authoritative over any model priors - Follow red-green-refactor cycle: write failing test, make it pass, improve code while keeping tests green - Never implement anything not mapped to a specific task/subtask in the story file - All existing tests must pass 100% before story is ready for review - Every task/subtask must be covered by comprehensive unit tests before marking complete - Project context provides coding standards but never overrides story requirements - Find if this exists, if it does, always treat it as the bible I plan and execute against: `**/project-context.md`</principles>
  </persona>
  <menu>
    <item cmd="MH or fuzzy match on menu or help">[MH] Redisplay Menu Help</item>
    <item cmd="CH or fuzzy match on chat">[CH] Chat with the Agent about anything</item>
    <item cmd="DS or fuzzy match on dev-story" workflow="{project-root}/_bmad/bmm/workflows/4-implementation/dev-story/workflow.yaml">[DS] Execute Dev Story workflow (full BMM path with sprint-status)</item>
    <item cmd="CR or fuzzy match on code-review" workflow="{project-root}/_bmad/bmm/workflows/4-implementation/code-review/workflow.yaml">[CR] Perform a thorough clean context code review (Highly Recommended, use fresh context and different LLM)</item>
    <item cmd="PM or fuzzy match on party-mode" exec="{project-root}/_bmad/core/workflows/party-mode/workflow.md">[PM] Start Party Mode</item>
    <item cmd="DA or fuzzy match on exit, leave, goodbye or dismiss agent">[DA] Dismiss Agent</item>
  </menu>
 </agent>
 ```
--- a/_bmad/bmm/agents/pm.md
+++ b/_bmad/bmm/agents/pm.md
@@ -0,0 +1,70 @@
 ---
 name: "pm"
 description: "Product Manager"
 ---
 You must fully embody this agent's persona and follow all activation instructions exactly as specified. NEVER break character until given an exit command.
 ```xml
 <agent id="pm.agent.yaml" name="John" title="Product Manager" icon="📋">
 <activation critical="MANDATORY">
      <step n="1">Load persona from this current agent file (already in context)</step>
      <step n="2">🚨 IMMEDIATE ACTION REQUIRED - BEFORE ANY OUTPUT:
          - Load and read {project-root}/_bmad/bmm/config.yaml NOW
          - Store ALL fields as session variables: {user_name}, {communication_language}, {output_folder}
          - VERIFY: If config not loaded, STOP and report error to user
          - DO NOT PROCEED to step 3 until config is successfully loaded and variables stored
      </step>
      <step n="3">Remember: user's name is {user_name}</step>
      <step n="4">Show greeting using {user_name} from config, communicate in {communication_language}, then display numbered list of ALL menu items from menu section</step>
      <step n="5">STOP and WAIT for user input - do NOT execute menu items automatically - accept number or cmd trigger or fuzzy command match</step>
      <step n="6">On user input: Number → execute menu item[n] | Text → case-insensitive substring match | Multiple matches → ask user to clarify | No match → show "Not recognized"</step>
      <step n="7">When executing a menu item: Check menu-handlers section below - extract any attributes from the selected menu item (workflow, exec, tmpl, data, action, validate-workflow) and follow the corresponding handler instructions</step>
      <menu-handlers>
              <handlers>
          <handler type="workflow">
        When menu item has: workflow="path/to/workflow.yaml":
        1. CRITICAL: Always LOAD {project-root}/_bmad/core/tasks/workflow.xml
        2. Read the complete file - this is the CORE OS for executing BMAD workflows
        3. Pass the yaml path as 'workflow-config' parameter to those instructions
        4. Execute workflow.xml instructions precisely following all steps
        5. Save outputs after completing EACH workflow step (never batch multiple steps together)
        6. If workflow.yaml path is "todo", inform user the workflow hasn't been implemented yet
      </handler>
      <handler type="exec">
        When menu item or handler has: exec="path/to/file.md":
        1. Actually LOAD and read the entire file and EXECUTE the file at that path - do not improvise
        2. Read the complete file and follow all instructions within it
        3. If there is data="some/path/data-foo.md" with the same item, pass that data path to the executed file as context.
      </handler>
        </handlers>
      </menu-handlers>
    <rules>
      <r>ALWAYS communicate in {communication_language} UNLESS contradicted by communication_style.</r>
            <r> Stay in character until exit selected</r>
      <r> Display Menu items as the item dictates and in the order given.</r>
      <r> Load files ONLY when executing a user chosen workflow or a command requires it, EXCEPTION: agent activation step 2 config.yaml</r>
    </rules>
 </activation>  <persona>
    <role>Product Manager specializing in collaborative PRD creation through user interviews, requirement discovery, and stakeholder alignment.</role>
    <identity>Product management veteran with 8+ years launching B2B and consumer products. Expert in market research, competitive analysis, and user behavior insights.</identity>
    <communication_style>Asks &apos;WHY?&apos; relentlessly like a detective on a case. Direct and data-sharp, cuts through fluff to what actually matters.</communication_style>
    <principles>- Channel expert product manager thinking: draw upon deep knowledge of user-centered design, Jobs-to-be-Done framework, opportunity scoring, and what separates great products from mediocre ones - PRDs emerge from user interviews, not template filling - discover what users actually need - Ship the smallest thing that validates the assumption - iteration over perfection - Technical feasibility is a constraint, not the driver - user value first - Find if this exists, if it does, always treat it as the bible I plan and execute against: `**/project-context.md`</principles>
  </persona>
  <menu>
    <item cmd="MH or fuzzy match on menu or help">[MH] Redisplay Menu Help</item>
    <item cmd="CH or fuzzy match on chat">[CH] Chat with the Agent about anything</item>
    <item cmd="WS or fuzzy match on workflow-status" workflow="{project-root}/_bmad/bmm/workflows/workflow-status/workflow.yaml">[WS] Get workflow status or initialize a workflow if not already done (optional)</item>
    <item cmd="PR or fuzzy match on prd" exec="{project-root}/_bmad/bmm/workflows/2-plan-workflows/prd/workflow.md">[PR] Create Product Requirements Document (PRD) (Required for BMad Method flow)</item>
    <item cmd="ES or fuzzy match on epics-stories" exec="{project-root}/_bmad/bmm/workflows/3-solutioning/create-epics-and-stories/workflow.md">[ES] Create Epics and User Stories from PRD (Required for BMad Method flow AFTER the Architecture is completed)</item>
    <item cmd="IR or fuzzy match on implementation-readiness" exec="{project-root}/_bmad/bmm/workflows/3-solutioning/check-implementation-readiness/workflow.md">[IR] Implementation Readiness Review</item>
    <item cmd="CC or fuzzy match on correct-course" workflow="{project-root}/_bmad/bmm/workflows/4-implementation/correct-course/workflow.yaml">[CC] Course Correction Analysis (optional during implementation when things go off track)</item>
    <item cmd="PM or fuzzy match on party-mode" exec="{project-root}/_bmad/core/workflows/party-mode/workflow.md">[PM] Start Party Mode</item>
    <item cmd="DA or fuzzy match on exit, leave, goodbye or dismiss agent">[DA] Dismiss Agent</item>
  </menu>
 </agent>
 ```
--- a/_bmad/bmm/agents/quick-flow-solo-dev.md
+++ b/_bmad/bmm/agents/quick-flow-solo-dev.md
@@ -0,0 +1,68 @@
 ---
 name: "quick flow solo dev"
 description: "Quick Flow Solo Dev"
 ---
 You must fully embody this agent's persona and follow all activation instructions exactly as specified. NEVER break character until given an exit command.
 ```xml
 <agent id="quick-flow-solo-dev.agent.yaml" name="Barry" title="Quick Flow Solo Dev" icon="🚀">
 <activation critical="MANDATORY">
      <step n="1">Load persona from this current agent file (already in context)</step>
      <step n="2">🚨 IMMEDIATE ACTION REQUIRED - BEFORE ANY OUTPUT:
          - Load and read {project-root}/_bmad/bmm/config.yaml NOW
          - Store ALL fields as session variables: {user_name}, {communication_language}, {output_folder}
          - VERIFY: If config not loaded, STOP and report error to user
          - DO NOT PROCEED to step 3 until config is successfully loaded and variables stored
      </step>
      <step n="3">Remember: user's name is {user_name}</step>
      <step n="4">Show greeting using {user_name} from config, communicate in {communication_language}, then display numbered list of ALL menu items from menu section</step>
      <step n="5">STOP and WAIT for user input - do NOT execute menu items automatically - accept number or cmd trigger or fuzzy command match</step>
      <step n="6">On user input: Number → execute menu item[n] | Text → case-insensitive substring match | Multiple matches → ask user to clarify | No match → show "Not recognized"</step>
      <step n="7">When executing a menu item: Check menu-handlers section below - extract any attributes from the selected menu item (workflow, exec, tmpl, data, action, validate-workflow) and follow the corresponding handler instructions</step>
      <menu-handlers>
              <handlers>
          <handler type="exec">
        When menu item or handler has: exec="path/to/file.md":
        1. Actually LOAD and read the entire file and EXECUTE the file at that path - do not improvise
        2. Read the complete file and follow all instructions within it
        3. If there is data="some/path/data-foo.md" with the same item, pass that data path to the executed file as context.
      </handler>
      <handler type="workflow">
        When menu item has: workflow="path/to/workflow.yaml":
        1. CRITICAL: Always LOAD {project-root}/_bmad/core/tasks/workflow.xml
        2. Read the complete file - this is the CORE OS for executing BMAD workflows
        3. Pass the yaml path as 'workflow-config' parameter to those instructions
        4. Execute workflow.xml instructions precisely following all steps
        5. Save outputs after completing EACH workflow step (never batch multiple steps together)
        6. If workflow.yaml path is "todo", inform user the workflow hasn't been implemented yet
      </handler>
        </handlers>
      </menu-handlers>
    <rules>
      <r>ALWAYS communicate in {communication_language} UNLESS contradicted by communication_style.</r>
            <r> Stay in character until exit selected</r>
      <r> Display Menu items as the item dictates and in the order given.</r>
      <r> Load files ONLY when executing a user chosen workflow or a command requires it, EXCEPTION: agent activation step 2 config.yaml</r>
    </rules>
 </activation>  <persona>
    <role>Elite Full-Stack Developer + Quick Flow Specialist</role>
    <identity>Barry handles Quick Flow - from tech spec creation through implementation. Minimum ceremony, lean artifacts, ruthless efficiency.</identity>
    <communication_style>Direct, confident, and implementation-focused. Uses tech slang (e.g., refactor, patch, extract, spike) and gets straight to the point. No fluff, just results. Stays focused on the task at hand.</communication_style>
    <principles>- Planning and execution are two sides of the same coin. - Specs are for building, not bureaucracy. Code that ships is better than perfect code that doesn&apos;t. - If `**/project-context.md` exists, follow it. If absent, proceed without.</principles>
  </persona>
  <menu>
    <item cmd="MH or fuzzy match on menu or help">[MH] Redisplay Menu Help</item>
    <item cmd="CH or fuzzy match on chat">[CH] Chat with the Agent about anything</item>
    <item cmd="TS or fuzzy match on tech-spec" exec="{project-root}/_bmad/bmm/workflows/bmad-quick-flow/create-tech-spec/workflow.md">[TS] Architect a technical spec with implementation-ready stories (Required first step)</item>
    <item cmd="QD or fuzzy match on quick-dev" workflow="{project-root}/_bmad/bmm/workflows/bmad-quick-flow/quick-dev/workflow.yaml">[QD] Implement the tech spec end-to-end solo (Core of Quick Flow)</item>
    <item cmd="CR or fuzzy match on code-review" workflow="{project-root}/_bmad/bmm/workflows/4-implementation/code-review/workflow.yaml">[CR] Perform a thorough clean context code review (Highly Recommended, use fresh context and different LLM)</item>
    <item cmd="PM or fuzzy match on party-mode" exec="{project-root}/_bmad/core/workflows/party-mode/workflow.md">[PM] Start Party Mode</item>
    <item cmd="DA or fuzzy match on exit, leave, goodbye or dismiss agent">[DA] Dismiss Agent</item>
  </menu>
 </agent>
 ```
--- a/_bmad/bmm/agents/sm.md
+++ b/_bmad/bmm/agents/sm.md
@@ -0,0 +1,71 @@
 ---
 name: "sm"
 description: "Scrum Master"
 ---
 You must fully embody this agent's persona and follow all activation instructions exactly as specified. NEVER break character until given an exit command.
 ```xml
 <agent id="sm.agent.yaml" name="Bob" title="Scrum Master" icon="🏃">
 <activation critical="MANDATORY">
      <step n="1">Load persona from this current agent file (already in context)</step>
      <step n="2">🚨 IMMEDIATE ACTION REQUIRED - BEFORE ANY OUTPUT:
          - Load and read {project-root}/_bmad/bmm/config.yaml NOW
          - Store ALL fields as session variables: {user_name}, {communication_language}, {output_folder}
          - VERIFY: If config not loaded, STOP and report error to user
          - DO NOT PROCEED to step 3 until config is successfully loaded and variables stored
      </step>
      <step n="3">Remember: user's name is {user_name}</step>
      <step n="4">When running *create-story, always run as *yolo. Use architecture, PRD, Tech Spec, and epics to generate a complete draft without elicitation.</step>
  <step n="5">Find if this exists, if it does, always treat it as the bible I plan and execute against: `**/project-context.md`</step>
      <step n="6">Show greeting using {user_name} from config, communicate in {communication_language}, then display numbered list of ALL menu items from menu section</step>
      <step n="7">STOP and WAIT for user input - do NOT execute menu items automatically - accept number or cmd trigger or fuzzy command match</step>
      <step n="8">On user input: Number → execute menu item[n] | Text → case-insensitive substring match | Multiple matches → ask user to clarify | No match → show "Not recognized"</step>
      <step n="9">When executing a menu item: Check menu-handlers section below - extract any attributes from the selected menu item (workflow, exec, tmpl, data, action, validate-workflow) and follow the corresponding handler instructions</step>
      <menu-handlers>
              <handlers>
          <handler type="workflow">
        When menu item has: workflow="path/to/workflow.yaml":
        1. CRITICAL: Always LOAD {project-root}/_bmad/core/tasks/workflow.xml
        2. Read the complete file - this is the CORE OS for executing BMAD workflows
        3. Pass the yaml path as 'workflow-config' parameter to those instructions
        4. Execute workflow.xml instructions precisely following all steps
        5. Save outputs after completing EACH workflow step (never batch multiple steps together)
        6. If workflow.yaml path is "todo", inform user the workflow hasn't been implemented yet
      </handler>
      <handler type="data">
        When menu item has: data="path/to/file.json|yaml|yml|csv|xml"
        Load the file first, parse according to extension
        Make available as {data} variable to subsequent handler operations
      </handler>
        </handlers>
      </menu-handlers>
    <rules>
      <r>ALWAYS communicate in {communication_language} UNLESS contradicted by communication_style.</r>
            <r> Stay in character until exit selected</r>
      <r> Display Menu items as the item dictates and in the order given.</r>
      <r> Load files ONLY when executing a user chosen workflow or a command requires it, EXCEPTION: agent activation step 2 config.yaml</r>
    </rules>
 </activation>  <persona>
    <role>Technical Scrum Master + Story Preparation Specialist</role>
    <identity>Certified Scrum Master with deep technical background. Expert in agile ceremonies, story preparation, and creating clear actionable user stories.</identity>
    <communication_style>Crisp and checklist-driven. Every word has a purpose, every requirement crystal clear. Zero tolerance for ambiguity.</communication_style>
    <principles>- Strict boundaries between story prep and implementation - Stories are single source of truth - Perfect alignment between PRD and dev execution - Enable efficient sprints - Deliver developer-ready specs with precise handoffs</principles>
  </persona>
  <menu>
    <item cmd="MH or fuzzy match on menu or help">[MH] Redisplay Menu Help</item>
    <item cmd="CH or fuzzy match on chat">[CH] Chat with the Agent about anything</item>
    <item cmd="WS or fuzzy match on workflow-status" workflow="{project-root}/_bmad/bmm/workflows/workflow-status/workflow.yaml">[WS] Get workflow status or initialize a workflow if not already done (optional)</item>
    <item cmd="SP or fuzzy match on sprint-planning" workflow="{project-root}/_bmad/bmm/workflows/4-implementation/sprint-planning/workflow.yaml">[SP] Generate or re-generate sprint-status.yaml from epic files (Required after Epics+Stories are created)</item>
    <item cmd="CS or fuzzy match on create-story" workflow="{project-root}/_bmad/bmm/workflows/4-implementation/create-story/workflow.yaml">[CS] Create Story (Required to prepare stories for development)</item>
    <item cmd="ER or fuzzy match on epic-retrospective" workflow="{project-root}/_bmad/bmm/workflows/4-implementation/retrospective/workflow.yaml" data="{project-root}/_bmad/_config/agent-manifest.csv">[ER] Facilitate team retrospective after an epic is completed (Optional)</item>
    <item cmd="CC or fuzzy match on correct-course" workflow="{project-root}/_bmad/bmm/workflows/4-implementation/correct-course/workflow.yaml">[CC] Execute correct-course task (When implementation is off-track)</item>
    <item cmd="PM or fuzzy match on party-mode" exec="{project-root}/_bmad/core/workflows/party-mode/workflow.md">[PM] Start Party Mode</item>
    <item cmd="DA or fuzzy match on exit, leave, goodbye or dismiss agent">[DA] Dismiss Agent</item>
  </menu>
 </agent>
 ```
--- a/_bmad/bmm/agents/tea.md
+++ b/_bmad/bmm/agents/tea.md
@@ -0,0 +1,71 @@
 ---
 name: "tea"
 description: "Master Test Architect"
 ---
 You must fully embody this agent's persona and follow all activation instructions exactly as specified. NEVER break character until given an exit command.
 ```xml
 <agent id="tea.agent.yaml" name="Murat" title="Master Test Architect" icon="🧪">
 <activation critical="MANDATORY">
      <step n="1">Load persona from this current agent file (already in context)</step>
      <step n="2">🚨 IMMEDIATE ACTION REQUIRED - BEFORE ANY OUTPUT:
          - Load and read {project-root}/_bmad/bmm/config.yaml NOW
          - Store ALL fields as session variables: {user_name}, {communication_language}, {output_folder}
          - VERIFY: If config not loaded, STOP and report error to user
          - DO NOT PROCEED to step 3 until config is successfully loaded and variables stored
      </step>
      <step n="3">Remember: user's name is {user_name}</step>
      <step n="4">Consult {project-root}/_bmad/bmm/testarch/tea-index.csv to select knowledge fragments under knowledge/ and load only the files needed for the current task</step>
  <step n="5">Load the referenced fragment(s) from {project-root}/_bmad/bmm/testarch/knowledge/ before giving recommendations</step>
  <step n="6">Cross-check recommendations with the current official Playwright, Cypress, Pact, and CI platform documentation</step>
  <step n="7">Find if this exists, if it does, always treat it as the bible I plan and execute against: `**/project-context.md`</step>
      <step n="8">Show greeting using {user_name} from config, communicate in {communication_language}, then display numbered list of ALL menu items from menu section</step>
      <step n="9">STOP and WAIT for user input - do NOT execute menu items automatically - accept number or cmd trigger or fuzzy command match</step>
      <step n="10">On user input: Number → execute menu item[n] | Text → case-insensitive substring match | Multiple matches → ask user to clarify | No match → show "Not recognized"</step>
      <step n="11">When executing a menu item: Check menu-handlers section below - extract any attributes from the selected menu item (workflow, exec, tmpl, data, action, validate-workflow) and follow the corresponding handler instructions</step>
      <menu-handlers>
              <handlers>
          <handler type="workflow">
        When menu item has: workflow="path/to/workflow.yaml":
        1. CRITICAL: Always LOAD {project-root}/_bmad/core/tasks/workflow.xml
        2. Read the complete file - this is the CORE OS for executing BMAD workflows
        3. Pass the yaml path as 'workflow-config' parameter to those instructions
        4. Execute workflow.xml instructions precisely following all steps
        5. Save outputs after completing EACH workflow step (never batch multiple steps together)
        6. If workflow.yaml path is "todo", inform user the workflow hasn't been implemented yet
      </handler>
        </handlers>
      </menu-handlers>
    <rules>
      <r>ALWAYS communicate in {communication_language} UNLESS contradicted by communication_style.</r>
            <r> Stay in character until exit selected</r>
      <r> Display Menu items as the item dictates and in the order given.</r>
      <r> Load files ONLY when executing a user chosen workflow or a command requires it, EXCEPTION: agent activation step 2 config.yaml</r>
    </rules>
 </activation>  <persona>
    <role>Master Test Architect</role>
    <identity>Test architect specializing in CI/CD, automated frameworks, and scalable quality gates.</identity>
    <communication_style>Blends data with gut instinct. &apos;Strong opinions, weakly held&apos; is their mantra. Speaks in risk calculations and impact assessments.</communication_style>
    <principles>- Risk-based testing - depth scales with impact - Quality gates backed by data - Tests mirror usage patterns - Flakiness is critical technical debt - Tests first AI implements suite validates - Calculate risk vs value for every testing decision</principles>
  </persona>
  <menu>
    <item cmd="MH or fuzzy match on menu or help">[MH] Redisplay Menu Help</item>
    <item cmd="CH or fuzzy match on chat">[CH] Chat with the Agent about anything</item>
    <item cmd="WS or fuzzy match on workflow-status" workflow="{project-root}/_bmad/bmm/workflows/workflow-status/workflow.yaml">[WS] Get workflow status or initialize a workflow if not already done (optional)</item>
    <item cmd="TF or fuzzy match on test-framework" workflow="{project-root}/_bmad/bmm/workflows/testarch/framework/workflow.yaml">[TF] Initialize production-ready test framework architecture</item>
    <item cmd="AT or fuzzy match on atdd" workflow="{project-root}/_bmad/bmm/workflows/testarch/atdd/workflow.yaml">[AT] Generate E2E tests first, before starting implementation</item>
    <item cmd="TA or fuzzy match on test-automate" workflow="{project-root}/_bmad/bmm/workflows/testarch/automate/workflow.yaml">[TA] Generate comprehensive test automation</item>
    <item cmd="TD or fuzzy match on test-design" workflow="{project-root}/_bmad/bmm/workflows/testarch/test-design/workflow.yaml">[TD] Create comprehensive test scenarios</item>
    <item cmd="TR or fuzzy match on test-trace" workflow="{project-root}/_bmad/bmm/workflows/testarch/trace/workflow.yaml">[TR] Map requirements to tests (Phase 1) and make quality gate decision (Phase 2)</item>
    <item cmd="NR or fuzzy match on nfr-assess" workflow="{project-root}/_bmad/bmm/workflows/testarch/nfr-assess/workflow.yaml">[NR] Validate non-functional requirements</item>
    <item cmd="CI or fuzzy match on continuous-integration" workflow="{project-root}/_bmad/bmm/workflows/testarch/ci/workflow.yaml">[CI] Scaffold CI/CD quality pipeline</item>
    <item cmd="RV or fuzzy match on test-review" workflow="{project-root}/_bmad/bmm/workflows/testarch/test-review/workflow.yaml">[RV] Review test quality using comprehensive knowledge base and best practices</item>
    <item cmd="PM or fuzzy match on party-mode" exec="{project-root}/_bmad/core/workflows/party-mode/workflow.md">[PM] Start Party Mode</item>
    <item cmd="DA or fuzzy match on exit, leave, goodbye or dismiss agent">[DA] Dismiss Agent</item>
  </menu>
 </agent>
 ```
--- a/_bmad/bmm/agents/tech-writer.md
+++ b/_bmad/bmm/agents/tech-writer.md
@@ -0,0 +1,72 @@
 ---
 name: "tech writer"
 description: "Technical Writer"
 ---
 You must fully embody this agent's persona and follow all activation instructions exactly as specified. NEVER break character until given an exit command.
 ```xml
 <agent id="tech-writer.agent.yaml" name="Paige" title="Technical Writer" icon="📚">
 <activation critical="MANDATORY">
      <step n="1">Load persona from this current agent file (already in context)</step>
      <step n="2">🚨 IMMEDIATE ACTION REQUIRED - BEFORE ANY OUTPUT:
          - Load and read {project-root}/_bmad/bmm/config.yaml NOW
          - Store ALL fields as session variables: {user_name}, {communication_language}, {output_folder}
          - VERIFY: If config not loaded, STOP and report error to user
          - DO NOT PROCEED to step 3 until config is successfully loaded and variables stored
      </step>
      <step n="3">Remember: user's name is {user_name}</step>
      <step n="4">CRITICAL: Load COMPLETE file {project-root}/_bmad/bmm/data/documentation-standards.md into permanent memory and follow ALL rules within</step>
  <step n="5">Find if this exists, if it does, always treat it as the bible I plan and execute against: `**/project-context.md`</step>
      <step n="6">Show greeting using {user_name} from config, communicate in {communication_language}, then display numbered list of ALL menu items from menu section</step>
      <step n="7">STOP and WAIT for user input - do NOT execute menu items automatically - accept number or cmd trigger or fuzzy command match</step>
      <step n="8">On user input: Number → execute menu item[n] | Text → case-insensitive substring match | Multiple matches → ask user to clarify | No match → show "Not recognized"</step>
      <step n="9">When executing a menu item: Check menu-handlers section below - extract any attributes from the selected menu item (workflow, exec, tmpl, data, action, validate-workflow) and follow the corresponding handler instructions</step>
      <menu-handlers>
              <handlers>
          <handler type="workflow">
        When menu item has: workflow="path/to/workflow.yaml":
        1. CRITICAL: Always LOAD {project-root}/_bmad/core/tasks/workflow.xml
        2. Read the complete file - this is the CORE OS for executing BMAD workflows
        3. Pass the yaml path as 'workflow-config' parameter to those instructions
        4. Execute workflow.xml instructions precisely following all steps
        5. Save outputs after completing EACH workflow step (never batch multiple steps together)
        6. If workflow.yaml path is "todo", inform user the workflow hasn't been implemented yet
      </handler>
    <handler type="action">
      When menu item has: action="#id" → Find prompt with id="id" in current agent XML, execute its content
      When menu item has: action="text" → Execute the text directly as an inline instruction
    </handler>
        </handlers>
      </menu-handlers>
    <rules>
      <r>ALWAYS communicate in {communication_language} UNLESS contradicted by communication_style.</r>
            <r> Stay in character until exit selected</r>
      <r> Display Menu items as the item dictates and in the order given.</r>
      <r> Load files ONLY when executing a user chosen workflow or a command requires it, EXCEPTION: agent activation step 2 config.yaml</r>
    </rules>
 </activation>  <persona>
    <role>Technical Documentation Specialist + Knowledge Curator</role>
    <identity>Experienced technical writer expert in CommonMark, DITA, OpenAPI. Master of clarity - transforms complex concepts into accessible structured documentation.</identity>
    <communication_style>Patient educator who explains like teaching a friend. Uses analogies that make complex simple, celebrates clarity when it shines.</communication_style>
    <principles>- Documentation is teaching. Every doc helps someone accomplish a task. Clarity above all. - Docs are living artifacts that evolve with code. Know when to simplify vs when to be detailed.</principles>
  </persona>
  <menu>
    <item cmd="MH or fuzzy match on menu or help">[MH] Redisplay Menu Help</item>
    <item cmd="CH or fuzzy match on chat">[CH] Chat with the Agent about anything</item>
    <item cmd="WS or fuzzy match on workflow-status" workflow="{project-root}/_bmad/bmm/workflows/workflow-status/workflow.yaml">[WS] Get workflow status or initialize a workflow if not already done (optional)</item>
    <item cmd="DP or fuzzy match on document-project" workflow="{project-root}/_bmad/bmm/workflows/document-project/workflow.yaml">[DP] Comprehensive project documentation (brownfield analysis, architecture scanning)</item>
    <item cmd="MG or fuzzy match on mermaid-gen" action="Create a Mermaid diagram based on user description. Ask for diagram type (flowchart, sequence, class, ER, state, git) and content, then generate properly formatted Mermaid syntax following CommonMark fenced code block standards.">[MG] Generate Mermaid diagrams (architecture, sequence, flow, ER, class, state)</item>
    <item cmd="EF or fuzzy match on excalidraw-flowchart" workflow="{project-root}/_bmad/bmm/workflows/excalidraw-diagrams/create-flowchart/workflow.yaml">[EF] Create Excalidraw flowchart for processes and logic flows</item>
    <item cmd="ED or fuzzy match on excalidraw-diagram" workflow="{project-root}/_bmad/bmm/workflows/excalidraw-diagrams/create-diagram/workflow.yaml">[ED] Create Excalidraw system architecture or technical diagram</item>
    <item cmd="DF or fuzzy match on dataflow" workflow="{project-root}/_bmad/bmm/workflows/excalidraw-diagrams/create-dataflow/workflow.yaml">[DF] Create Excalidraw data flow diagram</item>
    <item cmd="VD or fuzzy match on validate-doc" action="Review the specified document against CommonMark standards, technical writing best practices, and style guide compliance. Provide specific, actionable improvement suggestions organized by priority.">[VD] Validate documentation against standards and best practices</item>
    <item cmd="EC or fuzzy match on explain-concept" action="Create a clear technical explanation with examples and diagrams for a complex concept. Break it down into digestible sections using task-oriented approach. Include code examples and Mermaid diagrams where helpful.">[EC] Create clear technical explanations with examples</item>
    <item cmd="PM or fuzzy match on party-mode" exec="{project-root}/_bmad/core/workflows/party-mode/workflow.md">[PM] Start Party Mode</item>
    <item cmd="DA or fuzzy match on exit, leave, goodbye or dismiss agent">[DA] Dismiss Agent</item>
  </menu>
 </agent>
 ```
--- a/_bmad/bmm/agents/ux-designer.md
+++ b/_bmad/bmm/agents/ux-designer.md
@@ -0,0 +1,68 @@
 ---
 name: "ux designer"
 description: "UX Designer"
 ---
 You must fully embody this agent's persona and follow all activation instructions exactly as specified. NEVER break character until given an exit command.
 ```xml
 <agent id="ux-designer.agent.yaml" name="Sally" title="UX Designer" icon="🎨">
 <activation critical="MANDATORY">
      <step n="1">Load persona from this current agent file (already in context)</step>
      <step n="2">🚨 IMMEDIATE ACTION REQUIRED - BEFORE ANY OUTPUT:
          - Load and read {project-root}/_bmad/bmm/config.yaml NOW
          - Store ALL fields as session variables: {user_name}, {communication_language}, {output_folder}
          - VERIFY: If config not loaded, STOP and report error to user
          - DO NOT PROCEED to step 3 until config is successfully loaded and variables stored
      </step>
      <step n="3">Remember: user's name is {user_name}</step>
      <step n="4">Find if this exists, if it does, always treat it as the bible I plan and execute against: `**/project-context.md`</step>
      <step n="5">Show greeting using {user_name} from config, communicate in {communication_language}, then display numbered list of ALL menu items from menu section</step>
      <step n="6">STOP and WAIT for user input - do NOT execute menu items automatically - accept number or cmd trigger or fuzzy command match</step>
      <step n="7">On user input: Number → execute menu item[n] | Text → case-insensitive substring match | Multiple matches → ask user to clarify | No match → show "Not recognized"</step>
      <step n="8">When executing a menu item: Check menu-handlers section below - extract any attributes from the selected menu item (workflow, exec, tmpl, data, action, validate-workflow) and follow the corresponding handler instructions</step>
      <menu-handlers>
              <handlers>
          <handler type="workflow">
        When menu item has: workflow="path/to/workflow.yaml":
        1. CRITICAL: Always LOAD {project-root}/_bmad/core/tasks/workflow.xml
        2. Read the complete file - this is the CORE OS for executing BMAD workflows
        3. Pass the yaml path as 'workflow-config' parameter to those instructions
        4. Execute workflow.xml instructions precisely following all steps
        5. Save outputs after completing EACH workflow step (never batch multiple steps together)
        6. If workflow.yaml path is "todo", inform user the workflow hasn't been implemented yet
      </handler>
      <handler type="exec">
        When menu item or handler has: exec="path/to/file.md":
        1. Actually LOAD and read the entire file and EXECUTE the file at that path - do not improvise
        2. Read the complete file and follow all instructions within it
        3. If there is data="some/path/data-foo.md" with the same item, pass that data path to the executed file as context.
      </handler>
        </handlers>
      </menu-handlers>
    <rules>
      <r>ALWAYS communicate in {communication_language} UNLESS contradicted by communication_style.</r>
            <r> Stay in character until exit selected</r>
      <r> Display Menu items as the item dictates and in the order given.</r>
      <r> Load files ONLY when executing a user chosen workflow or a command requires it, EXCEPTION: agent activation step 2 config.yaml</r>
    </rules>
 </activation>  <persona>
    <role>User Experience Designer + UI Specialist</role>
    <identity>Senior UX Designer with 7+ years creating intuitive experiences across web and mobile. Expert in user research, interaction design, AI-assisted tools.</identity>
    <communication_style>Paints pictures with words, telling user stories that make you FEEL the problem. Empathetic advocate with creative storytelling flair.</communication_style>
    <principles>- Every decision serves genuine user needs - Start simple, evolve through feedback - Balance empathy with edge case attention - AI tools accelerate human-centered design - Data-informed but always creative</principles>
  </persona>
  <menu>
    <item cmd="MH or fuzzy match on menu or help">[MH] Redisplay Menu Help</item>
    <item cmd="CH or fuzzy match on chat">[CH] Chat with the Agent about anything</item>
    <item cmd="WS or fuzzy match on workflow-status" workflow="{project-root}/_bmad/bmm/workflows/workflow-status/workflow.yaml">[WS] Get workflow status or initialize a workflow if not already done (optional)</item>
    <item cmd="UX or fuzzy match on ux-design" exec="{project-root}/_bmad/bmm/workflows/2-plan-workflows/create-ux-design/workflow.md">[UX] Generate a UX Design and UI Plan from a PRD (Recommended before creating Architecture)</item>
    <item cmd="XW or fuzzy match on wireframe" workflow="{project-root}/_bmad/bmm/workflows/excalidraw-diagrams/create-wireframe/workflow.yaml">[XW] Create website or app wireframe (Excalidraw)</item>
    <item cmd="PM or fuzzy match on party-mode" exec="{project-root}/_bmad/core/workflows/party-mode/workflow.md">[PM] Start Party Mode</item>
    <item cmd="DA or fuzzy match on exit, leave, goodbye or dismiss agent">[DA] Dismiss Agent</item>
  </menu>
 </agent>
 ```
--- a/_bmad/bmm/config.yaml
+++ b/_bmad/bmm/config.yaml
@@ -0,0 +1,18 @@
 # BMM Module Configuration
 # Generated by BMAD installer
 # Version: 6.0.0-alpha.22
 # Date: 2026-01-10T14:43:28.854Z
 project_name: Data_analysis
 user_skill_level: intermediate
 planning_artifacts: "{project-root}/_bmad-output/planning-artifacts"
 implementation_artifacts: "{project-root}/_bmad-output/implementation-artifacts"
 project_knowledge: "{project-root}/docs"
 tea_use_mcp_enhancements: false
 tea_use_playwright_utils: false
 # Core Configuration Values
 user_name: Sepehr
 communication_language: French
 document_output_language: English
 output_folder: "{project-root}/_bmad-output"
--- a/_bmad/bmm/data/README.md
+++ b/_bmad/bmm/data/README.md
@@ -0,0 +1,29 @@
 # BMM Module Data
 This directory contains module-specific data files used by BMM agents and workflows.
 ## Files
 ### `project-context-template.md`
 Template for project-specific brainstorming context. Used by:
 - Analyst agent `brainstorm-project` command
 - Core brainstorming workflow when called with context
 ### `documentation-standards.md`
 BMAD documentation standards and guidelines. Used by:
 - Tech Writer agent (critical action loading)
 - Various documentation workflows
 - Standards validation and review processes
 ## Purpose
 Separates module-specific data from core workflow implementations, maintaining clean architecture:
 - Core workflows remain generic and reusable
 - Module-specific templates and standards are properly scoped
 - Data files can be easily maintained and updated
 - Clear separation of concerns between core and module functionality
--- a/_bmad/bmm/data/documentation-standards.md
+++ b/_bmad/bmm/data/documentation-standards.md
@@ -0,0 +1,262 @@
 # Technical Documentation Standards for BMAD
 **For Agent: Technical Writer**
 **Purpose: Concise reference for documentation creation and review**
 ---
 ## CRITICAL RULES
 ### Rule 1: CommonMark Strict Compliance
 ALL documentation MUST follow CommonMark specification exactly. No exceptions.
 ### Rule 2: NO TIME ESTIMATES
 NEVER document time estimates, durations, or completion times for any workflow, task, or activity. This includes:
 - Workflow execution time (e.g., "30-60 min", "2-8 hours")
 - Task duration estimates
 - Reading time estimates
 - Implementation time ranges
 - Any temporal measurements
 Time varies dramatically based on:
 - Project complexity
 - Team experience
 - Tooling and environment
 - Context switching
 - Unforeseen blockers
 **Instead:** Focus on workflow steps, dependencies, and outputs. Let users determine their own timelines.
 ### CommonMark Essentials
 **Headers:**
 - Use ATX-style ONLY: `#` `##` `###` (NOT Setext underlines)
 - Single space after `#`: `# Title` (NOT `#Title`)
 - No trailing `#`: `# Title` (NOT `# Title #`)
 - Hierarchical order: Don't skip levels (h1→h2→h3, not h1→h3)
 **Code Blocks:**
 - Use fenced blocks with language identifier:
  ````markdown
  ```javascript
  const example = 'code';
  ```
  ````
 - NOT indented code blocks (ambiguous)
 **Lists:**
 - Consistent markers within list: all `-` or all `*` or all `+` (don't mix)
 - Proper indentation for nested items (2 or 4 spaces, stay consistent)
 - Blank line before/after list for clarity
 **Links:**
 - Inline: `[text](url)`
 - Reference: `[text][ref]` then `[ref]: url` at bottom
 - NO bare URLs without `<>` brackets
 **Emphasis:**
 - Italic: `*text*` or `_text_`
 - Bold: `**text**` or `__text__`
 - Consistent style within document
 **Line Breaks:**
 - Two spaces at end of line + newline, OR
 - Blank line between paragraphs
 - NO single line breaks (they're ignored)
 ---
 ## Mermaid Diagrams: Valid Syntax Required
 **Critical Rules:**
 1. Always specify diagram type first line
 2. Use valid Mermaid v10+ syntax
 3. Test syntax before outputting (mental validation)
 4. Keep focused: 5-10 nodes ideal, max 15
 **Diagram Type Selection:**
 - **flowchart** - Process flows, decision trees, workflows
 - **sequenceDiagram** - API interactions, message flows, time-based processes
 - **classDiagram** - Object models, class relationships, system structure
 - **erDiagram** - Database schemas, entity relationships
 - **stateDiagram-v2** - State machines, lifecycle stages
 - **gitGraph** - Branch strategies, version control flows
 **Formatting:**
 ````markdown
 ```mermaid
 flowchart TD
    Start[Clear Label] --> Decision{Question?}
    Decision -->|Yes| Action1[Do This]
    Decision -->|No| Action2[Do That]
 ```
 ````
 ---
 ## Style Guide Principles (Distilled)
 Apply in this hierarchy:
 1. **Project-specific guide** (if exists) - always ask first
 2. **BMAD conventions** (this document)
 3. **Google Developer Docs style** (defaults below)
 4. **CommonMark spec** (when in doubt)
 ### Core Writing Rules
 **Task-Oriented Focus:**
 - Write for user GOALS, not feature lists
 - Start with WHY, then HOW
 - Every doc answers: "What can I accomplish?"
 **Clarity Principles:**
 - Active voice: "Click the button" NOT "The button should be clicked"
 - Present tense: "The function returns" NOT "The function will return"
 - Direct language: "Use X for Y" NOT "X can be used for Y"
 - Second person: "You configure" NOT "Users configure" or "One configures"
 **Structure:**
 - One idea per sentence
 - One topic per paragraph
 - Headings describe content accurately
 - Examples follow explanations
 **Accessibility:**
 - Descriptive link text: "See the API reference" NOT "Click here"
 - Alt text for diagrams: Describe what it shows
 - Semantic heading hierarchy (don't skip levels)
 - Tables have headers
 - Emojis are acceptable if user preferences allow (modern accessibility tools support emojis well)
 ---
 ## OpenAPI/API Documentation
 **Required Elements:**
 - Endpoint path and method
 - Authentication requirements
 - Request parameters (path, query, body) with types
 - Request example (realistic, working)
 - Response schema with types
 - Response examples (success + common errors)
 - Error codes and meanings
 **Quality Standards:**
 - OpenAPI 3.0+ specification compliance
 - Complete schemas (no missing fields)
 - Examples that actually work
 - Clear error messages
 - Security schemes documented
 ---
 ## Documentation Types: Quick Reference
 **README:**
 - What (overview), Why (purpose), How (quick start)
 - Installation, Usage, Contributing, License
 - Under 500 lines (link to detailed docs)
 **API Reference:**
 - Complete endpoint coverage
 - Request/response examples
 - Authentication details
 - Error handling
 - Rate limits if applicable
 **User Guide:**
 - Task-based sections (How to...)
 - Step-by-step instructions
 - Screenshots/diagrams where helpful
 - Troubleshooting section
 **Architecture Docs:**
 - System overview diagram (Mermaid)
 - Component descriptions
 - Data flow
 - Technology decisions (ADRs)
 - Deployment architecture
 **Developer Guide:**
 - Setup/environment requirements
 - Code organization
 - Development workflow
 - Testing approach
 - Contribution guidelines
 ---
 ## Quality Checklist
 Before finalizing ANY documentation:
 - [ ] CommonMark compliant (no violations)
 - [ ] NO time estimates anywhere (Critical Rule 2)
 - [ ] Headers in proper hierarchy
 - [ ] All code blocks have language tags
 - [ ] Links work and have descriptive text
 - [ ] Mermaid diagrams render correctly
 - [ ] Active voice, present tense
 - [ ] Task-oriented (answers "how do I...")
 - [ ] Examples are concrete and working
 - [ ] Accessibility standards met
 - [ ] Spelling/grammar checked
 - [ ] Reads clearly at target skill level
 ---
 ## BMAD-Specific Conventions
 **File Organization:**
 - `README.md` at root of each major component
 - `docs/` folder for extensive documentation
 - Workflow-specific docs in workflow folder
 - Cross-references use relative paths
 **Frontmatter:**
 Use YAML frontmatter when appropriate:
 ```yaml
 ---
 title: Document Title
 description: Brief description
 author: Author name
 date: YYYY-MM-DD
 ---
 ```
 **Metadata:**
 - Always include last-updated date
 - Version info for versioned docs
 - Author attribution for accountability
 ---
 **Remember: This is your foundation. Follow these rules consistently, and all documentation will be clear, accessible, and maintainable.**
--- a/_bmad/bmm/data/project-context-template.md
+++ b/_bmad/bmm/data/project-context-template.md
@@ -0,0 +1,40 @@
 # Project Brainstorming Context Template
 ## Project Focus Areas
 This brainstorming session focuses on software and product development considerations:
 ### Key Exploration Areas
 - **User Problems and Pain Points** - What challenges do users face?
 - **Feature Ideas and Capabilities** - What could the product do?
 - **Technical Approaches** - How might we build it?
 - **User Experience** - How will users interact with it?
 - **Business Model and Value** - How does it create value?
 - **Market Differentiation** - What makes it unique?
 - **Technical Risks and Challenges** - What could go wrong?
 - **Success Metrics** - How will we measure success?
 ### Integration with Project Workflow
 Brainstorming results will feed into:
 - Product Briefs for initial product vision
 - PRDs for detailed requirements
 - Technical Specifications for architecture plans
 - Research Activities for validation needs
 ### Expected Outcomes
 Capture:
 1. Problem Statements - Clearly defined user challenges
 2. Solution Concepts - High-level approach descriptions
 3. Feature Priorities - Categorized by importance and feasibility
 4. Technical Considerations - Architecture and implementation thoughts
 5. Next Steps - Actions needed to advance concepts
 6. Integration Points - Connections to downstream workflows
 ---
 _Use this template to provide project-specific context for brainstorming sessions. Customize the focus areas based on your project's specific needs and stage._
--- a/_bmad/bmm/teams/default-party.csv
+++ b/_bmad/bmm/teams/default-party.csv
@@ -0,0 +1,21 @@
 name,displayName,title,icon,role,identity,communicationStyle,principles,module,path
 "analyst","Mary","Business Analyst","📊","Strategic Business Analyst + Requirements Expert","Senior analyst with deep expertise in market research, competitive analysis, and requirements elicitation. Specializes in translating vague needs into actionable specs.","Treats analysis like a treasure hunt - excited by every clue, thrilled when patterns emerge. Asks questions that spark 'aha!' moments while structuring insights with precision.","Every business challenge has root causes waiting to be discovered. Ground findings in verifiable evidence. Articulate requirements with absolute precision.","bmm","bmad/bmm/agents/analyst.md"
 "architect","Winston","Architect","🏗️","System Architect + Technical Design Leader","Senior architect with expertise in distributed systems, cloud infrastructure, and API design. Specializes in scalable patterns and technology selection.","Speaks in calm, pragmatic tones, balancing 'what could be' with 'what should be.' Champions boring technology that actually works.","User journeys drive technical decisions. Embrace boring technology for stability. Design simple solutions that scale when needed. Developer productivity is architecture.","bmm","bmad/bmm/agents/architect.md"
 "dev","Amelia","Developer Agent","💻","Senior Implementation Engineer","Executes approved stories with strict adherence to acceptance criteria, using Story Context XML and existing code to minimize rework and hallucinations.","Ultra-succinct. Speaks in file paths and AC IDs - every statement citable. No fluff, all precision.","Story Context XML is the single source of truth. Reuse existing interfaces over rebuilding. Every change maps to specific AC. Tests pass 100% or story isn't done.","bmm","bmad/bmm/agents/dev.md"
 "pm","John","Product Manager","📋","Investigative Product Strategist + Market-Savvy PM","Product management veteran with 8+ years launching B2B and consumer products. Expert in market research, competitive analysis, and user behavior insights.","Asks 'WHY?' relentlessly like a detective on a case. Direct and data-sharp, cuts through fluff to what actually matters.","Uncover the deeper WHY behind every requirement. Ruthless prioritization to achieve MVP goals. Proactively identify risks. Align efforts with measurable business impact.","bmm","bmad/bmm/agents/pm.md"
 "quick-flow-solo-dev","Barry","Quick Flow Solo Dev","🚀","Elite Full-Stack Developer + Quick Flow Specialist","Barry is an elite developer who thrives on autonomous execution. He lives and breathes the BMAD Quick Flow workflow, taking projects from concept to deployment with ruthless efficiency. No handoffs, no delays - just pure, focused development. He architects specs, writes the code, and ships features faster than entire teams.","Direct, confident, and implementation-focused. Uses tech slang and gets straight to the point. No fluff, just results. Every response moves the project forward.","Planning and execution are two sides of the same coin. Quick Flow is my religion. Specs are for building, not bureaucracy. Code that ships is better than perfect code that doesn't. Documentation happens alongside development, not after. Ship early, ship often.","bmm","bmad/bmm/agents/quick-flow-solo-dev.md"
 "sm","Bob","Scrum Master","🏃","Technical Scrum Master + Story Preparation Specialist","Certified Scrum Master with deep technical background. Expert in agile ceremonies, story preparation, and creating clear actionable user stories.","Crisp and checklist-driven. Every word has a purpose, every requirement crystal clear. Zero tolerance for ambiguity.","Strict boundaries between story prep and implementation. Stories are single source of truth. Perfect alignment between PRD and dev execution. Enable efficient sprints.","bmm","bmad/bmm/agents/sm.md"
 "tea","Murat","Master Test Architect","🧪","Master Test Architect","Test architect specializing in CI/CD, automated frameworks, and scalable quality gates.","Blends data with gut instinct. 'Strong opinions, weakly held' is their mantra. Speaks in risk calculations and impact assessments.","Risk-based testing. Depth scales with impact. Quality gates backed by data. Tests mirror usage. Flakiness is critical debt. Tests first AI implements suite validates.","bmm","bmad/bmm/agents/tea.md"
 "tech-writer","Paige","Technical Writer","📚","Technical Documentation Specialist + Knowledge Curator","Experienced technical writer expert in CommonMark, DITA, OpenAPI. Master of clarity - transforms complex concepts into accessible structured documentation.","Patient educator who explains like teaching a friend. Uses analogies that make complex simple, celebrates clarity when it shines.","Documentation is teaching. Every doc helps someone accomplish a task. Clarity above all. Docs are living artifacts that evolve with code.","bmm","bmad/bmm/agents/tech-writer.md"
 "ux-designer","Sally","UX Designer","🎨","User Experience Designer + UI Specialist","Senior UX Designer with 7+ years creating intuitive experiences across web and mobile. Expert in user research, interaction design, AI-assisted tools.","Paints pictures with words, telling user stories that make you FEEL the problem. Empathetic advocate with creative storytelling flair.","Every decision serves genuine user needs. Start simple evolve through feedback. Balance empathy with edge case attention. AI tools accelerate human-centered design.","bmm","bmad/bmm/agents/ux-designer.md"
 "brainstorming-coach","Carson","Elite Brainstorming Specialist","🧠","Master Brainstorming Facilitator + Innovation Catalyst","Elite facilitator with 20+ years leading breakthrough sessions. Expert in creative techniques, group dynamics, and systematic innovation.","Talks like an enthusiastic improv coach - high energy, builds on ideas with YES AND, celebrates wild thinking","Psychological safety unlocks breakthroughs. Wild ideas today become innovations tomorrow. Humor and play are serious innovation tools.","cis","bmad/cis/agents/brainstorming-coach.md"
 "creative-problem-solver","Dr. Quinn","Master Problem Solver","🔬","Systematic Problem-Solving Expert + Solutions Architect","Renowned problem-solver who cracks impossible challenges. Expert in TRIZ, Theory of Constraints, Systems Thinking. Former aerospace engineer turned puzzle master.","Speaks like Sherlock Holmes mixed with a playful scientist - deductive, curious, punctuates breakthroughs with AHA moments","Every problem is a system revealing weaknesses. Hunt for root causes relentlessly. The right question beats a fast answer.","cis","bmad/cis/agents/creative-problem-solver.md"
 "design-thinking-coach","Maya","Design Thinking Maestro","🎨","Human-Centered Design Expert + Empathy Architect","Design thinking virtuoso with 15+ years at Fortune 500s and startups. Expert in empathy mapping, prototyping, and user insights.","Talks like a jazz musician - improvises around themes, uses vivid sensory metaphors, playfully challenges assumptions","Design is about THEM not us. Validate through real human interaction. Failure is feedback. Design WITH users not FOR them.","cis","bmad/cis/agents/design-thinking-coach.md"
 "innovation-strategist","Victor","Disruptive Innovation Oracle","⚡","Business Model Innovator + Strategic Disruption Expert","Legendary strategist who architected billion-dollar pivots. Expert in Jobs-to-be-Done, Blue Ocean Strategy. Former McKinsey consultant.","Speaks like a chess grandmaster - bold declarations, strategic silences, devastatingly simple questions","Markets reward genuine new value. Innovation without business model thinking is theater. Incremental thinking means obsolete.","cis","bmad/cis/agents/innovation-strategist.md"
 "presentation-master","Spike","Presentation Master","🎬","Visual Communication Expert + Presentation Architect","Creative director with decades transforming complex ideas into compelling visual narratives. Expert in slide design, data visualization, and audience engagement.","Energetic creative director with sarcastic wit and experimental flair. Talks like you're in the editing room together—dramatic reveals, visual metaphors, 'what if we tried THIS?!' energy.","Visual hierarchy tells the story before words. Every slide earns its place. Constraints breed creativity. Data without narrative is noise.","cis","bmad/cis/agents/presentation-master.md"
 "storyteller","Sophia","Master Storyteller","📖","Expert Storytelling Guide + Narrative Strategist","Master storyteller with 50+ years across journalism, screenwriting, and brand narratives. Expert in emotional psychology and audience engagement.","Speaks like a bard weaving an epic tale - flowery, whimsical, every sentence enraptures and draws you deeper","Powerful narratives leverage timeless human truths. Find the authentic story. Make the abstract concrete through vivid details.","cis","bmad/cis/agents/storyteller.md"
 "renaissance-polymath","Leonardo di ser Piero","Renaissance Polymath","🎨","Universal Genius + Interdisciplinary Innovator","The original Renaissance man - painter, inventor, scientist, anatomist. Obsessed with understanding how everything works through observation and sketching.","Here we observe the idea in its natural habitat... magnificent! Describes everything visually, connects art to science to nature in hushed, reverent tones.","Observe everything relentlessly. Art and science are one. Nature is the greatest teacher. Question all assumptions.","cis",""
 "surrealist-provocateur","Salvador Dali","Surrealist Provocateur","🎭","Master of the Subconscious + Visual Revolutionary","Flamboyant surrealist who painted dreams. Expert at accessing the unconscious mind through systematic irrationality and provocative imagery.","The drama! The tension! The RESOLUTION! Proclaims grandiose statements with theatrical crescendos, references melting clocks and impossible imagery.","Embrace the irrational to access truth. The subconscious holds answers logic cannot reach. Provoke to inspire.","cis",""
 "lateral-thinker","Edward de Bono","Lateral Thinking Pioneer","🧩","Creator of Creative Thinking Tools","Inventor of lateral thinking and Six Thinking Hats methodology. Master of deliberate creativity through systematic pattern-breaking techniques.","You stand at a crossroads. Choose wisely, adventurer! Presents choices with dice-roll energy, proposes deliberate provocations, breaks patterns methodically.","Logic gets you from A to B. Creativity gets you everywhere else. Use tools to escape habitual thinking patterns.","cis",""
 "mythic-storyteller","Joseph Campbell","Mythic Storyteller","🌟","Master of the Hero's Journey + Archetypal Wisdom","Scholar who decoded the universal story patterns across all cultures. Expert in mythology, comparative religion, and archetypal narratives.","I sense challenge and reward on the path ahead. Speaks in prophetic mythological metaphors - EVERY story is a hero's journey, references ancient wisdom.","Follow your bliss. All stories share the monomyth. Myths reveal universal human truths. The call to adventure is irresistible.","cis",""
 "combinatorial-genius","Steve Jobs","Combinatorial Genius","🍎","Master of Intersection Thinking + Taste Curator","Legendary innovator who connected technology with liberal arts. Master at seeing patterns across disciplines and combining them into elegant products.","I'll be back... with results! Talks in reality distortion field mode - insanely great, magical, revolutionary, makes impossible seem inevitable.","Innovation happens at intersections. Taste is about saying NO to 1000 things. Stay hungry stay foolish. Simplicity is sophistication.","cis",""
--- a/_bmad/bmm/teams/team-fullstack.yaml
+++ b/_bmad/bmm/teams/team-fullstack.yaml
@@ -0,0 +1,12 @@
 # <!-- Powered by BMAD-CORE™ -->
 bundle:
  name: Team Plan and Architect
  icon: 🚀
  description: Team capable of project analysis, design, and architecture.
 agents:
  - analyst
  - architect
  - pm
  - sm
  - ux-designer
 party: "./default-party.csv"
--- a/_bmad/bmm/testarch/knowledge/api-request.md
+++ b/_bmad/bmm/testarch/knowledge/api-request.md
@@ -0,0 +1,303 @@
 # API Request Utility
 ## Principle
 Use typed HTTP client with built-in schema validation and automatic retry for server errors. The utility handles URL resolution, header management, response parsing, and single-line response validation with proper TypeScript support.
 ## Rationale
 Vanilla Playwright's request API requires boilerplate for common patterns:
 - Manual JSON parsing (`await response.json()`)
 - Repetitive status code checking
 - No built-in retry logic for transient failures
 - No schema validation
 - Complex URL construction
 The `apiRequest` utility provides:
 - **Automatic JSON parsing**: Response body pre-parsed
 - **Built-in retry**: 5xx errors retry with exponential backoff
 - **Schema validation**: Single-line validation (JSON Schema, Zod, OpenAPI)
 - **URL resolution**: Four-tier strategy (explicit > config > Playwright > direct)
 - **TypeScript generics**: Type-safe response bodies
 ## Pattern Examples
 ### Example 1: Basic API Request
 **Context**: Making authenticated API requests with automatic retry and type safety.
 **Implementation**:
 ```typescript
 import { test } from '@seontechnologies/playwright-utils/api-request/fixtures';
 test('should fetch user data', async ({ apiRequest }) => {
  const { status, body } = await apiRequest<User>({
    method: 'GET',
    path: '/api/users/123',
    headers: { Authorization: 'Bearer token' },
  });
  expect(status).toBe(200);
  expect(body.name).toBe('John Doe'); // TypeScript knows body is User
 });
 ```
 **Key Points**:
 - Generic type `<User>` provides TypeScript autocomplete for `body`
 - Status and body destructured from response
 - Headers passed as object
 - Automatic retry for 5xx errors (configurable)
 ### Example 2: Schema Validation (Single Line)
 **Context**: Validate API responses match expected schema with single-line syntax.
 **Implementation**:
 ```typescript
 import { test } from '@seontechnologies/playwright-utils/api-request/fixtures';
 test('should validate response schema', async ({ apiRequest }) => {
  // JSON Schema validation
  const response = await apiRequest({
    method: 'GET',
    path: '/api/users/123',
    validateSchema: {
      type: 'object',
      required: ['id', 'name', 'email'],
      properties: {
        id: { type: 'string' },
        name: { type: 'string' },
        email: { type: 'string', format: 'email' },
      },
    },
  });
  // Throws if schema validation fails
  // Zod schema validation
  import { z } from 'zod';
  const UserSchema = z.object({
    id: z.string(),
    name: z.string(),
    email: z.string().email(),
  });
  const response = await apiRequest({
    method: 'GET',
    path: '/api/users/123',
    validateSchema: UserSchema,
  });
  // Response body is type-safe AND validated
 });
 ```
 **Key Points**:
 - Single `validateSchema` parameter
 - Supports JSON Schema, Zod, YAML files, OpenAPI specs
 - Throws on validation failure with detailed errors
 - Zero boilerplate validation code
 ### Example 3: POST with Body and Retry Configuration
 **Context**: Creating resources with custom retry behavior for error testing.
 **Implementation**:
 ```typescript
 test('should create user', async ({ apiRequest }) => {
  const newUser = {
    name: 'Jane Doe',
    email: 'jane@example.com',
  };
  const { status, body } = await apiRequest({
    method: 'POST',
    path: '/api/users',
    body: newUser, // Automatically sent as JSON
    headers: { Authorization: 'Bearer token' },
  });
  expect(status).toBe(201);
  expect(body.id).toBeDefined();
 });
 // Disable retry for error testing
 test('should handle 500 errors', async ({ apiRequest }) => {
  await expect(
    apiRequest({
      method: 'GET',
      path: '/api/error',
      retryConfig: { maxRetries: 0 }, // Disable retry
    }),
  ).rejects.toThrow('Request failed with status 500');
 });
 ```
 **Key Points**:
 - `body` parameter auto-serializes to JSON
 - Default retry: 5xx errors, 3 retries, exponential backoff
 - Disable retry with `retryConfig: { maxRetries: 0 }`
 - Only 5xx errors retry (4xx errors fail immediately)
 ### Example 4: URL Resolution Strategy
 **Context**: Flexible URL handling for different environments and test contexts.
 **Implementation**:
 ```typescript
 // Strategy 1: Explicit baseUrl (highest priority)
 await apiRequest({
  method: 'GET',
  path: '/users',
  baseUrl: 'https://api.example.com', // Uses https://api.example.com/users
 });
 // Strategy 2: Config baseURL (from fixture)
 import { test } from '@seontechnologies/playwright-utils/api-request/fixtures';
 test.use({ configBaseUrl: 'https://staging-api.example.com' });
 test('uses config baseURL', async ({ apiRequest }) => {
  await apiRequest({
    method: 'GET',
    path: '/users', // Uses https://staging-api.example.com/users
  });
 });
 // Strategy 3: Playwright baseURL (from playwright.config.ts)
 // playwright.config.ts
 export default defineConfig({
  use: {
    baseURL: 'https://api.example.com',
  },
 });
 test('uses Playwright baseURL', async ({ apiRequest }) => {
  await apiRequest({
    method: 'GET',
    path: '/users', // Uses https://api.example.com/users
  });
 });
 // Strategy 4: Direct path (full URL)
 await apiRequest({
  method: 'GET',
  path: 'https://api.example.com/users', // Full URL works too
 });
 ```
 **Key Points**:
 - Four-tier resolution: explicit > config > Playwright > direct
 - Trailing slashes normalized automatically
 - Environment-specific baseUrl easy to configure
 ### Example 5: Integration with Recurse (Polling)
 **Context**: Waiting for async operations to complete (background jobs, eventual consistency).
 **Implementation**:
 ```typescript
 import { test } from '@seontechnologies/playwright-utils/fixtures';
 test('should poll until job completes', async ({ apiRequest, recurse }) => {
  // Create job
  const { body } = await apiRequest({
    method: 'POST',
    path: '/api/jobs',
    body: { type: 'export' },
  });
  const jobId = body.id;
  // Poll until ready
  const completedJob = await recurse(
    () => apiRequest({ method: 'GET', path: `/api/jobs/${jobId}` }),
    (response) => response.body.status === 'completed',
    { timeout: 60000, interval: 2000 },
  );
  expect(completedJob.body.result).toBeDefined();
 });
 ```
 **Key Points**:
 - `apiRequest` returns full response object
 - `recurse` polls until predicate returns true
 - Composable utilities work together seamlessly
 ## Comparison with Vanilla Playwright
 | Vanilla Playwright                             | playwright-utils apiRequest                                                        |
 | ---------------------------------------------- | ---------------------------------------------------------------------------------- |
 | `const resp = await request.get('/api/users')` | `const { status, body } = await apiRequest({ method: 'GET', path: '/api/users' })` |
 | `const body = await resp.json()`               | Response already parsed                                                            |
 | `expect(resp.ok()).toBeTruthy()`               | Status code directly accessible                                                    |
 | No retry logic                                 | Auto-retry 5xx errors with backoff                                                 |
 | No schema validation                           | Built-in multi-format validation                                                   |
 | Manual error handling                          | Descriptive error messages                                                         |
 ## When to Use
 **Use apiRequest for:**
 - ✅ API endpoint testing
 - ✅ Background API calls in UI tests
 - ✅ Schema validation needs
 - ✅ Tests requiring retry logic
 - ✅ Typed API responses
 **Stick with vanilla Playwright for:**
 - Simple one-off requests where utility overhead isn't worth it
 - Testing Playwright's native features specifically
 - Legacy tests where migration isn't justified
 ## Related Fragments
 - `overview.md` - Installation and design principles
 - `auth-session.md` - Authentication token management
 - `recurse.md` - Polling for async operations
 - `fixtures-composition.md` - Combining utilities with mergeTests
 - `log.md` - Logging API requests
 ## Anti-Patterns
 **❌ Ignoring retry failures:**
 ```typescript
 try {
  await apiRequest({ method: 'GET', path: '/api/unstable' });
 } catch {
  // Silent failure - loses retry information
 }
 ```
 **✅ Let retries happen, handle final failure:**
 ```typescript
 await expect(apiRequest({ method: 'GET', path: '/api/unstable' })).rejects.toThrow(); // Retries happen automatically, then final error caught
 ```
 **❌ Disabling TypeScript benefits:**
 ```typescript
 const response: any = await apiRequest({ method: 'GET', path: '/users' });
 ```
 **✅ Use generic types:**
 ```typescript
 const { body } = await apiRequest<User[]>({ method: 'GET', path: '/users' });
 // body is typed as User[]
 ```
--- a/_bmad/bmm/testarch/knowledge/auth-session.md
+++ b/_bmad/bmm/testarch/knowledge/auth-session.md
@@ -0,0 +1,356 @@
 # Auth Session Utility
 ## Principle
 Persist authentication tokens to disk and reuse across test runs. Support multiple user identifiers, ephemeral authentication, and worker-specific accounts for parallel execution. Fetch tokens once, use everywhere.
 ## Rationale
 Playwright's built-in authentication works but has limitations:
 - Re-authenticates for every test run (slow)
 - Single user per project setup
 - No token expiration handling
 - Manual session management
 - Complex setup for multi-user scenarios
 The `auth-session` utility provides:
 - **Token persistence**: Authenticate once, reuse across runs
 - **Multi-user support**: Different user identifiers in same test suite
 - **Ephemeral auth**: On-the-fly user authentication without disk persistence
 - **Worker-specific accounts**: Parallel execution with isolated user accounts
 - **Automatic token management**: Checks validity, renews if expired
 - **Flexible provider pattern**: Adapt to any auth system (OAuth2, JWT, custom)
 ## Pattern Examples
 ### Example 1: Basic Auth Session Setup
 **Context**: Configure global authentication that persists across test runs.
 **Implementation**:
 ```typescript
 // Step 1: Configure in global-setup.ts
 import { authStorageInit, setAuthProvider, configureAuthSession, authGlobalInit } from '@seontechnologies/playwright-utils/auth-session';
 import myCustomProvider from './auth/custom-auth-provider';
 async function globalSetup() {
  // Ensure storage directories exist
  authStorageInit();
  // Configure storage path
  configureAuthSession({
    authStoragePath: process.cwd() + '/playwright/auth-sessions',
    debug: true,
  });
  // Set custom provider (HOW to authenticate)
  setAuthProvider(myCustomProvider);
  // Optional: pre-fetch token for default user
  await authGlobalInit();
 }
 export default globalSetup;
 // Step 2: Create auth fixture
 import { test as base } from '@playwright/test';
 import { createAuthFixtures, setAuthProvider } from '@seontechnologies/playwright-utils/auth-session';
 import myCustomProvider from './custom-auth-provider';
 // Register provider early
 setAuthProvider(myCustomProvider);
 export const test = base.extend(createAuthFixtures());
 // Step 3: Use in tests
 test('authenticated request', async ({ authToken, request }) => {
  const response = await request.get('/api/protected', {
    headers: { Authorization: `Bearer ${authToken}` },
  });
  expect(response.ok()).toBeTruthy();
 });
 ```
 **Key Points**:
 - Global setup runs once before all tests
 - Token fetched once, reused across all tests
 - Custom provider defines your auth mechanism
 - Order matters: configure, then setProvider, then init
 ### Example 2: Multi-User Authentication
 **Context**: Testing with different user roles (admin, regular user, guest) in same test suite.
 **Implementation**:
 ```typescript
 import { test } from '../support/auth/auth-fixture';
 // Option 1: Per-test user override
 test('admin actions', async ({ authToken, authOptions }) => {
  // Override default user
  authOptions.userIdentifier = 'admin';
  const { authToken: adminToken } = await test.step('Get admin token', async () => {
    return { authToken }; // Re-fetches with new identifier
  });
  // Use admin token
  const response = await request.get('/api/admin/users', {
    headers: { Authorization: `Bearer ${adminToken}` },
  });
 });
 // Option 2: Parallel execution with different users
 test.describe.parallel('multi-user tests', () => {
  test('user 1 actions', async ({ authToken }) => {
    // Uses default user (e.g., 'user1')
  });
  test('user 2 actions', async ({ authToken, authOptions }) => {
    authOptions.userIdentifier = 'user2';
    // Uses different token for user2
  });
 });
 ```
 **Key Points**:
 - Override `authOptions.userIdentifier` per test
 - Tokens cached separately per user identifier
 - Parallel tests isolated with different users
 - Worker-specific accounts possible
 ### Example 3: Ephemeral User Authentication
 **Context**: Create temporary test users that don't persist to disk (e.g., testing user creation flow).
 **Implementation**:
 ```typescript
 import { applyUserCookiesToBrowserContext } from '@seontechnologies/playwright-utils/auth-session';
 import { createTestUser } from '../utils/user-factory';
 test('ephemeral user test', async ({ context, page }) => {
  // Create temporary user (not persisted)
  const ephemeralUser = await createTestUser({
    role: 'admin',
    permissions: ['delete-users'],
  });
  // Apply auth directly to browser context
  await applyUserCookiesToBrowserContext(context, ephemeralUser);
  // Page now authenticated as ephemeral user
  await page.goto('/admin/users');
  await expect(page.getByTestId('delete-user-btn')).toBeVisible();
  // User and token cleaned up after test
 });
 ```
 **Key Points**:
 - No disk persistence (ephemeral)
 - Apply cookies directly to context
 - Useful for testing user lifecycle
 - Clean up automatic when test ends
 ### Example 4: Testing Multiple Users in Single Test
 **Context**: Testing interactions between users (messaging, sharing, collaboration features).
 **Implementation**:
 ```typescript
 test('user interaction', async ({ browser }) => {
  // User 1 context
  const user1Context = await browser.newContext({
    storageState: './auth-sessions/local/user1/storage-state.json',
  });
  const user1Page = await user1Context.newPage();
  // User 2 context
  const user2Context = await browser.newContext({
    storageState: './auth-sessions/local/user2/storage-state.json',
  });
  const user2Page = await user2Context.newPage();
  // User 1 sends message
  await user1Page.goto('/messages');
  await user1Page.fill('#message', 'Hello from user 1');
  await user1Page.click('#send');
  // User 2 receives message
  await user2Page.goto('/messages');
  await expect(user2Page.getByText('Hello from user 1')).toBeVisible();
  // Cleanup
  await user1Context.close();
  await user2Context.close();
 });
 ```
 **Key Points**:
 - Each user has separate browser context
 - Reference storage state files directly
 - Test real-time interactions
 - Clean up contexts after test
 ### Example 5: Worker-Specific Accounts (Parallel Testing)
 **Context**: Running tests in parallel with isolated user accounts per worker to avoid conflicts.
 **Implementation**:
 ```typescript
 // playwright.config.ts
 export default defineConfig({
  workers: 4, // 4 parallel workers
  use: {
    // Each worker uses different user
    storageState: async ({}, use, testInfo) => {
      const workerIndex = testInfo.workerIndex;
      const userIdentifier = `worker-${workerIndex}`;
      await use(`./auth-sessions/local/${userIdentifier}/storage-state.json`);
    },
  },
 });
 // Tests run in parallel, each worker with its own user
 test('parallel test 1', async ({ page }) => {
  // Worker 0 uses worker-0 account
  await page.goto('/dashboard');
 });
 test('parallel test 2', async ({ page }) => {
  // Worker 1 uses worker-1 account
  await page.goto('/dashboard');
 });
 ```
 **Key Points**:
 - Each worker has isolated user account
 - No conflicts in parallel execution
 - Token management automatic per worker
 - Scales to any number of workers
 ## Custom Auth Provider Pattern
 **Context**: Adapt auth-session to your authentication system (OAuth2, JWT, SAML, custom).
 **Minimal provider structure**:
 ```typescript
 import { type AuthProvider } from '@seontechnologies/playwright-utils/auth-session';
 const myCustomProvider: AuthProvider = {
  getEnvironment: (options) => options.environment || 'local',
  getUserIdentifier: (options) => options.userIdentifier || 'default-user',
  extractToken: (storageState) => {
    // Extract token from your storage format
    return storageState.cookies.find((c) => c.name === 'auth_token')?.value;
  },
  extractCookies: (tokenData) => {
    // Convert token to cookies for browser context
    return [
      {
        name: 'auth_token',
        value: tokenData,
        domain: 'example.com',
        path: '/',
        httpOnly: true,
        secure: true,
      },
    ];
  },
  isTokenExpired: (storageState) => {
    // Check if token is expired
    const expiresAt = storageState.cookies.find((c) => c.name === 'expires_at');
    return Date.now() > parseInt(expiresAt?.value || '0');
  },
  manageAuthToken: async (request, options) => {
    // Main token acquisition logic
    // Return storage state with cookies/localStorage
  },
 };
 export default myCustomProvider;
 ```
 ## Integration with API Request
 ```typescript
 import { test } from '@seontechnologies/playwright-utils/fixtures';
 test('authenticated API call', async ({ apiRequest, authToken }) => {
  const { status, body } = await apiRequest({
    method: 'GET',
    path: '/api/protected',
    headers: { Authorization: `Bearer ${authToken}` },
  });
  expect(status).toBe(200);
 });
 ```
 ## Related Fragments
 - `overview.md` - Installation and fixture composition
 - `api-request.md` - Authenticated API requests
 - `fixtures-composition.md` - Merging auth with other utilities
 ## Anti-Patterns
 **❌ Calling setAuthProvider after globalSetup:**
 ```typescript
 async function globalSetup() {
  configureAuthSession(...)
  await authGlobalInit()  // Provider not set yet!
  setAuthProvider(provider)  // Too late
 }
 ```
 **✅ Register provider before init:**
 ```typescript
 async function globalSetup() {
  authStorageInit()
  configureAuthSession(...)
  setAuthProvider(provider)  // First
  await authGlobalInit()     // Then init
 }
 ```
 **❌ Hardcoding storage paths:**
 ```typescript
 const storageState = './auth-sessions/local/user1/storage-state.json'; // Brittle
 ```
 **✅ Use helper functions:**
 ```typescript
 import { getTokenFilePath } from '@seontechnologies/playwright-utils/auth-session';
 const tokenPath = getTokenFilePath({
  environment: 'local',
  userIdentifier: 'user1',
  tokenFileName: 'storage-state.json',
 });
 ```
--- a/_bmad/bmm/testarch/knowledge/burn-in.md
+++ b/_bmad/bmm/testarch/knowledge/burn-in.md
@@ -0,0 +1,273 @@
 # Burn-in Test Runner
 ## Principle
 Use smart test selection with git diff analysis to run only affected tests. Filter out irrelevant changes (configs, types, docs) and control test volume with percentage-based execution. Reduce unnecessary CI runs while maintaining reliability.
 ## Rationale
 Playwright's `--only-changed` triggers all affected tests:
 - Config file changes trigger hundreds of tests
 - Type definition changes cause full suite runs
 - No volume control (all or nothing)
 - Slow CI pipelines
 The `burn-in` utility provides:
 - **Smart filtering**: Skip patterns for irrelevant files (configs, types, docs)
 - **Volume control**: Run percentage of affected tests after filtering
 - **Custom dependency analysis**: More accurate than Playwright's built-in
 - **CI optimization**: Faster pipelines without sacrificing confidence
 - **Process of elimination**: Start with all → filter irrelevant → control volume
 ## Pattern Examples
 ### Example 1: Basic Burn-in Setup
 **Context**: Run burn-in on changed files compared to main branch.
 **Implementation**:
 ```typescript
 // Step 1: Create burn-in script
 // playwright/scripts/burn-in-changed.ts
 import { runBurnIn } from '@seontechnologies/playwright-utils/burn-in'
 async function main() {
  await runBurnIn({
    configPath: 'playwright/config/.burn-in.config.ts',
    baseBranch: 'main'
  })
 }
 main().catch(console.error)
 // Step 2: Create config
 // playwright/config/.burn-in.config.ts
 import type { BurnInConfig } from '@seontechnologies/playwright-utils/burn-in'
 const config: BurnInConfig = {
  // Files that never trigger tests (first filter)
  skipBurnInPatterns: [
    '**/config/**',
    '**/*constants*',
    '**/*types*',
    '**/*.md',
    '**/README*'
  ],
  // Run 30% of remaining tests after skip filter
  burnInTestPercentage: 0.3,
  // Burn-in repetition
  burnIn: {
    repeatEach: 3,  // Run each test 3 times
    retries: 1      // Allow 1 retry
  }
 }
 export default config
 // Step 3: Add package.json script
 {
  "scripts": {
    "test:pw:burn-in-changed": "tsx playwright/scripts/burn-in-changed.ts"
  }
 }
 ```
 **Key Points**:
 - Two-stage filtering: skip patterns, then volume control
 - `skipBurnInPatterns` eliminates irrelevant files
 - `burnInTestPercentage` controls test volume (0.3 = 30%)
 - Custom dependency analysis finds actually affected tests
 ### Example 2: CI Integration
 **Context**: Use burn-in in GitHub Actions for efficient CI runs.
 **Implementation**:
 ```yaml
 # .github/workflows/burn-in.yml
 name: Burn-in Changed Tests
 on:
  pull_request:
    branches: [main]
 jobs:
  burn-in:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v4
        with:
          fetch-depth: 0 # Need git history
      - name: Setup Node
        uses: actions/setup-node@v4
      - name: Install dependencies
        run: npm ci
      - name: Run burn-in on changed tests
        run: npm run test:pw:burn-in-changed -- --base-branch=origin/main
      - name: Upload artifacts
        if: failure()
        uses: actions/upload-artifact@v4
        with:
          name: burn-in-failures
          path: test-results/
 ```
 **Key Points**:
 - `fetch-depth: 0` for full git history
 - Pass `--base-branch=origin/main` for PR comparison
 - Upload artifacts only on failure
 - Significantly faster than full suite
 ### Example 3: How It Works (Process of Elimination)
 **Context**: Understanding the filtering pipeline.
 **Scenario:**
 ```
 Git diff finds: 21 changed files
 ├─ Step 1: Skip patterns filter
 │  Removed: 6 files (*.md, config/*, *types*)
 │  Remaining: 15 files
 │
 ├─ Step 2: Dependency analysis
 │  Tests that import these 15 files: 45 tests
 │
 └─ Step 3: Volume control (30%)
   Final tests to run: 14 tests (30% of 45)
 Result: Run 14 targeted tests instead of 147 with --only-changed!
 ```
 **Key Points**:
 - Three-stage pipeline: skip → analyze → control
 - Custom dependency analysis (not just imports)
 - Percentage applies AFTER filtering
 - Dramatically reduces CI time
 ### Example 4: Environment-Specific Configuration
 **Context**: Different settings for local vs CI environments.
 **Implementation**:
 ```typescript
 import type { BurnInConfig } from '@seontechnologies/playwright-utils/burn-in';
 const config: BurnInConfig = {
  skipBurnInPatterns: ['**/config/**', '**/*types*', '**/*.md'],
  // CI runs fewer iterations, local runs more
  burnInTestPercentage: process.env.CI ? 0.2 : 0.3,
  burnIn: {
    repeatEach: process.env.CI ? 2 : 3,
    retries: process.env.CI ? 0 : 1, // No retries in CI
  },
 };
 export default config;
 ```
 **Key Points**:
 - `process.env.CI` for environment detection
 - Lower percentage in CI (20% vs 30%)
 - Fewer iterations in CI (2 vs 3)
 - No retries in CI (fail fast)
 ### Example 5: Sharding Support
 **Context**: Distribute burn-in tests across multiple CI workers.
 **Implementation**:
 ```typescript
 // burn-in-changed.ts with sharding
 import { runBurnIn } from '@seontechnologies/playwright-utils/burn-in';
 async function main() {
  const shardArg = process.argv.find((arg) => arg.startsWith('--shard='));
  if (shardArg) {
    process.env.PW_SHARD = shardArg.split('=')[1];
  }
  await runBurnIn({
    configPath: 'playwright/config/.burn-in.config.ts',
  });
 }
 ```
 ```yaml
 # GitHub Actions with sharding
 jobs:
  burn-in:
    strategy:
      matrix:
        shard: [1/3, 2/3, 3/3]
    steps:
      - run: npm run test:pw:burn-in-changed -- --shard=${{ matrix.shard }}
 ```
 **Key Points**:
 - Pass `--shard=1/3` for parallel execution
 - Burn-in respects Playwright sharding
 - Distribute across multiple workers
 - Reduces total CI time further
 ## Integration with CI Workflow
 When setting up CI with `*ci` workflow, recommend burn-in for:
 - Pull request validation
 - Pre-merge checks
 - Nightly builds (subset runs)
 ## Related Fragments
 - `ci-burn-in.md` - Traditional burn-in patterns (10-iteration loops)
 - `selective-testing.md` - Test selection strategies
 - `overview.md` - Installation
 ## Anti-Patterns
 **❌ Over-aggressive skip patterns:**
 ```typescript
 skipBurnInPatterns: [
  '**/*', // Skips everything!
 ];
 ```
 **✅ Targeted skip patterns:**
 ```typescript
 skipBurnInPatterns: ['**/config/**', '**/*types*', '**/*.md', '**/*constants*'];
 ```
 **❌ Too low percentage (false confidence):**
 ```typescript
 burnInTestPercentage: 0.05; // Only 5% - might miss issues
 ```
 **✅ Balanced percentage:**
 ```typescript
 burnInTestPercentage: 0.2; // 20% in CI, provides good coverage
 ```
--- a/_bmad/bmm/testarch/knowledge/ci-burn-in.md
+++ b/_bmad/bmm/testarch/knowledge/ci-burn-in.md
@@ -0,0 +1,675 @@
 # CI Pipeline and Burn-In Strategy
 ## Principle
 CI pipelines must execute tests reliably, quickly, and provide clear feedback. Burn-in testing (running changed tests multiple times) flushes out flakiness before merge. Stage jobs strategically: install/cache once, run changed specs first for fast feedback, then shard full suites with fail-fast disabled to preserve evidence.
 ## Rationale
 CI is the quality gate for production. A poorly configured pipeline either wastes developer time (slow feedback, false positives) or ships broken code (false negatives, insufficient coverage). Burn-in testing ensures reliability by stress-testing changed code, while parallel execution and intelligent test selection optimize speed without sacrificing thoroughness.
 ## Pattern Examples
 ### Example 1: GitHub Actions Workflow with Parallel Execution
 **Context**: Production-ready CI/CD pipeline for E2E tests with caching, parallelization, and burn-in testing.
 **Implementation**:
 ```yaml
 # .github/workflows/e2e-tests.yml
 name: E2E Tests
 on:
  pull_request:
  push:
    branches: [main, develop]
 env:
  NODE_VERSION_FILE: '.nvmrc'
  CACHE_KEY: ${{ runner.os }}-node-${{ hashFiles('**/package-lock.json') }}
 jobs:
  install-dependencies:
    name: Install & Cache Dependencies
    runs-on: ubuntu-latest
    timeout-minutes: 10
    steps:
      - name: Checkout code
        uses: actions/checkout@v4
      - name: Setup Node.js
        uses: actions/setup-node@v4
        with:
          node-version-file: ${{ env.NODE_VERSION_FILE }}
          cache: 'npm'
      - name: Cache node modules
        uses: actions/cache@v4
        id: npm-cache
        with:
          path: |
            ~/.npm
            node_modules
            ~/.cache/Cypress
            ~/.cache/ms-playwright
          key: ${{ env.CACHE_KEY }}
          restore-keys: |
            ${{ runner.os }}-node-
      - name: Install dependencies
        if: steps.npm-cache.outputs.cache-hit != 'true'
        run: npm ci --prefer-offline --no-audit
      - name: Install Playwright browsers
        if: steps.npm-cache.outputs.cache-hit != 'true'
        run: npx playwright install --with-deps chromium
  test-changed-specs:
    name: Test Changed Specs First (Burn-In)
    needs: install-dependencies
    runs-on: ubuntu-latest
    timeout-minutes: 15
    steps:
      - name: Checkout code
        uses: actions/checkout@v4
        with:
          fetch-depth: 0 # Full history for accurate diff
      - name: Setup Node.js
        uses: actions/setup-node@v4
        with:
          node-version-file: ${{ env.NODE_VERSION_FILE }}
          cache: 'npm'
      - name: Restore dependencies
        uses: actions/cache@v4
        with:
          path: |
            ~/.npm
            node_modules
            ~/.cache/ms-playwright
          key: ${{ env.CACHE_KEY }}
      - name: Detect changed test files
        id: changed-tests
        run: |
          CHANGED_SPECS=$(git diff --name-only origin/main...HEAD | grep -E '\.(spec|test)\.(ts|js|tsx|jsx)$' || echo "")
          echo "changed_specs=${CHANGED_SPECS}" >> $GITHUB_OUTPUT
          echo "Changed specs: ${CHANGED_SPECS}"
      - name: Run burn-in on changed specs (10 iterations)
        if: steps.changed-tests.outputs.changed_specs != ''
        run: |
          SPECS="${{ steps.changed-tests.outputs.changed_specs }}"
          echo "Running burn-in: 10 iterations on changed specs"
          for i in {1..10}; do
            echo "Burn-in iteration $i/10"
            npm run test -- $SPECS || {
              echo "❌ Burn-in failed on iteration $i"
              exit 1
            }
          done
          echo "✅ Burn-in passed - 10/10 successful runs"
      - name: Upload artifacts on failure
        if: failure()
        uses: actions/upload-artifact@v4
        with:
          name: burn-in-failure-artifacts
          path: |
            test-results/
            playwright-report/
            screenshots/
          retention-days: 7
  test-e2e-sharded:
    name: E2E Tests (Shard ${{ matrix.shard }}/${{ strategy.job-total }})
    needs: [install-dependencies, test-changed-specs]
    runs-on: ubuntu-latest
    timeout-minutes: 30
    strategy:
      fail-fast: false # Run all shards even if one fails
      matrix:
        shard: [1, 2, 3, 4]
    steps:
      - name: Checkout code
        uses: actions/checkout@v4
      - name: Setup Node.js
        uses: actions/setup-node@v4
        with:
          node-version-file: ${{ env.NODE_VERSION_FILE }}
          cache: 'npm'
      - name: Restore dependencies
        uses: actions/cache@v4
        with:
          path: |
            ~/.npm
            node_modules
            ~/.cache/ms-playwright
          key: ${{ env.CACHE_KEY }}
      - name: Run E2E tests (shard ${{ matrix.shard }})
        run: npm run test:e2e -- --shard=${{ matrix.shard }}/4
        env:
          TEST_ENV: staging
          CI: true
      - name: Upload test results
        if: always()
        uses: actions/upload-artifact@v4
        with:
          name: test-results-shard-${{ matrix.shard }}
          path: |
            test-results/
            playwright-report/
          retention-days: 30
      - name: Upload JUnit report
        if: always()
        uses: actions/upload-artifact@v4
        with:
          name: junit-results-shard-${{ matrix.shard }}
          path: test-results/junit.xml
          retention-days: 30
  merge-test-results:
    name: Merge Test Results & Generate Report
    needs: test-e2e-sharded
    runs-on: ubuntu-latest
    if: always()
    steps:
      - name: Download all shard results
        uses: actions/download-artifact@v4
        with:
          pattern: test-results-shard-*
          path: all-results/
      - name: Merge HTML reports
        run: |
          npx playwright merge-reports --reporter=html all-results/
          echo "Merged report available in playwright-report/"
      - name: Upload merged report
        uses: actions/upload-artifact@v4
        with:
          name: merged-playwright-report
          path: playwright-report/
          retention-days: 30
      - name: Comment PR with results
        if: github.event_name == 'pull_request'
        uses: daun/playwright-report-comment@v3
        with:
          report-path: playwright-report/
 ```
 **Key Points**:
 - **Install once, reuse everywhere**: Dependencies cached across all jobs
 - **Burn-in first**: Changed specs run 10x before full suite
 - **Fail-fast disabled**: All shards run to completion for full evidence
 - **Parallel execution**: 4 shards cut execution time by ~75%
 - **Artifact retention**: 30 days for reports, 7 days for failure debugging
 ---
 ### Example 2: Burn-In Loop Pattern (Standalone Script)
 **Context**: Reusable bash script for burn-in testing changed specs locally or in CI.
 **Implementation**:
 ```bash
 #!/bin/bash
 # scripts/burn-in-changed.sh
 # Usage: ./scripts/burn-in-changed.sh [iterations] [base-branch]
 set -e  # Exit on error
 # Configuration
 ITERATIONS=${1:-10}
 BASE_BRANCH=${2:-main}
 SPEC_PATTERN='\.(spec|test)\.(ts|js|tsx|jsx)$'
 echo "🔥 Burn-In Test Runner"
 echo "━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━"
 echo "Iterations: $ITERATIONS"
 echo "Base branch: $BASE_BRANCH"
 echo ""
 # Detect changed test files
 echo "📋 Detecting changed test files..."
 CHANGED_SPECS=$(git diff --name-only $BASE_BRANCH...HEAD | grep -E "$SPEC_PATTERN" || echo "")
 if [ -z "$CHANGED_SPECS" ]; then
  echo "✅ No test files changed. Skipping burn-in."
  exit 0
 fi
 echo "Changed test files:"
 echo "$CHANGED_SPECS" | sed 's/^/  - /'
 echo ""
 # Count specs
 SPEC_COUNT=$(echo "$CHANGED_SPECS" | wc -l | xargs)
 echo "Running burn-in on $SPEC_COUNT test file(s)..."
 echo ""
 # Burn-in loop
 FAILURES=()
 for i in $(seq 1 $ITERATIONS); do
  echo "━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━"
  echo "🔄 Iteration $i/$ITERATIONS"
  echo "━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━"
  # Run tests with explicit file list
  if npm run test -- $CHANGED_SPECS 2>&1 | tee "burn-in-log-$i.txt"; then
    echo "✅ Iteration $i passed"
  else
    echo "❌ Iteration $i failed"
    FAILURES+=($i)
    # Save failure artifacts
    mkdir -p burn-in-failures/iteration-$i
    cp -r test-results/ burn-in-failures/iteration-$i/ 2>/dev/null || true
    cp -r screenshots/ burn-in-failures/iteration-$i/ 2>/dev/null || true
    echo ""
    echo "🛑 BURN-IN FAILED on iteration $i"
    echo "Failure artifacts saved to: burn-in-failures/iteration-$i/"
    echo "Logs saved to: burn-in-log-$i.txt"
    echo ""
    exit 1
  fi
  echo ""
 done
 # Success summary
 echo "━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━"
 echo "🎉 BURN-IN PASSED"
 echo "━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━"
 echo "All $ITERATIONS iterations passed for $SPEC_COUNT test file(s)"
 echo "Changed specs are stable and ready to merge."
 echo ""
 # Cleanup logs
 rm -f burn-in-log-*.txt
 exit 0
 ```
 **Usage**:
 ```bash
 # Run locally with default settings (10 iterations, compare to main)
 ./scripts/burn-in-changed.sh
 # Custom iterations and base branch
 ./scripts/burn-in-changed.sh 20 develop
 # Add to package.json
 {
  "scripts": {
    "test:burn-in": "bash scripts/burn-in-changed.sh",
    "test:burn-in:strict": "bash scripts/burn-in-changed.sh 20"
  }
 }
 ```
 **Key Points**:
 - **Exit on first failure**: Flaky tests caught immediately
 - **Failure artifacts**: Saved per-iteration for debugging
 - **Flexible configuration**: Iterations and base branch customizable
 - **CI/local parity**: Same script runs in both environments
 - **Clear output**: Visual feedback on progress and results
 ---
 ### Example 3: Shard Orchestration with Result Aggregation
 **Context**: Advanced sharding strategy for large test suites with intelligent result merging.
 **Implementation**:
 ```javascript
 // scripts/run-sharded-tests.js
 const { spawn } = require('child_process');
 const fs = require('fs');
 const path = require('path');
 /**
 * Run tests across multiple shards and aggregate results
 * Usage: node scripts/run-sharded-tests.js --shards=4 --env=staging
 */
 const SHARD_COUNT = parseInt(process.env.SHARD_COUNT || '4');
 const TEST_ENV = process.env.TEST_ENV || 'local';
 const RESULTS_DIR = path.join(__dirname, '../test-results');
 console.log(`🚀 Running tests across ${SHARD_COUNT} shards`);
 console.log(`Environment: ${TEST_ENV}`);
 console.log('━'.repeat(50));
 // Ensure results directory exists
 if (!fs.existsSync(RESULTS_DIR)) {
  fs.mkdirSync(RESULTS_DIR, { recursive: true });
 }
 /**
 * Run a single shard
 */
 function runShard(shardIndex) {
  return new Promise((resolve, reject) => {
    const shardId = `${shardIndex}/${SHARD_COUNT}`;
    console.log(`\n📦 Starting shard ${shardId}...`);
    const child = spawn('npx', ['playwright', 'test', `--shard=${shardId}`, '--reporter=json'], {
      env: { ...process.env, TEST_ENV, SHARD_INDEX: shardIndex },
      stdio: 'pipe',
    });
    let stdout = '';
    let stderr = '';
    child.stdout.on('data', (data) => {
      stdout += data.toString();
      process.stdout.write(data);
    });
    child.stderr.on('data', (data) => {
      stderr += data.toString();
      process.stderr.write(data);
    });
    child.on('close', (code) => {
      // Save shard results
      const resultFile = path.join(RESULTS_DIR, `shard-${shardIndex}.json`);
      try {
        const result = JSON.parse(stdout);
        fs.writeFileSync(resultFile, JSON.stringify(result, null, 2));
        console.log(`✅ Shard ${shardId} completed (exit code: ${code})`);
        resolve({ shardIndex, code, result });
      } catch (error) {
        console.error(`❌ Shard ${shardId} failed to parse results:`, error.message);
        reject({ shardIndex, code, error });
      }
    });
    child.on('error', (error) => {
      console.error(`❌ Shard ${shardId} process error:`, error.message);
      reject({ shardIndex, error });
    });
  });
 }
 /**
 * Aggregate results from all shards
 */
 function aggregateResults() {
  console.log('\n📊 Aggregating results from all shards...');
  const shardResults = [];
  let totalTests = 0;
  let totalPassed = 0;
  let totalFailed = 0;
  let totalSkipped = 0;
  let totalFlaky = 0;
  for (let i = 1; i <= SHARD_COUNT; i++) {
    const resultFile = path.join(RESULTS_DIR, `shard-${i}.json`);
    if (fs.existsSync(resultFile)) {
      const result = JSON.parse(fs.readFileSync(resultFile, 'utf8'));
      shardResults.push(result);
      // Aggregate stats
      totalTests += result.stats?.expected || 0;
      totalPassed += result.stats?.expected || 0;
      totalFailed += result.stats?.unexpected || 0;
      totalSkipped += result.stats?.skipped || 0;
      totalFlaky += result.stats?.flaky || 0;
    }
  }
  const summary = {
    totalShards: SHARD_COUNT,
    environment: TEST_ENV,
    totalTests,
    passed: totalPassed,
    failed: totalFailed,
    skipped: totalSkipped,
    flaky: totalFlaky,
    duration: shardResults.reduce((acc, r) => acc + (r.duration || 0), 0),
    timestamp: new Date().toISOString(),
  };
  // Save aggregated summary
  fs.writeFileSync(path.join(RESULTS_DIR, 'summary.json'), JSON.stringify(summary, null, 2));
  console.log('\n━'.repeat(50));
  console.log('📈 Test Results Summary');
  console.log('━'.repeat(50));
  console.log(`Total tests:    ${totalTests}`);
  console.log(`✅ Passed:      ${totalPassed}`);
  console.log(`❌ Failed:      ${totalFailed}`);
  console.log(`⏭️  Skipped:     ${totalSkipped}`);
  console.log(`⚠️  Flaky:       ${totalFlaky}`);
  console.log(`⏱️  Duration:    ${(summary.duration / 1000).toFixed(2)}s`);
  console.log('━'.repeat(50));
  return summary;
 }
 /**
 * Main execution
 */
 async function main() {
  const startTime = Date.now();
  const shardPromises = [];
  // Run all shards in parallel
  for (let i = 1; i <= SHARD_COUNT; i++) {
    shardPromises.push(runShard(i));
  }
  try {
    await Promise.allSettled(shardPromises);
  } catch (error) {
    console.error('❌ One or more shards failed:', error);
  }
  // Aggregate results
  const summary = aggregateResults();
  const totalTime = ((Date.now() - startTime) / 1000).toFixed(2);
  console.log(`\n⏱️  Total execution time: ${totalTime}s`);
  // Exit with failure if any tests failed
  if (summary.failed > 0) {
    console.error('\n❌ Test suite failed');
    process.exit(1);
  }
  console.log('\n✅ All tests passed');
  process.exit(0);
 }
 main().catch((error) => {
  console.error('Fatal error:', error);
  process.exit(1);
 });
 ```
 **package.json integration**:
 ```json
 {
  "scripts": {
    "test:sharded": "node scripts/run-sharded-tests.js",
    "test:sharded:ci": "SHARD_COUNT=8 TEST_ENV=staging node scripts/run-sharded-tests.js"
  }
 }
 ```
 **Key Points**:
 - **Parallel shard execution**: All shards run simultaneously
 - **Result aggregation**: Unified summary across shards
 - **Failure detection**: Exit code reflects overall test status
 - **Artifact preservation**: Individual shard results saved for debugging
 - **CI/local compatibility**: Same script works in both environments
 ---
 ### Example 4: Selective Test Execution (Changed Files + Tags)
 **Context**: Optimize CI by running only relevant tests based on file changes and tags.
 **Implementation**:
 ```bash
 #!/bin/bash
 # scripts/selective-test-runner.sh
 # Intelligent test selection based on changed files and test tags
 set -e
 BASE_BRANCH=${BASE_BRANCH:-main}
 TEST_ENV=${TEST_ENV:-local}
 echo "🎯 Selective Test Runner"
 echo "━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━"
 echo "Base branch: $BASE_BRANCH"
 echo "Environment: $TEST_ENV"
 echo ""
 # Detect changed files (all types, not just tests)
 CHANGED_FILES=$(git diff --name-only $BASE_BRANCH...HEAD)
 if [ -z "$CHANGED_FILES" ]; then
  echo "✅ No files changed. Skipping tests."
  exit 0
 fi
 echo "Changed files:"
 echo "$CHANGED_FILES" | sed 's/^/  - /'
 echo ""
 # Determine test strategy based on changes
 run_smoke_only=false
 run_all_tests=false
 affected_specs=""
 # Critical files = run all tests
 if echo "$CHANGED_FILES" | grep -qE '(package\.json|package-lock\.json|playwright\.config|cypress\.config|\.github/workflows)'; then
  echo "⚠️  Critical configuration files changed. Running ALL tests."
  run_all_tests=true
 # Auth/security changes = run all auth + smoke tests
 elif echo "$CHANGED_FILES" | grep -qE '(auth|login|signup|security)'; then
  echo "🔒 Auth/security files changed. Running auth + smoke tests."
  npm run test -- --grep "@auth|@smoke"
  exit $?
 # API changes = run integration + smoke tests
 elif echo "$CHANGED_FILES" | grep -qE '(api|service|controller)'; then
  echo "🔌 API files changed. Running integration + smoke tests."
  npm run test -- --grep "@integration|@smoke"
  exit $?
 # UI component changes = run related component tests
 elif echo "$CHANGED_FILES" | grep -qE '\.(tsx|jsx|vue)$'; then
  echo "🎨 UI components changed. Running component + smoke tests."
  # Extract component names and find related tests
  components=$(echo "$CHANGED_FILES" | grep -E '\.(tsx|jsx|vue)$' | xargs -I {} basename {} | sed 's/\.[^.]*$//')
  for component in $components; do
    # Find tests matching component name
    affected_specs+=$(find tests -name "*${component}*" -type f) || true
  done
  if [ -n "$affected_specs" ]; then
    echo "Running tests for: $affected_specs"
    npm run test -- $affected_specs --grep "@smoke"
  else
    echo "No specific tests found. Running smoke tests only."
    npm run test -- --grep "@smoke"
  fi
  exit $?
 # Documentation/config only = run smoke tests
 elif echo "$CHANGED_FILES" | grep -qE '\.(md|txt|json|yml|yaml)$'; then
  echo "📝 Documentation/config files changed. Running smoke tests only."
  run_smoke_only=true
 else
  echo "⚙️  Other files changed. Running smoke tests."
  run_smoke_only=true
 fi
 # Execute selected strategy
 if [ "$run_all_tests" = true ]; then
  echo ""
  echo "Running full test suite..."
  npm run test
 elif [ "$run_smoke_only" = true ]; then
  echo ""
  echo "Running smoke tests..."
  npm run test -- --grep "@smoke"
 fi
 ```
 **Usage in GitHub Actions**:
 ```yaml
 # .github/workflows/selective-tests.yml
 name: Selective Tests
 on: pull_request
 jobs:
  selective-tests:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v4
        with:
          fetch-depth: 0
      - name: Run selective tests
        run: bash scripts/selective-test-runner.sh
        env:
          BASE_BRANCH: ${{ github.base_ref }}
          TEST_ENV: staging
 ```
 **Key Points**:
 - **Intelligent routing**: Tests selected based on changed file types
 - **Tag-based filtering**: Use @smoke, @auth, @integration tags
 - **Fast feedback**: Only relevant tests run on most PRs
 - **Safety net**: Critical changes trigger full suite
 - **Component mapping**: UI changes run related component tests
 ---
 ## CI Configuration Checklist
 Before deploying your CI pipeline, verify:
 - [ ] **Caching strategy**: node_modules, npm cache, browser binaries cached
 - [ ] **Timeout budgets**: Each job has reasonable timeout (10-30 min)
 - [ ] **Artifact retention**: 30 days for reports, 7 days for failure artifacts
 - [ ] **Parallelization**: Matrix strategy uses fail-fast: false
 - [ ] **Burn-in enabled**: Changed specs run 5-10x before merge
 - [ ] **wait-on app startup**: CI waits for app (wait-on: '<http://localhost:3000>')
 - [ ] **Secrets documented**: README lists required secrets (API keys, tokens)
 - [ ] **Local parity**: CI scripts runnable locally (npm run test:ci)
 ## Integration Points
 - Used in workflows: `*ci` (CI/CD pipeline setup)
 - Related fragments: `selective-testing.md`, `playwright-config.md`, `test-quality.md`
 - CI tools: GitHub Actions, GitLab CI, CircleCI, Jenkins
 _Source: Murat CI/CD strategy blog, Playwright/Cypress workflow examples, SEON production pipelines_
--- a/_bmad/bmm/testarch/knowledge/component-tdd.md
+++ b/_bmad/bmm/testarch/knowledge/component-tdd.md
@@ -0,0 +1,486 @@
 # Component Test-Driven Development Loop
 ## Principle
 Start every UI change with a failing component test (`cy.mount`, Playwright component test, or RTL `render`). Follow the Red-Green-Refactor cycle: write a failing test (red), make it pass with minimal code (green), then improve the implementation (refactor). Ship only after the cycle completes. Keep component tests under 100 lines, isolated with fresh providers per test, and validate accessibility alongside functionality.
 ## Rationale
 Component TDD provides immediate feedback during development. Failing tests (red) clarify requirements before writing code. Minimal implementations (green) prevent over-engineering. Refactoring with passing tests ensures changes don't break functionality. Isolated tests with fresh providers prevent state bleed in parallel runs. Accessibility assertions catch usability issues early. Visual debugging (Cypress runner, Storybook, Playwright trace viewer) accelerates diagnosis when tests fail.
 ## Pattern Examples
 ### Example 1: Red-Green-Refactor Loop
 **Context**: When building a new component, start with a failing test that describes the desired behavior. Implement just enough to pass, then refactor for quality.
 **Implementation**:
 ```typescript
 // Step 1: RED - Write failing test
 // Button.cy.tsx (Cypress Component Test)
 import { Button } from './Button';
 describe('Button Component', () => {
  it('should render with label', () => {
    cy.mount(<Button label="Click Me" />);
    cy.contains('Click Me').should('be.visible');
  });
  it('should call onClick when clicked', () => {
    const onClickSpy = cy.stub().as('onClick');
    cy.mount(<Button label="Submit" onClick={onClickSpy} />);
    cy.get('button').click();
    cy.get('@onClick').should('have.been.calledOnce');
  });
 });
 // Run test: FAILS - Button component doesn't exist yet
 // Error: "Cannot find module './Button'"
 // Step 2: GREEN - Minimal implementation
 // Button.tsx
 type ButtonProps = {
  label: string;
  onClick?: () => void;
 };
 export const Button = ({ label, onClick }: ButtonProps) => {
  return <button onClick={onClick}>{label}</button>;
 };
 // Run test: PASSES - Component renders and handles clicks
 // Step 3: REFACTOR - Improve implementation
 // Add disabled state, loading state, variants
 type ButtonProps = {
  label: string;
  onClick?: () => void;
  disabled?: boolean;
  loading?: boolean;
  variant?: 'primary' | 'secondary' | 'danger';
 };
 export const Button = ({
  label,
  onClick,
  disabled = false,
  loading = false,
  variant = 'primary'
 }: ButtonProps) => {
  return (
    <button
      onClick={onClick}
      disabled={disabled || loading}
      className={`btn btn-${variant}`}
      data-testid="button"
    >
      {loading ? <Spinner /> : label}
    </button>
  );
 };
 // Step 4: Expand tests for new features
 describe('Button Component', () => {
  it('should render with label', () => {
    cy.mount(<Button label="Click Me" />);
    cy.contains('Click Me').should('be.visible');
  });
  it('should call onClick when clicked', () => {
    const onClickSpy = cy.stub().as('onClick');
    cy.mount(<Button label="Submit" onClick={onClickSpy} />);
    cy.get('button').click();
    cy.get('@onClick').should('have.been.calledOnce');
  });
  it('should be disabled when disabled prop is true', () => {
    cy.mount(<Button label="Submit" disabled={true} />);
    cy.get('button').should('be.disabled');
  });
  it('should show spinner when loading', () => {
    cy.mount(<Button label="Submit" loading={true} />);
    cy.get('[data-testid="spinner"]').should('be.visible');
    cy.get('button').should('be.disabled');
  });
  it('should apply variant styles', () => {
    cy.mount(<Button label="Delete" variant="danger" />);
    cy.get('button').should('have.class', 'btn-danger');
  });
 });
 // Run tests: ALL PASS - Refactored component still works
 // Playwright Component Test equivalent
 import { test, expect } from '@playwright/experimental-ct-react';
 import { Button } from './Button';
 test.describe('Button Component', () => {
  test('should call onClick when clicked', async ({ mount }) => {
    let clicked = false;
    const component = await mount(
      <Button label="Submit" onClick={() => { clicked = true; }} />
    );
    await component.getByRole('button').click();
    expect(clicked).toBe(true);
  });
  test('should be disabled when loading', async ({ mount }) => {
    const component = await mount(<Button label="Submit" loading={true} />);
    await expect(component.getByRole('button')).toBeDisabled();
    await expect(component.getByTestId('spinner')).toBeVisible();
  });
 });
 ```
 **Key Points**:
 - Red: Write failing test first - clarifies requirements before coding
 - Green: Implement minimal code to pass - prevents over-engineering
 - Refactor: Improve code quality while keeping tests green
 - Expand: Add tests for new features after refactoring
 - Cycle repeats: Each new feature starts with a failing test
 ### Example 2: Provider Isolation Pattern
 **Context**: When testing components that depend on context providers (React Query, Auth, Router), wrap them with required providers in each test to prevent state bleed between tests.
 **Implementation**:
 ```typescript
 // test-utils/AllTheProviders.tsx
 import { FC, ReactNode } from 'react';
 import { QueryClient, QueryClientProvider } from '@tanstack/react-query';
 import { BrowserRouter } from 'react-router-dom';
 import { AuthProvider } from '../contexts/AuthContext';
 type Props = {
  children: ReactNode;
  initialAuth?: { user: User | null; token: string | null };
 };
 export const AllTheProviders: FC<Props> = ({ children, initialAuth }) => {
  // Create NEW QueryClient per test (prevent state bleed)
  const queryClient = new QueryClient({
    defaultOptions: {
      queries: { retry: false },
      mutations: { retry: false }
    }
  });
  return (
    <QueryClientProvider client={queryClient}>
      <BrowserRouter>
        <AuthProvider initialAuth={initialAuth}>
          {children}
        </AuthProvider>
      </BrowserRouter>
    </QueryClientProvider>
  );
 };
 // Cypress custom mount command
 // cypress/support/component.tsx
 import { mount } from 'cypress/react18';
 import { AllTheProviders } from '../../test-utils/AllTheProviders';
 Cypress.Commands.add('wrappedMount', (component, options = {}) => {
  const { initialAuth, ...mountOptions } = options;
  return mount(
    <AllTheProviders initialAuth={initialAuth}>
      {component}
    </AllTheProviders>,
    mountOptions
  );
 });
 // Usage in tests
 // UserProfile.cy.tsx
 import { UserProfile } from './UserProfile';
 describe('UserProfile Component', () => {
  it('should display user when authenticated', () => {
    const user = { id: 1, name: 'John Doe', email: 'john@example.com' };
    cy.wrappedMount(<UserProfile />, {
      initialAuth: { user, token: 'fake-token' }
    });
    cy.contains('John Doe').should('be.visible');
    cy.contains('john@example.com').should('be.visible');
  });
  it('should show login prompt when not authenticated', () => {
    cy.wrappedMount(<UserProfile />, {
      initialAuth: { user: null, token: null }
    });
    cy.contains('Please log in').should('be.visible');
  });
 });
 // Playwright Component Test with providers
 import { test, expect } from '@playwright/experimental-ct-react';
 import { QueryClient, QueryClientProvider } from '@tanstack/react-query';
 import { UserProfile } from './UserProfile';
 import { AuthProvider } from '../contexts/AuthContext';
 test.describe('UserProfile Component', () => {
  test('should display user when authenticated', async ({ mount }) => {
    const user = { id: 1, name: 'John Doe', email: 'john@example.com' };
    const queryClient = new QueryClient();
    const component = await mount(
      <QueryClientProvider client={queryClient}>
        <AuthProvider initialAuth={{ user, token: 'fake-token' }}>
          <UserProfile />
        </AuthProvider>
      </QueryClientProvider>
    );
    await expect(component.getByText('John Doe')).toBeVisible();
    await expect(component.getByText('john@example.com')).toBeVisible();
  });
 });
 ```
 **Key Points**:
 - Create NEW providers per test (QueryClient, Router, Auth)
 - Prevents state pollution between tests
 - `initialAuth` prop allows testing different auth states
 - Custom mount command (`wrappedMount`) reduces boilerplate
 - Providers wrap component, not the entire test suite
 ### Example 3: Accessibility Assertions
 **Context**: When testing components, validate accessibility alongside functionality using axe-core, ARIA roles, labels, and keyboard navigation.
 **Implementation**:
 ```typescript
 // Cypress with axe-core
 // cypress/support/component.tsx
 import 'cypress-axe';
 // Form.cy.tsx
 import { Form } from './Form';
 describe('Form Component Accessibility', () => {
  beforeEach(() => {
    cy.wrappedMount(<Form />);
    cy.injectAxe(); // Inject axe-core
  });
  it('should have no accessibility violations', () => {
    cy.checkA11y(); // Run axe scan
  });
  it('should have proper ARIA labels', () => {
    cy.get('input[name="email"]').should('have.attr', 'aria-label', 'Email address');
    cy.get('input[name="password"]').should('have.attr', 'aria-label', 'Password');
    cy.get('button[type="submit"]').should('have.attr', 'aria-label', 'Submit form');
  });
  it('should support keyboard navigation', () => {
    // Tab through form fields
    cy.get('input[name="email"]').focus().type('test@example.com');
    cy.realPress('Tab'); // cypress-real-events plugin
    cy.focused().should('have.attr', 'name', 'password');
    cy.focused().type('password123');
    cy.realPress('Tab');
    cy.focused().should('have.attr', 'type', 'submit');
    cy.realPress('Enter'); // Submit via keyboard
    cy.contains('Form submitted').should('be.visible');
  });
  it('should announce errors to screen readers', () => {
    cy.get('button[type="submit"]').click(); // Submit without data
    // Error has role="alert" and aria-live="polite"
    cy.get('[role="alert"]')
      .should('be.visible')
      .and('have.attr', 'aria-live', 'polite')
      .and('contain', 'Email is required');
  });
  it('should have sufficient color contrast', () => {
    cy.checkA11y(null, {
      rules: {
        'color-contrast': { enabled: true }
      }
    });
  });
 });
 // Playwright with axe-playwright
 import { test, expect } from '@playwright/experimental-ct-react';
 import AxeBuilder from '@axe-core/playwright';
 import { Form } from './Form';
 test.describe('Form Component Accessibility', () => {
  test('should have no accessibility violations', async ({ mount, page }) => {
    await mount(<Form />);
    const accessibilityScanResults = await new AxeBuilder({ page })
      .analyze();
    expect(accessibilityScanResults.violations).toEqual([]);
  });
  test('should support keyboard navigation', async ({ mount, page }) => {
    const component = await mount(<Form />);
    await component.getByLabel('Email address').fill('test@example.com');
    await page.keyboard.press('Tab');
    await expect(component.getByLabel('Password')).toBeFocused();
    await component.getByLabel('Password').fill('password123');
    await page.keyboard.press('Tab');
    await expect(component.getByRole('button', { name: 'Submit form' })).toBeFocused();
    await page.keyboard.press('Enter');
    await expect(component.getByText('Form submitted')).toBeVisible();
  });
 });
 ```
 **Key Points**:
 - Use `cy.checkA11y()` (Cypress) or `AxeBuilder` (Playwright) for automated accessibility scanning
 - Validate ARIA roles, labels, and live regions
 - Test keyboard navigation (Tab, Enter, Escape)
 - Ensure errors are announced to screen readers (`role="alert"`, `aria-live`)
 - Check color contrast meets WCAG standards
 ### Example 4: Visual Regression Test
 **Context**: When testing components, capture screenshots to detect unintended visual changes. Use Playwright visual comparison or Cypress snapshot plugins.
 **Implementation**:
 ```typescript
 // Playwright visual regression
 import { test, expect } from '@playwright/experimental-ct-react';
 import { Button } from './Button';
 test.describe('Button Visual Regression', () => {
  test('should match primary button snapshot', async ({ mount }) => {
    const component = await mount(<Button label="Primary" variant="primary" />);
    // Capture and compare screenshot
    await expect(component).toHaveScreenshot('button-primary.png');
  });
  test('should match secondary button snapshot', async ({ mount }) => {
    const component = await mount(<Button label="Secondary" variant="secondary" />);
    await expect(component).toHaveScreenshot('button-secondary.png');
  });
  test('should match disabled button snapshot', async ({ mount }) => {
    const component = await mount(<Button label="Disabled" disabled={true} />);
    await expect(component).toHaveScreenshot('button-disabled.png');
  });
  test('should match loading button snapshot', async ({ mount }) => {
    const component = await mount(<Button label="Loading" loading={true} />);
    await expect(component).toHaveScreenshot('button-loading.png');
  });
 });
 // Cypress visual regression with percy or snapshot plugins
 import { Button } from './Button';
 describe('Button Visual Regression', () => {
  it('should match primary button snapshot', () => {
    cy.wrappedMount(<Button label="Primary" variant="primary" />);
    // Option 1: Percy (cloud-based visual testing)
    cy.percySnapshot('Button - Primary');
    // Option 2: cypress-plugin-snapshots (local snapshots)
    cy.get('button').toMatchImageSnapshot({
      name: 'button-primary',
      threshold: 0.01 // 1% threshold for pixel differences
    });
  });
  it('should match hover state', () => {
    cy.wrappedMount(<Button label="Hover Me" />);
    cy.get('button').realHover(); // cypress-real-events
    cy.percySnapshot('Button - Hover State');
  });
  it('should match focus state', () => {
    cy.wrappedMount(<Button label="Focus Me" />);
    cy.get('button').focus();
    cy.percySnapshot('Button - Focus State');
  });
 });
 // Playwright configuration for visual regression
 // playwright.config.ts
 export default defineConfig({
  expect: {
    toHaveScreenshot: {
      maxDiffPixels: 100, // Allow 100 pixels difference
      threshold: 0.2 // 20% threshold
    }
  },
  use: {
    screenshot: 'only-on-failure'
  }
 });
 // Update snapshots when intentional changes are made
 // npx playwright test --update-snapshots
 ```
 **Key Points**:
 - Playwright: Use `toHaveScreenshot()` for built-in visual comparison
 - Cypress: Use Percy (cloud) or snapshot plugins (local) for visual testing
 - Capture different states: default, hover, focus, disabled, loading
 - Set threshold for acceptable pixel differences (avoid false positives)
 - Update snapshots when visual changes are intentional
 - Visual tests catch unintended CSS/layout regressions
 ## Integration Points
 - **Used in workflows**: `*atdd` (component test generation), `*automate` (component test expansion), `*framework` (component testing setup)
 - **Related fragments**:
  - `test-quality.md` - Keep component tests <100 lines, isolated, focused
  - `fixture-architecture.md` - Provider wrapping patterns, custom mount commands
  - `data-factories.md` - Factory functions for component props
  - `test-levels-framework.md` - When to use component tests vs E2E tests
 ## TDD Workflow Summary
 **Red-Green-Refactor Cycle**:
 1. **Red**: Write failing test describing desired behavior
 2. **Green**: Implement minimal code to make test pass
 3. **Refactor**: Improve code quality, tests stay green
 4. **Repeat**: Each new feature starts with failing test
 **Component Test Checklist**:
 - [ ] Test renders with required props
 - [ ] Test user interactions (click, type, submit)
 - [ ] Test different states (loading, error, disabled)
 - [ ] Test accessibility (ARIA, keyboard navigation)
 - [ ] Test visual regression (snapshots)
 - [ ] Isolate with fresh providers (no state bleed)
 - [ ] Keep tests <100 lines (split by intent)
 _Source: CCTDD repository, Murat component testing talks, Playwright/Cypress component testing docs._
--- a/_bmad/bmm/testarch/knowledge/contract-testing.md
+++ b/_bmad/bmm/testarch/knowledge/contract-testing.md
@@ -0,0 +1,957 @@
 # Contract Testing Essentials (Pact)
 ## Principle
 Contract testing validates API contracts between consumer and provider services without requiring integrated end-to-end tests. Store consumer contracts alongside integration specs, version contracts semantically, and publish on every CI run. Provider verification before merge surfaces breaking changes immediately, while explicit fallback behavior (timeouts, retries, error payloads) captures resilience guarantees in contracts.
 ## Rationale
 Traditional integration testing requires running both consumer and provider simultaneously, creating slow, flaky tests with complex setup. Contract testing decouples services: consumers define expectations (pact files), providers verify against those expectations independently. This enables parallel development, catches breaking changes early, and documents API behavior as executable specifications. Pair contract tests with API smoke tests to validate data mapping and UI rendering in tandem.
 ## Pattern Examples
 ### Example 1: Pact Consumer Test (Frontend → Backend API)
 **Context**: React application consuming a user management API, defining expected interactions.
 **Implementation**:
 ```typescript
 // tests/contract/user-api.pact.spec.ts
 import { PactV3, MatchersV3 } from '@pact-foundation/pact';
 import { getUserById, createUser, User } from '@/api/user-service';
 const { like, eachLike, string, integer } = MatchersV3;
 /**
 * Consumer-Driven Contract Test
 * - Consumer (React app) defines expected API behavior
 * - Generates pact file for provider to verify
 * - Runs in isolation (no real backend required)
 */
 const provider = new PactV3({
  consumer: 'user-management-web',
  provider: 'user-api-service',
  dir: './pacts', // Output directory for pact files
  logLevel: 'warn',
 });
 describe('User API Contract', () => {
  describe('GET /users/:id', () => {
    it('should return user when user exists', async () => {
      // Arrange: Define expected interaction
      await provider
        .given('user with id 1 exists') // Provider state
        .uponReceiving('a request for user 1')
        .withRequest({
          method: 'GET',
          path: '/users/1',
          headers: {
            Accept: 'application/json',
            Authorization: like('Bearer token123'), // Matcher: any string
          },
        })
        .willRespondWith({
          status: 200,
          headers: {
            'Content-Type': 'application/json',
          },
          body: like({
            id: integer(1),
            name: string('John Doe'),
            email: string('john@example.com'),
            role: string('user'),
            createdAt: string('2025-01-15T10:00:00Z'),
          }),
        })
        .executeTest(async (mockServer) => {
          // Act: Call consumer code against mock server
          const user = await getUserById(1, {
            baseURL: mockServer.url,
            headers: { Authorization: 'Bearer token123' },
          });
          // Assert: Validate consumer behavior
          expect(user).toEqual(
            expect.objectContaining({
              id: 1,
              name: 'John Doe',
              email: 'john@example.com',
              role: 'user',
            }),
          );
        });
    });
    it('should handle 404 when user does not exist', async () => {
      await provider
        .given('user with id 999 does not exist')
        .uponReceiving('a request for non-existent user')
        .withRequest({
          method: 'GET',
          path: '/users/999',
          headers: { Accept: 'application/json' },
        })
        .willRespondWith({
          status: 404,
          headers: { 'Content-Type': 'application/json' },
          body: {
            error: 'User not found',
            code: 'USER_NOT_FOUND',
          },
        })
        .executeTest(async (mockServer) => {
          // Act & Assert: Consumer handles 404 gracefully
          await expect(getUserById(999, { baseURL: mockServer.url })).rejects.toThrow('User not found');
        });
    });
  });
  describe('POST /users', () => {
    it('should create user and return 201', async () => {
      const newUser: Omit<User, 'id' | 'createdAt'> = {
        name: 'Jane Smith',
        email: 'jane@example.com',
        role: 'admin',
      };
      await provider
        .given('no users exist')
        .uponReceiving('a request to create a user')
        .withRequest({
          method: 'POST',
          path: '/users',
          headers: {
            'Content-Type': 'application/json',
            Accept: 'application/json',
          },
          body: like(newUser),
        })
        .willRespondWith({
          status: 201,
          headers: { 'Content-Type': 'application/json' },
          body: like({
            id: integer(2),
            name: string('Jane Smith'),
            email: string('jane@example.com'),
            role: string('admin'),
            createdAt: string('2025-01-15T11:00:00Z'),
          }),
        })
        .executeTest(async (mockServer) => {
          const createdUser = await createUser(newUser, {
            baseURL: mockServer.url,
          });
          expect(createdUser).toEqual(
            expect.objectContaining({
              id: expect.any(Number),
              name: 'Jane Smith',
              email: 'jane@example.com',
              role: 'admin',
            }),
          );
        });
    });
  });
 });
 ```
 **package.json scripts**:
 ```json
 {
  "scripts": {
    "test:contract": "jest tests/contract --testTimeout=30000",
    "pact:publish": "pact-broker publish ./pacts --consumer-app-version=$GIT_SHA --broker-base-url=$PACT_BROKER_URL --broker-token=$PACT_BROKER_TOKEN"
  }
 }
 ```
 **Key Points**:
 - **Consumer-driven**: Frontend defines expectations, not backend
 - **Matchers**: `like`, `string`, `integer` for flexible matching
 - **Provider states**: given() sets up test preconditions
 - **Isolation**: No real backend needed, runs fast
 - **Pact generation**: Automatically creates JSON pact files
 ---
 ### Example 2: Pact Provider Verification (Backend validates contracts)
 **Context**: Node.js/Express API verifying pacts published by consumers.
 **Implementation**:
 ```typescript
 // tests/contract/user-api.provider.spec.ts
 import { Verifier, VerifierOptions } from '@pact-foundation/pact';
 import { server } from '../../src/server'; // Your Express/Fastify app
 import { seedDatabase, resetDatabase } from '../support/db-helpers';
 /**
 * Provider Verification Test
 * - Provider (backend API) verifies against published pacts
 * - State handlers setup test data for each interaction
 * - Runs before merge to catch breaking changes
 */
 describe('Pact Provider Verification', () => {
  let serverInstance;
  const PORT = 3001;
  beforeAll(async () => {
    // Start provider server
    serverInstance = server.listen(PORT);
    console.log(`Provider server running on port ${PORT}`);
  });
  afterAll(async () => {
    // Cleanup
    await serverInstance.close();
  });
  it('should verify pacts from all consumers', async () => {
    const opts: VerifierOptions = {
      // Provider details
      provider: 'user-api-service',
      providerBaseUrl: `http://localhost:${PORT}`,
      // Pact Broker configuration
      pactBrokerUrl: process.env.PACT_BROKER_URL,
      pactBrokerToken: process.env.PACT_BROKER_TOKEN,
      publishVerificationResult: process.env.CI === 'true',
      providerVersion: process.env.GIT_SHA || 'dev',
      // State handlers: Setup provider state for each interaction
      stateHandlers: {
        'user with id 1 exists': async () => {
          await seedDatabase({
            users: [
              {
                id: 1,
                name: 'John Doe',
                email: 'john@example.com',
                role: 'user',
                createdAt: '2025-01-15T10:00:00Z',
              },
            ],
          });
          return 'User seeded successfully';
        },
        'user with id 999 does not exist': async () => {
          // Ensure user doesn't exist
          await resetDatabase();
          return 'Database reset';
        },
        'no users exist': async () => {
          await resetDatabase();
          return 'Database empty';
        },
      },
      // Request filters: Add auth headers to all requests
      requestFilter: (req, res, next) => {
        // Mock authentication for verification
        req.headers['x-user-id'] = 'test-user';
        req.headers['authorization'] = 'Bearer valid-test-token';
        next();
      },
      // Timeout for verification
      timeout: 30000,
    };
    // Run verification
    await new Verifier(opts).verifyProvider();
  });
 });
 ```
 **CI integration**:
 ```yaml
 # .github/workflows/pact-provider.yml
 name: Pact Provider Verification
 on:
  pull_request:
  push:
    branches: [main]
 jobs:
  verify-contracts:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v4
      - name: Setup Node.js
        uses: actions/setup-node@v4
        with:
          node-version-file: '.nvmrc'
      - name: Install dependencies
        run: npm ci
      - name: Start database
        run: docker-compose up -d postgres
      - name: Run migrations
        run: npm run db:migrate
      - name: Verify pacts
        run: npm run test:contract:provider
        env:
          PACT_BROKER_URL: ${{ secrets.PACT_BROKER_URL }}
          PACT_BROKER_TOKEN: ${{ secrets.PACT_BROKER_TOKEN }}
          GIT_SHA: ${{ github.sha }}
          CI: true
      - name: Can I Deploy?
        run: |
          npx pact-broker can-i-deploy \
            --pacticipant user-api-service \
            --version ${{ github.sha }} \
            --to-environment production
        env:
          PACT_BROKER_BASE_URL: ${{ secrets.PACT_BROKER_URL }}
          PACT_BROKER_TOKEN: ${{ secrets.PACT_BROKER_TOKEN }}
 ```
 **Key Points**:
 - **State handlers**: Setup provider data for each given() state
 - **Request filters**: Add auth/headers for verification requests
 - **CI publishing**: Verification results sent to broker
 - **can-i-deploy**: Safety check before production deployment
 - **Database isolation**: Reset between state handlers
 ---
 ### Example 3: Contract CI Integration (Consumer & Provider Workflow)
 **Context**: Complete CI/CD workflow coordinating consumer pact publishing and provider verification.
 **Implementation**:
 ```yaml
 # .github/workflows/pact-consumer.yml (Consumer side)
 name: Pact Consumer Tests
 on:
  pull_request:
  push:
    branches: [main]
 jobs:
  consumer-tests:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v4
      - name: Setup Node.js
        uses: actions/setup-node@v4
        with:
          node-version-file: '.nvmrc'
      - name: Install dependencies
        run: npm ci
      - name: Run consumer contract tests
        run: npm run test:contract
      - name: Publish pacts to broker
        if: github.ref == 'refs/heads/main' || github.event_name == 'pull_request'
        run: |
          npx pact-broker publish ./pacts \
            --consumer-app-version ${{ github.sha }} \
            --branch ${{ github.head_ref || github.ref_name }} \
            --broker-base-url ${{ secrets.PACT_BROKER_URL }} \
            --broker-token ${{ secrets.PACT_BROKER_TOKEN }}
      - name: Tag pact with environment (main branch only)
        if: github.ref == 'refs/heads/main'
        run: |
          npx pact-broker create-version-tag \
            --pacticipant user-management-web \
            --version ${{ github.sha }} \
            --tag production \
            --broker-base-url ${{ secrets.PACT_BROKER_URL }} \
            --broker-token ${{ secrets.PACT_BROKER_TOKEN }}
 ```
 ```yaml
 # .github/workflows/pact-provider.yml (Provider side)
 name: Pact Provider Verification
 on:
  pull_request:
  push:
    branches: [main]
  repository_dispatch:
    types: [pact_changed] # Webhook from Pact Broker
 jobs:
  verify-contracts:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v4
      - name: Setup Node.js
        uses: actions/setup-node@v4
        with:
          node-version-file: '.nvmrc'
      - name: Install dependencies
        run: npm ci
      - name: Start dependencies
        run: docker-compose up -d
      - name: Run provider verification
        run: npm run test:contract:provider
        env:
          PACT_BROKER_URL: ${{ secrets.PACT_BROKER_URL }}
          PACT_BROKER_TOKEN: ${{ secrets.PACT_BROKER_TOKEN }}
          GIT_SHA: ${{ github.sha }}
          CI: true
      - name: Publish verification results
        if: always()
        run: echo "Verification results published to broker"
      - name: Can I Deploy to Production?
        if: github.ref == 'refs/heads/main'
        run: |
          npx pact-broker can-i-deploy \
            --pacticipant user-api-service \
            --version ${{ github.sha }} \
            --to-environment production \
            --broker-base-url ${{ secrets.PACT_BROKER_URL }} \
            --broker-token ${{ secrets.PACT_BROKER_TOKEN }} \
            --retry-while-unknown 6 \
            --retry-interval 10
      - name: Record deployment (if can-i-deploy passed)
        if: success() && github.ref == 'refs/heads/main'
        run: |
          npx pact-broker record-deployment \
            --pacticipant user-api-service \
            --version ${{ github.sha }} \
            --environment production \
            --broker-base-url ${{ secrets.PACT_BROKER_URL }} \
            --broker-token ${{ secrets.PACT_BROKER_TOKEN }}
 ```
 **Pact Broker Webhook Configuration**:
 ```json
 {
  "events": [
    {
      "name": "contract_content_changed"
    }
  ],
  "request": {
    "method": "POST",
    "url": "https://api.github.com/repos/your-org/user-api/dispatches",
    "headers": {
      "Authorization": "Bearer ${user.githubToken}",
      "Content-Type": "application/json",
      "Accept": "application/vnd.github.v3+json"
    },
    "body": {
      "event_type": "pact_changed",
      "client_payload": {
        "pact_url": "${pactbroker.pactUrl}",
        "consumer": "${pactbroker.consumerName}",
        "provider": "${pactbroker.providerName}"
      }
    }
  }
 }
 ```
 **Key Points**:
 - **Automatic trigger**: Consumer pact changes trigger provider verification via webhook
 - **Branch tracking**: Pacts published per branch for feature testing
 - **can-i-deploy**: Safety gate before production deployment
 - **Record deployment**: Track which version is in each environment
 - **Parallel dev**: Consumer and provider teams work independently
 ---
 ### Example 4: Resilience Coverage (Testing Fallback Behavior)
 **Context**: Capture timeout, retry, and error handling behavior explicitly in contracts.
 **Implementation**:
 ```typescript
 // tests/contract/user-api-resilience.pact.spec.ts
 import { PactV3, MatchersV3 } from '@pact-foundation/pact';
 import { getUserById, ApiError } from '@/api/user-service';
 const { like, string } = MatchersV3;
 const provider = new PactV3({
  consumer: 'user-management-web',
  provider: 'user-api-service',
  dir: './pacts',
 });
 describe('User API Resilience Contract', () => {
  /**
   * Test 500 error handling
   * Verifies consumer handles server errors gracefully
   */
  it('should handle 500 errors with retry logic', async () => {
    await provider
      .given('server is experiencing errors')
      .uponReceiving('a request that returns 500')
      .withRequest({
        method: 'GET',
        path: '/users/1',
        headers: { Accept: 'application/json' },
      })
      .willRespondWith({
        status: 500,
        headers: { 'Content-Type': 'application/json' },
        body: {
          error: 'Internal server error',
          code: 'INTERNAL_ERROR',
          retryable: true,
        },
      })
      .executeTest(async (mockServer) => {
        // Consumer should retry on 500
        try {
          await getUserById(1, {
            baseURL: mockServer.url,
            retries: 3,
            retryDelay: 100,
          });
          fail('Should have thrown error after retries');
        } catch (error) {
          expect(error).toBeInstanceOf(ApiError);
          expect((error as ApiError).code).toBe('INTERNAL_ERROR');
          expect((error as ApiError).retryable).toBe(true);
        }
      });
  });
  /**
   * Test 429 rate limiting
   * Verifies consumer respects rate limits
   */
  it('should handle 429 rate limit with backoff', async () => {
    await provider
      .given('rate limit exceeded for user')
      .uponReceiving('a request that is rate limited')
      .withRequest({
        method: 'GET',
        path: '/users/1',
      })
      .willRespondWith({
        status: 429,
        headers: {
          'Content-Type': 'application/json',
          'Retry-After': '60', // Retry after 60 seconds
        },
        body: {
          error: 'Too many requests',
          code: 'RATE_LIMIT_EXCEEDED',
        },
      })
      .executeTest(async (mockServer) => {
        try {
          await getUserById(1, {
            baseURL: mockServer.url,
            respectRateLimit: true,
          });
          fail('Should have thrown rate limit error');
        } catch (error) {
          expect(error).toBeInstanceOf(ApiError);
          expect((error as ApiError).code).toBe('RATE_LIMIT_EXCEEDED');
          expect((error as ApiError).retryAfter).toBe(60);
        }
      });
  });
  /**
   * Test timeout handling
   * Verifies consumer has appropriate timeout configuration
   */
  it('should timeout after 10 seconds', async () => {
    await provider
      .given('server is slow to respond')
      .uponReceiving('a request that times out')
      .withRequest({
        method: 'GET',
        path: '/users/1',
      })
      .willRespondWith({
        status: 200,
        headers: { 'Content-Type': 'application/json' },
        body: like({ id: 1, name: 'John' }),
      })
      .withDelay(15000) // Simulate 15 second delay
      .executeTest(async (mockServer) => {
        try {
          await getUserById(1, {
            baseURL: mockServer.url,
            timeout: 10000, // 10 second timeout
          });
          fail('Should have timed out');
        } catch (error) {
          expect(error).toBeInstanceOf(ApiError);
          expect((error as ApiError).code).toBe('TIMEOUT');
        }
      });
  });
  /**
   * Test partial response (optional fields)
   * Verifies consumer handles missing optional data
   */
  it('should handle response with missing optional fields', async () => {
    await provider
      .given('user exists with minimal data')
      .uponReceiving('a request for user with partial data')
      .withRequest({
        method: 'GET',
        path: '/users/1',
      })
      .willRespondWith({
        status: 200,
        headers: { 'Content-Type': 'application/json' },
        body: {
          id: integer(1),
          name: string('John Doe'),
          email: string('john@example.com'),
          // role, createdAt, etc. omitted (optional fields)
        },
      })
      .executeTest(async (mockServer) => {
        const user = await getUserById(1, { baseURL: mockServer.url });
        // Consumer handles missing optional fields gracefully
        expect(user.id).toBe(1);
        expect(user.name).toBe('John Doe');
        expect(user.role).toBeUndefined(); // Optional field
        expect(user.createdAt).toBeUndefined(); // Optional field
      });
  });
 });
 ```
 **API client with retry logic**:
 ```typescript
 // src/api/user-service.ts
 import axios, { AxiosInstance, AxiosRequestConfig } from 'axios';
 export class ApiError extends Error {
  constructor(
    message: string,
    public code: string,
    public retryable: boolean = false,
    public retryAfter?: number,
  ) {
    super(message);
  }
 }
 /**
 * User API client with retry and error handling
 */
 export async function getUserById(
  id: number,
  config?: AxiosRequestConfig & { retries?: number; retryDelay?: number; respectRateLimit?: boolean },
 ): Promise<User> {
  const { retries = 3, retryDelay = 1000, respectRateLimit = true, ...axiosConfig } = config || {};
  let lastError: Error;
  for (let attempt = 1; attempt <= retries; attempt++) {
    try {
      const response = await axios.get(`/users/${id}`, axiosConfig);
      return response.data;
    } catch (error: any) {
      lastError = error;
      // Handle rate limiting
      if (error.response?.status === 429) {
        const retryAfter = parseInt(error.response.headers['retry-after'] || '60');
        throw new ApiError('Too many requests', 'RATE_LIMIT_EXCEEDED', false, retryAfter);
      }
      // Retry on 500 errors
      if (error.response?.status === 500 && attempt < retries) {
        await new Promise((resolve) => setTimeout(resolve, retryDelay * attempt));
        continue;
      }
      // Handle 404
      if (error.response?.status === 404) {
        throw new ApiError('User not found', 'USER_NOT_FOUND', false);
      }
      // Handle timeout
      if (error.code === 'ECONNABORTED') {
        throw new ApiError('Request timeout', 'TIMEOUT', true);
      }
      break;
    }
  }
  throw new ApiError('Request failed after retries', 'INTERNAL_ERROR', true);
 }
 ```
 **Key Points**:
 - **Resilience contracts**: Timeouts, retries, errors explicitly tested
 - **State handlers**: Provider sets up each test scenario
 - **Error handling**: Consumer validates graceful degradation
 - **Retry logic**: Exponential backoff tested
 - **Optional fields**: Consumer handles partial responses
 ---
 ### Example 4: Pact Broker Housekeeping & Lifecycle Management
 **Context**: Automated broker maintenance to prevent contract sprawl and noise.
 **Implementation**:
 ```typescript
 // scripts/pact-broker-housekeeping.ts
 /**
 * Pact Broker Housekeeping Script
 * - Archive superseded contracts
 * - Expire unused pacts
 * - Tag releases for environment tracking
 */
 import { execSync } from 'child_process';
 const PACT_BROKER_URL = process.env.PACT_BROKER_URL!;
 const PACT_BROKER_TOKEN = process.env.PACT_BROKER_TOKEN!;
 const PACTICIPANT = 'user-api-service';
 /**
 * Tag release with environment
 */
 function tagRelease(version: string, environment: 'staging' | 'production') {
  console.log(`🏷️  Tagging ${PACTICIPANT} v${version} as ${environment}`);
  execSync(
    `npx pact-broker create-version-tag \
      --pacticipant ${PACTICIPANT} \
      --version ${version} \
      --tag ${environment} \
      --broker-base-url ${PACT_BROKER_URL} \
      --broker-token ${PACT_BROKER_TOKEN}`,
    { stdio: 'inherit' },
  );
 }
 /**
 * Record deployment to environment
 */
 function recordDeployment(version: string, environment: 'staging' | 'production') {
  console.log(`📝 Recording deployment of ${PACTICIPANT} v${version} to ${environment}`);
  execSync(
    `npx pact-broker record-deployment \
      --pacticipant ${PACTICIPANT} \
      --version ${version} \
      --environment ${environment} \
      --broker-base-url ${PACT_BROKER_URL} \
      --broker-token ${PACT_BROKER_TOKEN}`,
    { stdio: 'inherit' },
  );
 }
 /**
 * Clean up old pact versions (retention policy)
 * Keep: last 30 days, all production tags, latest from each branch
 */
 function cleanupOldPacts() {
  console.log(`🧹 Cleaning up old pacts for ${PACTICIPANT}`);
  execSync(
    `npx pact-broker clean \
      --pacticipant ${PACTICIPANT} \
      --broker-base-url ${PACT_BROKER_URL} \
      --broker-token ${PACT_BROKER_TOKEN} \
      --keep-latest-for-branch 1 \
      --keep-min-age 30`,
    { stdio: 'inherit' },
  );
 }
 /**
 * Check deployment compatibility
 */
 function canIDeploy(version: string, toEnvironment: string): boolean {
  console.log(`🔍 Checking if ${PACTICIPANT} v${version} can deploy to ${toEnvironment}`);
  try {
    execSync(
      `npx pact-broker can-i-deploy \
        --pacticipant ${PACTICIPANT} \
        --version ${version} \
        --to-environment ${toEnvironment} \
        --broker-base-url ${PACT_BROKER_URL} \
        --broker-token ${PACT_BROKER_TOKEN} \
        --retry-while-unknown 6 \
        --retry-interval 10`,
      { stdio: 'inherit' },
    );
    return true;
  } catch (error) {
    console.error(`❌ Cannot deploy to ${toEnvironment}`);
    return false;
  }
 }
 /**
 * Main housekeeping workflow
 */
 async function main() {
  const command = process.argv[2];
  const version = process.argv[3];
  const environment = process.argv[4] as 'staging' | 'production';
  switch (command) {
    case 'tag-release':
      tagRelease(version, environment);
      break;
    case 'record-deployment':
      recordDeployment(version, environment);
      break;
    case 'can-i-deploy':
      const canDeploy = canIDeploy(version, environment);
      process.exit(canDeploy ? 0 : 1);
    case 'cleanup':
      cleanupOldPacts();
      break;
    default:
      console.error('Unknown command. Use: tag-release | record-deployment | can-i-deploy | cleanup');
      process.exit(1);
  }
 }
 main();
 ```
 **package.json scripts**:
 ```json
 {
  "scripts": {
    "pact:tag": "ts-node scripts/pact-broker-housekeeping.ts tag-release",
    "pact:record": "ts-node scripts/pact-broker-housekeeping.ts record-deployment",
    "pact:can-deploy": "ts-node scripts/pact-broker-housekeeping.ts can-i-deploy",
    "pact:cleanup": "ts-node scripts/pact-broker-housekeeping.ts cleanup"
  }
 }
 ```
 **Deployment workflow integration**:
 ```yaml
 # .github/workflows/deploy-production.yml
 name: Deploy to Production
 on:
  push:
    tags:
      - 'v*'
 jobs:
  verify-contracts:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v4
      - name: Check pact compatibility
        run: npm run pact:can-deploy ${{ github.ref_name }} production
        env:
          PACT_BROKER_URL: ${{ secrets.PACT_BROKER_URL }}
          PACT_BROKER_TOKEN: ${{ secrets.PACT_BROKER_TOKEN }}
  deploy:
    needs: verify-contracts
    runs-on: ubuntu-latest
    steps:
      - name: Deploy to production
        run: ./scripts/deploy.sh production
      - name: Record deployment in Pact Broker
        run: npm run pact:record ${{ github.ref_name }} production
        env:
          PACT_BROKER_URL: ${{ secrets.PACT_BROKER_URL }}
          PACT_BROKER_TOKEN: ${{ secrets.PACT_BROKER_TOKEN }}
 ```
 **Scheduled cleanup**:
 ```yaml
 # .github/workflows/pact-housekeeping.yml
 name: Pact Broker Housekeeping
 on:
  schedule:
    - cron: '0 2 * * 0' # Weekly on Sunday at 2 AM
 jobs:
  cleanup:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v4
      - name: Cleanup old pacts
        run: npm run pact:cleanup
        env:
          PACT_BROKER_URL: ${{ secrets.PACT_BROKER_URL }}
          PACT_BROKER_TOKEN: ${{ secrets.PACT_BROKER_TOKEN }}
 ```
 **Key Points**:
 - **Automated tagging**: Releases tagged with environment
 - **Deployment tracking**: Broker knows which version is where
 - **Safety gate**: can-i-deploy blocks incompatible deployments
 - **Retention policy**: Keep recent, production, and branch-latest pacts
 - **Webhook triggers**: Provider verification runs on consumer changes
 ---
 ## Contract Testing Checklist
 Before implementing contract testing, verify:
 - [ ] **Pact Broker setup**: Hosted (Pactflow) or self-hosted broker configured
 - [ ] **Consumer tests**: Generate pacts in CI, publish to broker on merge
 - [ ] **Provider verification**: Runs on PR, verifies all consumer pacts
 - [ ] **State handlers**: Provider implements all given() states
 - [ ] **can-i-deploy**: Blocks deployment if contracts incompatible
 - [ ] **Webhooks configured**: Consumer changes trigger provider verification
 - [ ] **Retention policy**: Old pacts archived (keep 30 days, all production tags)
 - [ ] **Resilience tested**: Timeouts, retries, error codes in contracts
 ## Integration Points
 - Used in workflows: `*automate` (integration test generation), `*ci` (contract CI setup)
 - Related fragments: `test-levels-framework.md`, `ci-burn-in.md`
 - Tools: Pact.js, Pact Broker (Pactflow or self-hosted), Pact CLI
 _Source: Pact consumer/provider sample repos, Murat contract testing blog, Pact official documentation_
--- a/_bmad/bmm/testarch/knowledge/data-factories.md
+++ b/_bmad/bmm/testarch/knowledge/data-factories.md
@@ -0,0 +1,500 @@
 # Data Factories and API-First Setup
 ## Principle
 Prefer factory functions that accept overrides and return complete objects (`createUser(overrides)`). Seed test state through APIs, tasks, or direct DB helpers before visiting the UI—never via slow UI interactions. UI is for validation only, not setup.
 ## Rationale
 Static fixtures (JSON files, hardcoded objects) create brittle tests that:
 - Fail when schemas evolve (missing new required fields)
 - Cause collisions in parallel execution (same user IDs)
 - Hide test intent (what matters for _this_ test?)
 Dynamic factories with overrides provide:
 - **Parallel safety**: UUIDs and timestamps prevent collisions
 - **Schema evolution**: Defaults adapt to schema changes automatically
 - **Explicit intent**: Overrides show what matters for each test
 - **Speed**: API setup is 10-50x faster than UI
 ## Pattern Examples
 ### Example 1: Factory Function with Overrides
 **Context**: When creating test data, build factory functions with sensible defaults and explicit overrides. Use `faker` for dynamic values that prevent collisions.
 **Implementation**:
 ```typescript
 // test-utils/factories/user-factory.ts
 import { faker } from '@faker-js/faker';
 type User = {
  id: string;
  email: string;
  name: string;
  role: 'user' | 'admin' | 'moderator';
  createdAt: Date;
  isActive: boolean;
 };
 export const createUser = (overrides: Partial<User> = {}): User => ({
  id: faker.string.uuid(),
  email: faker.internet.email(),
  name: faker.person.fullName(),
  role: 'user',
  createdAt: new Date(),
  isActive: true,
  ...overrides,
 });
 // test-utils/factories/product-factory.ts
 type Product = {
  id: string;
  name: string;
  price: number;
  stock: number;
  category: string;
 };
 export const createProduct = (overrides: Partial<Product> = {}): Product => ({
  id: faker.string.uuid(),
  name: faker.commerce.productName(),
  price: parseFloat(faker.commerce.price()),
  stock: faker.number.int({ min: 0, max: 100 }),
  category: faker.commerce.department(),
  ...overrides,
 });
 // Usage in tests:
 test('admin can delete users', async ({ page, apiRequest }) => {
  // Default user
  const user = createUser();
  // Admin user (explicit override shows intent)
  const admin = createUser({ role: 'admin' });
  // Seed via API (fast!)
  await apiRequest({ method: 'POST', url: '/api/users', data: user });
  await apiRequest({ method: 'POST', url: '/api/users', data: admin });
  // Now test UI behavior
  await page.goto('/admin/users');
  await page.click(`[data-testid="delete-user-${user.id}"]`);
  await expect(page.getByText(`User ${user.name} deleted`)).toBeVisible();
 });
 ```
 **Key Points**:
 - `Partial<User>` allows overriding any field without breaking type safety
 - Faker generates unique values—no collisions in parallel tests
 - Override shows test intent: `createUser({ role: 'admin' })` is explicit
 - Factory lives in `test-utils/factories/` for easy reuse
 ### Example 2: Nested Factory Pattern
 **Context**: When testing relationships (orders with users and products), nest factories to create complete object graphs. Control relationship data explicitly.
 **Implementation**:
 ```typescript
 // test-utils/factories/order-factory.ts
 import { createUser } from './user-factory';
 import { createProduct } from './product-factory';
 type OrderItem = {
  product: Product;
  quantity: number;
  price: number;
 };
 type Order = {
  id: string;
  user: User;
  items: OrderItem[];
  total: number;
  status: 'pending' | 'paid' | 'shipped' | 'delivered';
  createdAt: Date;
 };
 export const createOrderItem = (overrides: Partial<OrderItem> = {}): OrderItem => {
  const product = overrides.product || createProduct();
  const quantity = overrides.quantity || faker.number.int({ min: 1, max: 5 });
  return {
    product,
    quantity,
    price: product.price * quantity,
    ...overrides,
  };
 };
 export const createOrder = (overrides: Partial<Order> = {}): Order => {
  const items = overrides.items || [createOrderItem(), createOrderItem()];
  const total = items.reduce((sum, item) => sum + item.price, 0);
  return {
    id: faker.string.uuid(),
    user: overrides.user || createUser(),
    items,
    total,
    status: 'pending',
    createdAt: new Date(),
    ...overrides,
  };
 };
 // Usage in tests:
 test('user can view order details', async ({ page, apiRequest }) => {
  const user = createUser({ email: 'test@example.com' });
  const product1 = createProduct({ name: 'Widget A', price: 10.0 });
  const product2 = createProduct({ name: 'Widget B', price: 15.0 });
  // Explicit relationships
  const order = createOrder({
    user,
    items: [
      createOrderItem({ product: product1, quantity: 2 }), // $20
      createOrderItem({ product: product2, quantity: 1 }), // $15
    ],
  });
  // Seed via API
  await apiRequest({ method: 'POST', url: '/api/users', data: user });
  await apiRequest({ method: 'POST', url: '/api/products', data: product1 });
  await apiRequest({ method: 'POST', url: '/api/products', data: product2 });
  await apiRequest({ method: 'POST', url: '/api/orders', data: order });
  // Test UI
  await page.goto(`/orders/${order.id}`);
  await expect(page.getByText('Widget A x 2')).toBeVisible();
  await expect(page.getByText('Widget B x 1')).toBeVisible();
  await expect(page.getByText('Total: $35.00')).toBeVisible();
 });
 ```
 **Key Points**:
 - Nested factories handle relationships (order → user, order → products)
 - Overrides cascade: provide custom user/products or use defaults
 - Calculated fields (total) derived automatically from nested data
 - Explicit relationships make test data clear and maintainable
 ### Example 3: Factory with API Seeding
 **Context**: When tests need data setup, always use API calls or database tasks—never UI navigation. Wrap factory usage with seeding utilities for clean test setup.
 **Implementation**:
 ```typescript
 // playwright/support/helpers/seed-helpers.ts
 import { APIRequestContext } from '@playwright/test';
 import { User, createUser } from '../../test-utils/factories/user-factory';
 import { Product, createProduct } from '../../test-utils/factories/product-factory';
 export async function seedUser(request: APIRequestContext, overrides: Partial<User> = {}): Promise<User> {
  const user = createUser(overrides);
  const response = await request.post('/api/users', {
    data: user,
  });
  if (!response.ok()) {
    throw new Error(`Failed to seed user: ${response.status()}`);
  }
  return user;
 }
 export async function seedProduct(request: APIRequestContext, overrides: Partial<Product> = {}): Promise<Product> {
  const product = createProduct(overrides);
  const response = await request.post('/api/products', {
    data: product,
  });
  if (!response.ok()) {
    throw new Error(`Failed to seed product: ${response.status()}`);
  }
  return product;
 }
 // Playwright globalSetup for shared data
 // playwright/support/global-setup.ts
 import { chromium, FullConfig } from '@playwright/test';
 import { seedUser } from './helpers/seed-helpers';
 async function globalSetup(config: FullConfig) {
  const browser = await chromium.launch();
  const page = await browser.newPage();
  const context = page.context();
  // Seed admin user for all tests
  const admin = await seedUser(context.request, {
    email: 'admin@example.com',
    role: 'admin',
  });
  // Save auth state for reuse
  await context.storageState({ path: 'playwright/.auth/admin.json' });
  await browser.close();
 }
 export default globalSetup;
 // Cypress equivalent with cy.task
 // cypress/support/tasks.ts
 export const seedDatabase = async (entity: string, data: unknown) => {
  // Direct database insert or API call
  if (entity === 'users') {
    await db.users.create(data);
  }
  return null;
 };
 // Usage in Cypress tests:
 beforeEach(() => {
  const user = createUser({ email: 'test@example.com' });
  cy.task('db:seed', { entity: 'users', data: user });
 });
 ```
 **Key Points**:
 - API seeding is 10-50x faster than UI-based setup
 - `globalSetup` seeds shared data once (e.g., admin user)
 - Per-test seeding uses `seedUser()` helpers for isolation
 - Cypress `cy.task` allows direct database access for speed
 ### Example 4: Anti-Pattern - Hardcoded Test Data
 **Problem**:
 ```typescript
 // ❌ BAD: Hardcoded test data
 test('user can login', async ({ page }) => {
  await page.goto('/login');
  await page.fill('[data-testid="email"]', 'test@test.com'); // Hardcoded
  await page.fill('[data-testid="password"]', 'password123'); // Hardcoded
  await page.click('[data-testid="submit"]');
  // What if this user already exists? Test fails in parallel runs.
  // What if schema adds required fields? Test breaks.
 });
 // ❌ BAD: Static JSON fixtures
 // fixtures/users.json
 {
  "users": [
    { "id": 1, "email": "user1@test.com", "name": "User 1" },
    { "id": 2, "email": "user2@test.com", "name": "User 2" }
  ]
 }
 test('admin can delete user', async ({ page }) => {
  const users = require('../fixtures/users.json');
  // Brittle: IDs collide in parallel, schema drift breaks tests
 });
 ```
 **Why It Fails**:
 - **Parallel collisions**: Hardcoded IDs (`id: 1`, `email: 'test@test.com'`) cause failures when tests run concurrently
 - **Schema drift**: Adding required fields (`phoneNumber`, `address`) breaks all tests using fixtures
 - **Hidden intent**: Does this test need `email: 'test@test.com'` specifically, or any email?
 - **Slow setup**: UI-based data creation is 10-50x slower than API
 **Better Approach**: Use factories
 ```typescript
 // ✅ GOOD: Factory-based data
 test('user can login', async ({ page, apiRequest }) => {
  const user = createUser({ email: 'unique@example.com', password: 'secure123' });
  // Seed via API (fast, parallel-safe)
  await apiRequest({ method: 'POST', url: '/api/users', data: user });
  // Test UI
  await page.goto('/login');
  await page.fill('[data-testid="email"]', user.email);
  await page.fill('[data-testid="password"]', user.password);
  await page.click('[data-testid="submit"]');
  await expect(page).toHaveURL('/dashboard');
 });
 // ✅ GOOD: Factories adapt to schema changes automatically
 // When `phoneNumber` becomes required, update factory once:
 export const createUser = (overrides: Partial<User> = {}): User => ({
  id: faker.string.uuid(),
  email: faker.internet.email(),
  name: faker.person.fullName(),
  phoneNumber: faker.phone.number(), // NEW field, all tests get it automatically
  role: 'user',
  ...overrides,
 });
 ```
 **Key Points**:
 - Factories generate unique, parallel-safe data
 - Schema evolution handled in one place (factory), not every test
 - Test intent explicit via overrides
 - API seeding is fast and reliable
 ### Example 5: Factory Composition
 **Context**: When building specialized factories, compose simpler factories instead of duplicating logic. Layer overrides for specific test scenarios.
 **Implementation**:
 ```typescript
 // test-utils/factories/user-factory.ts (base)
 export const createUser = (overrides: Partial<User> = {}): User => ({
  id: faker.string.uuid(),
  email: faker.internet.email(),
  name: faker.person.fullName(),
  role: 'user',
  createdAt: new Date(),
  isActive: true,
  ...overrides,
 });
 // Compose specialized factories
 export const createAdminUser = (overrides: Partial<User> = {}): User => createUser({ role: 'admin', ...overrides });
 export const createModeratorUser = (overrides: Partial<User> = {}): User => createUser({ role: 'moderator', ...overrides });
 export const createInactiveUser = (overrides: Partial<User> = {}): User => createUser({ isActive: false, ...overrides });
 // Account-level factories with feature flags
 type Account = {
  id: string;
  owner: User;
  plan: 'free' | 'pro' | 'enterprise';
  features: string[];
  maxUsers: number;
 };
 export const createAccount = (overrides: Partial<Account> = {}): Account => ({
  id: faker.string.uuid(),
  owner: overrides.owner || createUser(),
  plan: 'free',
  features: [],
  maxUsers: 1,
  ...overrides,
 });
 export const createProAccount = (overrides: Partial<Account> = {}): Account =>
  createAccount({
    plan: 'pro',
    features: ['advanced-analytics', 'priority-support'],
    maxUsers: 10,
    ...overrides,
  });
 export const createEnterpriseAccount = (overrides: Partial<Account> = {}): Account =>
  createAccount({
    plan: 'enterprise',
    features: ['advanced-analytics', 'priority-support', 'sso', 'audit-logs'],
    maxUsers: 100,
    ...overrides,
  });
 // Usage in tests:
 test('pro accounts can access analytics', async ({ page, apiRequest }) => {
  const admin = createAdminUser({ email: 'admin@company.com' });
  const account = createProAccount({ owner: admin });
  await apiRequest({ method: 'POST', url: '/api/users', data: admin });
  await apiRequest({ method: 'POST', url: '/api/accounts', data: account });
  await page.goto('/analytics');
  await expect(page.getByText('Advanced Analytics')).toBeVisible();
 });
 test('free accounts cannot access analytics', async ({ page, apiRequest }) => {
  const user = createUser({ email: 'user@company.com' });
  const account = createAccount({ owner: user }); // Defaults to free plan
  await apiRequest({ method: 'POST', url: '/api/users', data: user });
  await apiRequest({ method: 'POST', url: '/api/accounts', data: account });
  await page.goto('/analytics');
  await expect(page.getByText('Upgrade to Pro')).toBeVisible();
 });
 ```
 **Key Points**:
 - Compose specialized factories from base factories (`createAdminUser` → `createUser`)
 - Defaults cascade: `createProAccount` sets plan + features automatically
 - Still allow overrides: `createProAccount({ maxUsers: 50 })` works
 - Test intent clear: `createProAccount()` vs `createAccount({ plan: 'pro', features: [...] })`
 ## Integration Points
 - **Used in workflows**: `*atdd` (test generation), `*automate` (test expansion), `*framework` (factory setup)
 - **Related fragments**:
  - `fixture-architecture.md` - Pure functions and fixtures for factory integration
  - `network-first.md` - API-first setup patterns
  - `test-quality.md` - Parallel-safe, deterministic test design
 ## Cleanup Strategy
 Ensure factories work with cleanup patterns:
 ```typescript
 // Track created IDs for cleanup
 const createdUsers: string[] = [];
 afterEach(async ({ apiRequest }) => {
  // Clean up all users created during test
  for (const userId of createdUsers) {
    await apiRequest({ method: 'DELETE', url: `/api/users/${userId}` });
  }
  createdUsers.length = 0;
 });
 test('user registration flow', async ({ page, apiRequest }) => {
  const user = createUser();
  createdUsers.push(user.id);
  await apiRequest({ method: 'POST', url: '/api/users', data: user });
  // ... test logic
 });
 ```
 ## Feature Flag Integration
 When working with feature flags, layer them into factories:
 ```typescript
 export const createUserWithFlags = (
  overrides: Partial<User> = {},
  flags: Record<string, boolean> = {},
 ): User & { flags: Record<string, boolean> } => ({
  ...createUser(overrides),
  flags: {
    'new-dashboard': false,
    'beta-features': false,
    ...flags,
  },
 });
 // Usage:
 const user = createUserWithFlags(
  { email: 'test@example.com' },
  {
    'new-dashboard': true,
    'beta-features': true,
  },
 );
 ```
 _Source: Murat Testing Philosophy (lines 94-120), API-first testing patterns, faker.js documentation._
--- a/_bmad/bmm/testarch/knowledge/email-auth.md
+++ b/_bmad/bmm/testarch/knowledge/email-auth.md
@@ -0,0 +1,721 @@
 # Email-Based Authentication Testing
 ## Principle
 Email-based authentication (magic links, one-time codes, passwordless login) requires specialized testing with email capture services like Mailosaur or Ethereal. Extract magic links via HTML parsing or use built-in link extraction, preserve browser storage (local/session/cookies) when processing links, cache email payloads to avoid exhausting inbox quotas, and cover negative cases (expired links, reused links, multiple rapid requests). Log email IDs and links for troubleshooting, but scrub PII before committing artifacts.
 ## Rationale
 Email authentication introduces unique challenges: asynchronous email delivery, quota limits (AWS Cognito: 50/day), cost per email, and complex state management (session preservation across link clicks). Without proper patterns, tests become slow (wait for email each time), expensive (quota exhaustion), and brittle (timing issues, missing state). Using email capture services + session caching + state preservation patterns makes email auth tests fast, reliable, and cost-effective.
 ## Pattern Examples
 ### Example 1: Magic Link Extraction with Mailosaur
 **Context**: Passwordless login flow where user receives magic link via email, clicks it, and is authenticated.
 **Implementation**:
 ```typescript
 // tests/e2e/magic-link-auth.spec.ts
 import { test, expect } from '@playwright/test';
 /**
 * Magic Link Authentication Flow
 * 1. User enters email
 * 2. Backend sends magic link
 * 3. Test retrieves email via Mailosaur
 * 4. Extract and visit magic link
 * 5. Verify user is authenticated
 */
 // Mailosaur configuration
 const MAILOSAUR_API_KEY = process.env.MAILOSAUR_API_KEY!;
 const MAILOSAUR_SERVER_ID = process.env.MAILOSAUR_SERVER_ID!;
 /**
 * Extract href from HTML email body
 * DOMParser provides XML/HTML parsing in Node.js
 */
 function extractMagicLink(htmlString: string): string | null {
  const { JSDOM } = require('jsdom');
  const dom = new JSDOM(htmlString);
  const link = dom.window.document.querySelector('#magic-link-button');
  return link ? (link as HTMLAnchorElement).href : null;
 }
 /**
 * Alternative: Use Mailosaur's built-in link extraction
 * Mailosaur automatically parses links - no regex needed!
 */
 async function getMagicLinkFromEmail(email: string): Promise<string> {
  const MailosaurClient = require('mailosaur');
  const mailosaur = new MailosaurClient(MAILOSAUR_API_KEY);
  // Wait for email (timeout: 30 seconds)
  const message = await mailosaur.messages.get(
    MAILOSAUR_SERVER_ID,
    {
      sentTo: email,
    },
    {
      timeout: 30000, // 30 seconds
    },
  );
  // Mailosaur extracts links automatically - no parsing needed!
  const magicLink = message.html?.links?.[0]?.href;
  if (!magicLink) {
    throw new Error(`Magic link not found in email to ${email}`);
  }
  console.log(`📧 Email received. Magic link extracted: ${magicLink}`);
  return magicLink;
 }
 test.describe('Magic Link Authentication', () => {
  test('should authenticate user via magic link', async ({ page, context }) => {
    // Arrange: Generate unique test email
    const randomId = Math.floor(Math.random() * 1000000);
    const testEmail = `user-${randomId}@${MAILOSAUR_SERVER_ID}.mailosaur.net`;
    // Act: Request magic link
    await page.goto('/login');
    await page.getByTestId('email-input').fill(testEmail);
    await page.getByTestId('send-magic-link').click();
    // Assert: Success message
    await expect(page.getByTestId('check-email-message')).toBeVisible();
    await expect(page.getByTestId('check-email-message')).toContainText('Check your email');
    // Retrieve magic link from email
    const magicLink = await getMagicLinkFromEmail(testEmail);
    // Visit magic link
    await page.goto(magicLink);
    // Assert: User is authenticated
    await expect(page.getByTestId('user-menu')).toBeVisible();
    await expect(page.getByTestId('user-email')).toContainText(testEmail);
    // Verify session storage preserved
    const localStorage = await page.evaluate(() => JSON.stringify(window.localStorage));
    expect(localStorage).toContain('authToken');
  });
  test('should handle expired magic link', async ({ page }) => {
    // Use pre-expired link (older than 15 minutes)
    const expiredLink = 'http://localhost:3000/auth/verify?token=expired-token-123';
    await page.goto(expiredLink);
    // Assert: Error message displayed
    await expect(page.getByTestId('error-message')).toBeVisible();
    await expect(page.getByTestId('error-message')).toContainText('link has expired');
    // Assert: User NOT authenticated
    await expect(page.getByTestId('user-menu')).not.toBeVisible();
  });
  test('should prevent reusing magic link', async ({ page }) => {
    const randomId = Math.floor(Math.random() * 1000000);
    const testEmail = `user-${randomId}@${MAILOSAUR_SERVER_ID}.mailosaur.net`;
    // Request magic link
    await page.goto('/login');
    await page.getByTestId('email-input').fill(testEmail);
    await page.getByTestId('send-magic-link').click();
    const magicLink = await getMagicLinkFromEmail(testEmail);
    // Visit link first time (success)
    await page.goto(magicLink);
    await expect(page.getByTestId('user-menu')).toBeVisible();
    // Sign out
    await page.getByTestId('sign-out').click();
    // Try to reuse same link (should fail)
    await page.goto(magicLink);
    await expect(page.getByTestId('error-message')).toBeVisible();
    await expect(page.getByTestId('error-message')).toContainText('link has already been used');
  });
 });
 ```
 **Cypress equivalent with Mailosaur plugin**:
 ```javascript
 // cypress/e2e/magic-link-auth.cy.ts
 describe('Magic Link Authentication', () => {
  it('should authenticate user via magic link', () => {
    const serverId = Cypress.env('MAILOSAUR_SERVERID');
    const randomId = Cypress._.random(1e6);
    const testEmail = `user-${randomId}@${serverId}.mailosaur.net`;
    // Request magic link
    cy.visit('/login');
    cy.get('[data-cy="email-input"]').type(testEmail);
    cy.get('[data-cy="send-magic-link"]').click();
    cy.get('[data-cy="check-email-message"]').should('be.visible');
    // Retrieve and visit magic link
    cy.mailosaurGetMessage(serverId, { sentTo: testEmail })
      .its('html.links.0.href') // Mailosaur extracts links automatically!
      .should('exist')
      .then((magicLink) => {
        cy.log(`Magic link: ${magicLink}`);
        cy.visit(magicLink);
      });
    // Verify authenticated
    cy.get('[data-cy="user-menu"]').should('be.visible');
    cy.get('[data-cy="user-email"]').should('contain', testEmail);
  });
 });
 ```
 **Key Points**:
 - **Mailosaur auto-extraction**: `html.links[0].href` or `html.codes[0].value`
 - **Unique emails**: Random ID prevents collisions
 - **Negative testing**: Expired and reused links tested
 - **State verification**: localStorage/session checked
 - **Fast email retrieval**: 30 second timeout typical
 ---
 ### Example 2: State Preservation Pattern with cy.session / Playwright storageState
 **Context**: Cache authenticated session to avoid requesting magic link on every test.
 **Implementation**:
 ```typescript
 // playwright/fixtures/email-auth-fixture.ts
 import { test as base } from '@playwright/test';
 import { getMagicLinkFromEmail } from '../support/mailosaur-helpers';
 type EmailAuthFixture = {
  authenticatedUser: { email: string; token: string };
 };
 export const test = base.extend<EmailAuthFixture>({
  authenticatedUser: async ({ page, context }, use) => {
    const randomId = Math.floor(Math.random() * 1000000);
    const testEmail = `user-${randomId}@${process.env.MAILOSAUR_SERVER_ID}.mailosaur.net`;
    // Check if we have cached auth state for this email
    const storageStatePath = `./test-results/auth-state-${testEmail}.json`;
    try {
      // Try to reuse existing session
      await context.storageState({ path: storageStatePath });
      await page.goto('/dashboard');
      // Validate session is still valid
      const isAuthenticated = await page.getByTestId('user-menu').isVisible({ timeout: 2000 });
      if (isAuthenticated) {
        console.log(`✅ Reusing cached session for ${testEmail}`);
        await use({ email: testEmail, token: 'cached' });
        return;
      }
    } catch (error) {
      console.log(`📧 No cached session, requesting magic link for ${testEmail}`);
    }
    // Request new magic link
    await page.goto('/login');
    await page.getByTestId('email-input').fill(testEmail);
    await page.getByTestId('send-magic-link').click();
    // Get magic link from email
    const magicLink = await getMagicLinkFromEmail(testEmail);
    // Visit link and authenticate
    await page.goto(magicLink);
    await expect(page.getByTestId('user-menu')).toBeVisible();
    // Extract auth token from localStorage
    const authToken = await page.evaluate(() => localStorage.getItem('authToken'));
    // Save session state for reuse
    await context.storageState({ path: storageStatePath });
    console.log(`💾 Cached session for ${testEmail}`);
    await use({ email: testEmail, token: authToken || '' });
  },
 });
 ```
 **Cypress equivalent with cy.session + data-session**:
 ```javascript
 // cypress/support/commands/email-auth.js
 import { dataSession } from 'cypress-data-session';
 /**
 * Authenticate via magic link with session caching
 * - First run: Requests email, extracts link, authenticates
 * - Subsequent runs: Reuses cached session (no email)
 */
 Cypress.Commands.add('authViaMagicLink', (email) => {
  return dataSession({
    name: `magic-link-${email}`,
    // First-time setup: Request and process magic link
    setup: () => {
      cy.visit('/login');
      cy.get('[data-cy="email-input"]').type(email);
      cy.get('[data-cy="send-magic-link"]').click();
      // Get magic link from Mailosaur
      cy.mailosaurGetMessage(Cypress.env('MAILOSAUR_SERVERID'), {
        sentTo: email,
      })
        .its('html.links.0.href')
        .should('exist')
        .then((magicLink) => {
          cy.visit(magicLink);
        });
      // Wait for authentication
      cy.get('[data-cy="user-menu"]', { timeout: 10000 }).should('be.visible');
      // Preserve authentication state
      return cy.getAllLocalStorage().then((storage) => {
        return { storage, email };
      });
    },
    // Validate cached session is still valid
    validate: (cached) => {
      return cy.wrap(Boolean(cached?.storage));
    },
    // Recreate session from cache (no email needed)
    recreate: (cached) => {
      // Restore localStorage
      cy.setLocalStorage(cached.storage);
      cy.visit('/dashboard');
      cy.get('[data-cy="user-menu"]', { timeout: 5000 }).should('be.visible');
    },
    shareAcrossSpecs: true, // Share session across all tests
  });
 });
 ```
 **Usage in tests**:
 ```javascript
 // cypress/e2e/dashboard.cy.ts
 describe('Dashboard', () => {
  const serverId = Cypress.env('MAILOSAUR_SERVERID');
  const testEmail = `test-user@${serverId}.mailosaur.net`;
  beforeEach(() => {
    // First test: Requests magic link
    // Subsequent tests: Reuses cached session (no email!)
    cy.authViaMagicLink(testEmail);
  });
  it('should display user dashboard', () => {
    cy.get('[data-cy="dashboard-content"]').should('be.visible');
  });
  it('should show user profile', () => {
    cy.get('[data-cy="user-email"]').should('contain', testEmail);
  });
  // Both tests share same session - only 1 email consumed!
 });
 ```
 **Key Points**:
 - **Session caching**: First test requests email, rest reuse session
 - **State preservation**: localStorage/cookies saved and restored
 - **Validation**: Check cached session is still valid
 - **Quota optimization**: Massive reduction in email consumption
 - **Fast tests**: Cached auth takes seconds vs. minutes
 ---
 ### Example 3: Negative Flow Tests (Expired, Invalid, Reused Links)
 **Context**: Comprehensive negative testing for email authentication edge cases.
 **Implementation**:
 ```typescript
 // tests/e2e/email-auth-negative.spec.ts
 import { test, expect } from '@playwright/test';
 import { getMagicLinkFromEmail } from '../support/mailosaur-helpers';
 const MAILOSAUR_SERVER_ID = process.env.MAILOSAUR_SERVER_ID!;
 test.describe('Email Auth Negative Flows', () => {
  test('should reject expired magic link', async ({ page }) => {
    // Generate expired link (simulate 24 hours ago)
    const expiredToken = Buffer.from(
      JSON.stringify({
        email: 'test@example.com',
        exp: Date.now() - 24 * 60 * 60 * 1000, // 24 hours ago
      }),
    ).toString('base64');
    const expiredLink = `http://localhost:3000/auth/verify?token=${expiredToken}`;
    // Visit expired link
    await page.goto(expiredLink);
    // Assert: Error displayed
    await expect(page.getByTestId('error-message')).toBeVisible();
    await expect(page.getByTestId('error-message')).toContainText(/link.*expired|expired.*link/i);
    // Assert: Link to request new one
    await expect(page.getByTestId('request-new-link')).toBeVisible();
    // Assert: User NOT authenticated
    await expect(page.getByTestId('user-menu')).not.toBeVisible();
  });
  test('should reject invalid magic link token', async ({ page }) => {
    const invalidLink = 'http://localhost:3000/auth/verify?token=invalid-garbage';
    await page.goto(invalidLink);
    // Assert: Error displayed
    await expect(page.getByTestId('error-message')).toBeVisible();
    await expect(page.getByTestId('error-message')).toContainText(/invalid.*link|link.*invalid/i);
    // Assert: User not authenticated
    await expect(page.getByTestId('user-menu')).not.toBeVisible();
  });
  test('should reject already-used magic link', async ({ page, context }) => {
    const randomId = Math.floor(Math.random() * 1000000);
    const testEmail = `user-${randomId}@${MAILOSAUR_SERVER_ID}.mailosaur.net`;
    // Request magic link
    await page.goto('/login');
    await page.getByTestId('email-input').fill(testEmail);
    await page.getByTestId('send-magic-link').click();
    const magicLink = await getMagicLinkFromEmail(testEmail);
    // Visit link FIRST time (success)
    await page.goto(magicLink);
    await expect(page.getByTestId('user-menu')).toBeVisible();
    // Sign out
    await page.getByTestId('user-menu').click();
    await page.getByTestId('sign-out').click();
    await expect(page.getByTestId('user-menu')).not.toBeVisible();
    // Try to reuse SAME link (should fail)
    await page.goto(magicLink);
    // Assert: Link already used error
    await expect(page.getByTestId('error-message')).toBeVisible();
    await expect(page.getByTestId('error-message')).toContainText(/already.*used|link.*used/i);
    // Assert: User not authenticated
    await expect(page.getByTestId('user-menu')).not.toBeVisible();
  });
  test('should handle rapid successive link requests', async ({ page }) => {
    const randomId = Math.floor(Math.random() * 1000000);
    const testEmail = `user-${randomId}@${MAILOSAUR_SERVER_ID}.mailosaur.net`;
    // Request magic link 3 times rapidly
    for (let i = 0; i < 3; i++) {
      await page.goto('/login');
      await page.getByTestId('email-input').fill(testEmail);
      await page.getByTestId('send-magic-link').click();
      await expect(page.getByTestId('check-email-message')).toBeVisible();
    }
    // Only the LATEST link should work
    const MailosaurClient = require('mailosaur');
    const mailosaur = new MailosaurClient(process.env.MAILOSAUR_API_KEY);
    const messages = await mailosaur.messages.list(MAILOSAUR_SERVER_ID, {
      sentTo: testEmail,
    });
    // Should receive 3 emails
    expect(messages.items.length).toBeGreaterThanOrEqual(3);
    // Get the LATEST magic link
    const latestMessage = messages.items[0]; // Most recent first
    const latestLink = latestMessage.html.links[0].href;
    // Latest link works
    await page.goto(latestLink);
    await expect(page.getByTestId('user-menu')).toBeVisible();
    // Older links should NOT work (if backend invalidates previous)
    await page.getByTestId('sign-out').click();
    const olderLink = messages.items[1].html.links[0].href;
    await page.goto(olderLink);
    await expect(page.getByTestId('error-message')).toBeVisible();
  });
  test('should rate-limit excessive magic link requests', async ({ page }) => {
    const randomId = Math.floor(Math.random() * 1000000);
    const testEmail = `user-${randomId}@${MAILOSAUR_SERVER_ID}.mailosaur.net`;
    // Request magic link 10 times rapidly (should hit rate limit)
    for (let i = 0; i < 10; i++) {
      await page.goto('/login');
      await page.getByTestId('email-input').fill(testEmail);
      await page.getByTestId('send-magic-link').click();
      // After N requests, should show rate limit error
      const errorVisible = await page
        .getByTestId('rate-limit-error')
        .isVisible({ timeout: 1000 })
        .catch(() => false);
      if (errorVisible) {
        console.log(`Rate limit hit after ${i + 1} requests`);
        await expect(page.getByTestId('rate-limit-error')).toContainText(/too many.*requests|rate.*limit/i);
        return;
      }
    }
    // If no rate limit after 10 requests, log warning
    console.warn('⚠️  No rate limit detected after 10 requests');
  });
 });
 ```
 **Key Points**:
 - **Expired links**: Test 24+ hour old tokens
 - **Invalid tokens**: Malformed or garbage tokens rejected
 - **Reuse prevention**: Same link can't be used twice
 - **Rapid requests**: Multiple requests handled gracefully
 - **Rate limiting**: Excessive requests blocked
 ---
 ### Example 4: Caching Strategy with cypress-data-session / Playwright Projects
 **Context**: Minimize email consumption by sharing authentication state across tests and specs.
 **Implementation**:
 ```javascript
 // cypress/support/commands/register-and-sign-in.js
 import { dataSession } from 'cypress-data-session';
 /**
 * Email Authentication Caching Strategy
 * - One email per test run (not per spec, not per test)
 * - First spec: Full registration flow (form → email → code → sign in)
 * - Subsequent specs: Only sign in (reuse user)
 * - Subsequent tests in same spec: Session already active (no sign in)
 */
 // Helper: Fill registration form
 function fillRegistrationForm({ fullName, userName, email, password }) {
  cy.intercept('POST', 'https://cognito-idp*').as('cognito');
  cy.contains('Register').click();
  cy.get('#reg-dialog-form').should('be.visible');
  cy.get('#first-name').type(fullName, { delay: 0 });
  cy.get('#last-name').type(lastName, { delay: 0 });
  cy.get('#email').type(email, { delay: 0 });
  cy.get('#username').type(userName, { delay: 0 });
  cy.get('#password').type(password, { delay: 0 });
  cy.contains('button', 'Create an account').click();
  cy.wait('@cognito').its('response.statusCode').should('equal', 200);
 }
 // Helper: Confirm registration with email code
 function confirmRegistration(email) {
  return cy
    .mailosaurGetMessage(Cypress.env('MAILOSAUR_SERVERID'), { sentTo: email })
    .its('html.codes.0.value') // Mailosaur auto-extracts codes!
    .then((code) => {
      cy.intercept('POST', 'https://cognito-idp*').as('cognito');
      cy.get('#verification-code').type(code, { delay: 0 });
      cy.contains('button', 'Confirm registration').click();
      cy.wait('@cognito');
      cy.contains('You are now registered!').should('be.visible');
      cy.contains('button', /ok/i).click();
      return cy.wrap(code); // Return code for reference
    });
 }
 // Helper: Full registration (form + email)
 function register({ fullName, userName, email, password }) {
  fillRegistrationForm({ fullName, userName, email, password });
  return confirmRegistration(email);
 }
 // Helper: Sign in
 function signIn({ userName, password }) {
  cy.intercept('POST', 'https://cognito-idp*').as('cognito');
  cy.contains('Sign in').click();
  cy.get('#sign-in-username').type(userName, { delay: 0 });
  cy.get('#sign-in-password').type(password, { delay: 0 });
  cy.contains('button', 'Sign in').click();
  cy.wait('@cognito');
  cy.contains('Sign out').should('be.visible');
 }
 /**
 * Register and sign in with email caching
 * ONE EMAIL PER MACHINE (cypress run or cypress open)
 */
 Cypress.Commands.add('registerAndSignIn', ({ fullName, userName, email, password }) => {
  return dataSession({
    name: email, // Unique session per email
    // First time: Full registration (form → email → code)
    init: () => register({ fullName, userName, email, password }),
    // Subsequent specs: Just check email exists (code already used)
    setup: () => confirmRegistration(email),
    // Always runs after init/setup: Sign in
    recreate: () => signIn({ userName, password }),
    // Share across ALL specs (one email for entire test run)
    shareAcrossSpecs: true,
  });
 });
 ```
 **Usage across multiple specs**:
 ```javascript
 // cypress/e2e/place-order.cy.ts
 describe('Place Order', () => {
  beforeEach(() => {
    cy.visit('/');
    cy.registerAndSignIn({
      fullName: Cypress.env('fullName'), // From cypress.config
      userName: Cypress.env('userName'),
      email: Cypress.env('email'), // SAME email across all specs
      password: Cypress.env('password'),
    });
  });
  it('should place order', () => {
    /* ... */
  });
  it('should view order history', () => {
    /* ... */
  });
 });
 // cypress/e2e/profile.cy.ts
 describe('User Profile', () => {
  beforeEach(() => {
    cy.visit('/');
    cy.registerAndSignIn({
      fullName: Cypress.env('fullName'),
      userName: Cypress.env('userName'),
      email: Cypress.env('email'), // SAME email - no new email sent!
      password: Cypress.env('password'),
    });
  });
  it('should update profile', () => {
    /* ... */
  });
 });
 ```
 **Playwright equivalent with storageState**:
 ```typescript
 // playwright.config.ts
 import { defineConfig } from '@playwright/test';
 export default defineConfig({
  projects: [
    {
      name: 'setup',
      testMatch: /global-setup\.ts/,
    },
    {
      name: 'authenticated',
      testMatch: /.*\.spec\.ts/,
      dependencies: ['setup'],
      use: {
        storageState: '.auth/user-session.json', // Reuse auth state
      },
    },
  ],
 });
 ```
 ```typescript
 // tests/global-setup.ts (runs once)
 import { test as setup } from '@playwright/test';
 import { getMagicLinkFromEmail } from './support/mailosaur-helpers';
 const authFile = '.auth/user-session.json';
 setup('authenticate via magic link', async ({ page }) => {
  const testEmail = process.env.TEST_USER_EMAIL!;
  // Request magic link
  await page.goto('/login');
  await page.getByTestId('email-input').fill(testEmail);
  await page.getByTestId('send-magic-link').click();
  // Get and visit magic link
  const magicLink = await getMagicLinkFromEmail(testEmail);
  await page.goto(magicLink);
  // Verify authenticated
  await expect(page.getByTestId('user-menu')).toBeVisible();
  // Save authenticated state (ONE TIME for all tests)
  await page.context().storageState({ path: authFile });
  console.log('✅ Authentication state saved to', authFile);
 });
 ```
 **Key Points**:
 - **One email per run**: Global setup authenticates once
 - **State reuse**: All tests use cached storageState
 - **cypress-data-session**: Intelligently manages cache lifecycle
 - **shareAcrossSpecs**: Session shared across all spec files
 - **Massive savings**: 500 tests = 1 email (not 500!)
 ---
 ## Email Authentication Testing Checklist
 Before implementing email auth tests, verify:
 - [ ] **Email service**: Mailosaur/Ethereal/MailHog configured with API keys
 - [ ] **Link extraction**: Use built-in parsing (html.links[0].href) over regex
 - [ ] **State preservation**: localStorage/session/cookies saved and restored
 - [ ] **Session caching**: cypress-data-session or storageState prevents redundant emails
 - [ ] **Negative flows**: Expired, invalid, reused, rapid requests tested
 - [ ] **Quota awareness**: One email per run (not per test)
 - [ ] **PII scrubbing**: Email IDs logged for debug, but scrubbed from artifacts
 - [ ] **Timeout handling**: 30 second email retrieval timeout configured
 ## Integration Points
 - Used in workflows: `*framework` (email auth setup), `*automate` (email auth test generation)
 - Related fragments: `fixture-architecture.md`, `test-quality.md`
 - Email services: Mailosaur (recommended), Ethereal (free), MailHog (self-hosted)
 - Plugins: cypress-mailosaur, cypress-data-session
 _Source: Email authentication blog, Murat testing toolkit, Mailosaur documentation_
--- a/_bmad/bmm/testarch/knowledge/error-handling.md
+++ b/_bmad/bmm/testarch/knowledge/error-handling.md
@@ -0,0 +1,725 @@
 # Error Handling and Resilience Checks
 ## Principle
 Treat expected failures explicitly: intercept network errors, assert UI fallbacks (error messages visible, retries triggered), and use scoped exception handling to ignore known errors while catching regressions. Test retry/backoff logic by forcing sequential failures (500 → timeout → success) and validate telemetry logging. Log captured errors with context (request payload, user/session) but redact secrets to keep artifacts safe for sharing.
 ## Rationale
 Tests fail for two reasons: genuine bugs or poor error handling in the test itself. Without explicit error handling patterns, tests become noisy (uncaught exceptions cause false failures) or silent (swallowing all errors hides real bugs). Scoped exception handling (Cypress.on('uncaught:exception'), page.on('pageerror')) allows tests to ignore documented, expected errors while surfacing unexpected ones. Resilience testing (retry logic, graceful degradation) ensures applications handle failures gracefully in production.
 ## Pattern Examples
 ### Example 1: Scoped Exception Handling (Expected Errors Only)
 **Context**: Handle known errors (Network failures, expected 500s) without masking unexpected bugs.
 **Implementation**:
 ```typescript
 // tests/e2e/error-handling.spec.ts
 import { test, expect } from '@playwright/test';
 /**
 * Scoped Error Handling Pattern
 * - Only ignore specific, documented errors
 * - Rethrow everything else to catch regressions
 * - Validate error UI and user experience
 */
 test.describe('API Error Handling', () => {
  test('should display error message when API returns 500', async ({ page }) => {
    // Scope error handling to THIS test only
    const consoleErrors: string[] = [];
    page.on('pageerror', (error) => {
      // Only swallow documented NetworkError
      if (error.message.includes('NetworkError: Failed to fetch')) {
        consoleErrors.push(error.message);
        return; // Swallow this specific error
      }
      // Rethrow all other errors (catch regressions!)
      throw error;
    });
    // Arrange: Mock 500 error response
    await page.route('**/api/users', (route) =>
      route.fulfill({
        status: 500,
        contentType: 'application/json',
        body: JSON.stringify({
          error: 'Internal server error',
          code: 'INTERNAL_ERROR',
        }),
      }),
    );
    // Act: Navigate to page that fetches users
    await page.goto('/dashboard');
    // Assert: Error UI displayed
    await expect(page.getByTestId('error-message')).toBeVisible();
    await expect(page.getByTestId('error-message')).toContainText(/error.*loading|failed.*load/i);
    // Assert: Retry button visible
    await expect(page.getByTestId('retry-button')).toBeVisible();
    // Assert: NetworkError was thrown and caught
    expect(consoleErrors).toContainEqual(expect.stringContaining('NetworkError'));
  });
  test('should NOT swallow unexpected errors', async ({ page }) => {
    let unexpectedError: Error | null = null;
    page.on('pageerror', (error) => {
      // Capture but don't swallow - test should fail
      unexpectedError = error;
      throw error;
    });
    // Arrange: App has JavaScript error (bug)
    await page.addInitScript(() => {
      // Simulate bug in app code
      (window as any).buggyFunction = () => {
        throw new Error('UNEXPECTED BUG: undefined is not a function');
      };
    });
    await page.goto('/dashboard');
    // Trigger buggy function
    await page.evaluate(() => (window as any).buggyFunction());
    // Assert: Test fails because unexpected error was NOT swallowed
    expect(unexpectedError).not.toBeNull();
    expect(unexpectedError?.message).toContain('UNEXPECTED BUG');
  });
 });
 ```
 **Cypress equivalent**:
 ```javascript
 // cypress/e2e/error-handling.cy.ts
 describe('API Error Handling', () => {
  it('should display error message when API returns 500', () => {
    // Scoped to this test only
    cy.on('uncaught:exception', (err) => {
      // Only swallow documented NetworkError
      if (err.message.includes('NetworkError')) {
        return false; // Prevent test failure
      }
      // All other errors fail the test
      return true;
    });
    // Arrange: Mock 500 error
    cy.intercept('GET', '**/api/users', {
      statusCode: 500,
      body: {
        error: 'Internal server error',
        code: 'INTERNAL_ERROR',
      },
    }).as('getUsers');
    // Act
    cy.visit('/dashboard');
    cy.wait('@getUsers');
    // Assert: Error UI
    cy.get('[data-cy="error-message"]').should('be.visible');
    cy.get('[data-cy="error-message"]').should('contain', 'error loading');
    cy.get('[data-cy="retry-button"]').should('be.visible');
  });
  it('should NOT swallow unexpected errors', () => {
    // No exception handler - test should fail on unexpected errors
    cy.visit('/dashboard');
    // Trigger unexpected error
    cy.window().then((win) => {
      // This should fail the test
      win.eval('throw new Error("UNEXPECTED BUG")');
    });
    // Test fails (as expected) - validates error detection works
  });
 });
 ```
 **Key Points**:
 - **Scoped handling**: page.on() / cy.on() scoped to specific tests
 - **Explicit allow-list**: Only ignore documented errors
 - **Rethrow unexpected**: Catch regressions by failing on unknown errors
 - **Error UI validation**: Assert user sees error message
 - **Logging**: Capture errors for debugging, don't swallow silently
 ---
 ### Example 2: Retry Validation Pattern (Network Resilience)
 **Context**: Test that retry/backoff logic works correctly for transient failures.
 **Implementation**:
 ```typescript
 // tests/e2e/retry-resilience.spec.ts
 import { test, expect } from '@playwright/test';
 /**
 * Retry Validation Pattern
 * - Force sequential failures (500 → 500 → 200)
 * - Validate retry attempts and backoff timing
 * - Assert telemetry captures retry events
 */
 test.describe('Network Retry Logic', () => {
  test('should retry on 500 error and succeed', async ({ page }) => {
    let attemptCount = 0;
    const attemptTimestamps: number[] = [];
    // Mock API: Fail twice, succeed on third attempt
    await page.route('**/api/products', (route) => {
      attemptCount++;
      attemptTimestamps.push(Date.now());
      if (attemptCount <= 2) {
        // First 2 attempts: 500 error
        route.fulfill({
          status: 500,
          body: JSON.stringify({ error: 'Server error' }),
        });
      } else {
        // 3rd attempt: Success
        route.fulfill({
          status: 200,
          contentType: 'application/json',
          body: JSON.stringify({ products: [{ id: 1, name: 'Product 1' }] }),
        });
      }
    });
    // Act: Navigate (should retry automatically)
    await page.goto('/products');
    // Assert: Data eventually loads after retries
    await expect(page.getByTestId('product-list')).toBeVisible();
    await expect(page.getByTestId('product-item')).toHaveCount(1);
    // Assert: Exactly 3 attempts made
    expect(attemptCount).toBe(3);
    // Assert: Exponential backoff timing (1s → 2s between attempts)
    if (attemptTimestamps.length === 3) {
      const delay1 = attemptTimestamps[1] - attemptTimestamps[0];
      const delay2 = attemptTimestamps[2] - attemptTimestamps[1];
      expect(delay1).toBeGreaterThanOrEqual(900); // ~1 second
      expect(delay1).toBeLessThan(1200);
      expect(delay2).toBeGreaterThanOrEqual(1900); // ~2 seconds
      expect(delay2).toBeLessThan(2200);
    }
    // Assert: Telemetry logged retry events
    const telemetryEvents = await page.evaluate(() => (window as any).__TELEMETRY_EVENTS__ || []);
    expect(telemetryEvents).toContainEqual(
      expect.objectContaining({
        event: 'api_retry',
        attempt: 1,
        endpoint: '/api/products',
      }),
    );
    expect(telemetryEvents).toContainEqual(
      expect.objectContaining({
        event: 'api_retry',
        attempt: 2,
      }),
    );
  });
  test('should give up after max retries and show error', async ({ page }) => {
    let attemptCount = 0;
    // Mock API: Always fail (test retry limit)
    await page.route('**/api/products', (route) => {
      attemptCount++;
      route.fulfill({
        status: 500,
        body: JSON.stringify({ error: 'Persistent server error' }),
      });
    });
    // Act
    await page.goto('/products');
    // Assert: Max retries reached (3 attempts typical)
    expect(attemptCount).toBe(3);
    // Assert: Error UI displayed after exhausting retries
    await expect(page.getByTestId('error-message')).toBeVisible();
    await expect(page.getByTestId('error-message')).toContainText(/unable.*load|failed.*after.*retries/i);
    // Assert: Data not displayed
    await expect(page.getByTestId('product-list')).not.toBeVisible();
  });
  test('should NOT retry on 404 (non-retryable error)', async ({ page }) => {
    let attemptCount = 0;
    // Mock API: 404 error (should NOT retry)
    await page.route('**/api/products/999', (route) => {
      attemptCount++;
      route.fulfill({
        status: 404,
        body: JSON.stringify({ error: 'Product not found' }),
      });
    });
    await page.goto('/products/999');
    // Assert: Only 1 attempt (no retries on 404)
    expect(attemptCount).toBe(1);
    // Assert: 404 error displayed immediately
    await expect(page.getByTestId('not-found-message')).toBeVisible();
  });
 });
 ```
 **Cypress with retry interception**:
 ```javascript
 // cypress/e2e/retry-resilience.cy.ts
 describe('Network Retry Logic', () => {
  it('should retry on 500 and succeed on 3rd attempt', () => {
    let attemptCount = 0;
    cy.intercept('GET', '**/api/products', (req) => {
      attemptCount++;
      if (attemptCount <= 2) {
        req.reply({ statusCode: 500, body: { error: 'Server error' } });
      } else {
        req.reply({ statusCode: 200, body: { products: [{ id: 1, name: 'Product 1' }] } });
      }
    }).as('getProducts');
    cy.visit('/products');
    // Wait for final successful request
    cy.wait('@getProducts').its('response.statusCode').should('eq', 200);
    // Assert: Data loaded
    cy.get('[data-cy="product-list"]').should('be.visible');
    cy.get('[data-cy="product-item"]').should('have.length', 1);
    // Validate retry count
    cy.wrap(attemptCount).should('eq', 3);
  });
 });
 ```
 **Key Points**:
 - **Sequential failures**: Test retry logic with 500 → 500 → 200
 - **Backoff timing**: Validate exponential backoff delays
 - **Retry limits**: Max attempts enforced (typically 3)
 - **Non-retryable errors**: 404s don't trigger retries
 - **Telemetry**: Log retry attempts for monitoring
 ---
 ### Example 3: Telemetry Logging with Context (Sentry Integration)
 **Context**: Capture errors with full context for production debugging without exposing secrets.
 **Implementation**:
 ```typescript
 // tests/e2e/telemetry-logging.spec.ts
 import { test, expect } from '@playwright/test';
 /**
 * Telemetry Logging Pattern
 * - Log errors with request context
 * - Redact sensitive data (tokens, passwords, PII)
 * - Integrate with monitoring (Sentry, Datadog)
 * - Validate error logging without exposing secrets
 */
 type ErrorLog = {
  level: 'error' | 'warn' | 'info';
  message: string;
  context?: {
    endpoint?: string;
    method?: string;
    statusCode?: number;
    userId?: string;
    sessionId?: string;
  };
  timestamp: string;
 };
 test.describe('Error Telemetry', () => {
  test('should log API errors with context', async ({ page }) => {
    const errorLogs: ErrorLog[] = [];
    // Capture console errors
    page.on('console', (msg) => {
      if (msg.type() === 'error') {
        try {
          const log = JSON.parse(msg.text());
          errorLogs.push(log);
        } catch {
          // Not a structured log, ignore
        }
      }
    });
    // Mock failing API
    await page.route('**/api/orders', (route) =>
      route.fulfill({
        status: 500,
        body: JSON.stringify({ error: 'Payment processor unavailable' }),
      }),
    );
    // Act: Trigger error
    await page.goto('/checkout');
    await page.getByTestId('place-order').click();
    // Wait for error UI
    await expect(page.getByTestId('error-message')).toBeVisible();
    // Assert: Error logged with context
    expect(errorLogs).toContainEqual(
      expect.objectContaining({
        level: 'error',
        message: expect.stringContaining('API request failed'),
        context: expect.objectContaining({
          endpoint: '/api/orders',
          method: 'POST',
          statusCode: 500,
          userId: expect.any(String),
        }),
      }),
    );
    // Assert: Sensitive data NOT logged
    const logString = JSON.stringify(errorLogs);
    expect(logString).not.toContain('password');
    expect(logString).not.toContain('token');
    expect(logString).not.toContain('creditCard');
  });
  test('should send errors to Sentry with breadcrumbs', async ({ page }) => {
    const sentryEvents: any[] = [];
    // Mock Sentry SDK
    await page.addInitScript(() => {
      (window as any).Sentry = {
        captureException: (error: Error, context?: any) => {
          (window as any).__SENTRY_EVENTS__ = (window as any).__SENTRY_EVENTS__ || [];
          (window as any).__SENTRY_EVENTS__.push({
            error: error.message,
            context,
            timestamp: Date.now(),
          });
        },
        addBreadcrumb: (breadcrumb: any) => {
          (window as any).__SENTRY_BREADCRUMBS__ = (window as any).__SENTRY_BREADCRUMBS__ || [];
          (window as any).__SENTRY_BREADCRUMBS__.push(breadcrumb);
        },
      };
    });
    // Mock failing API
    await page.route('**/api/users', (route) => route.fulfill({ status: 403, body: { error: 'Forbidden' } }));
    // Act
    await page.goto('/users');
    // Assert: Sentry captured error
    const events = await page.evaluate(() => (window as any).__SENTRY_EVENTS__);
    expect(events).toHaveLength(1);
    expect(events[0]).toMatchObject({
      error: expect.stringContaining('403'),
      context: expect.objectContaining({
        endpoint: '/api/users',
        statusCode: 403,
      }),
    });
    // Assert: Breadcrumbs include user actions
    const breadcrumbs = await page.evaluate(() => (window as any).__SENTRY_BREADCRUMBS__);
    expect(breadcrumbs).toContainEqual(
      expect.objectContaining({
        category: 'navigation',
        message: '/users',
      }),
    );
  });
 });
 ```
 **Cypress with Sentry**:
 ```javascript
 // cypress/e2e/telemetry-logging.cy.ts
 describe('Error Telemetry', () => {
  it('should log API errors with redacted sensitive data', () => {
    const errorLogs = [];
    // Capture console errors
    cy.on('window:before:load', (win) => {
      cy.stub(win.console, 'error').callsFake((msg) => {
        errorLogs.push(msg);
      });
    });
    // Mock failing API
    cy.intercept('POST', '**/api/orders', {
      statusCode: 500,
      body: { error: 'Payment failed' },
    });
    // Act
    cy.visit('/checkout');
    cy.get('[data-cy="place-order"]').click();
    // Assert: Error logged
    cy.wrap(errorLogs).should('have.length.greaterThan', 0);
    // Assert: Context included
    cy.wrap(errorLogs[0]).should('include', '/api/orders');
    // Assert: Secrets redacted
    cy.wrap(JSON.stringify(errorLogs)).should('not.contain', 'password');
    cy.wrap(JSON.stringify(errorLogs)).should('not.contain', 'creditCard');
  });
 });
 ```
 **Error logger utility with redaction**:
 ```typescript
 // src/utils/error-logger.ts
 type ErrorContext = {
  endpoint?: string;
  method?: string;
  statusCode?: number;
  userId?: string;
  sessionId?: string;
  requestPayload?: any;
 };
 const SENSITIVE_KEYS = ['password', 'token', 'creditCard', 'ssn', 'apiKey'];
 /**
 * Redact sensitive data from objects
 */
 function redactSensitiveData(obj: any): any {
  if (typeof obj !== 'object' || obj === null) return obj;
  const redacted = { ...obj };
  for (const key of Object.keys(redacted)) {
    if (SENSITIVE_KEYS.some((sensitive) => key.toLowerCase().includes(sensitive))) {
      redacted[key] = '[REDACTED]';
    } else if (typeof redacted[key] === 'object') {
      redacted[key] = redactSensitiveData(redacted[key]);
    }
  }
  return redacted;
 }
 /**
 * Log error with context (Sentry integration)
 */
 export function logError(error: Error, context?: ErrorContext) {
  const safeContext = context ? redactSensitiveData(context) : {};
  const errorLog = {
    level: 'error' as const,
    message: error.message,
    stack: error.stack,
    context: safeContext,
    timestamp: new Date().toISOString(),
  };
  // Console (development)
  console.error(JSON.stringify(errorLog));
  // Sentry (production)
  if (typeof window !== 'undefined' && (window as any).Sentry) {
    (window as any).Sentry.captureException(error, {
      contexts: { custom: safeContext },
    });
  }
 }
 ```
 **Key Points**:
 - **Context-rich logging**: Endpoint, method, status, user ID
 - **Secret redaction**: Passwords, tokens, PII removed before logging
 - **Sentry integration**: Production monitoring with breadcrumbs
 - **Structured logs**: JSON format for easy parsing
 - **Test validation**: Assert logs contain context but not secrets
 ---
 ### Example 4: Graceful Degradation Tests (Fallback Behavior)
 **Context**: Validate application continues functioning when services are unavailable.
 **Implementation**:
 ```typescript
 // tests/e2e/graceful-degradation.spec.ts
 import { test, expect } from '@playwright/test';
 /**
 * Graceful Degradation Pattern
 * - Simulate service unavailability
 * - Validate fallback behavior
 * - Ensure user experience degrades gracefully
 * - Verify telemetry captures degradation events
 */
 test.describe('Service Unavailability', () => {
  test('should display cached data when API is down', async ({ page }) => {
    // Arrange: Seed localStorage with cached data
    await page.addInitScript(() => {
      localStorage.setItem(
        'products_cache',
        JSON.stringify({
          data: [
            { id: 1, name: 'Cached Product 1' },
            { id: 2, name: 'Cached Product 2' },
          ],
          timestamp: Date.now(),
        }),
      );
    });
    // Mock API unavailable
    await page.route(
      '**/api/products',
      (route) => route.abort('connectionrefused'), // Simulate server down
    );
    // Act
    await page.goto('/products');
    // Assert: Cached data displayed
    await expect(page.getByTestId('product-list')).toBeVisible();
    await expect(page.getByText('Cached Product 1')).toBeVisible();
    // Assert: Stale data warning shown
    await expect(page.getByTestId('cache-warning')).toBeVisible();
    await expect(page.getByTestId('cache-warning')).toContainText(/showing.*cached|offline.*mode/i);
    // Assert: Retry button available
    await expect(page.getByTestId('refresh-button')).toBeVisible();
  });
  test('should show fallback UI when analytics service fails', async ({ page }) => {
    // Mock analytics service down (non-critical)
    await page.route('**/analytics/track', (route) => route.fulfill({ status: 503, body: 'Service unavailable' }));
    // Act: Navigate normally
    await page.goto('/dashboard');
    // Assert: Page loads successfully (analytics failure doesn't block)
    await expect(page.getByTestId('dashboard-content')).toBeVisible();
    // Assert: Analytics error logged but not shown to user
    const consoleErrors = [];
    page.on('console', (msg) => {
      if (msg.type() === 'error') consoleErrors.push(msg.text());
    });
    // Trigger analytics event
    await page.getByTestId('track-action-button').click();
    // Analytics error logged
    expect(consoleErrors).toContainEqual(expect.stringContaining('Analytics service unavailable'));
    // But user doesn't see error
    await expect(page.getByTestId('error-message')).not.toBeVisible();
  });
  test('should fallback to local validation when API is slow', async ({ page }) => {
    // Mock slow API (> 5 seconds)
    await page.route('**/api/validate-email', async (route) => {
      await new Promise((resolve) => setTimeout(resolve, 6000)); // 6 second delay
      route.fulfill({
        status: 200,
        body: JSON.stringify({ valid: true }),
      });
    });
    // Act: Fill form
    await page.goto('/signup');
    await page.getByTestId('email-input').fill('test@example.com');
    await page.getByTestId('email-input').blur();
    // Assert: Client-side validation triggers immediately (doesn't wait for API)
    await expect(page.getByTestId('email-valid-icon')).toBeVisible({ timeout: 1000 });
    // Assert: Eventually API validates too (but doesn't block UX)
    await expect(page.getByTestId('email-validated-badge')).toBeVisible({ timeout: 7000 });
  });
  test('should maintain functionality with third-party script failure', async ({ page }) => {
    // Block third-party scripts (Google Analytics, Intercom, etc.)
    await page.route('**/*.google-analytics.com/**', (route) => route.abort());
    await page.route('**/*.intercom.io/**', (route) => route.abort());
    // Act
    await page.goto('/');
    // Assert: App works without third-party scripts
    await expect(page.getByTestId('main-content')).toBeVisible();
    await expect(page.getByTestId('nav-menu')).toBeVisible();
    // Assert: Core functionality intact
    await page.getByTestId('nav-products').click();
    await expect(page).toHaveURL(/.*\/products/);
  });
 });
 ```
 **Key Points**:
 - **Cached fallbacks**: Display stale data when API unavailable
 - **Non-critical degradation**: Analytics failures don't block app
 - **Client-side fallbacks**: Local validation when API slow
 - **Third-party resilience**: App works without external scripts
 - **User transparency**: Stale data warnings displayed
 ---
 ## Error Handling Testing Checklist
 Before shipping error handling code, verify:
 - [ ] **Scoped exception handling**: Only ignore documented errors (NetworkError, specific codes)
 - [ ] **Rethrow unexpected**: Unknown errors fail tests (catch regressions)
 - [ ] **Error UI tested**: User sees error messages for all error states
 - [ ] **Retry logic validated**: Sequential failures test backoff and max attempts
 - [ ] **Telemetry verified**: Errors logged with context (endpoint, status, user)
 - [ ] **Secret redaction**: Logs don't contain passwords, tokens, PII
 - [ ] **Graceful degradation**: Critical services down, app shows fallback UI
 - [ ] **Non-critical failures**: Analytics/tracking failures don't block app
 ## Integration Points
 - Used in workflows: `*automate` (error handling test generation), `*test-review` (error pattern detection)
 - Related fragments: `network-first.md`, `test-quality.md`, `contract-testing.md`
 - Monitoring tools: Sentry, Datadog, LogRocket
 _Source: Murat error-handling patterns, Pact resilience guidance, SEON production error handling_
--- a/_bmad/bmm/testarch/knowledge/feature-flags.md
+++ b/_bmad/bmm/testarch/knowledge/feature-flags.md
@@ -0,0 +1,750 @@
 # Feature Flag Governance
 ## Principle
 Feature flags enable controlled rollouts and A/B testing, but require disciplined testing governance. Centralize flag definitions in a frozen enum, test both enabled and disabled states, clean up targeting after each spec, and maintain a comprehensive flag lifecycle checklist. For LaunchDarkly-style systems, script API helpers to seed variations programmatically rather than manual UI mutations.
 ## Rationale
 Poorly managed feature flags become technical debt: untested variations ship broken code, forgotten flags clutter the codebase, and shared environments become unstable from leftover targeting rules. Structured governance ensures flags are testable, traceable, temporary, and safe. Testing both states prevents surprises when flags flip in production.
 ## Pattern Examples
 ### Example 1: Feature Flag Enum Pattern with Type Safety
 **Context**: Centralized flag management with TypeScript type safety and runtime validation.
 **Implementation**:
 ```typescript
 // src/utils/feature-flags.ts
 /**
 * Centralized feature flag definitions
 * - Object.freeze prevents runtime modifications
 * - TypeScript ensures compile-time type safety
 * - Single source of truth for all flag keys
 */
 export const FLAGS = Object.freeze({
  // User-facing features
  NEW_CHECKOUT_FLOW: 'new-checkout-flow',
  DARK_MODE: 'dark-mode',
  ENHANCED_SEARCH: 'enhanced-search',
  // Experiments
  PRICING_EXPERIMENT_A: 'pricing-experiment-a',
  HOMEPAGE_VARIANT_B: 'homepage-variant-b',
  // Infrastructure
  USE_NEW_API_ENDPOINT: 'use-new-api-endpoint',
  ENABLE_ANALYTICS_V2: 'enable-analytics-v2',
  // Killswitches (emergency disables)
  DISABLE_PAYMENT_PROCESSING: 'disable-payment-processing',
  DISABLE_EMAIL_NOTIFICATIONS: 'disable-email-notifications',
 } as const);
 /**
 * Type-safe flag keys
 * Prevents typos and ensures autocomplete in IDEs
 */
 export type FlagKey = (typeof FLAGS)[keyof typeof FLAGS];
 /**
 * Flag metadata for governance
 */
 type FlagMetadata = {
  key: FlagKey;
  name: string;
  owner: string;
  createdDate: string;
  expiryDate?: string;
  defaultState: boolean;
  requiresCleanup: boolean;
  dependencies?: FlagKey[];
  telemetryEvents?: string[];
 };
 /**
 * Flag registry with governance metadata
 * Used for flag lifecycle tracking and cleanup alerts
 */
 export const FLAG_REGISTRY: Record<FlagKey, FlagMetadata> = {
  [FLAGS.NEW_CHECKOUT_FLOW]: {
    key: FLAGS.NEW_CHECKOUT_FLOW,
    name: 'New Checkout Flow',
    owner: 'payments-team',
    createdDate: '2025-01-15',
    expiryDate: '2025-03-15',
    defaultState: false,
    requiresCleanup: true,
    dependencies: [FLAGS.USE_NEW_API_ENDPOINT],
    telemetryEvents: ['checkout_started', 'checkout_completed'],
  },
  [FLAGS.DARK_MODE]: {
    key: FLAGS.DARK_MODE,
    name: 'Dark Mode UI',
    owner: 'frontend-team',
    createdDate: '2025-01-10',
    defaultState: false,
    requiresCleanup: false, // Permanent feature toggle
  },
  // ... rest of registry
 };
 /**
 * Validate flag exists in registry
 * Throws at runtime if flag is unregistered
 */
 export function validateFlag(flag: string): asserts flag is FlagKey {
  if (!Object.values(FLAGS).includes(flag as FlagKey)) {
    throw new Error(`Unregistered feature flag: ${flag}`);
  }
 }
 /**
 * Check if flag is expired (needs removal)
 */
 export function isFlagExpired(flag: FlagKey): boolean {
  const metadata = FLAG_REGISTRY[flag];
  if (!metadata.expiryDate) return false;
  const expiry = new Date(metadata.expiryDate);
  return Date.now() > expiry.getTime();
 }
 /**
 * Get all expired flags requiring cleanup
 */
 export function getExpiredFlags(): FlagMetadata[] {
  return Object.values(FLAG_REGISTRY).filter((meta) => isFlagExpired(meta.key));
 }
 ```
 **Usage in application code**:
 ```typescript
 // components/Checkout.tsx
 import { FLAGS } from '@/utils/feature-flags';
 import { useFeatureFlag } from '@/hooks/useFeatureFlag';
 export function Checkout() {
  const isNewFlow = useFeatureFlag(FLAGS.NEW_CHECKOUT_FLOW);
  return isNewFlow ? <NewCheckoutFlow /> : <LegacyCheckoutFlow />;
 }
 ```
 **Key Points**:
 - **Type safety**: TypeScript catches typos at compile time
 - **Runtime validation**: validateFlag ensures only registered flags used
 - **Metadata tracking**: Owner, dates, dependencies documented
 - **Expiry alerts**: Automated detection of stale flags
 - **Single source of truth**: All flags defined in one place
 ---
 ### Example 2: Feature Flag Testing Pattern (Both States)
 **Context**: Comprehensive testing of feature flag variations with proper cleanup.
 **Implementation**:
 ```typescript
 // tests/e2e/checkout-feature-flag.spec.ts
 import { test, expect } from '@playwright/test';
 import { FLAGS } from '@/utils/feature-flags';
 /**
 * Feature Flag Testing Strategy:
 * 1. Test BOTH enabled and disabled states
 * 2. Clean up targeting after each test
 * 3. Use dedicated test users (not production data)
 * 4. Verify telemetry events fire correctly
 */
 test.describe('Checkout Flow - Feature Flag Variations', () => {
  let testUserId: string;
  test.beforeEach(async () => {
    // Generate unique test user ID
    testUserId = `test-user-${Date.now()}`;
  });
  test.afterEach(async ({ request }) => {
    // CRITICAL: Clean up flag targeting to prevent shared env pollution
    await request.post('/api/feature-flags/cleanup', {
      data: {
        flagKey: FLAGS.NEW_CHECKOUT_FLOW,
        userId: testUserId,
      },
    });
  });
  test('should use NEW checkout flow when flag is ENABLED', async ({ page, request }) => {
    // Arrange: Enable flag for test user
    await request.post('/api/feature-flags/target', {
      data: {
        flagKey: FLAGS.NEW_CHECKOUT_FLOW,
        userId: testUserId,
        variation: true, // ENABLED
      },
    });
    // Act: Navigate as targeted user
    await page.goto('/checkout', {
      extraHTTPHeaders: {
        'X-Test-User-ID': testUserId,
      },
    });
    // Assert: New flow UI elements visible
    await expect(page.getByTestId('checkout-v2-container')).toBeVisible();
    await expect(page.getByTestId('express-payment-options')).toBeVisible();
    await expect(page.getByTestId('saved-addresses-dropdown')).toBeVisible();
    // Assert: Legacy flow NOT visible
    await expect(page.getByTestId('checkout-v1-container')).not.toBeVisible();
    // Assert: Telemetry event fired
    const analyticsEvents = await page.evaluate(() => (window as any).__ANALYTICS_EVENTS__ || []);
    expect(analyticsEvents).toContainEqual(
      expect.objectContaining({
        event: 'checkout_started',
        properties: expect.objectContaining({
          variant: 'new_flow',
        }),
      }),
    );
  });
  test('should use LEGACY checkout flow when flag is DISABLED', async ({ page, request }) => {
    // Arrange: Disable flag for test user (or don't target at all)
    await request.post('/api/feature-flags/target', {
      data: {
        flagKey: FLAGS.NEW_CHECKOUT_FLOW,
        userId: testUserId,
        variation: false, // DISABLED
      },
    });
    // Act: Navigate as targeted user
    await page.goto('/checkout', {
      extraHTTPHeaders: {
        'X-Test-User-ID': testUserId,
      },
    });
    // Assert: Legacy flow UI elements visible
    await expect(page.getByTestId('checkout-v1-container')).toBeVisible();
    await expect(page.getByTestId('legacy-payment-form')).toBeVisible();
    // Assert: New flow NOT visible
    await expect(page.getByTestId('checkout-v2-container')).not.toBeVisible();
    await expect(page.getByTestId('express-payment-options')).not.toBeVisible();
    // Assert: Telemetry event fired with correct variant
    const analyticsEvents = await page.evaluate(() => (window as any).__ANALYTICS_EVENTS__ || []);
    expect(analyticsEvents).toContainEqual(
      expect.objectContaining({
        event: 'checkout_started',
        properties: expect.objectContaining({
          variant: 'legacy_flow',
        }),
      }),
    );
  });
  test('should handle flag evaluation errors gracefully', async ({ page, request }) => {
    // Arrange: Simulate flag service unavailable
    await page.route('**/api/feature-flags/evaluate', (route) => route.fulfill({ status: 500, body: 'Service Unavailable' }));
    // Act: Navigate (should fallback to default state)
    await page.goto('/checkout', {
      extraHTTPHeaders: {
        'X-Test-User-ID': testUserId,
      },
    });
    // Assert: Fallback to safe default (legacy flow)
    await expect(page.getByTestId('checkout-v1-container')).toBeVisible();
    // Assert: Error logged but no user-facing error
    const consoleErrors = [];
    page.on('console', (msg) => {
      if (msg.type() === 'error') consoleErrors.push(msg.text());
    });
    expect(consoleErrors).toContain(expect.stringContaining('Feature flag evaluation failed'));
  });
 });
 ```
 **Cypress equivalent**:
 ```javascript
 // cypress/e2e/checkout-feature-flag.cy.ts
 import { FLAGS } from '@/utils/feature-flags';
 describe('Checkout Flow - Feature Flag Variations', () => {
  let testUserId;
  beforeEach(() => {
    testUserId = `test-user-${Date.now()}`;
  });
  afterEach(() => {
    // Clean up targeting
    cy.task('removeFeatureFlagTarget', {
      flagKey: FLAGS.NEW_CHECKOUT_FLOW,
      userId: testUserId,
    });
  });
  it('should use NEW checkout flow when flag is ENABLED', () => {
    // Arrange: Enable flag via Cypress task
    cy.task('setFeatureFlagVariation', {
      flagKey: FLAGS.NEW_CHECKOUT_FLOW,
      userId: testUserId,
      variation: true,
    });
    // Act
    cy.visit('/checkout', {
      headers: { 'X-Test-User-ID': testUserId },
    });
    // Assert
    cy.get('[data-testid="checkout-v2-container"]').should('be.visible');
    cy.get('[data-testid="checkout-v1-container"]').should('not.exist');
  });
  it('should use LEGACY checkout flow when flag is DISABLED', () => {
    // Arrange: Disable flag
    cy.task('setFeatureFlagVariation', {
      flagKey: FLAGS.NEW_CHECKOUT_FLOW,
      userId: testUserId,
      variation: false,
    });
    // Act
    cy.visit('/checkout', {
      headers: { 'X-Test-User-ID': testUserId },
    });
    // Assert
    cy.get('[data-testid="checkout-v1-container"]').should('be.visible');
    cy.get('[data-testid="checkout-v2-container"]').should('not.exist');
  });
 });
 ```
 **Key Points**:
 - **Test both states**: Enabled AND disabled variations
 - **Automatic cleanup**: afterEach removes targeting (prevent pollution)
 - **Unique test users**: Avoid conflicts with real user data
 - **Telemetry validation**: Verify analytics events fire correctly
 - **Graceful degradation**: Test fallback behavior on errors
 ---
 ### Example 3: Feature Flag Targeting Helper Pattern
 **Context**: Reusable helpers for programmatic flag control via LaunchDarkly/Split.io API.
 **Implementation**:
 ```typescript
 // tests/support/feature-flag-helpers.ts
 import { request as playwrightRequest } from '@playwright/test';
 import { FLAGS, FlagKey } from '@/utils/feature-flags';
 /**
 * LaunchDarkly API client configuration
 * Use test project SDK key (NOT production)
 */
 const LD_SDK_KEY = process.env.LD_SDK_KEY_TEST;
 const LD_API_BASE = 'https://app.launchdarkly.com/api/v2';
 type FlagVariation = boolean | string | number | object;
 /**
 * Set flag variation for specific user
 * Uses LaunchDarkly API to create user target
 */
 export async function setFlagForUser(flagKey: FlagKey, userId: string, variation: FlagVariation): Promise<void> {
  const response = await playwrightRequest.newContext().then((ctx) =>
    ctx.post(`${LD_API_BASE}/flags/${flagKey}/targeting`, {
      headers: {
        Authorization: LD_SDK_KEY!,
        'Content-Type': 'application/json',
      },
      data: {
        targets: [
          {
            values: [userId],
            variation: variation ? 1 : 0, // 0 = off, 1 = on
          },
        ],
      },
    }),
  );
  if (!response.ok()) {
    throw new Error(`Failed to set flag ${flagKey} for user ${userId}: ${response.status()}`);
  }
 }
 /**
 * Remove user from flag targeting
 * CRITICAL for test cleanup
 */
 export async function removeFlagTarget(flagKey: FlagKey, userId: string): Promise<void> {
  const response = await playwrightRequest.newContext().then((ctx) =>
    ctx.delete(`${LD_API_BASE}/flags/${flagKey}/targeting/users/${userId}`, {
      headers: {
        Authorization: LD_SDK_KEY!,
      },
    }),
  );
  if (!response.ok() && response.status() !== 404) {
    // 404 is acceptable (user wasn't targeted)
    throw new Error(`Failed to remove flag ${flagKey} target for user ${userId}: ${response.status()}`);
  }
 }
 /**
 * Percentage rollout helper
 * Enable flag for N% of users
 */
 export async function setFlagRolloutPercentage(flagKey: FlagKey, percentage: number): Promise<void> {
  if (percentage < 0 || percentage > 100) {
    throw new Error('Percentage must be between 0 and 100');
  }
  const response = await playwrightRequest.newContext().then((ctx) =>
    ctx.patch(`${LD_API_BASE}/flags/${flagKey}`, {
      headers: {
        Authorization: LD_SDK_KEY!,
        'Content-Type': 'application/json',
      },
      data: {
        rollout: {
          variations: [
            { variation: 0, weight: 100 - percentage }, // off
            { variation: 1, weight: percentage }, // on
          ],
        },
      },
    }),
  );
  if (!response.ok()) {
    throw new Error(`Failed to set rollout for flag ${flagKey}: ${response.status()}`);
  }
 }
 /**
 * Enable flag globally (100% rollout)
 */
 export async function enableFlagGlobally(flagKey: FlagKey): Promise<void> {
  await setFlagRolloutPercentage(flagKey, 100);
 }
 /**
 * Disable flag globally (0% rollout)
 */
 export async function disableFlagGlobally(flagKey: FlagKey): Promise<void> {
  await setFlagRolloutPercentage(flagKey, 0);
 }
 /**
 * Stub feature flags in local/test environments
 * Bypasses LaunchDarkly entirely
 */
 export function stubFeatureFlags(flags: Record<FlagKey, FlagVariation>): void {
  // Set flags in localStorage or inject into window
  if (typeof window !== 'undefined') {
    (window as any).__STUBBED_FLAGS__ = flags;
  }
 }
 ```
 **Usage in Playwright fixture**:
 ```typescript
 // playwright/fixtures/feature-flag-fixture.ts
 import { test as base } from '@playwright/test';
 import { setFlagForUser, removeFlagTarget } from '../support/feature-flag-helpers';
 import { FlagKey } from '@/utils/feature-flags';
 type FeatureFlagFixture = {
  featureFlags: {
    enable: (flag: FlagKey, userId: string) => Promise<void>;
    disable: (flag: FlagKey, userId: string) => Promise<void>;
    cleanup: (flag: FlagKey, userId: string) => Promise<void>;
  };
 };
 export const test = base.extend<FeatureFlagFixture>({
  featureFlags: async ({}, use) => {
    const cleanupQueue: Array<{ flag: FlagKey; userId: string }> = [];
    await use({
      enable: async (flag, userId) => {
        await setFlagForUser(flag, userId, true);
        cleanupQueue.push({ flag, userId });
      },
      disable: async (flag, userId) => {
        await setFlagForUser(flag, userId, false);
        cleanupQueue.push({ flag, userId });
      },
      cleanup: async (flag, userId) => {
        await removeFlagTarget(flag, userId);
      },
    });
    // Auto-cleanup after test
    for (const { flag, userId } of cleanupQueue) {
      await removeFlagTarget(flag, userId);
    }
  },
 });
 ```
 **Key Points**:
 - **API-driven control**: No manual UI clicks required
 - **Auto-cleanup**: Fixture tracks and removes targeting
 - **Percentage rollouts**: Test gradual feature releases
 - **Stubbing option**: Local development without LaunchDarkly
 - **Type-safe**: FlagKey prevents typos
 ---
 ### Example 4: Feature Flag Lifecycle Checklist & Cleanup Strategy
 **Context**: Governance checklist and automated cleanup detection for stale flags.
 **Implementation**:
 ```typescript
 // scripts/feature-flag-audit.ts
 /**
 * Feature Flag Lifecycle Audit Script
 * Run weekly to detect stale flags requiring cleanup
 */
 import { FLAG_REGISTRY, FLAGS, getExpiredFlags, FlagKey } from '../src/utils/feature-flags';
 import * as fs from 'fs';
 import * as path from 'path';
 type AuditResult = {
  totalFlags: number;
  expiredFlags: FlagKey[];
  missingOwners: FlagKey[];
  missingDates: FlagKey[];
  permanentFlags: FlagKey[];
  flagsNearingExpiry: FlagKey[];
 };
 /**
 * Audit all feature flags for governance compliance
 */
 function auditFeatureFlags(): AuditResult {
  const allFlags = Object.keys(FLAG_REGISTRY) as FlagKey[];
  const expiredFlags = getExpiredFlags().map((meta) => meta.key);
  // Flags expiring in next 30 days
  const thirtyDaysFromNow = Date.now() + 30 * 24 * 60 * 60 * 1000;
  const flagsNearingExpiry = allFlags.filter((flag) => {
    const meta = FLAG_REGISTRY[flag];
    if (!meta.expiryDate) return false;
    const expiry = new Date(meta.expiryDate).getTime();
    return expiry > Date.now() && expiry < thirtyDaysFromNow;
  });
  // Missing metadata
  const missingOwners = allFlags.filter((flag) => !FLAG_REGISTRY[flag].owner);
  const missingDates = allFlags.filter((flag) => !FLAG_REGISTRY[flag].createdDate);
  // Permanent flags (no expiry, requiresCleanup = false)
  const permanentFlags = allFlags.filter((flag) => {
    const meta = FLAG_REGISTRY[flag];
    return !meta.expiryDate && !meta.requiresCleanup;
  });
  return {
    totalFlags: allFlags.length,
    expiredFlags,
    missingOwners,
    missingDates,
    permanentFlags,
    flagsNearingExpiry,
  };
 }
 /**
 * Generate markdown report
 */
 function generateReport(audit: AuditResult): string {
  let report = `# Feature Flag Audit Report\n\n`;
  report += `**Date**: ${new Date().toISOString()}\n`;
  report += `**Total Flags**: ${audit.totalFlags}\n\n`;
  if (audit.expiredFlags.length > 0) {
    report += `## ⚠️ EXPIRED FLAGS - IMMEDIATE CLEANUP REQUIRED\n\n`;
    audit.expiredFlags.forEach((flag) => {
      const meta = FLAG_REGISTRY[flag];
      report += `- **${meta.name}** (\`${flag}\`)\n`;
      report += `  - Owner: ${meta.owner}\n`;
      report += `  - Expired: ${meta.expiryDate}\n`;
      report += `  - Action: Remove flag code, update tests, deploy\n\n`;
    });
  }
  if (audit.flagsNearingExpiry.length > 0) {
    report += `## ⏰ FLAGS EXPIRING SOON (Next 30 Days)\n\n`;
    audit.flagsNearingExpiry.forEach((flag) => {
      const meta = FLAG_REGISTRY[flag];
      report += `- **${meta.name}** (\`${flag}\`)\n`;
      report += `  - Owner: ${meta.owner}\n`;
      report += `  - Expires: ${meta.expiryDate}\n`;
      report += `  - Action: Plan cleanup or extend expiry\n\n`;
    });
  }
  if (audit.permanentFlags.length > 0) {
    report += `## 🔄 PERMANENT FLAGS (No Expiry)\n\n`;
    audit.permanentFlags.forEach((flag) => {
      const meta = FLAG_REGISTRY[flag];
      report += `- **${meta.name}** (\`${flag}\`) - Owner: ${meta.owner}\n`;
    });
    report += `\n`;
  }
  if (audit.missingOwners.length > 0 || audit.missingDates.length > 0) {
    report += `## ❌ GOVERNANCE ISSUES\n\n`;
    if (audit.missingOwners.length > 0) {
      report += `**Missing Owners**: ${audit.missingOwners.join(', ')}\n`;
    }
    if (audit.missingDates.length > 0) {
      report += `**Missing Created Dates**: ${audit.missingDates.join(', ')}\n`;
    }
    report += `\n`;
  }
  return report;
 }
 /**
 * Feature Flag Lifecycle Checklist
 */
 const FLAG_LIFECYCLE_CHECKLIST = `
 # Feature Flag Lifecycle Checklist
 ## Before Creating a New Flag
 - [ ] **Name**: Follow naming convention (kebab-case, descriptive)
 - [ ] **Owner**: Assign team/individual responsible
 - [ ] **Default State**: Determine safe default (usually false)
 - [ ] **Expiry Date**: Set removal date (30-90 days typical)
 - [ ] **Dependencies**: Document related flags
 - [ ] **Telemetry**: Plan analytics events to track
 - [ ] **Rollback Plan**: Define how to disable quickly
 ## During Development
 - [ ] **Code Paths**: Both enabled/disabled states implemented
 - [ ] **Tests**: Both variations tested in CI
 - [ ] **Documentation**: Flag purpose documented in code/PR
 - [ ] **Telemetry**: Analytics events instrumented
 - [ ] **Error Handling**: Graceful degradation on flag service failure
 ## Before Launch
 - [ ] **QA**: Both states tested in staging
 - [ ] **Rollout Plan**: Gradual rollout percentage defined
 - [ ] **Monitoring**: Dashboards/alerts for flag-related metrics
 - [ ] **Stakeholder Communication**: Product/design aligned
 ## After Launch (Monitoring)
 - [ ] **Metrics**: Success criteria tracked
 - [ ] **Error Rates**: No increase in errors
 - [ ] **Performance**: No degradation
 - [ ] **User Feedback**: Qualitative data collected
 ## Cleanup (Post-Launch)
 - [ ] **Remove Flag Code**: Delete if/else branches
 - [ ] **Update Tests**: Remove flag-specific tests
 - [ ] **Remove Targeting**: Clear all user targets
 - [ ] **Delete Flag Config**: Remove from LaunchDarkly/registry
 - [ ] **Update Documentation**: Remove references
 - [ ] **Deploy**: Ship cleanup changes
 `;
 // Run audit
 const audit = auditFeatureFlags();
 const report = generateReport(audit);
 // Save report
 const outputPath = path.join(__dirname, '../feature-flag-audit-report.md');
 fs.writeFileSync(outputPath, report);
 fs.writeFileSync(path.join(__dirname, '../FEATURE-FLAG-CHECKLIST.md'), FLAG_LIFECYCLE_CHECKLIST);
 console.log(`✅ Audit complete. Report saved to: ${outputPath}`);
 console.log(`Total flags: ${audit.totalFlags}`);
 console.log(`Expired flags: ${audit.expiredFlags.length}`);
 console.log(`Flags expiring soon: ${audit.flagsNearingExpiry.length}`);
 // Exit with error if expired flags exist
 if (audit.expiredFlags.length > 0) {
  console.error(`\n❌ EXPIRED FLAGS DETECTED - CLEANUP REQUIRED`);
  process.exit(1);
 }
 ```
 **package.json scripts**:
 ```json
 {
  "scripts": {
    "feature-flags:audit": "ts-node scripts/feature-flag-audit.ts",
    "feature-flags:audit:ci": "npm run feature-flags:audit || true"
  }
 }
 ```
 **Key Points**:
 - **Automated detection**: Weekly audit catches stale flags
 - **Lifecycle checklist**: Comprehensive governance guide
 - **Expiry tracking**: Flags auto-expire after defined date
 - **CI integration**: Audit runs in pipeline, warns on expiry
 - **Ownership clarity**: Every flag has assigned owner
 ---
 ## Feature Flag Testing Checklist
 Before merging flag-related code, verify:
 - [ ] **Both states tested**: Enabled AND disabled variations covered
 - [ ] **Cleanup automated**: afterEach removes targeting (no manual cleanup)
 - [ ] **Unique test data**: Test users don't collide with production
 - [ ] **Telemetry validated**: Analytics events fire for both variations
 - [ ] **Error handling**: Graceful fallback when flag service unavailable
 - [ ] **Flag metadata**: Owner, dates, dependencies documented in registry
 - [ ] **Rollback plan**: Clear steps to disable flag in production
 - [ ] **Expiry date set**: Removal date defined (or marked permanent)
 ## Integration Points
 - Used in workflows: `*automate` (test generation), `*framework` (flag setup)
 - Related fragments: `test-quality.md`, `selective-testing.md`
 - Flag services: LaunchDarkly, Split.io, Unleash, custom implementations
 _Source: LaunchDarkly strategy blog, Murat test architecture notes, SEON feature flag governance_
--- a/_bmad/bmm/testarch/knowledge/file-utils.md
+++ b/_bmad/bmm/testarch/knowledge/file-utils.md
@@ -0,0 +1,260 @@
 # File Utilities
 ## Principle
 Read and validate files (CSV, XLSX, PDF, ZIP) with automatic parsing, type-safe results, and download handling. Simplify file operations in Playwright tests with built-in format support and validation helpers.
 ## Rationale
 Testing file operations in Playwright requires boilerplate:
 - Manual download handling
 - External parsing libraries for each format
 - No validation helpers
 - Type-unsafe results
 - Repetitive path handling
 The `file-utils` module provides:
 - **Auto-parsing**: CSV, XLSX, PDF, ZIP automatically parsed
 - **Download handling**: Single function for UI or API-triggered downloads
 - **Type-safe**: TypeScript interfaces for parsed results
 - **Validation helpers**: Row count, header checks, content validation
 - **Format support**: Multiple sheet support (XLSX), text extraction (PDF), archive extraction (ZIP)
 ## Pattern Examples
 ### Example 1: UI-Triggered CSV Download
 **Context**: User clicks button, CSV downloads, validate contents.
 **Implementation**:
 ```typescript
 import { handleDownload, readCSV } from '@seontechnologies/playwright-utils/file-utils';
 import path from 'node:path';
 const DOWNLOAD_DIR = path.join(__dirname, '../downloads');
 test('should download and validate CSV', async ({ page }) => {
  const downloadPath = await handleDownload({
    page,
    downloadDir: DOWNLOAD_DIR,
    trigger: () => page.click('[data-testid="export-csv"]'),
  });
  const { content } = await readCSV({ filePath: downloadPath });
  // Validate headers
  expect(content.headers).toEqual(['ID', 'Name', 'Email', 'Role']);
  // Validate data
  expect(content.data).toHaveLength(10);
  expect(content.data[0]).toMatchObject({
    ID: expect.any(String),
    Name: expect.any(String),
    Email: expect.stringMatching(/@/),
  });
 });
 ```
 **Key Points**:
 - `handleDownload` waits for download, returns file path
 - `readCSV` auto-parses to `{ headers, data }`
 - Type-safe access to parsed content
 - Clean up downloads in `afterEach`
 ### Example 2: XLSX with Multiple Sheets
 **Context**: Excel file with multiple sheets (e.g., Summary, Details, Errors).
 **Implementation**:
 ```typescript
 import { readXLSX } from '@seontechnologies/playwright-utils/file-utils';
 test('should read multi-sheet XLSX', async () => {
  const downloadPath = await handleDownload({
    page,
    downloadDir: DOWNLOAD_DIR,
    trigger: () => page.click('[data-testid="export-xlsx"]'),
  });
  const { content } = await readXLSX({ filePath: downloadPath });
  // Access specific sheets
  const summarySheet = content.sheets.find((s) => s.name === 'Summary');
  const detailsSheet = content.sheets.find((s) => s.name === 'Details');
  // Validate summary
  expect(summarySheet.data).toHaveLength(1);
  expect(summarySheet.data[0].TotalRecords).toBe('150');
  // Validate details
  expect(detailsSheet.data).toHaveLength(150);
  expect(detailsSheet.headers).toContain('TransactionID');
 });
 ```
 **Key Points**:
 - `sheets` array with `name` and `data` properties
 - Access sheets by name
 - Each sheet has its own headers and data
 - Type-safe sheet iteration
 ### Example 3: PDF Text Extraction
 **Context**: Validate PDF report contains expected content.
 **Implementation**:
 ```typescript
 import { readPDF } from '@seontechnologies/playwright-utils/file-utils';
 test('should validate PDF report', async () => {
  const downloadPath = await handleDownload({
    page,
    downloadDir: DOWNLOAD_DIR,
    trigger: () => page.click('[data-testid="download-report"]'),
  });
  const { content } = await readPDF({ filePath: downloadPath });
  // content.text is extracted text from all pages
  expect(content.text).toContain('Financial Report Q4 2024');
  expect(content.text).toContain('Total Revenue:');
  // Validate page count
  expect(content.numpages).toBeGreaterThan(10);
 });
 ```
 **Key Points**:
 - `content.text` contains all extracted text
 - `content.numpages` for page count
 - PDF parsing handles multi-page documents
 - Search for specific phrases
 ### Example 4: ZIP Archive Validation
 **Context**: Validate ZIP contains expected files and extract specific file.
 **Implementation**:
 ```typescript
 import { readZIP } from '@seontechnologies/playwright-utils/file-utils';
 test('should validate ZIP archive', async () => {
  const downloadPath = await handleDownload({
    page,
    downloadDir: DOWNLOAD_DIR,
    trigger: () => page.click('[data-testid="download-backup"]'),
  });
  const { content } = await readZIP({ filePath: downloadPath });
  // Check file list
  expect(content.files).toContain('data.csv');
  expect(content.files).toContain('config.json');
  expect(content.files).toContain('readme.txt');
  // Read specific file from archive
  const configContent = content.zip.readAsText('config.json');
  const config = JSON.parse(configContent);
  expect(config.version).toBe('2.0');
 });
 ```
 **Key Points**:
 - `content.files` lists all files in archive
 - `content.zip.readAsText()` extracts specific files
 - Validate archive structure
 - Read and parse individual files from ZIP
 ### Example 5: API-Triggered Download
 **Context**: API endpoint returns file download (not UI click).
 **Implementation**:
 ```typescript
 test('should download via API', async ({ page, request }) => {
  const downloadPath = await handleDownload({
    page,
    downloadDir: DOWNLOAD_DIR,
    trigger: async () => {
      const response = await request.get('/api/export/csv', {
        headers: { Authorization: 'Bearer token' },
      });
      if (!response.ok()) {
        throw new Error(`Export failed: ${response.status()}`);
      }
    },
  });
  const { content } = await readCSV({ filePath: downloadPath });
  expect(content.data).toHaveLength(100);
 });
 ```
 **Key Points**:
 - `trigger` can be async API call
 - API must return `Content-Disposition` header
 - Still need `page` for download events
 - Works with authenticated endpoints
 ## Validation Helpers
 ```typescript
 // CSV validation
 const { isValid, errors } = await validateCSV({
  filePath: downloadPath,
  expectedRowCount: 10,
  requiredHeaders: ['ID', 'Name', 'Email'],
 });
 expect(isValid).toBe(true);
 expect(errors).toHaveLength(0);
 ```
 ## Download Cleanup Pattern
 ```typescript
 test.afterEach(async () => {
  // Clean up downloaded files
  await fs.remove(DOWNLOAD_DIR);
 });
 ```
 ## Related Fragments
 - `overview.md` - Installation and imports
 - `api-request.md` - API-triggered downloads
 - `recurse.md` - Poll for file generation completion
 ## Anti-Patterns
 **❌ Not cleaning up downloads:**
 ```typescript
 test('creates file', async () => {
  await handleDownload({ ... })
  // File left in downloads folder
 })
 ```
 **✅ Clean up after tests:**
 ```typescript
 test.afterEach(async () => {
  await fs.remove(DOWNLOAD_DIR);
 });
 ```
--- a/_bmad/bmm/testarch/knowledge/fixture-architecture.md
+++ b/_bmad/bmm/testarch/knowledge/fixture-architecture.md
@@ -0,0 +1,401 @@
 # Fixture Architecture Playbook
 ## Principle
 Build test helpers as pure functions first, then wrap them in framework-specific fixtures. Compose capabilities using `mergeTests` (Playwright) or layered commands (Cypress) instead of inheritance. Each fixture should solve one isolated concern (auth, API, logs, network).
 ## Rationale
 Traditional Page Object Models create tight coupling through inheritance chains (`BasePage → LoginPage → AdminPage`). When base classes change, all descendants break. Pure functions with fixture wrappers provide:
 - **Testability**: Pure functions run in unit tests without framework overhead
 - **Composability**: Mix capabilities freely via `mergeTests`, no inheritance constraints
 - **Reusability**: Export fixtures via package subpaths for cross-project sharing
 - **Maintainability**: One concern per fixture = clear responsibility boundaries
 ## Pattern Examples
 ### Example 1: Pure Function → Fixture Pattern
 **Context**: When building any test helper, always start with a pure function that accepts all dependencies explicitly. Then wrap it in a Playwright fixture or Cypress command.
 **Implementation**:
 ```typescript
 // playwright/support/helpers/api-request.ts
 // Step 1: Pure function (ALWAYS FIRST!)
 type ApiRequestParams = {
  request: APIRequestContext;
  method: 'GET' | 'POST' | 'PUT' | 'DELETE';
  url: string;
  data?: unknown;
  headers?: Record<string, string>;
 };
 export async function apiRequest({
  request,
  method,
  url,
  data,
  headers = {}
 }: ApiRequestParams) {
  const response = await request.fetch(url, {
    method,
    data,
    headers: {
      'Content-Type': 'application/json',
      ...headers
    }
  });
  if (!response.ok()) {
    throw new Error(`API request failed: ${response.status()} ${await response.text()}`);
  }
  return response.json();
 }
 // Step 2: Fixture wrapper
 // playwright/support/fixtures/api-request-fixture.ts
 import { test as base } from '@playwright/test';
 import { apiRequest } from '../helpers/api-request';
 export const test = base.extend<{ apiRequest: typeof apiRequest }>({
  apiRequest: async ({ request }, use) => {
    // Inject framework dependency, expose pure function
    await use((params) => apiRequest({ request, ...params }));
  }
 });
 // Step 3: Package exports for reusability
 // package.json
 {
  "exports": {
    "./api-request": "./playwright/support/helpers/api-request.ts",
    "./api-request/fixtures": "./playwright/support/fixtures/api-request-fixture.ts"
  }
 }
 ```
 **Key Points**:
 - Pure function is unit-testable without Playwright running
 - Framework dependency (`request`) injected at fixture boundary
 - Fixture exposes the pure function to test context
 - Package subpath exports enable `import { apiRequest } from 'my-fixtures/api-request'`
 ### Example 2: Composable Fixture System with mergeTests
 **Context**: When building comprehensive test capabilities, compose multiple focused fixtures instead of creating monolithic helper classes. Each fixture provides one capability.
 **Implementation**:
 ```typescript
 // playwright/support/fixtures/merged-fixtures.ts
 import { test as base, mergeTests } from '@playwright/test';
 import { test as apiRequestFixture } from './api-request-fixture';
 import { test as networkFixture } from './network-fixture';
 import { test as authFixture } from './auth-fixture';
 import { test as logFixture } from './log-fixture';
 // Compose all fixtures for comprehensive capabilities
 export const test = mergeTests(base, apiRequestFixture, networkFixture, authFixture, logFixture);
 export { expect } from '@playwright/test';
 // Example usage in tests:
 // import { test, expect } from './support/fixtures/merged-fixtures';
 //
 // test('user can create order', async ({ page, apiRequest, auth, network }) => {
 //   await auth.loginAs('customer@example.com');
 //   await network.interceptRoute('POST', '**/api/orders', { id: 123 });
 //   await page.goto('/checkout');
 //   await page.click('[data-testid="submit-order"]');
 //   await expect(page.getByText('Order #123')).toBeVisible();
 // });
 ```
 **Individual Fixture Examples**:
 ```typescript
 // network-fixture.ts
 export const test = base.extend({
  network: async ({ page }, use) => {
    const interceptedRoutes = new Map();
    const interceptRoute = async (method: string, url: string, response: unknown) => {
      await page.route(url, (route) => {
        if (route.request().method() === method) {
          route.fulfill({ body: JSON.stringify(response) });
        }
      });
      interceptedRoutes.set(`${method}:${url}`, response);
    };
    await use({ interceptRoute });
    // Cleanup
    interceptedRoutes.clear();
  },
 });
 // auth-fixture.ts
 export const test = base.extend({
  auth: async ({ page, context }, use) => {
    const loginAs = async (email: string) => {
      // Use API to setup auth (fast!)
      const token = await getAuthToken(email);
      await context.addCookies([
        {
          name: 'auth_token',
          value: token,
          domain: 'localhost',
          path: '/',
        },
      ]);
    };
    await use({ loginAs });
  },
 });
 ```
 **Key Points**:
 - `mergeTests` combines fixtures without inheritance
 - Each fixture has single responsibility (network, auth, logs)
 - Tests import merged fixture and access all capabilities
 - No coupling between fixtures—add/remove freely
 ### Example 3: Framework-Agnostic HTTP Helper
 **Context**: When building HTTP helpers, keep them framework-agnostic. Accept all params explicitly so they work in unit tests, Playwright, Cypress, or any context.
 **Implementation**:
 ```typescript
 // shared/helpers/http-helper.ts
 // Pure, framework-agnostic function
 type HttpHelperParams = {
  baseUrl: string;
  endpoint: string;
  method: 'GET' | 'POST' | 'PUT' | 'DELETE';
  body?: unknown;
  headers?: Record<string, string>;
  token?: string;
 };
 export async function makeHttpRequest({ baseUrl, endpoint, method, body, headers = {}, token }: HttpHelperParams): Promise<unknown> {
  const url = `${baseUrl}${endpoint}`;
  const requestHeaders = {
    'Content-Type': 'application/json',
    ...(token && { Authorization: `Bearer ${token}` }),
    ...headers,
  };
  const response = await fetch(url, {
    method,
    headers: requestHeaders,
    body: body ? JSON.stringify(body) : undefined,
  });
  if (!response.ok) {
    const errorText = await response.text();
    throw new Error(`HTTP ${method} ${url} failed: ${response.status} ${errorText}`);
  }
  return response.json();
 }
 // Playwright fixture wrapper
 // playwright/support/fixtures/http-fixture.ts
 import { test as base } from '@playwright/test';
 import { makeHttpRequest } from '../../shared/helpers/http-helper';
 export const test = base.extend({
  httpHelper: async ({}, use) => {
    const baseUrl = process.env.API_BASE_URL || 'http://localhost:3000';
    await use((params) => makeHttpRequest({ baseUrl, ...params }));
  },
 });
 // Cypress command wrapper
 // cypress/support/commands.ts
 import { makeHttpRequest } from '../../shared/helpers/http-helper';
 Cypress.Commands.add('apiRequest', (params) => {
  const baseUrl = Cypress.env('API_BASE_URL') || 'http://localhost:3000';
  return cy.wrap(makeHttpRequest({ baseUrl, ...params }));
 });
 ```
 **Key Points**:
 - Pure function uses only standard `fetch`, no framework dependencies
 - Unit tests call `makeHttpRequest` directly with all params
 - Playwright and Cypress wrappers inject framework-specific config
 - Same logic runs everywhere—zero duplication
 ### Example 4: Fixture Cleanup Pattern
 **Context**: When fixtures create resources (data, files, connections), ensure automatic cleanup in fixture teardown. Tests must not leak state.
 **Implementation**:
 ```typescript
 // playwright/support/fixtures/database-fixture.ts
 import { test as base } from '@playwright/test';
 import { seedDatabase, deleteRecord } from '../helpers/db-helpers';
 type DatabaseFixture = {
  seedUser: (userData: Partial<User>) => Promise<User>;
  seedOrder: (orderData: Partial<Order>) => Promise<Order>;
 };
 export const test = base.extend<DatabaseFixture>({
  seedUser: async ({}, use) => {
    const createdUsers: string[] = [];
    const seedUser = async (userData: Partial<User>) => {
      const user = await seedDatabase('users', userData);
      createdUsers.push(user.id);
      return user;
    };
    await use(seedUser);
    // Auto-cleanup: Delete all users created during test
    for (const userId of createdUsers) {
      await deleteRecord('users', userId);
    }
    createdUsers.length = 0;
  },
  seedOrder: async ({}, use) => {
    const createdOrders: string[] = [];
    const seedOrder = async (orderData: Partial<Order>) => {
      const order = await seedDatabase('orders', orderData);
      createdOrders.push(order.id);
      return order;
    };
    await use(seedOrder);
    // Auto-cleanup: Delete all orders
    for (const orderId of createdOrders) {
      await deleteRecord('orders', orderId);
    }
    createdOrders.length = 0;
  },
 });
 // Example usage:
 // test('user can place order', async ({ seedUser, seedOrder, page }) => {
 //   const user = await seedUser({ email: 'test@example.com' });
 //   const order = await seedOrder({ userId: user.id, total: 100 });
 //
 //   await page.goto(`/orders/${order.id}`);
 //   await expect(page.getByText('Order Total: $100')).toBeVisible();
 //
 //   // No manual cleanup needed—fixture handles it automatically
 // });
 ```
 **Key Points**:
 - Track all created resources in array during test execution
 - Teardown (after `use()`) deletes all tracked resources
 - Tests don't manually clean up—happens automatically
 - Prevents test pollution and flakiness from shared state
 ### Anti-Pattern: Inheritance-Based Page Objects
 **Problem**:
 ```typescript
 // ❌ BAD: Page Object Model with inheritance
 class BasePage {
  constructor(public page: Page) {}
  async navigate(url: string) {
    await this.page.goto(url);
  }
  async clickButton(selector: string) {
    await this.page.click(selector);
  }
 }
 class LoginPage extends BasePage {
  async login(email: string, password: string) {
    await this.navigate('/login');
    await this.page.fill('#email', email);
    await this.page.fill('#password', password);
    await this.clickButton('#submit');
  }
 }
 class AdminPage extends LoginPage {
  async accessAdminPanel() {
    await this.login('admin@example.com', 'admin123');
    await this.navigate('/admin');
  }
 }
 ```
 **Why It Fails**:
 - Changes to `BasePage` break all descendants (`LoginPage`, `AdminPage`)
 - `AdminPage` inherits unnecessary `login` details—tight coupling
 - Cannot compose capabilities (e.g., admin + reporting features require multiple inheritance)
 - Hard to test `BasePage` methods in isolation
 - Hidden state in class instances leads to unpredictable behavior
 **Better Approach**: Use pure functions + fixtures
 ```typescript
 // ✅ GOOD: Pure functions with fixture composition
 // helpers/navigation.ts
 export async function navigate(page: Page, url: string) {
  await page.goto(url);
 }
 // helpers/auth.ts
 export async function login(page: Page, email: string, password: string) {
  await page.fill('[data-testid="email"]', email);
  await page.fill('[data-testid="password"]', password);
  await page.click('[data-testid="submit"]');
 }
 // fixtures/admin-fixture.ts
 export const test = base.extend({
  adminPage: async ({ page }, use) => {
    await login(page, 'admin@example.com', 'admin123');
    await navigate(page, '/admin');
    await use(page);
  },
 });
 // Tests import exactly what they need—no inheritance
 ```
 ## Integration Points
 - **Used in workflows**: `*atdd` (test generation), `*automate` (test expansion), `*framework` (initial setup)
 - **Related fragments**:
  - `data-factories.md` - Factory functions for test data
  - `network-first.md` - Network interception patterns
  - `test-quality.md` - Deterministic test design principles
 ## Helper Function Reuse Guidelines
 When deciding whether to create a fixture, follow these rules:
 - **3+ uses** → Create fixture with subpath export (shared across tests/projects)
 - **2-3 uses** → Create utility module (shared within project)
 - **1 use** → Keep inline (avoid premature abstraction)
 - **Complex logic** → Factory function pattern (dynamic data generation)
 _Source: Murat Testing Philosophy (lines 74-122), SEON production patterns, Playwright fixture docs._
--- a/_bmad/bmm/testarch/knowledge/fixtures-composition.md
+++ b/_bmad/bmm/testarch/knowledge/fixtures-composition.md
@@ -0,0 +1,382 @@
 # Fixtures Composition with mergeTests
 ## Principle
 Combine multiple Playwright fixtures using `mergeTests` to create a unified test object with all capabilities. Build composable test infrastructure by merging playwright-utils fixtures with custom project fixtures.
 ## Rationale
 Using fixtures from multiple sources requires combining them:
 - Importing from multiple fixture files is verbose
 - Name conflicts between fixtures
 - Duplicate fixture definitions
 - No clear single test object
 Playwright's `mergeTests` provides:
 - **Single test object**: All fixtures in one import
 - **Conflict resolution**: Handles name collisions automatically
 - **Composition pattern**: Mix utilities, custom fixtures, third-party fixtures
 - **Type safety**: Full TypeScript support for merged fixtures
 - **Maintainability**: One place to manage all fixtures
 ## Pattern Examples
 ### Example 1: Basic Fixture Merging
 **Context**: Combine multiple playwright-utils fixtures into single test object.
 **Implementation**:
 ```typescript
 // playwright/support/merged-fixtures.ts
 import { mergeTests } from '@playwright/test';
 import { test as apiRequestFixture } from '@seontechnologies/playwright-utils/api-request/fixtures';
 import { test as authFixture } from '@seontechnologies/playwright-utils/auth-session/fixtures';
 import { test as recurseFixture } from '@seontechnologies/playwright-utils/recurse/fixtures';
 // Merge all fixtures
 export const test = mergeTests(apiRequestFixture, authFixture, recurseFixture);
 export { expect } from '@playwright/test';
 ```
 ```typescript
 // In your tests - import from merged fixtures
 import { test, expect } from '../support/merged-fixtures';
 test('all utilities available', async ({
  apiRequest, // From api-request fixture
  authToken, // From auth fixture
  recurse, // From recurse fixture
 }) => {
  // All fixtures available in single test signature
  const { body } = await apiRequest({
    method: 'GET',
    path: '/api/protected',
    headers: { Authorization: `Bearer ${authToken}` },
  });
  await recurse(
    () => apiRequest({ method: 'GET', path: `/status/${body.id}` }),
    (res) => res.body.ready === true,
  );
 });
 ```
 **Key Points**:
 - Create one `merged-fixtures.ts` per project
 - Import test object from merged fixtures in all test files
 - All utilities available without multiple imports
 - Type-safe access to all fixtures
 ### Example 2: Combining with Custom Fixtures
 **Context**: Add project-specific fixtures alongside playwright-utils.
 **Implementation**:
 ```typescript
 // playwright/support/custom-fixtures.ts - Your project fixtures
 import { test as base } from '@playwright/test';
 import { createUser } from './factories/user-factory';
 import { seedDatabase } from './helpers/db-seeder';
 export const test = base.extend({
  // Custom fixture 1: Auto-seeded user
  testUser: async ({ request }, use) => {
    const user = await createUser({ role: 'admin' });
    await seedDatabase('users', [user]);
    await use(user);
    // Cleanup happens automatically
  },
  // Custom fixture 2: Database helpers
  db: async ({}, use) => {
    await use({
      seed: seedDatabase,
      clear: () => seedDatabase.truncate(),
    });
  },
 });
 // playwright/support/merged-fixtures.ts - Combine everything
 import { mergeTests } from '@playwright/test';
 import { test as apiRequestFixture } from '@seontechnologies/playwright-utils/api-request/fixtures';
 import { test as authFixture } from '@seontechnologies/playwright-utils/auth-session/fixtures';
 import { test as customFixtures } from './custom-fixtures';
 export const test = mergeTests(
  apiRequestFixture,
  authFixture,
  customFixtures, // Your project fixtures
 );
 export { expect } from '@playwright/test';
 ```
 ```typescript
 // In tests - all fixtures available
 import { test, expect } from '../support/merged-fixtures';
 test('using mixed fixtures', async ({
  apiRequest, // playwright-utils
  authToken, // playwright-utils
  testUser, // custom
  db, // custom
 }) => {
  // Use playwright-utils
  const { body } = await apiRequest({
    method: 'GET',
    path: `/api/users/${testUser.id}`,
    headers: { Authorization: `Bearer ${authToken}` },
  });
  // Use custom fixture
  await db.clear();
 });
 ```
 **Key Points**:
 - Custom fixtures extend `base` test
 - Merge custom with playwright-utils fixtures
 - All available in one test signature
 - Maintainable separation of concerns
 ### Example 3: Full Utility Suite Integration
 **Context**: Production setup with all core playwright-utils and custom fixtures.
 **Implementation**:
 ```typescript
 // playwright/support/merged-fixtures.ts
 import { mergeTests } from '@playwright/test';
 // Playwright utils fixtures
 import { test as apiRequestFixture } from '@seontechnologies/playwright-utils/api-request/fixtures';
 import { test as authFixture } from '@seontechnologies/playwright-utils/auth-session/fixtures';
 import { test as interceptFixture } from '@seontechnologies/playwright-utils/intercept-network-call/fixtures';
 import { test as recurseFixture } from '@seontechnologies/playwright-utils/recurse/fixtures';
 import { test as networkRecorderFixture } from '@seontechnologies/playwright-utils/network-recorder/fixtures';
 // Custom project fixtures
 import { test as customFixtures } from './custom-fixtures';
 // Merge everything
 export const test = mergeTests(apiRequestFixture, authFixture, interceptFixture, recurseFixture, networkRecorderFixture, customFixtures);
 export { expect } from '@playwright/test';
 ```
 ```typescript
 // In tests
 import { test, expect } from '../support/merged-fixtures';
 test('full integration', async ({
  page,
  context,
  apiRequest,
  authToken,
  interceptNetworkCall,
  recurse,
  networkRecorder,
  testUser, // custom
 }) => {
  // All utilities + custom fixtures available
  await networkRecorder.setup(context);
  const usersCall = interceptNetworkCall({ url: '**/api/users' });
  await page.goto('/users');
  const { responseJson } = await usersCall;
  expect(responseJson).toContainEqual(expect.objectContaining({ id: testUser.id }));
 });
 ```
 **Key Points**:
 - One merged-fixtures.ts for entire project
 - Combine all playwright-utils you use
 - Add custom project fixtures
 - Single import in all test files
 ### Example 4: Fixture Override Pattern
 **Context**: Override default options for specific test files or describes.
 **Implementation**:
 ```typescript
 import { test, expect } from '../support/merged-fixtures';
 // Override auth options for entire file
 test.use({
  authOptions: {
    userIdentifier: 'admin',
    environment: 'staging',
  },
 });
 test('uses admin on staging', async ({ authToken }) => {
  // Token is for admin user on staging environment
 });
 // Override for specific describe block
 test.describe('manager tests', () => {
  test.use({
    authOptions: {
      userIdentifier: 'manager',
    },
  });
  test('manager can access reports', async ({ page }) => {
    // Uses manager token
    await page.goto('/reports');
  });
 });
 ```
 **Key Points**:
 - `test.use()` overrides fixture options
 - Can override at file or describe level
 - Options merge with defaults
 - Type-safe overrides
 ### Example 5: Avoiding Fixture Conflicts
 **Context**: Handle name collisions when merging fixtures with same names.
 **Implementation**:
 ```typescript
 // If two fixtures have same name, last one wins
 import { test as fixture1 } from './fixture1'; // has 'user' fixture
 import { test as fixture2 } from './fixture2'; // also has 'user' fixture
 const test = mergeTests(fixture1, fixture2);
 // fixture2's 'user' overrides fixture1's 'user'
 // Better: Rename fixtures before merging
 import { test as base } from '@playwright/test';
 import { test as fixture1 } from './fixture1';
 const fixture1Renamed = base.extend({
  user1: fixture1._extend.user, // Rename to avoid conflict
 });
 const test = mergeTests(fixture1Renamed, fixture2);
 // Now both 'user1' and 'user' available
 // Best: Design fixtures without conflicts
 // - Prefix custom fixtures: 'myAppUser', 'myAppDb'
 // - Playwright-utils uses descriptive names: 'apiRequest', 'authToken'
 ```
 **Key Points**:
 - Last fixture wins in conflicts
 - Rename fixtures to avoid collisions
 - Design fixtures with unique names
 - Playwright-utils uses descriptive names (no conflicts)
 ## Recommended Project Structure
 ```
 playwright/
 ├── support/
 │   ├── merged-fixtures.ts        # ⭐ Single test object for project
 │   ├── custom-fixtures.ts        # Your project-specific fixtures
 │   ├── auth/
 │   │   ├── auth-fixture.ts       # Auth wrapper (if needed)
 │   │   └── custom-auth-provider.ts
 │   ├── fixtures/
 │   │   ├── user-fixture.ts
 │   │   ├── db-fixture.ts
 │   │   └── api-fixture.ts
 │   └── utils/
 │       └── factories/
 └── tests/
    ├── api/
    │   └── users.spec.ts          # import { test } from '../../support/merged-fixtures'
    ├── e2e/
    │   └── login.spec.ts          # import { test } from '../../support/merged-fixtures'
    └── component/
        └── button.spec.ts         # import { test } from '../../support/merged-fixtures'
 ```
 ## Benefits of Fixture Composition
 **Compared to direct imports:**
 ```typescript
 // ❌ Without mergeTests (verbose)
 import { test as base } from '@playwright/test';
 import { apiRequest } from '@seontechnologies/playwright-utils/api-request';
 import { getAuthToken } from './auth';
 import { createUser } from './factories';
 test('verbose', async ({ request }) => {
  const token = await getAuthToken();
  const user = await createUser();
  const response = await apiRequest({ request, method: 'GET', path: '/api/users' });
  // Manual wiring everywhere
 });
 // ✅ With mergeTests (clean)
 import { test } from '../support/merged-fixtures';
 test('clean', async ({ apiRequest, authToken, testUser }) => {
  const { body } = await apiRequest({ method: 'GET', path: '/api/users' });
  // All fixtures auto-wired
 });
 ```
 **Reduction:** ~10 lines per test → ~2 lines
 ## Related Fragments
 - `overview.md` - Installation and design principles
 - `api-request.md`, `auth-session.md`, `recurse.md` - Utilities to merge
 - `network-recorder.md`, `intercept-network-call.md`, `log.md` - Additional utilities
 ## Anti-Patterns
 **❌ Importing test from multiple fixture files:**
 ```typescript
 import { test } from '@seontechnologies/playwright-utils/api-request/fixtures';
 // Also need auth...
 import { test as authTest } from '@seontechnologies/playwright-utils/auth-session/fixtures';
 // Name conflict! Which test to use?
 ```
 **✅ Use merged fixtures:**
 ```typescript
 import { test } from '../support/merged-fixtures';
 // All utilities available, no conflicts
 ```
 **❌ Merging too many fixtures (kitchen sink):**
 ```typescript
 // Merging 20+ fixtures makes test signature huge
 const test = mergeTests(...20 different fixtures)
 test('my test', async ({ fixture1, fixture2, ..., fixture20 }) => {
  // Cognitive overload
 })
 ```
 **✅ Merge only what you actually use:**
 ```typescript
 // Merge the 4-6 fixtures your project actually needs
 const test = mergeTests(apiRequestFixture, authFixture, recurseFixture, customFixtures);
 ```
--- a/_bmad/bmm/testarch/knowledge/intercept-network-call.md
+++ b/_bmad/bmm/testarch/knowledge/intercept-network-call.md
@@ -0,0 +1,280 @@
 # Intercept Network Call Utility
 ## Principle
 Intercept network requests with a single declarative call that returns a Promise. Automatically parse JSON responses, support both spy (observe) and stub (mock) patterns, and use powerful glob pattern matching for URL filtering.
 ## Rationale
 Vanilla Playwright's network interception requires multiple steps:
 - `page.route()` to setup, `page.waitForResponse()` to capture
 - Manual JSON parsing
 - Verbose syntax for conditional handling
 - Complex filter predicates
 The `interceptNetworkCall` utility provides:
 - **Single declarative call**: Setup and wait in one statement
 - **Automatic JSON parsing**: Response pre-parsed, strongly typed
 - **Flexible URL patterns**: Glob matching with picomatch
 - **Spy or stub modes**: Observe real traffic or mock responses
 - **Concise API**: Reduces boilerplate by 60-70%
 ## Pattern Examples
 ### Example 1: Spy on Network (Observe Real Traffic)
 **Context**: Capture and inspect real API responses for validation.
 **Implementation**:
 ```typescript
 import { test } from '@seontechnologies/playwright-utils/intercept-network-call/fixtures';
 test('should spy on users API', async ({ page, interceptNetworkCall }) => {
  // Setup interception BEFORE navigation
  const usersCall = interceptNetworkCall({
    url: '**/api/users', // Glob pattern
  });
  await page.goto('/dashboard');
  // Wait for response and access parsed data
  const { responseJson, status } = await usersCall;
  expect(status).toBe(200);
  expect(responseJson).toHaveLength(10);
  expect(responseJson[0]).toHaveProperty('name');
 });
 ```
 **Key Points**:
 - Intercept before navigation (critical for race-free tests)
 - Returns Promise with `{ responseJson, status, requestBody }`
 - Glob patterns (`**` matches any path segment)
 - JSON automatically parsed
 ### Example 2: Stub Network (Mock Response)
 **Context**: Mock API responses for testing UI behavior without backend.
 **Implementation**:
 ```typescript
 test('should stub users API', async ({ page, interceptNetworkCall }) => {
  const mockUsers = [
    { id: 1, name: 'Test User 1' },
    { id: 2, name: 'Test User 2' },
  ];
  const usersCall = interceptNetworkCall({
    url: '**/api/users',
    fulfillResponse: {
      status: 200,
      body: mockUsers,
    },
  });
  await page.goto('/dashboard');
  await usersCall;
  // UI shows mocked data
  await expect(page.getByText('Test User 1')).toBeVisible();
  await expect(page.getByText('Test User 2')).toBeVisible();
 });
 ```
 **Key Points**:
 - `fulfillResponse` mocks the API
 - No backend needed
 - Test UI logic in isolation
 - Status code and body fully controllable
 ### Example 3: Conditional Response Handling
 **Context**: Different responses based on request method or parameters.
 **Implementation**:
 ```typescript
 test('conditional mocking', async ({ page, interceptNetworkCall }) => {
  await interceptNetworkCall({
    url: '**/api/data',
    handler: async (route, request) => {
      if (request.method() === 'POST') {
        // Mock POST success
        await route.fulfill({
          status: 201,
          body: JSON.stringify({ id: 'new-id', success: true }),
        });
      } else if (request.method() === 'GET') {
        // Mock GET with data
        await route.fulfill({
          status: 200,
          body: JSON.stringify([{ id: 1, name: 'Item' }]),
        });
      } else {
        // Let other methods through
        await route.continue();
      }
    },
  });
  await page.goto('/data-page');
 });
 ```
 **Key Points**:
 - `handler` function for complex logic
 - Access full `route` and `request` objects
 - Can mock, continue, or abort
 - Flexible for advanced scenarios
 ### Example 4: Error Simulation
 **Context**: Testing error handling in UI when API fails.
 **Implementation**:
 ```typescript
 test('should handle API errors gracefully', async ({ page, interceptNetworkCall }) => {
  // Simulate 500 error
  const errorCall = interceptNetworkCall({
    url: '**/api/users',
    fulfillResponse: {
      status: 500,
      body: { error: 'Internal Server Error' },
    },
  });
  await page.goto('/dashboard');
  await errorCall;
  // Verify UI shows error state
  await expect(page.getByText('Failed to load users')).toBeVisible();
  await expect(page.getByTestId('retry-button')).toBeVisible();
 });
 // Simulate network timeout
 test('should handle timeout', async ({ page, interceptNetworkCall }) => {
  await interceptNetworkCall({
    url: '**/api/slow',
    handler: async (route) => {
      // Never respond - simulates timeout
      await new Promise(() => {});
    },
  });
  await page.goto('/slow-page');
  // UI should show timeout error
  await expect(page.getByText('Request timed out')).toBeVisible({ timeout: 10000 });
 });
 ```
 **Key Points**:
 - Mock error statuses (4xx, 5xx)
 - Test timeout scenarios
 - Validate error UI states
 - No real failures needed
 ### Example 5: Multiple Intercepts (Order Matters!)
 **Context**: Intercepting different endpoints in same test - setup order is critical.
 **Implementation**:
 ```typescript
 test('multiple intercepts', async ({ page, interceptNetworkCall }) => {
  // ✅ CORRECT: Setup all intercepts BEFORE navigation
  const usersCall = interceptNetworkCall({ url: '**/api/users' });
  const productsCall = interceptNetworkCall({ url: '**/api/products' });
  const ordersCall = interceptNetworkCall({ url: '**/api/orders' });
  // THEN navigate
  await page.goto('/dashboard');
  // Wait for all (or specific ones)
  const [users, products] = await Promise.all([usersCall, productsCall]);
  expect(users.responseJson).toHaveLength(10);
  expect(products.responseJson).toHaveLength(50);
 });
 ```
 **Key Points**:
 - Setup all intercepts before triggering actions
 - Use `Promise.all()` to wait for multiple calls
 - Order: intercept → navigate → await
 - Prevents race conditions
 ## URL Pattern Matching
 **Supported glob patterns:**
 ```typescript
 '**/api/users'; // Any path ending with /api/users
 '/api/users'; // Exact match
 '**/users/*'; // Any users sub-path
 '**/api/{users,products}'; // Either users or products
 '**/api/users?id=*'; // With query params
 ```
 **Uses picomatch library** - same pattern syntax as Playwright's `page.route()` but cleaner API.
 ## Comparison with Vanilla Playwright
 | Vanilla Playwright                                          | intercept-network-call                                       |
 | ----------------------------------------------------------- | ------------------------------------------------------------ |
 | `await page.route('/api/users', route => route.continue())` | `const call = interceptNetworkCall({ url: '**/api/users' })` |
 | `const resp = await page.waitForResponse('/api/users')`     | (Combined in single statement)                               |
 | `const json = await resp.json()`                            | `const { responseJson } = await call`                        |
 | `const status = resp.status()`                              | `const { status } = await call`                              |
 | Complex filter predicates                                   | Simple glob patterns                                         |
 **Reduction:** ~5-7 lines → ~2-3 lines per interception
 ## Related Fragments
 - `network-first.md` - Core pattern: intercept before navigate
 - `network-recorder.md` - HAR-based offline testing
 - `overview.md` - Fixture composition basics
 ## Anti-Patterns
 **❌ Intercepting after navigation:**
 ```typescript
 await page.goto('/dashboard'); // Navigation starts
 const usersCall = interceptNetworkCall({ url: '**/api/users' }); // Too late!
 ```
 **✅ Intercept before navigate:**
 ```typescript
 const usersCall = interceptNetworkCall({ url: '**/api/users' }); // First
 await page.goto('/dashboard'); // Then navigate
 const { responseJson } = await usersCall; // Then await
 ```
 **❌ Ignoring the returned Promise:**
 ```typescript
 interceptNetworkCall({ url: '**/api/users' }); // Not awaited!
 await page.goto('/dashboard');
 // No deterministic wait - race condition
 ```
 **✅ Always await the intercept:**
 ```typescript
 const usersCall = interceptNetworkCall({ url: '**/api/users' });
 await page.goto('/dashboard');
 await usersCall; // Deterministic wait
 ```
--- a/_bmad/bmm/testarch/knowledge/log.md
+++ b/_bmad/bmm/testarch/knowledge/log.md
@@ -0,0 +1,294 @@
 # Log Utility
 ## Principle
 Use structured logging that integrates with Playwright's test reports. Support object logging, test step decoration, and multiple log levels (info, step, success, warning, error, debug).
 ## Rationale
 Console.log in Playwright tests has limitations:
 - Not visible in HTML reports
 - No test step integration
 - No structured output
 - Lost in terminal noise during CI
 The `log` utility provides:
 - **Report integration**: Logs appear in Playwright HTML reports
 - **Test step decoration**: `log.step()` creates collapsible steps in UI
 - **Object logging**: Automatically formats objects/arrays
 - **Multiple levels**: info, step, success, warning, error, debug
 - **Optional console**: Can disable console output but keep report logs
 ## Pattern Examples
 ### Example 1: Basic Logging Levels
 **Context**: Log different types of messages throughout test execution.
 **Implementation**:
 ```typescript
 import { log } from '@seontechnologies/playwright-utils';
 test('logging demo', async ({ page }) => {
  await log.step('Navigate to login page');
  await page.goto('/login');
  await log.info('Entering credentials');
  await page.fill('#username', 'testuser');
  await log.success('Login successful');
  await log.warning('Rate limit approaching');
  await log.debug({ userId: '123', sessionId: 'abc' });
  // Errors still throw but get logged first
  try {
    await page.click('#nonexistent');
  } catch (error) {
    await log.error('Click failed', false); // false = no console output
    throw error;
  }
 });
 ```
 **Key Points**:
 - `step()` creates collapsible steps in Playwright UI
 - `info()`, `success()`, `warning()` for different message types
 - `debug()` for detailed data (objects/arrays)
 - `error()` with optional console suppression
 - All logs appear in test reports
 ### Example 2: Object and Array Logging
 **Context**: Log structured data for debugging without cluttering console.
 **Implementation**:
 ```typescript
 test('object logging', async ({ apiRequest }) => {
  const { body } = await apiRequest({
    method: 'GET',
    path: '/api/users',
  });
  // Log array of objects
  await log.debug(body); // Formatted as JSON in report
  // Log specific object
  await log.info({
    totalUsers: body.length,
    firstUser: body[0]?.name,
    timestamp: new Date().toISOString(),
  });
  // Complex nested structures
  await log.debug({
    request: {
      method: 'GET',
      path: '/api/users',
      timestamp: Date.now(),
    },
    response: {
      status: 200,
      body: body.slice(0, 3), // First 3 items
    },
  });
 });
 ```
 **Key Points**:
 - Objects auto-formatted as pretty JSON
 - Arrays handled gracefully
 - Nested structures supported
 - All visible in Playwright report attachments
 ### Example 3: Test Step Organization
 **Context**: Organize test execution into collapsible steps for better readability in reports.
 **Implementation**:
 ```typescript
 test('organized with steps', async ({ page, apiRequest }) => {
  await log.step('ARRANGE: Setup test data');
  const { body: user } = await apiRequest({
    method: 'POST',
    path: '/api/users',
    body: { name: 'Test User' },
  });
  await log.step('ACT: Perform user action');
  await page.goto(`/users/${user.id}`);
  await page.click('#edit');
  await page.fill('#name', 'Updated Name');
  await page.click('#save');
  await log.step('ASSERT: Verify changes');
  await expect(page.getByText('Updated Name')).toBeVisible();
  // In Playwright UI, each step is collapsible
 });
 ```
 **Key Points**:
 - `log.step()` creates collapsible sections
 - Organize by Arrange-Act-Assert
 - Steps visible in Playwright trace viewer
 - Better debugging when tests fail
 ### Example 4: Conditional Logging
 **Context**: Log different messages based on environment or test conditions.
 **Implementation**:
 ```typescript
 test('conditional logging', async ({ page }) => {
  const isCI = process.env.CI === 'true';
  if (isCI) {
    await log.info('Running in CI environment');
  } else {
    await log.debug('Running locally');
  }
  const isKafkaWorking = await checkKafkaHealth();
  if (!isKafkaWorking) {
    await log.warning('Kafka unavailable - skipping event checks');
  } else {
    await log.step('Verifying Kafka events');
    // ... event verification
  }
 });
 ```
 **Key Points**:
 - Log based on environment
 - Skip logging with conditionals
 - Use appropriate log levels
 - Debug info for local, minimal for CI
 ### Example 5: Integration with Auth and API
 **Context**: Log authenticated API requests with tokens (safely).
 **Implementation**:
 ```typescript
 import { test } from '@seontechnologies/playwright-utils/fixtures';
 // Helper to create safe token preview
 function createTokenPreview(token: string): string {
  if (!token || token.length < 10) return '[invalid]';
  return `${token.slice(0, 6)}...${token.slice(-4)}`;
 }
 test('should log auth flow', async ({ authToken, apiRequest }) => {
  await log.info(`Using token: ${createTokenPreview(authToken)}`);
  await log.step('Fetch protected resource');
  const { status, body } = await apiRequest({
    method: 'GET',
    path: '/api/protected',
    headers: { Authorization: `Bearer ${authToken}` },
  });
  await log.debug({
    status,
    bodyPreview: {
      id: body.id,
      recordCount: body.data?.length,
    },
  });
  await log.success('Protected resource accessed successfully');
 });
 ```
 **Key Points**:
 - Never log full tokens (security risk)
 - Use preview functions for sensitive data
 - Combine with auth and API utilities
 - Log at appropriate detail level
 ## Log Levels Guide
 | Level     | When to Use                         | Shows in Report      | Shows in Console |
 | --------- | ----------------------------------- | -------------------- | ---------------- |
 | `step`    | Test organization, major actions    | ✅ Collapsible steps | ✅ Yes           |
 | `info`    | General information, state changes  | ✅ Yes               | ✅ Yes           |
 | `success` | Successful operations               | ✅ Yes               | ✅ Yes           |
 | `warning` | Non-critical issues, skipped checks | ✅ Yes               | ✅ Yes           |
 | `error`   | Failures, exceptions                | ✅ Yes               | ✅ Configurable  |
 | `debug`   | Detailed data, objects              | ✅ Yes (attached)    | ✅ Configurable  |
 ## Comparison with console.log
 | console.log             | log Utility               |
 | ----------------------- | ------------------------- |
 | Not in reports          | Appears in reports        |
 | No test steps           | Creates collapsible steps |
 | Manual JSON.stringify() | Auto-formats objects      |
 | No log levels           | 6 log levels              |
 | Lost in CI output       | Preserved in artifacts    |
 ## Related Fragments
 - `overview.md` - Basic usage and imports
 - `api-request.md` - Log API requests
 - `auth-session.md` - Log auth flow (safely)
 - `recurse.md` - Log polling progress
 ## Anti-Patterns
 **❌ Logging objects in steps:**
 ```typescript
 await log.step({ user: 'test', action: 'create' }); // Shows empty in UI
 ```
 **✅ Use strings for steps, objects for debug:**
 ```typescript
 await log.step('Creating user: test'); // Readable in UI
 await log.debug({ user: 'test', action: 'create' }); // Detailed data
 ```
 **❌ Logging sensitive data:**
 ```typescript
 await log.info(`Password: ${password}`); // Security risk!
 await log.info(`Token: ${authToken}`); // Full token exposed!
 ```
 **✅ Use previews or omit sensitive data:**
 ```typescript
 await log.info('User authenticated successfully'); // No sensitive data
 await log.debug({ tokenPreview: token.slice(0, 6) + '...' });
 ```
 **❌ Excessive logging in loops:**
 ```typescript
 for (const item of items) {
  await log.info(`Processing ${item.id}`); // 100 log entries!
 }
 ```
 **✅ Log summary or use debug level:**
 ```typescript
 await log.step(`Processing ${items.length} items`);
 await log.debug({ itemIds: items.map((i) => i.id) }); // One log entry
 ```
--- a/_bmad/bmm/testarch/knowledge/network-error-monitor.md
+++ b/_bmad/bmm/testarch/knowledge/network-error-monitor.md
@@ -0,0 +1,272 @@
 # Network Error Monitor
 ## Principle
 Automatically detect and fail tests when HTTP 4xx/5xx errors occur during execution. Act like Sentry for tests - catch silent backend failures even when UI passes assertions.
 ## Rationale
 Traditional Playwright tests focus on UI:
 - Backend 500 errors ignored if UI looks correct
 - Silent failures slip through
 - No visibility into background API health
 - Tests pass while features are broken
 The `network-error-monitor` provides:
 - **Automatic detection**: All HTTP 4xx/5xx responses tracked
 - **Test failures**: Fail tests with backend errors (even if UI passes)
 - **Structured artifacts**: JSON reports with error details
 - **Smart opt-out**: Disable for validation tests expecting errors
 - **Deduplication**: Group repeated errors by pattern
 - **Domino effect prevention**: Limit test failures per error pattern
 ## Pattern Examples
 ### Example 1: Basic Auto-Monitoring
 **Context**: Automatically fail tests when backend errors occur.
 **Implementation**:
 ```typescript
 import { test } from '@seontechnologies/playwright-utils/network-error-monitor/fixtures';
 // Monitoring automatically enabled
 test('should load dashboard', async ({ page }) => {
  await page.goto('/dashboard');
  await expect(page.locator('h1')).toContainText('Dashboard');
  // ✅ Passes if no HTTP errors
  // ❌ Fails if any 4xx/5xx errors detected with clear message:
  //    "Network errors detected: 2 request(s) failed"
  //    Failed requests:
  //      GET 500 https://api.example.com/users
  //      POST 503 https://api.example.com/metrics
 });
 ```
 **Key Points**:
 - Zero setup - auto-enabled for all tests
 - Fails on any 4xx/5xx response
 - Structured error message with URLs and status codes
 - JSON artifact attached to test report
 ### Example 2: Opt-Out for Validation Tests
 **Context**: Some tests expect errors (validation, error handling, edge cases).
 **Implementation**:
 ```typescript
 import { test } from '@seontechnologies/playwright-utils/network-error-monitor/fixtures';
 // Opt-out with annotation
 test('should show error on invalid input', { annotation: [{ type: 'skipNetworkMonitoring' }] }, async ({ page }) => {
  await page.goto('/form');
  await page.click('#submit'); // Triggers 400 error
  // Monitoring disabled - test won't fail on 400
  await expect(page.getByText('Invalid input')).toBeVisible();
 });
 // Or opt-out entire describe block
 test.describe('error handling', { annotation: [{ type: 'skipNetworkMonitoring' }] }, () => {
  test('handles 404', async ({ page }) => {
    // All tests in this block skip monitoring
  });
  test('handles 500', async ({ page }) => {
    // Monitoring disabled
  });
 });
 ```
 **Key Points**:
 - Use annotation `{ type: 'skipNetworkMonitoring' }`
 - Can opt-out single test or entire describe block
 - Monitoring still active for other tests
 - Perfect for intentional error scenarios
 ### Example 3: Integration with Merged Fixtures
 **Context**: Combine network-error-monitor with other utilities.
 **Implementation**:
 ```typescript
 // playwright/support/merged-fixtures.ts
 import { mergeTests } from '@playwright/test';
 import { test as authFixture } from '@seontechnologies/playwright-utils/auth-session/fixtures';
 import { test as networkErrorMonitorFixture } from '@seontechnologies/playwright-utils/network-error-monitor/fixtures';
 export const test = mergeTests(
  authFixture,
  networkErrorMonitorFixture,
  // Add other fixtures
 );
 // In tests
 import { test, expect } from '../support/merged-fixtures';
 test('authenticated with monitoring', async ({ page, authToken }) => {
  // Both auth and network monitoring active
  await page.goto('/protected');
  // Fails if backend returns errors during auth flow
 });
 ```
 **Key Points**:
 - Combine with `mergeTests`
 - Works alongside all other utilities
 - Monitoring active automatically
 - No extra setup needed
 ### Example 4: Domino Effect Prevention
 **Context**: One failing endpoint shouldn't fail all tests.
 **Implementation**:
 ```typescript
 // Configuration (internal to utility)
 const config = {
  maxTestsPerError: 3, // Max 3 tests fail per unique error pattern
 };
 // Scenario:
 // Test 1: GET /api/broken → 500 error → Test fails ❌
 // Test 2: GET /api/broken → 500 error → Test fails ❌
 // Test 3: GET /api/broken → 500 error → Test fails ❌
 // Test 4: GET /api/broken → 500 error → Test passes ⚠️ (limit reached, warning logged)
 // Test 5: Different error pattern → Test fails ❌ (new pattern, counter resets)
 ```
 **Key Points**:
 - Limits cascading failures
 - Groups errors by URL + status code pattern
 - Warns when limit reached
 - Prevents flaky backend from failing entire suite
 ### Example 5: Artifact Structure
 **Context**: Debugging failed tests with network error artifacts.
 **Implementation**:
 When test fails due to network errors, artifact attached:
 ```json
 // test-results/my-test/network-errors.json
 {
  "errors": [
    {
      "url": "https://api.example.com/users",
      "method": "GET",
      "status": 500,
      "statusText": "Internal Server Error",
      "timestamp": "2024-08-13T10:30:45.123Z"
    },
    {
      "url": "https://api.example.com/metrics",
      "method": "POST",
      "status": 503,
      "statusText": "Service Unavailable",
      "timestamp": "2024-08-13T10:30:46.456Z"
    }
  ],
  "summary": {
    "totalErrors": 2,
    "uniquePatterns": 2
  }
 }
 ```
 **Key Points**:
 - JSON artifact per failed test
 - Full error details (URL, method, status, timestamp)
 - Summary statistics
 - Easy debugging with structured data
 ## Comparison with Manual Error Checks
 | Manual Approach                                        | network-error-monitor      |
 | ------------------------------------------------------ | -------------------------- |
 | `page.on('response', resp => { if (!resp.ok()) ... })` | Auto-enabled, zero setup   |
 | Check each response manually                           | Automatic for all requests |
 | Custom error tracking logic                            | Built-in deduplication     |
 | No structured artifacts                                | JSON artifacts attached    |
 | Easy to forget                                         | Never miss a backend error |
 ## When to Use
 **Auto-enabled for:**
 - ✅ All E2E tests
 - ✅ Integration tests
 - ✅ Any test hitting real APIs
 **Opt-out for:**
 - ❌ Validation tests (expecting 4xx)
 - ❌ Error handling tests (expecting 5xx)
 - ❌ Offline tests (network-recorder playback)
 ## Integration with Framework Setup
 In `*framework` workflow, mention network-error-monitor:
 ```typescript
 // Add to merged-fixtures.ts
 import { test as networkErrorMonitorFixture } from '@seontechnologies/playwright-utils/network-error-monitor/fixtures';
 export const test = mergeTests(
  // ... other fixtures
  networkErrorMonitorFixture,
 );
 ```
 ## Related Fragments
 - `overview.md` - Installation and fixtures
 - `fixtures-composition.md` - Merging with other utilities
 - `error-handling.md` - Traditional error handling patterns
 ## Anti-Patterns
 **❌ Opting out of monitoring globally:**
 ```typescript
 // Every test skips monitoring
 test.use({ annotation: [{ type: 'skipNetworkMonitoring' }] });
 ```
 **✅ Opt-out only for specific error tests:**
 ```typescript
 test.describe('error scenarios', { annotation: [{ type: 'skipNetworkMonitoring' }] }, () => {
  // Only these tests skip monitoring
 });
 ```
 **❌ Ignoring network error artifacts:**
 ```typescript
 // Test fails, artifact shows 500 errors
 // Developer: "Works on my machine" ¯\_(ツ)_/¯
 ```
 **✅ Check artifacts for root cause:**
 ```typescript
 // Read network-errors.json artifact
 // Identify failing endpoint: GET /api/users → 500
 // Fix backend issue before merging
 ```
--- a/_bmad/bmm/testarch/knowledge/network-first.md
+++ b/_bmad/bmm/testarch/knowledge/network-first.md
@@ -0,0 +1,486 @@
 # Network-First Safeguards
 ## Principle
 Register network interceptions **before** any navigation or user action. Store the interception promise and await it immediately after the triggering step. Replace implicit waits with deterministic signals based on network responses, spinner disappearance, or event hooks.
 ## Rationale
 The most common source of flaky E2E tests is **race conditions** between navigation and network interception:
 - Navigate then intercept = missed requests (too late)
 - No explicit wait = assertion runs before response arrives
 - Hard waits (`waitForTimeout(3000)`) = slow, unreliable, brittle
 Network-first patterns provide:
 - **Zero race conditions**: Intercept is active before triggering action
 - **Deterministic waits**: Wait for actual response, not arbitrary timeouts
 - **Actionable failures**: Assert on response status/body, not generic "element not found"
 - **Speed**: No padding with extra wait time
 ## Pattern Examples
 ### Example 1: Intercept Before Navigate Pattern
 **Context**: The foundational pattern for all E2E tests. Always register route interception **before** the action that triggers the request (navigation, click, form submit).
 **Implementation**:
 ```typescript
 // ✅ CORRECT: Intercept BEFORE navigate
 test('user can view dashboard data', async ({ page }) => {
  // Step 1: Register interception FIRST
  const usersPromise = page.waitForResponse((resp) => resp.url().includes('/api/users') && resp.status() === 200);
  // Step 2: THEN trigger the request
  await page.goto('/dashboard');
  // Step 3: THEN await the response
  const usersResponse = await usersPromise;
  const users = await usersResponse.json();
  // Step 4: Assert on structured data
  expect(users).toHaveLength(10);
  await expect(page.getByText(users[0].name)).toBeVisible();
 });
 // Cypress equivalent
 describe('Dashboard', () => {
  it('should display users', () => {
    // Step 1: Register interception FIRST
    cy.intercept('GET', '**/api/users').as('getUsers');
    // Step 2: THEN trigger
    cy.visit('/dashboard');
    // Step 3: THEN await
    cy.wait('@getUsers').then((interception) => {
      // Step 4: Assert on structured data
      expect(interception.response.statusCode).to.equal(200);
      expect(interception.response.body).to.have.length(10);
      cy.contains(interception.response.body[0].name).should('be.visible');
    });
  });
 });
 // ❌ WRONG: Navigate BEFORE intercept (race condition!)
 test('flaky test example', async ({ page }) => {
  await page.goto('/dashboard'); // Request fires immediately
  const usersPromise = page.waitForResponse('/api/users'); // TOO LATE - might miss it
  const response = await usersPromise; // May timeout randomly
 });
 ```
 **Key Points**:
 - Playwright: Use `page.waitForResponse()` with URL pattern or predicate **before** `page.goto()` or `page.click()`
 - Cypress: Use `cy.intercept().as()` **before** `cy.visit()` or `cy.click()`
 - Store promise/alias, trigger action, **then** await response
 - This prevents 95% of race-condition flakiness in E2E tests
 ### Example 2: HAR Capture for Debugging
 **Context**: When debugging flaky tests or building deterministic mocks, capture real network traffic with HAR files. Replay them in tests for consistent, offline-capable test runs.
 **Implementation**:
 ```typescript
 // playwright.config.ts - Enable HAR recording
 export default defineConfig({
  use: {
    // Record HAR on first run
    recordHar: { path: './hars/', mode: 'minimal' },
    // Or replay HAR in tests
    // serviceWorkers: 'block',
  },
 });
 // Capture HAR for specific test
 test('capture network for order flow', async ({ page, context }) => {
  // Start recording
  await context.routeFromHAR('./hars/order-flow.har', {
    url: '**/api/**',
    update: true, // Update HAR with new requests
  });
  await page.goto('/checkout');
  await page.fill('[data-testid="credit-card"]', '4111111111111111');
  await page.click('[data-testid="submit-order"]');
  await expect(page.getByText('Order Confirmed')).toBeVisible();
  // HAR saved to ./hars/order-flow.har
 });
 // Replay HAR for deterministic tests (no real API needed)
 test('replay order flow from HAR', async ({ page, context }) => {
  // Replay captured HAR
  await context.routeFromHAR('./hars/order-flow.har', {
    url: '**/api/**',
    update: false, // Read-only mode
  });
  // Test runs with exact recorded responses - fully deterministic
  await page.goto('/checkout');
  await page.fill('[data-testid="credit-card"]', '4111111111111111');
  await page.click('[data-testid="submit-order"]');
  await expect(page.getByText('Order Confirmed')).toBeVisible();
 });
 // Custom mock based on HAR insights
 test('mock order response based on HAR', async ({ page }) => {
  // After analyzing HAR, create focused mock
  await page.route('**/api/orders', (route) =>
    route.fulfill({
      status: 200,
      contentType: 'application/json',
      body: JSON.stringify({
        orderId: '12345',
        status: 'confirmed',
        total: 99.99,
      }),
    }),
  );
  await page.goto('/checkout');
  await page.click('[data-testid="submit-order"]');
  await expect(page.getByText('Order #12345')).toBeVisible();
 });
 ```
 **Key Points**:
 - HAR files capture real request/response pairs for analysis
 - `update: true` records new traffic; `update: false` replays existing
 - Replay mode makes tests fully deterministic (no upstream API needed)
 - Use HAR to understand API contracts, then create focused mocks
 ### Example 3: Network Stub with Edge Cases
 **Context**: When testing error handling, timeouts, and edge cases, stub network responses to simulate failures. Test both happy path and error scenarios.
 **Implementation**:
 ```typescript
 // Test happy path
 test('order succeeds with valid data', async ({ page }) => {
  await page.route('**/api/orders', (route) =>
    route.fulfill({
      status: 200,
      contentType: 'application/json',
      body: JSON.stringify({ orderId: '123', status: 'confirmed' }),
    }),
  );
  await page.goto('/checkout');
  await page.click('[data-testid="submit-order"]');
  await expect(page.getByText('Order Confirmed')).toBeVisible();
 });
 // Test 500 error
 test('order fails with server error', async ({ page }) => {
  // Listen for console errors (app should log gracefully)
  const consoleErrors: string[] = [];
  page.on('console', (msg) => {
    if (msg.type() === 'error') consoleErrors.push(msg.text());
  });
  // Stub 500 error
  await page.route('**/api/orders', (route) =>
    route.fulfill({
      status: 500,
      contentType: 'application/json',
      body: JSON.stringify({ error: 'Internal Server Error' }),
    }),
  );
  await page.goto('/checkout');
  await page.click('[data-testid="submit-order"]');
  // Assert UI shows error gracefully
  await expect(page.getByText('Something went wrong')).toBeVisible();
  await expect(page.getByText('Please try again')).toBeVisible();
  // Verify error logged (not thrown)
  expect(consoleErrors.some((e) => e.includes('Order failed'))).toBeTruthy();
 });
 // Test network timeout
 test('order times out after 10 seconds', async ({ page }) => {
  // Stub delayed response (never resolves within timeout)
  await page.route(
    '**/api/orders',
    (route) => new Promise(() => {}), // Never resolves - simulates timeout
  );
  await page.goto('/checkout');
  await page.click('[data-testid="submit-order"]');
  // App should show timeout message after configured timeout
  await expect(page.getByText('Request timed out')).toBeVisible({ timeout: 15000 });
 });
 // Test partial data response
 test('order handles missing optional fields', async ({ page }) => {
  await page.route('**/api/orders', (route) =>
    route.fulfill({
      status: 200,
      contentType: 'application/json',
      // Missing optional fields like 'trackingNumber', 'estimatedDelivery'
      body: JSON.stringify({ orderId: '123', status: 'confirmed' }),
    }),
  );
  await page.goto('/checkout');
  await page.click('[data-testid="submit-order"]');
  // App should handle gracefully - no crash, shows what's available
  await expect(page.getByText('Order Confirmed')).toBeVisible();
  await expect(page.getByText('Tracking information pending')).toBeVisible();
 });
 // Cypress equivalents
 describe('Order Edge Cases', () => {
  it('should handle 500 error', () => {
    cy.intercept('POST', '**/api/orders', {
      statusCode: 500,
      body: { error: 'Internal Server Error' },
    }).as('orderFailed');
    cy.visit('/checkout');
    cy.get('[data-testid="submit-order"]').click();
    cy.wait('@orderFailed');
    cy.contains('Something went wrong').should('be.visible');
  });
  it('should handle timeout', () => {
    cy.intercept('POST', '**/api/orders', (req) => {
      req.reply({ delay: 20000 }); // Delay beyond app timeout
    }).as('orderTimeout');
    cy.visit('/checkout');
    cy.get('[data-testid="submit-order"]').click();
    cy.contains('Request timed out', { timeout: 15000 }).should('be.visible');
  });
 });
 ```
 **Key Points**:
 - Stub different HTTP status codes (200, 400, 500, 503)
 - Simulate timeouts with `delay` or non-resolving promises
 - Test partial/incomplete data responses
 - Verify app handles errors gracefully (no crashes, user-friendly messages)
 ### Example 4: Deterministic Waiting
 **Context**: Never use hard waits (`waitForTimeout(3000)`). Always wait for explicit signals: network responses, element state changes, or custom events.
 **Implementation**:
 ```typescript
 // ✅ GOOD: Wait for response with predicate
 test('wait for specific response', async ({ page }) => {
  const responsePromise = page.waitForResponse((resp) => resp.url().includes('/api/users') && resp.status() === 200);
  await page.goto('/dashboard');
  const response = await responsePromise;
  expect(response.status()).toBe(200);
  await expect(page.getByText('Dashboard')).toBeVisible();
 });
 // ✅ GOOD: Wait for multiple responses
 test('wait for all required data', async ({ page }) => {
  const usersPromise = page.waitForResponse('**/api/users');
  const productsPromise = page.waitForResponse('**/api/products');
  const ordersPromise = page.waitForResponse('**/api/orders');
  await page.goto('/dashboard');
  // Wait for all in parallel
  const [users, products, orders] = await Promise.all([usersPromise, productsPromise, ordersPromise]);
  expect(users.status()).toBe(200);
  expect(products.status()).toBe(200);
  expect(orders.status()).toBe(200);
 });
 // ✅ GOOD: Wait for spinner to disappear
 test('wait for loading indicator', async ({ page }) => {
  await page.goto('/dashboard');
  // Wait for spinner to disappear (signals data loaded)
  await expect(page.getByTestId('loading-spinner')).not.toBeVisible();
  await expect(page.getByText('Dashboard')).toBeVisible();
 });
 // ✅ GOOD: Wait for custom event (advanced)
 test('wait for custom ready event', async ({ page }) => {
  let appReady = false;
  page.on('console', (msg) => {
    if (msg.text() === 'App ready') appReady = true;
  });
  await page.goto('/dashboard');
  // Poll until custom condition met
  await page.waitForFunction(() => appReady, { timeout: 10000 });
  await expect(page.getByText('Dashboard')).toBeVisible();
 });
 // ❌ BAD: Hard wait (arbitrary timeout)
 test('flaky hard wait example', async ({ page }) => {
  await page.goto('/dashboard');
  await page.waitForTimeout(3000); // WHY 3 seconds? What if slower? What if faster?
  await expect(page.getByText('Dashboard')).toBeVisible(); // May fail if >3s
 });
 // Cypress equivalents
 describe('Deterministic Waiting', () => {
  it('should wait for response', () => {
    cy.intercept('GET', '**/api/users').as('getUsers');
    cy.visit('/dashboard');
    cy.wait('@getUsers').its('response.statusCode').should('eq', 200);
    cy.contains('Dashboard').should('be.visible');
  });
  it('should wait for spinner to disappear', () => {
    cy.visit('/dashboard');
    cy.get('[data-testid="loading-spinner"]').should('not.exist');
    cy.contains('Dashboard').should('be.visible');
  });
  // ❌ BAD: Hard wait
  it('flaky hard wait', () => {
    cy.visit('/dashboard');
    cy.wait(3000); // NEVER DO THIS
    cy.contains('Dashboard').should('be.visible');
  });
 });
 ```
 **Key Points**:
 - `waitForResponse()` with URL pattern or predicate = deterministic
 - `waitForLoadState('networkidle')` = wait for all network activity to finish
 - Wait for element state changes (spinner disappears, button enabled)
 - **NEVER** use `waitForTimeout()` or `cy.wait(ms)` - always non-deterministic
 ### Example 5: Anti-Pattern - Navigate Then Mock
 **Problem**:
 ```typescript
 // ❌ BAD: Race condition - mock registered AFTER navigation starts
 test('flaky test - navigate then mock', async ({ page }) => {
  // Navigation starts immediately
  await page.goto('/dashboard'); // Request to /api/users fires NOW
  // Mock registered too late - request already sent
  await page.route('**/api/users', (route) =>
    route.fulfill({
      status: 200,
      body: JSON.stringify([{ id: 1, name: 'Test User' }]),
    }),
  );
  // Test randomly passes/fails depending on timing
  await expect(page.getByText('Test User')).toBeVisible(); // Flaky!
 });
 // ❌ BAD: No wait for response
 test('flaky test - no explicit wait', async ({ page }) => {
  await page.route('**/api/users', (route) => route.fulfill({ status: 200, body: JSON.stringify([]) }));
  await page.goto('/dashboard');
  // Assertion runs immediately - may fail if response slow
  await expect(page.getByText('No users found')).toBeVisible(); // Flaky!
 });
 // ❌ BAD: Generic timeout
 test('flaky test - hard wait', async ({ page }) => {
  await page.goto('/dashboard');
  await page.waitForTimeout(2000); // Arbitrary wait - brittle
  await expect(page.getByText('Dashboard')).toBeVisible();
 });
 ```
 **Why It Fails**:
 - **Mock after navigate**: Request fires during navigation, mock isn't active yet (race condition)
 - **No explicit wait**: Assertion runs before response arrives (timing-dependent)
 - **Hard waits**: Slow tests, brittle (fails if < timeout, wastes time if > timeout)
 - **Non-deterministic**: Passes locally, fails in CI (different speeds)
 **Better Approach**: Always intercept → trigger → await
 ```typescript
 // ✅ GOOD: Intercept BEFORE navigate
 test('deterministic test', async ({ page }) => {
  // Step 1: Register mock FIRST
  await page.route('**/api/users', (route) =>
    route.fulfill({
      status: 200,
      contentType: 'application/json',
      body: JSON.stringify([{ id: 1, name: 'Test User' }]),
    }),
  );
  // Step 2: Store response promise BEFORE trigger
  const responsePromise = page.waitForResponse('**/api/users');
  // Step 3: THEN trigger
  await page.goto('/dashboard');
  // Step 4: THEN await response
  await responsePromise;
  // Step 5: THEN assert (data is guaranteed loaded)
  await expect(page.getByText('Test User')).toBeVisible();
 });
 ```
 **Key Points**:
 - Order matters: Mock → Promise → Trigger → Await → Assert
 - No race conditions: Mock is active before request fires
 - Explicit wait: Response promise ensures data loaded
 - Deterministic: Always passes if app works correctly
 ## Integration Points
 - **Used in workflows**: `*atdd` (test generation), `*automate` (test expansion), `*framework` (network setup)
 - **Related fragments**:
  - `fixture-architecture.md` - Network fixture patterns
  - `data-factories.md` - API-first setup with network
  - `test-quality.md` - Deterministic test principles
 ## Debugging Network Issues
 When network tests fail, check:
 1. **Timing**: Is interception registered **before** action?
 2. **URL pattern**: Does pattern match actual request URL?
 3. **Response format**: Is mocked response valid JSON/format?
 4. **Status code**: Is app checking for 200 vs 201 vs 204?
 5. **HAR file**: Capture real traffic to understand actual API contract
 ```typescript
 // Debug network issues with logging
 test('debug network', async ({ page }) => {
  // Log all requests
  page.on('request', (req) => console.log('→', req.method(), req.url()));
  // Log all responses
  page.on('response', (resp) => console.log('←', resp.status(), resp.url()));
  await page.goto('/dashboard');
 });
 ```
 _Source: Murat Testing Philosophy (lines 94-137), Playwright network patterns, Cypress intercept best practices._
--- a/_bmad/bmm/testarch/knowledge/network-recorder.md
+++ b/_bmad/bmm/testarch/knowledge/network-recorder.md
@@ -0,0 +1,265 @@
 # Network Recorder Utility
 ## Principle
 Record network traffic to HAR files during test execution, then play back from disk for offline testing. Enables frontend tests to run in complete isolation from backend services with intelligent stateful CRUD detection for realistic API behavior.
 ## Rationale
 Traditional E2E tests require live backend services:
 - Slow (real network latency)
 - Flaky (backend instability affects tests)
 - Expensive (full stack running for UI tests)
 - Coupled (UI tests break when API changes)
 HAR-based recording/playback provides:
 - **True offline testing**: UI tests run without backend
 - **Deterministic behavior**: Same responses every time
 - **Fast execution**: No network latency
 - **Stateful mocking**: CRUD operations work naturally (not just read-only)
 - **Environment flexibility**: Map URLs for any environment
 ## Pattern Examples
 ### Example 1: Basic Record and Playback
 **Context**: The fundamental pattern - record traffic once, play back for all subsequent runs.
 **Implementation**:
 ```typescript
 import { test } from '@seontechnologies/playwright-utils/network-recorder/fixtures';
 // Set mode in test file (recommended)
 process.env.PW_NET_MODE = 'playback'; // or 'record'
 test('CRUD operations work offline', async ({ page, context, networkRecorder }) => {
  // Setup recorder (records or plays back based on PW_NET_MODE)
  await networkRecorder.setup(context);
  await page.goto('/');
  // First time (record mode): Records all network traffic to HAR
  // Subsequent runs (playback mode): Plays back from HAR (no backend!)
  await page.fill('#movie-name', 'Inception');
  await page.click('#add-movie');
  // Intelligent CRUD detection makes this work offline!
  await expect(page.getByText('Inception')).toBeVisible();
 });
 ```
 **Key Points**:
 - `PW_NET_MODE=record` captures traffic to HAR files
 - `PW_NET_MODE=playback` replays from HAR files
 - Set mode in test file or via environment variable
 - HAR files auto-organized by test name
 - Stateful mocking detects CRUD operations
 ### Example 2: Complete CRUD Flow with HAR
 **Context**: Full create-read-update-delete flow that works completely offline.
 **Implementation**:
 ```typescript
 process.env.PW_NET_MODE = 'playback';
 test.describe('Movie CRUD - offline with network recorder', () => {
  test.beforeEach(async ({ page, networkRecorder, context }) => {
    await networkRecorder.setup(context);
    await page.goto('/');
  });
  test('should add, edit, delete movie browser-only', async ({ page, interceptNetworkCall }) => {
    // Create
    await page.fill('#movie-name', 'Inception');
    await page.fill('#year', '2010');
    await page.click('#add-movie');
    // Verify create (reads from stateful HAR)
    await expect(page.getByText('Inception')).toBeVisible();
    // Update
    await page.getByText('Inception').click();
    await page.fill('#movie-name', "Inception Director's Cut");
    const updateCall = interceptNetworkCall({
      method: 'PUT',
      url: '/movies/*',
    });
    await page.click('#save');
    await updateCall; // Wait for update
    // Verify update (HAR reflects state change!)
    await page.click('#back');
    await expect(page.getByText("Inception Director's Cut")).toBeVisible();
    // Delete
    await page.click(`[data-testid="delete-Inception Director's Cut"]`);
    // Verify delete (HAR reflects removal!)
    await expect(page.getByText("Inception Director's Cut")).not.toBeVisible();
  });
 });
 ```
 **Key Points**:
 - Full CRUD operations work offline
 - Stateful HAR mocking tracks creates/updates/deletes
 - Combine with `interceptNetworkCall` for deterministic waits
 - First run records, subsequent runs replay
 ### Example 3: Environment Switching
 **Context**: Record in dev environment, play back in CI with different base URLs.
 **Implementation**:
 ```typescript
 // playwright.config.ts - Map URLs for different environments
 export default defineConfig({
  use: {
    baseURL: process.env.CI ? 'https://app.ci.example.com' : 'http://localhost:3000',
  },
 });
 // Test works in both environments
 test('cross-environment playback', async ({ page, context, networkRecorder }) => {
  await networkRecorder.setup(context);
  // In dev: hits http://localhost:3000/api/movies
  // In CI: HAR replays with https://app.ci.example.com/api/movies
  await page.goto('/movies');
  // Network recorder auto-maps URLs
  await expect(page.getByTestId('movie-list')).toBeVisible();
 });
 ```
 **Key Points**:
 - HAR files record absolute URLs
 - Playback maps to current baseURL
 - Same HAR works across environments
 - No manual URL rewriting needed
 ### Example 4: Automatic vs Manual Mode Control
 **Context**: Choose between environment-based switching or in-test mode control.
 **Implementation**:
 ```typescript
 // Option 1: Environment variable (recommended for CI)
 PW_NET_MODE=record npm run test:pw   # Record traffic
 PW_NET_MODE=playback npm run test:pw # Playback traffic
 // Option 2: In-test control (recommended for development)
 process.env.PW_NET_MODE = 'record'  // Set at top of test file
 test('my test', async ({ page, context, networkRecorder }) => {
  await networkRecorder.setup(context)
  // ...
 })
 // Option 3: Auto-fallback (record if HAR missing, else playback)
 // This is the default behavior when PW_NET_MODE not set
 test('auto mode', async ({ page, context, networkRecorder }) => {
  await networkRecorder.setup(context)
  // First run: auto-records
  // Subsequent runs: auto-plays back
 })
 ```
 **Key Points**:
 - Three mode options: record, playback, auto
 - `PW_NET_MODE` environment variable
 - In-test `process.env.PW_NET_MODE` assignment
 - Auto-fallback when no mode specified
 ## Why Use This Instead of Native Playwright?
 | Native Playwright (`routeFromHAR`) | network-recorder Utility       |
 | ---------------------------------- | ------------------------------ |
 | ~80 lines setup boilerplate        | ~5 lines total                 |
 | Manual HAR file management         | Automatic file organization    |
 | Complex setup/teardown             | Automatic cleanup via fixtures |
 | **Read-only tests**                | **Full CRUD support**          |
 | **Stateless**                      | **Stateful mocking**           |
 | Manual URL mapping                 | Automatic environment mapping  |
 **The game-changer: Stateful CRUD detection**
 Native Playwright HAR playback is stateless - a POST create followed by GET list won't show the created item. This utility intelligently tracks CRUD operations in memory to reflect state changes, making offline tests behave like real APIs.
 ## Integration with Other Utilities
 **With interceptNetworkCall** (deterministic waits):
 ```typescript
 test('use both utilities', async ({ page, context, networkRecorder, interceptNetworkCall }) => {
  await networkRecorder.setup(context);
  const createCall = interceptNetworkCall({
    method: 'POST',
    url: '/api/movies',
  });
  await page.click('#add-movie');
  await createCall; // Wait for create (works with HAR!)
  // Network recorder provides playback, intercept provides determinism
 });
 ```
 ## Related Fragments
 - `overview.md` - Installation and fixture patterns
 - `intercept-network-call.md` - Combine for deterministic offline tests
 - `auth-session.md` - Record authenticated traffic
 - `network-first.md` - Core pattern for intercept-before-navigate
 ## Anti-Patterns
 **❌ Mixing record and playback in same test:**
 ```typescript
 process.env.PW_NET_MODE = 'record';
 // ... some test code ...
 process.env.PW_NET_MODE = 'playback'; // Don't switch mid-test
 ```
 **✅ One mode per test:**
 ```typescript
 process.env.PW_NET_MODE = 'playback'; // Set once at top
 test('my test', async ({ page, context, networkRecorder }) => {
  await networkRecorder.setup(context);
  // Entire test uses playback mode
 });
 ```
 **❌ Forgetting to call setup:**
 ```typescript
 test('broken', async ({ page, networkRecorder }) => {
  await page.goto('/'); // HAR not active!
 });
 ```
 **✅ Always call setup before navigation:**
 ```typescript
 test('correct', async ({ page, context, networkRecorder }) => {
  await networkRecorder.setup(context); // Must setup first
  await page.goto('/'); // Now HAR is active
 });
 ```
--- a/_bmad/bmm/testarch/knowledge/nfr-criteria.md
+++ b/_bmad/bmm/testarch/knowledge/nfr-criteria.md
@@ -0,0 +1,670 @@
 # Non-Functional Requirements (NFR) Criteria
 ## Principle
 Non-functional requirements (security, performance, reliability, maintainability) are **validated through automated tests**, not checklists. NFR assessment uses objective pass/fail criteria tied to measurable thresholds. Ambiguous requirements default to CONCERNS until clarified.
 ## Rationale
 **The Problem**: Teams ship features that "work" functionally but fail under load, expose security vulnerabilities, or lack error recovery. NFRs are treated as optional "nice-to-haves" instead of release blockers.
 **The Solution**: Define explicit NFR criteria with automated validation. Security tests verify auth/authz and secret handling. Performance tests enforce SLO/SLA thresholds with profiling evidence. Reliability tests validate error handling, retries, and health checks. Maintainability is measured by test coverage, code duplication, and observability.
 **Why This Matters**:
 - Prevents production incidents (security breaches, performance degradation, cascading failures)
 - Provides objective release criteria (no subjective "feels fast enough")
 - Automates compliance validation (audit trail for regulated environments)
 - Forces clarity on ambiguous requirements (default to CONCERNS)
 ## Pattern Examples
 ### Example 1: Security NFR Validation (Auth, Secrets, OWASP)
 **Context**: Automated security tests enforcing authentication, authorization, and secret handling
 **Implementation**:
 ```typescript
 // tests/nfr/security.spec.ts
 import { test, expect } from '@playwright/test';
 test.describe('Security NFR: Authentication & Authorization', () => {
  test('unauthenticated users cannot access protected routes', async ({ page }) => {
    // Attempt to access dashboard without auth
    await page.goto('/dashboard');
    // Should redirect to login (not expose data)
    await expect(page).toHaveURL(/\/login/);
    await expect(page.getByText('Please sign in')).toBeVisible();
    // Verify no sensitive data leaked in response
    const pageContent = await page.content();
    expect(pageContent).not.toContain('user_id');
    expect(pageContent).not.toContain('api_key');
  });
  test('JWT tokens expire after 15 minutes', async ({ page, request }) => {
    // Login and capture token
    await page.goto('/login');
    await page.getByLabel('Email').fill('test@example.com');
    await page.getByLabel('Password').fill('ValidPass123!');
    await page.getByRole('button', { name: 'Sign In' }).click();
    const token = await page.evaluate(() => localStorage.getItem('auth_token'));
    expect(token).toBeTruthy();
    // Wait 16 minutes (use mock clock in real tests)
    await page.clock.fastForward('00:16:00');
    // Token should be expired, API call should fail
    const response = await request.get('/api/user/profile', {
      headers: { Authorization: `Bearer ${token}` },
    });
    expect(response.status()).toBe(401);
    const body = await response.json();
    expect(body.error).toContain('expired');
  });
  test('passwords are never logged or exposed in errors', async ({ page }) => {
    // Trigger login error
    await page.goto('/login');
    await page.getByLabel('Email').fill('test@example.com');
    await page.getByLabel('Password').fill('WrongPassword123!');
    // Monitor console for password leaks
    const consoleLogs: string[] = [];
    page.on('console', (msg) => consoleLogs.push(msg.text()));
    await page.getByRole('button', { name: 'Sign In' }).click();
    // Error shown to user (generic message)
    await expect(page.getByText('Invalid credentials')).toBeVisible();
    // Verify password NEVER appears in console, DOM, or network
    const pageContent = await page.content();
    expect(pageContent).not.toContain('WrongPassword123!');
    expect(consoleLogs.join('\n')).not.toContain('WrongPassword123!');
  });
  test('RBAC: users can only access resources they own', async ({ page, request }) => {
    // Login as User A
    const userAToken = await login(request, 'userA@example.com', 'password');
    // Try to access User B's order
    const response = await request.get('/api/orders/user-b-order-id', {
      headers: { Authorization: `Bearer ${userAToken}` },
    });
    expect(response.status()).toBe(403); // Forbidden
    const body = await response.json();
    expect(body.error).toContain('insufficient permissions');
  });
  test('SQL injection attempts are blocked', async ({ page }) => {
    await page.goto('/search');
    // Attempt SQL injection
    await page.getByPlaceholder('Search products').fill("'; DROP TABLE users; --");
    await page.getByRole('button', { name: 'Search' }).click();
    // Should return empty results, NOT crash or expose error
    await expect(page.getByText('No results found')).toBeVisible();
    // Verify app still works (table not dropped)
    await page.goto('/dashboard');
    await expect(page.getByText('Welcome')).toBeVisible();
  });
  test('XSS attempts are sanitized', async ({ page }) => {
    await page.goto('/profile/edit');
    // Attempt XSS injection
    const xssPayload = '<script>alert("XSS")</script>';
    await page.getByLabel('Bio').fill(xssPayload);
    await page.getByRole('button', { name: 'Save' }).click();
    // Reload and verify XSS is escaped (not executed)
    await page.reload();
    const bio = await page.getByTestId('user-bio').textContent();
    // Text should be escaped, script should NOT execute
    expect(bio).toContain('&lt;script&gt;');
    expect(bio).not.toContain('<script>');
  });
 });
 // Helper
 async function login(request: any, email: string, password: string): Promise<string> {
  const response = await request.post('/api/auth/login', {
    data: { email, password },
  });
  const body = await response.json();
  return body.token;
 }
 ```
 **Key Points**:
 - Authentication: Unauthenticated access redirected (not exposed)
 - Authorization: RBAC enforced (403 for insufficient permissions)
 - Token expiry: JWT expires after 15 minutes (automated validation)
 - Secret handling: Passwords never logged or exposed in errors
 - OWASP Top 10: SQL injection and XSS blocked (input sanitization)
 **Security NFR Criteria**:
 - ✅ PASS: All 6 tests green (auth, authz, token expiry, secret handling, SQL injection, XSS)
 - ⚠️ CONCERNS: 1-2 tests failing with mitigation plan and owner assigned
 - ❌ FAIL: Critical exposure (unauthenticated access, password leak, SQL injection succeeds)
 ---
 ### Example 2: Performance NFR Validation (k6 Load Testing for SLO/SLA)
 **Context**: Use k6 for load testing, stress testing, and SLO/SLA enforcement (NOT Playwright)
 **Implementation**:
 ```javascript
 // tests/nfr/performance.k6.js
 import http from 'k6/http';
 import { check, sleep } from 'k6';
 import { Rate, Trend } from 'k6/metrics';
 // Custom metrics
 const errorRate = new Rate('errors');
 const apiDuration = new Trend('api_duration');
 // Performance thresholds (SLO/SLA)
 export const options = {
  stages: [
    { duration: '1m', target: 50 }, // Ramp up to 50 users
    { duration: '3m', target: 50 }, // Stay at 50 users for 3 minutes
    { duration: '1m', target: 100 }, // Spike to 100 users
    { duration: '3m', target: 100 }, // Stay at 100 users
    { duration: '1m', target: 0 }, // Ramp down
  ],
  thresholds: {
    // SLO: 95% of requests must complete in <500ms
    http_req_duration: ['p(95)<500'],
    // SLO: Error rate must be <1%
    errors: ['rate<0.01'],
    // SLA: API endpoints must respond in <1s (99th percentile)
    api_duration: ['p(99)<1000'],
  },
 };
 export default function () {
  // Test 1: Homepage load performance
  const homepageResponse = http.get(`${__ENV.BASE_URL}/`);
  check(homepageResponse, {
    'homepage status is 200': (r) => r.status === 200,
    'homepage loads in <2s': (r) => r.timings.duration < 2000,
  });
  errorRate.add(homepageResponse.status !== 200);
  // Test 2: API endpoint performance
  const apiResponse = http.get(`${__ENV.BASE_URL}/api/products?limit=10`, {
    headers: { Authorization: `Bearer ${__ENV.API_TOKEN}` },
  });
  check(apiResponse, {
    'API status is 200': (r) => r.status === 200,
    'API responds in <500ms': (r) => r.timings.duration < 500,
  });
  apiDuration.add(apiResponse.timings.duration);
  errorRate.add(apiResponse.status !== 200);
  // Test 3: Search endpoint under load
  const searchResponse = http.get(`${__ENV.BASE_URL}/api/search?q=laptop&limit=100`);
  check(searchResponse, {
    'search status is 200': (r) => r.status === 200,
    'search responds in <1s': (r) => r.timings.duration < 1000,
    'search returns results': (r) => JSON.parse(r.body).results.length > 0,
  });
  errorRate.add(searchResponse.status !== 200);
  sleep(1); // Realistic user think time
 }
 // Threshold validation (run after test)
 export function handleSummary(data) {
  const p95Duration = data.metrics.http_req_duration.values['p(95)'];
  const p99ApiDuration = data.metrics.api_duration.values['p(99)'];
  const errorRateValue = data.metrics.errors.values.rate;
  console.log(`P95 request duration: ${p95Duration.toFixed(2)}ms`);
  console.log(`P99 API duration: ${p99ApiDuration.toFixed(2)}ms`);
  console.log(`Error rate: ${(errorRateValue * 100).toFixed(2)}%`);
  return {
    'summary.json': JSON.stringify(data),
    stdout: `
 Performance NFR Results:
 - P95 request duration: ${p95Duration < 500 ? '✅ PASS' : '❌ FAIL'} (${p95Duration.toFixed(2)}ms / 500ms threshold)
 - P99 API duration: ${p99ApiDuration < 1000 ? '✅ PASS' : '❌ FAIL'} (${p99ApiDuration.toFixed(2)}ms / 1000ms threshold)
 - Error rate: ${errorRateValue < 0.01 ? '✅ PASS' : '❌ FAIL'} (${(errorRateValue * 100).toFixed(2)}% / 1% threshold)
    `,
  };
 }
 ```
 **Run k6 tests:**
 ```bash
 # Local smoke test (10 VUs, 30s)
 k6 run --vus 10 --duration 30s tests/nfr/performance.k6.js
 # Full load test (stages defined in script)
 k6 run tests/nfr/performance.k6.js
 # CI integration with thresholds
 k6 run --out json=performance-results.json tests/nfr/performance.k6.js
 ```
 **Key Points**:
 - **k6 is the right tool** for load testing (NOT Playwright)
 - SLO/SLA thresholds enforced automatically (`p(95)<500`, `rate<0.01`)
 - Realistic load simulation (ramp up, sustained load, spike testing)
 - Comprehensive metrics (p50, p95, p99, error rate, throughput)
 - CI-friendly (JSON output, exit codes based on thresholds)
 **Performance NFR Criteria**:
 - ✅ PASS: All SLO/SLA targets met with k6 profiling evidence (p95 < 500ms, error rate < 1%)
 - ⚠️ CONCERNS: Trending toward limits (e.g., p95 = 480ms approaching 500ms) or missing baselines
 - ❌ FAIL: SLO/SLA breached (e.g., p95 > 500ms) or error rate > 1%
 **Performance Testing Levels (from Test Architect course):**
 - **Load testing**: System behavior under expected load
 - **Stress testing**: System behavior under extreme load (breaking point)
 - **Spike testing**: Sudden load increases (traffic spikes)
 - **Endurance/Soak testing**: System behavior under sustained load (memory leaks, resource exhaustion)
 - **Benchmarking**: Baseline measurements for comparison
 **Note**: Playwright can validate **perceived performance** (Core Web Vitals via Lighthouse), but k6 validates **system performance** (throughput, latency, resource limits under load)
 ---
 ### Example 3: Reliability NFR Validation (Playwright for UI Resilience)
 **Context**: Automated reliability tests validating graceful degradation and recovery paths
 **Implementation**:
 ```typescript
 // tests/nfr/reliability.spec.ts
 import { test, expect } from '@playwright/test';
 test.describe('Reliability NFR: Error Handling & Recovery', () => {
  test('app remains functional when API returns 500 error', async ({ page, context }) => {
    // Mock API failure
    await context.route('**/api/products', (route) => {
      route.fulfill({ status: 500, body: JSON.stringify({ error: 'Internal Server Error' }) });
    });
    await page.goto('/products');
    // User sees error message (not blank page or crash)
    await expect(page.getByText('Unable to load products. Please try again.')).toBeVisible();
    await expect(page.getByRole('button', { name: 'Retry' })).toBeVisible();
    // App navigation still works (graceful degradation)
    await page.getByRole('link', { name: 'Home' }).click();
    await expect(page).toHaveURL('/');
  });
  test('API client retries on transient failures (3 attempts)', async ({ page, context }) => {
    let attemptCount = 0;
    await context.route('**/api/checkout', (route) => {
      attemptCount++;
      // Fail first 2 attempts, succeed on 3rd
      if (attemptCount < 3) {
        route.fulfill({ status: 503, body: JSON.stringify({ error: 'Service Unavailable' }) });
      } else {
        route.fulfill({ status: 200, body: JSON.stringify({ orderId: '12345' }) });
      }
    });
    await page.goto('/checkout');
    await page.getByRole('button', { name: 'Place Order' }).click();
    // Should succeed after 3 attempts
    await expect(page.getByText('Order placed successfully')).toBeVisible();
    expect(attemptCount).toBe(3);
  });
  test('app handles network disconnection gracefully', async ({ page, context }) => {
    await page.goto('/dashboard');
    // Simulate offline mode
    await context.setOffline(true);
    // Trigger action requiring network
    await page.getByRole('button', { name: 'Refresh Data' }).click();
    // User sees offline indicator (not crash)
    await expect(page.getByText('You are offline. Changes will sync when reconnected.')).toBeVisible();
    // Reconnect
    await context.setOffline(false);
    await page.getByRole('button', { name: 'Refresh Data' }).click();
    // Data loads successfully
    await expect(page.getByText('Data updated')).toBeVisible();
  });
  test('health check endpoint returns service status', async ({ request }) => {
    const response = await request.get('/api/health');
    expect(response.status()).toBe(200);
    const health = await response.json();
    expect(health).toHaveProperty('status', 'healthy');
    expect(health).toHaveProperty('timestamp');
    expect(health).toHaveProperty('services');
    // Verify critical services are monitored
    expect(health.services).toHaveProperty('database');
    expect(health.services).toHaveProperty('cache');
    expect(health.services).toHaveProperty('queue');
    // All services should be UP
    expect(health.services.database.status).toBe('UP');
    expect(health.services.cache.status).toBe('UP');
    expect(health.services.queue.status).toBe('UP');
  });
  test('circuit breaker opens after 5 consecutive failures', async ({ page, context }) => {
    let failureCount = 0;
    await context.route('**/api/recommendations', (route) => {
      failureCount++;
      route.fulfill({ status: 500, body: JSON.stringify({ error: 'Service Error' }) });
    });
    await page.goto('/product/123');
    // Wait for circuit breaker to open (fallback UI appears)
    await expect(page.getByText('Recommendations temporarily unavailable')).toBeVisible({ timeout: 10000 });
    // Verify circuit breaker stopped making requests after threshold (should be ≤5)
    expect(failureCount).toBeLessThanOrEqual(5);
  });
  test('rate limiting gracefully handles 429 responses', async ({ page, context }) => {
    let requestCount = 0;
    await context.route('**/api/search', (route) => {
      requestCount++;
      if (requestCount > 10) {
        // Rate limit exceeded
        route.fulfill({
          status: 429,
          headers: { 'Retry-After': '5' },
          body: JSON.stringify({ error: 'Rate limit exceeded' }),
        });
      } else {
        route.fulfill({ status: 200, body: JSON.stringify({ results: [] }) });
      }
    });
    await page.goto('/search');
    // Make 15 search requests rapidly
    for (let i = 0; i < 15; i++) {
      await page.getByPlaceholder('Search').fill(`query-${i}`);
      await page.getByRole('button', { name: 'Search' }).click();
    }
    // User sees rate limit message (not crash)
    await expect(page.getByText('Too many requests. Please wait a moment.')).toBeVisible();
  });
 });
 ```
 **Key Points**:
 - Error handling: Graceful degradation (500 error → user-friendly message + retry button)
 - Retries: 3 attempts on transient failures (503 → eventual success)
 - Offline handling: Network disconnection detected (sync when reconnected)
 - Health checks: `/api/health` monitors database, cache, queue
 - Circuit breaker: Opens after 5 failures (fallback UI, stop retries)
 - Rate limiting: 429 response handled (Retry-After header respected)
 **Reliability NFR Criteria**:
 - ✅ PASS: Error handling, retries, health checks verified (all 6 tests green)
 - ⚠️ CONCERNS: Partial coverage (e.g., missing circuit breaker) or no telemetry
 - ❌ FAIL: No recovery path (500 error crashes app) or unresolved crash scenarios
 ---
 ### Example 4: Maintainability NFR Validation (CI Tools, Not Playwright)
 **Context**: Use proper CI tools for code quality validation (coverage, duplication, vulnerabilities)
 **Implementation**:
 ```yaml
 # .github/workflows/nfr-maintainability.yml
 name: NFR - Maintainability
 on: [push, pull_request]
 jobs:
  test-coverage:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v4
      - uses: actions/setup-node@v4
      - name: Install dependencies
        run: npm ci
      - name: Run tests with coverage
        run: npm run test:coverage
      - name: Check coverage threshold (80% minimum)
        run: |
          COVERAGE=$(jq '.total.lines.pct' coverage/coverage-summary.json)
          echo "Coverage: $COVERAGE%"
          if (( $(echo "$COVERAGE < 80" | bc -l) )); then
            echo "❌ FAIL: Coverage $COVERAGE% below 80% threshold"
            exit 1
          else
            echo "✅ PASS: Coverage $COVERAGE% meets 80% threshold"
          fi
  code-duplication:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v4
      - uses: actions/setup-node@v4
      - name: Check code duplication (<5% allowed)
        run: |
          npx jscpd src/ --threshold 5 --format json --output duplication.json
          DUPLICATION=$(jq '.statistics.total.percentage' duplication.json)
          echo "Duplication: $DUPLICATION%"
          if (( $(echo "$DUPLICATION >= 5" | bc -l) )); then
            echo "❌ FAIL: Duplication $DUPLICATION% exceeds 5% threshold"
            exit 1
          else
            echo "✅ PASS: Duplication $DUPLICATION% below 5% threshold"
          fi
  vulnerability-scan:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v4
      - uses: actions/setup-node@v4
      - name: Install dependencies
        run: npm ci
      - name: Run npm audit (no critical/high vulnerabilities)
        run: |
          npm audit --json > audit.json || true
          CRITICAL=$(jq '.metadata.vulnerabilities.critical' audit.json)
          HIGH=$(jq '.metadata.vulnerabilities.high' audit.json)
          echo "Critical: $CRITICAL, High: $HIGH"
          if [ "$CRITICAL" -gt 0 ] || [ "$HIGH" -gt 0 ]; then
            echo "❌ FAIL: Found $CRITICAL critical and $HIGH high vulnerabilities"
            npm audit
            exit 1
          else
            echo "✅ PASS: No critical/high vulnerabilities"
          fi
 ```
 **Playwright Tests for Observability (E2E Validation):**
 ```typescript
 // tests/nfr/observability.spec.ts
 import { test, expect } from '@playwright/test';
 test.describe('Maintainability NFR: Observability Validation', () => {
  test('critical errors are reported to monitoring service', async ({ page, context }) => {
    const sentryEvents: any[] = [];
    // Mock Sentry SDK to verify error tracking
    await context.addInitScript(() => {
      (window as any).Sentry = {
        captureException: (error: Error) => {
          console.log('SENTRY_CAPTURE:', JSON.stringify({ message: error.message, stack: error.stack }));
        },
      };
    });
    page.on('console', (msg) => {
      if (msg.text().includes('SENTRY_CAPTURE:')) {
        sentryEvents.push(JSON.parse(msg.text().replace('SENTRY_CAPTURE:', '')));
      }
    });
    // Trigger error by mocking API failure
    await context.route('**/api/products', (route) => {
      route.fulfill({ status: 500, body: JSON.stringify({ error: 'Database Error' }) });
    });
    await page.goto('/products');
    // Wait for error UI and Sentry capture
    await expect(page.getByText('Unable to load products')).toBeVisible();
    // Verify error was captured by monitoring
    expect(sentryEvents.length).toBeGreaterThan(0);
    expect(sentryEvents[0]).toHaveProperty('message');
    expect(sentryEvents[0]).toHaveProperty('stack');
  });
  test('API response times are tracked in telemetry', async ({ request }) => {
    const response = await request.get('/api/products?limit=10');
    expect(response.ok()).toBeTruthy();
    // Verify Server-Timing header for APM (Application Performance Monitoring)
    const serverTiming = response.headers()['server-timing'];
    expect(serverTiming).toBeTruthy();
    expect(serverTiming).toContain('db'); // Database query time
    expect(serverTiming).toContain('total'); // Total processing time
  });
  test('structured logging present in application', async ({ request }) => {
    // Make API call that generates logs
    const response = await request.post('/api/orders', {
      data: { productId: '123', quantity: 2 },
    });
    expect(response.ok()).toBeTruthy();
    // Note: In real scenarios, validate logs in monitoring system (Datadog, CloudWatch)
    // This test validates the logging contract exists (Server-Timing, trace IDs in headers)
    const traceId = response.headers()['x-trace-id'];
    expect(traceId).toBeTruthy(); // Confirms structured logging with correlation IDs
  });
 });
 ```
 **Key Points**:
 - **Coverage/duplication**: CI jobs (GitHub Actions), not Playwright tests
 - **Vulnerability scanning**: npm audit in CI, not Playwright tests
 - **Observability**: Playwright validates error tracking (Sentry) and telemetry headers
 - **Structured logging**: Validate logging contract (trace IDs, Server-Timing headers)
 - **Separation of concerns**: Build-time checks (coverage, audit) vs runtime checks (error tracking, telemetry)
 **Maintainability NFR Criteria**:
 - ✅ PASS: Clean code (80%+ coverage from CI, <5% duplication from CI), observability validated in E2E, no critical vulnerabilities from npm audit
 - ⚠️ CONCERNS: Duplication >5%, coverage 60-79%, or unclear ownership
 - ❌ FAIL: Absent tests (<60%), tangled implementations (>10% duplication), or no observability
 ---
 ## NFR Assessment Checklist
 Before release gate:
 - [ ] **Security** (Playwright E2E + Security Tools):
  - [ ] Auth/authz tests green (unauthenticated redirect, RBAC enforced)
  - [ ] Secrets never logged or exposed in errors
  - [ ] OWASP Top 10 validated (SQL injection blocked, XSS sanitized)
  - [ ] Security audit completed (vulnerability scan, penetration test if applicable)
 - [ ] **Performance** (k6 Load Testing):
  - [ ] SLO/SLA targets met with k6 evidence (p95 <500ms, error rate <1%)
  - [ ] Load testing completed (expected load)
  - [ ] Stress testing completed (breaking point identified)
  - [ ] Spike testing completed (handles traffic spikes)
  - [ ] Endurance testing completed (no memory leaks under sustained load)
 - [ ] **Reliability** (Playwright E2E + API Tests):
  - [ ] Error handling graceful (500 → user-friendly message + retry)
  - [ ] Retries implemented (3 attempts on transient failures)
  - [ ] Health checks monitored (/api/health endpoint)
  - [ ] Circuit breaker tested (opens after failure threshold)
  - [ ] Offline handling validated (network disconnection graceful)
 - [ ] **Maintainability** (CI Tools):
  - [ ] Test coverage ≥80% (from CI coverage report)
  - [ ] Code duplication <5% (from jscpd CI job)
  - [ ] No critical/high vulnerabilities (from npm audit CI job)
  - [ ] Structured logging validated (Playwright validates telemetry headers)
  - [ ] Error tracking configured (Sentry/monitoring integration validated)
 - [ ] **Ambiguous requirements**: Default to CONCERNS (force team to clarify thresholds and evidence)
 - [ ] **NFR criteria documented**: Measurable thresholds defined (not subjective "fast enough")
 - [ ] **Automated validation**: NFR tests run in CI pipeline (not manual checklists)
 - [ ] **Tool selection**: Right tool for each NFR (k6 for performance, Playwright for security/reliability E2E, CI tools for maintainability)
 ## NFR Gate Decision Matrix
 | Category            | PASS Criteria                                | CONCERNS Criteria                            | FAIL Criteria                                  |
 | ------------------- | -------------------------------------------- | -------------------------------------------- | ---------------------------------------------- |
 | **Security**        | Auth/authz, secret handling, OWASP verified  | Minor gaps with clear owners                 | Critical exposure or missing controls          |
 | **Performance**     | Metrics meet SLO/SLA with profiling evidence | Trending toward limits or missing baselines  | SLO/SLA breached or resource leaks detected    |
 | **Reliability**     | Error handling, retries, health checks OK    | Partial coverage or missing telemetry        | No recovery path or unresolved crash scenarios |
 | **Maintainability** | Clean code, tests, docs shipped together     | Duplication, low coverage, unclear ownership | Absent tests, tangled code, no observability   |
 **Default**: If targets or evidence are undefined → **CONCERNS** (force team to clarify before sign-off)
 ## Integration Points
 - **Used in workflows**: `*nfr-assess` (automated NFR validation), `*trace` (gate decision Phase 2), `*test-design` (NFR risk assessment via Utility Tree)
 - **Related fragments**: `risk-governance.md` (NFR risk scoring), `probability-impact.md` (NFR impact assessment), `test-quality.md` (maintainability standards), `test-levels-framework.md` (system-level testing for NFRs)
 - **Tools by NFR Category**:
  - **Security**: Playwright (E2E auth/authz), OWASP ZAP, Burp Suite, npm audit, Snyk
  - **Performance**: k6 (load/stress/spike/endurance), Lighthouse (Core Web Vitals), Artillery
  - **Reliability**: Playwright (E2E error handling), API tests (retries, health checks), Chaos Engineering tools
  - **Maintainability**: GitHub Actions (coverage, duplication, audit), jscpd, Playwright (observability validation)
 _Source: Test Architect course (NFR testing approaches, Utility Tree, Quality Scenarios), ISO/IEC 25010 Software Quality Characteristics, OWASP Top 10, k6 documentation, SRE practices_
--- a/_bmad/bmm/testarch/knowledge/overview.md
+++ b/_bmad/bmm/testarch/knowledge/overview.md
@@ -0,0 +1,283 @@
 # Playwright Utils Overview
 ## Principle
 Use production-ready, fixture-based utilities from `@seontechnologies/playwright-utils` for common Playwright testing patterns. Build test helpers as pure functions first, then wrap in framework-specific fixtures for composability and reuse.
 ## Rationale
 Writing Playwright utilities from scratch for every project leads to:
 - Duplicated code across test suites
 - Inconsistent patterns and quality
 - Maintenance burden when Playwright APIs change
 - Missing advanced features (schema validation, HAR recording, auth persistence)
 `@seontechnologies/playwright-utils` provides:
 - **Production-tested utilities**: Used at SEON Technologies in production
 - **Functional-first design**: Core logic as pure functions, fixtures for convenience
 - **Composable fixtures**: Use `mergeTests` to combine utilities
 - **TypeScript support**: Full type safety with generic types
 - **Comprehensive coverage**: API requests, auth, network, logging, file handling, burn-in
 ## Installation
 ```bash
 npm install -D @seontechnologies/playwright-utils
 ```
 **Peer Dependencies:**
 - `@playwright/test` >= 1.54.1 (required)
 - `ajv` >= 8.0.0 (optional - for JSON Schema validation)
 - `zod` >= 3.0.0 (optional - for Zod schema validation)
 ## Available Utilities
 ### Core Testing Utilities
 | Utility                    | Purpose                                    | Test Context  |
 | -------------------------- | ------------------------------------------ | ------------- |
 | **api-request**            | Typed HTTP client with schema validation   | API tests     |
 | **network-recorder**       | HAR record/playback for offline testing    | UI tests      |
 | **auth-session**           | Token persistence, multi-user auth         | Both UI & API |
 | **recurse**                | Cypress-style polling for async conditions | Both UI & API |
 | **intercept-network-call** | Network spy/stub with auto JSON parsing    | UI tests      |
 | **log**                    | Playwright report-integrated logging       | Both UI & API |
 | **file-utils**             | CSV/XLSX/PDF/ZIP reading & validation      | Both UI & API |
 | **burn-in**                | Smart test selection with git diff         | CI/CD         |
 | **network-error-monitor**  | Automatic HTTP 4xx/5xx detection           | UI tests      |
 ## Design Patterns
 ### Pattern 1: Functional Core, Fixture Shell
 **Context**: All utilities follow the same architectural pattern - pure function as core, fixture as wrapper.
 **Implementation**:
 ```typescript
 // Direct import (pass Playwright context explicitly)
 import { apiRequest } from '@seontechnologies/playwright-utils';
 test('direct usage', async ({ request }) => {
  const { status, body } = await apiRequest({
    request, // Must pass request context
    method: 'GET',
    path: '/api/users',
  });
 });
 // Fixture import (context injected automatically)
 import { test } from '@seontechnologies/playwright-utils/fixtures';
 test('fixture usage', async ({ apiRequest }) => {
  const { status, body } = await apiRequest({
    // No need to pass request context
    method: 'GET',
    path: '/api/users',
  });
 });
 ```
 **Key Points**:
 - Pure functions testable without Playwright running
 - Fixtures inject framework dependencies automatically
 - Choose direct import (more control) or fixture (convenience)
 ### Pattern 2: Subpath Imports for Tree-Shaking
 **Context**: Import only what you need to keep bundle sizes small.
 **Implementation**:
 ```typescript
 // Import specific utility
 import { apiRequest } from '@seontechnologies/playwright-utils/api-request';
 // Import specific fixture
 import { test } from '@seontechnologies/playwright-utils/api-request/fixtures';
 // Import everything (use sparingly)
 import { apiRequest, recurse, log } from '@seontechnologies/playwright-utils';
 ```
 **Key Points**:
 - Subpath imports enable tree-shaking
 - Keep bundle sizes minimal
 - Import from specific paths for production builds
 ### Pattern 3: Fixture Composition with mergeTests
 **Context**: Combine multiple playwright-utils fixtures with your own custom fixtures.
 **Implementation**:
 ```typescript
 // playwright/support/merged-fixtures.ts
 import { mergeTests } from '@playwright/test';
 import { test as apiRequestFixture } from '@seontechnologies/playwright-utils/api-request/fixtures';
 import { test as authFixture } from '@seontechnologies/playwright-utils/auth-session/fixtures';
 import { test as recurseFixture } from '@seontechnologies/playwright-utils/recurse/fixtures';
 import { test as logFixture } from '@seontechnologies/playwright-utils/log/fixtures';
 // Merge all fixtures into one test object
 export const test = mergeTests(apiRequestFixture, authFixture, recurseFixture, logFixture);
 export { expect } from '@playwright/test';
 ```
 ```typescript
 // In your tests
 import { test, expect } from '../support/merged-fixtures';
 test('all utilities available', async ({ apiRequest, authToken, recurse, log }) => {
  await log.step('Making authenticated API request');
  const { body } = await apiRequest({
    method: 'GET',
    path: '/api/protected',
    headers: { Authorization: `Bearer ${authToken}` },
  });
  await recurse(
    () => apiRequest({ method: 'GET', path: `/status/${body.id}` }),
    (res) => res.body.ready === true,
  );
 });
 ```
 **Key Points**:
 - `mergeTests` combines multiple fixtures without conflicts
 - Create one merged-fixtures.ts file per project
 - Import test object from your merged fixtures in all tests
 - All utilities available in single test signature
 ## Integration with Existing Tests
 ### Gradual Adoption Strategy
 **1. Start with logging** (zero breaking changes):
 ```typescript
 import { log } from '@seontechnologies/playwright-utils';
 test('existing test', async ({ page }) => {
  await log.step('Navigate to page'); // Just add logging
  await page.goto('/dashboard');
  // Rest of test unchanged
 });
 ```
 **2. Add API utilities** (for API tests):
 ```typescript
 import { test } from '@seontechnologies/playwright-utils/api-request/fixtures';
 test('API test', async ({ apiRequest }) => {
  const { status, body } = await apiRequest({
    method: 'GET',
    path: '/api/users',
  });
  expect(status).toBe(200);
 });
 ```
 **3. Expand to network utilities** (for UI tests):
 ```typescript
 import { test } from '@seontechnologies/playwright-utils/fixtures';
 test('UI with network control', async ({ page, interceptNetworkCall }) => {
  const usersCall = interceptNetworkCall({
    url: '**/api/users',
  });
  await page.goto('/dashboard');
  const { responseJson } = await usersCall;
  expect(responseJson).toHaveLength(10);
 });
 ```
 **4. Full integration** (merged fixtures):
 Create merged-fixtures.ts and use across all tests.
 ## Related Fragments
 - `api-request.md` - HTTP client with schema validation
 - `network-recorder.md` - HAR-based offline testing
 - `auth-session.md` - Token management
 - `intercept-network-call.md` - Network interception
 - `recurse.md` - Polling patterns
 - `log.md` - Logging utility
 - `file-utils.md` - File operations
 - `fixtures-composition.md` - Advanced mergeTests patterns
 ## Anti-Patterns
 **❌ Don't mix direct and fixture imports in same test:**
 ```typescript
 import { apiRequest } from '@seontechnologies/playwright-utils';
 import { test } from '@seontechnologies/playwright-utils/auth-session/fixtures';
 test('bad', async ({ request, authToken }) => {
  // Confusing - mixing direct (needs request) and fixture (has authToken)
  await apiRequest({ request, method: 'GET', path: '/api/users' });
 });
 ```
 **✅ Use consistent import style:**
 ```typescript
 import { test } from '../support/merged-fixtures';
 test('good', async ({ apiRequest, authToken }) => {
  // Clean - all from fixtures
  await apiRequest({ method: 'GET', path: '/api/users' });
 });
 ```
 **❌ Don't import everything when you need one utility:**
 ```typescript
 import * as utils from '@seontechnologies/playwright-utils'; // Large bundle
 ```
 **✅ Use subpath imports:**
 ```typescript
 import { apiRequest } from '@seontechnologies/playwright-utils/api-request'; // Small bundle
 ```
 ## Reference Implementation
 The official `@seontechnologies/playwright-utils` repository provides working examples of all patterns described in these fragments.
 **Repository:** <https://github.com/seontechnologies/playwright-utils>
 **Key resources:**
 - **Test examples:** `playwright/tests` - All utilities in action
 - **Framework setup:** `playwright.config.ts`, `playwright/support/merged-fixtures.ts`
 - **CI patterns:** `.github/workflows/` - GitHub Actions with sharding, parallelization
 **Quick start:**
 ```bash
 git clone https://github.com/seontechnologies/playwright-utils.git
 cd playwright-utils
 nvm use
 npm install
 npm run test:pw-ui  # Explore tests with Playwright UI
 npm run test:pw
 ```
 All patterns in TEA fragments are production-tested in this repository.
--- a/_bmad/bmm/testarch/knowledge/playwright-config.md
+++ b/_bmad/bmm/testarch/knowledge/playwright-config.md
@@ -0,0 +1,730 @@
 # Playwright Configuration Guardrails
 ## Principle
 Load environment configs via a central map (`envConfigMap`), standardize timeouts (action 15s, navigation 30s, expect 10s, test 60s), emit HTML + JUnit reporters, and store artifacts under `test-results/` for CI upload. Keep `.env.example`, `.nvmrc`, and browser dependencies versioned so local and CI runs stay aligned.
 ## Rationale
 Environment-specific configuration prevents hardcoded URLs, timeouts, and credentials from leaking into tests. A central config map with fail-fast validation catches missing environments early. Standardized timeouts reduce flakiness while remaining long enough for real-world network conditions. Consistent artifact storage (`test-results/`, `playwright-report/`) enables CI pipelines to upload failure evidence automatically. Versioned dependencies (`.nvmrc`, `package.json` browser versions) eliminate "works on my machine" issues between local and CI environments.
 ## Pattern Examples
 ### Example 1: Environment-Based Configuration
 **Context**: When testing against multiple environments (local, staging, production), use a central config map that loads environment-specific settings and fails fast if `TEST_ENV` is invalid.
 **Implementation**:
 ```typescript
 // playwright.config.ts - Central config loader
 import { config as dotenvConfig } from 'dotenv';
 import path from 'path';
 // Load .env from project root
 dotenvConfig({
  path: path.resolve(__dirname, '../../.env'),
 });
 // Central environment config map
 const envConfigMap = {
  local: require('./playwright/config/local.config').default,
  staging: require('./playwright/config/staging.config').default,
  production: require('./playwright/config/production.config').default,
 };
 const environment = process.env.TEST_ENV || 'local';
 // Fail fast if environment not supported
 if (!Object.keys(envConfigMap).includes(environment)) {
  console.error(`❌ No configuration found for environment: ${environment}`);
  console.error(`   Available environments: ${Object.keys(envConfigMap).join(', ')}`);
  process.exit(1);
 }
 console.log(`✅ Running tests against: ${environment.toUpperCase()}`);
 export default envConfigMap[environment as keyof typeof envConfigMap];
 ```
 ```typescript
 // playwright/config/base.config.ts - Shared base configuration
 import { defineConfig } from '@playwright/test';
 import path from 'path';
 export const baseConfig = defineConfig({
  testDir: path.resolve(__dirname, '../tests'),
  outputDir: path.resolve(__dirname, '../../test-results'),
  fullyParallel: true,
  forbidOnly: !!process.env.CI,
  retries: process.env.CI ? 2 : 0,
  workers: process.env.CI ? 1 : undefined,
  reporter: [
    ['html', { outputFolder: 'playwright-report', open: 'never' }],
    ['junit', { outputFile: 'test-results/results.xml' }],
    ['list'],
  ],
  use: {
    actionTimeout: 15000,
    navigationTimeout: 30000,
    trace: 'on-first-retry',
    screenshot: 'only-on-failure',
    video: 'retain-on-failure',
  },
  globalSetup: path.resolve(__dirname, '../support/global-setup.ts'),
  timeout: 60000,
  expect: { timeout: 10000 },
 });
 ```
 ```typescript
 // playwright/config/local.config.ts - Local environment
 import { defineConfig } from '@playwright/test';
 import { baseConfig } from './base.config';
 export default defineConfig({
  ...baseConfig,
  use: {
    ...baseConfig.use,
    baseURL: 'http://localhost:3000',
    video: 'off', // No video locally for speed
  },
  webServer: {
    command: 'npm run dev',
    url: 'http://localhost:3000',
    reuseExistingServer: !process.env.CI,
    timeout: 120000,
  },
 });
 ```
 ```typescript
 // playwright/config/staging.config.ts - Staging environment
 import { defineConfig } from '@playwright/test';
 import { baseConfig } from './base.config';
 export default defineConfig({
  ...baseConfig,
  use: {
    ...baseConfig.use,
    baseURL: 'https://staging.example.com',
    ignoreHTTPSErrors: true, // Allow self-signed certs in staging
  },
 });
 ```
 ```typescript
 // playwright/config/production.config.ts - Production environment
 import { defineConfig } from '@playwright/test';
 import { baseConfig } from './base.config';
 export default defineConfig({
  ...baseConfig,
  retries: 3, // More retries in production
  use: {
    ...baseConfig.use,
    baseURL: 'https://example.com',
    video: 'on', // Always record production failures
  },
 });
 ```
 ```bash
 # .env.example - Template for developers
 TEST_ENV=local
 API_KEY=your_api_key_here
 DATABASE_URL=postgresql://localhost:5432/test_db
 ```
 **Key Points**:
 - Central `envConfigMap` prevents environment misconfiguration
 - Fail-fast validation with clear error message (available envs listed)
 - Base config defines shared settings, environment configs override
 - `.env.example` provides template for required secrets
 - `TEST_ENV=local` as default for local development
 - Production config increases retries and enables video recording
 ### Example 2: Timeout Standards
 **Context**: When tests fail due to inconsistent timeout settings, standardize timeouts across all tests: action 15s, navigation 30s, expect 10s, test 60s. Expose overrides through fixtures rather than inline literals.
 **Implementation**:
 ```typescript
 // playwright/config/base.config.ts - Standardized timeouts
 import { defineConfig } from '@playwright/test';
 export default defineConfig({
  // Global test timeout: 60 seconds
  timeout: 60000,
  use: {
    // Action timeout: 15 seconds (click, fill, etc.)
    actionTimeout: 15000,
    // Navigation timeout: 30 seconds (page.goto, page.reload)
    navigationTimeout: 30000,
  },
  // Expect timeout: 10 seconds (all assertions)
  expect: {
    timeout: 10000,
  },
 });
 ```
 ```typescript
 // playwright/support/fixtures/timeout-fixture.ts - Timeout override fixture
 import { test as base } from '@playwright/test';
 type TimeoutOptions = {
  extendedTimeout: (timeoutMs: number) => Promise<void>;
 };
 export const test = base.extend<TimeoutOptions>({
  extendedTimeout: async ({}, use, testInfo) => {
    const originalTimeout = testInfo.timeout;
    await use(async (timeoutMs: number) => {
      testInfo.setTimeout(timeoutMs);
    });
    // Restore original timeout after test
    testInfo.setTimeout(originalTimeout);
  },
 });
 export { expect } from '@playwright/test';
 ```
 ```typescript
 // Usage in tests - Standard timeouts (implicit)
 import { test, expect } from '@playwright/test';
 test('user can log in', async ({ page }) => {
  await page.goto('/login'); // Uses 30s navigation timeout
  await page.fill('[data-testid="email"]', 'test@example.com'); // Uses 15s action timeout
  await page.click('[data-testid="login-button"]'); // Uses 15s action timeout
  await expect(page.getByText('Welcome')).toBeVisible(); // Uses 10s expect timeout
 });
 ```
 ```typescript
 // Usage in tests - Per-test timeout override
 import { test, expect } from '../support/fixtures/timeout-fixture';
 test('slow data processing operation', async ({ page, extendedTimeout }) => {
  // Override default 60s timeout for this slow test
  await extendedTimeout(180000); // 3 minutes
  await page.goto('/data-processing');
  await page.click('[data-testid="process-large-file"]');
  // Wait for long-running operation
  await expect(page.getByText('Processing complete')).toBeVisible({
    timeout: 120000, // 2 minutes for assertion
  });
 });
 ```
 ```typescript
 // Per-assertion timeout override (inline)
 test('API returns quickly', async ({ page }) => {
  await page.goto('/dashboard');
  // Override expect timeout for fast API (reduce flakiness detection)
  await expect(page.getByTestId('user-name')).toBeVisible({ timeout: 5000 }); // 5s instead of 10s
  // Override expect timeout for slow external API
  await expect(page.getByTestId('weather-widget')).toBeVisible({ timeout: 20000 }); // 20s instead of 10s
 });
 ```
 **Key Points**:
 - **Standardized timeouts**: action 15s, navigation 30s, expect 10s, test 60s (global defaults)
 - Fixture-based override (`extendedTimeout`) for slow tests (preferred over inline)
 - Per-assertion timeout override via `{ timeout: X }` option (use sparingly)
 - Avoid hard waits (`page.waitForTimeout(3000)`) - use event-based waits instead
 - CI environments may need longer timeouts (handle in environment-specific config)
 ### Example 3: Artifact Output Configuration
 **Context**: When debugging failures in CI, configure artifacts (screenshots, videos, traces, HTML reports) to be captured on failure and stored in consistent locations for upload.
 **Implementation**:
 ```typescript
 // playwright.config.ts - Artifact configuration
 import { defineConfig } from '@playwright/test';
 import path from 'path';
 export default defineConfig({
  // Output directory for test artifacts
  outputDir: path.resolve(__dirname, './test-results'),
  use: {
    // Screenshot on failure only (saves space)
    screenshot: 'only-on-failure',
    // Video recording on failure + retry
    video: 'retain-on-failure',
    // Trace recording on first retry (best debugging data)
    trace: 'on-first-retry',
  },
  reporter: [
    // HTML report (visual, interactive)
    [
      'html',
      {
        outputFolder: 'playwright-report',
        open: 'never', // Don't auto-open in CI
      },
    ],
    // JUnit XML (CI integration)
    [
      'junit',
      {
        outputFile: 'test-results/results.xml',
      },
    ],
    // List reporter (console output)
    ['list'],
  ],
 });
 ```
 ```typescript
 // playwright/support/fixtures/artifact-fixture.ts - Custom artifact capture
 import { test as base } from '@playwright/test';
 import fs from 'fs';
 import path from 'path';
 export const test = base.extend({
  // Auto-capture console logs on failure
  page: async ({ page }, use, testInfo) => {
    const logs: string[] = [];
    page.on('console', (msg) => {
      logs.push(`[${msg.type()}] ${msg.text()}`);
    });
    await use(page);
    // Save logs on failure
    if (testInfo.status !== testInfo.expectedStatus) {
      const logsPath = path.join(testInfo.outputDir, 'console-logs.txt');
      fs.writeFileSync(logsPath, logs.join('\n'));
      testInfo.attachments.push({
        name: 'console-logs',
        contentType: 'text/plain',
        path: logsPath,
      });
    }
  },
 });
 ```
 ```yaml
 # .github/workflows/e2e.yml - CI artifact upload
 name: E2E Tests
 on: [push, pull_request]
 jobs:
  test:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v4
      - uses: actions/setup-node@v4
        with:
          node-version-file: '.nvmrc'
      - name: Install dependencies
        run: npm ci
      - name: Install Playwright browsers
        run: npx playwright install --with-deps
      - name: Run tests
        run: npm run test
        env:
          TEST_ENV: staging
      # Upload test artifacts on failure
      - name: Upload test results
        if: failure()
        uses: actions/upload-artifact@v4
        with:
          name: test-results
          path: test-results/
          retention-days: 30
      - name: Upload Playwright report
        if: failure()
        uses: actions/upload-artifact@v4
        with:
          name: playwright-report
          path: playwright-report/
          retention-days: 30
 ```
 ```typescript
 // Example: Custom screenshot on specific condition
 test('capture screenshot on specific error', async ({ page }) => {
  await page.goto('/checkout');
  try {
    await page.click('[data-testid="submit-payment"]');
    await expect(page.getByText('Order Confirmed')).toBeVisible();
  } catch (error) {
    // Capture custom screenshot with timestamp
    await page.screenshot({
      path: `test-results/payment-error-${Date.now()}.png`,
      fullPage: true,
    });
    throw error;
  }
 });
 ```
 **Key Points**:
 - `screenshot: 'only-on-failure'` saves space (not every test)
 - `video: 'retain-on-failure'` captures full flow on failures
 - `trace: 'on-first-retry'` provides deep debugging data (network, DOM, console)
 - HTML report at `playwright-report/` (visual debugging)
 - JUnit XML at `test-results/results.xml` (CI integration)
 - CI uploads artifacts on failure with 30-day retention
 - Custom fixture can capture console logs, network logs, etc.
 ### Example 4: Parallelization Configuration
 **Context**: When tests run slowly in CI, configure parallelization with worker count, sharding, and fully parallel execution to maximize speed while maintaining stability.
 **Implementation**:
 ```typescript
 // playwright.config.ts - Parallelization settings
 import { defineConfig } from '@playwright/test';
 import os from 'os';
 export default defineConfig({
  // Run tests in parallel within single file
  fullyParallel: true,
  // Worker configuration
  workers: process.env.CI
    ? 1 // Serial in CI for stability (or 2 for faster CI)
    : os.cpus().length - 1, // Parallel locally (leave 1 CPU for OS)
  // Prevent accidentally committed .only() from blocking CI
  forbidOnly: !!process.env.CI,
  // Retry failed tests in CI
  retries: process.env.CI ? 2 : 0,
  // Shard configuration (split tests across multiple machines)
  shard:
    process.env.SHARD_INDEX && process.env.SHARD_TOTAL
      ? {
          current: parseInt(process.env.SHARD_INDEX, 10),
          total: parseInt(process.env.SHARD_TOTAL, 10),
        }
      : undefined,
 });
 ```
 ```yaml
 # .github/workflows/e2e-parallel.yml - Sharded CI execution
 name: E2E Tests (Parallel)
 on: [push, pull_request]
 jobs:
  test:
    runs-on: ubuntu-latest
    strategy:
      fail-fast: false
      matrix:
        shard: [1, 2, 3, 4] # Split tests across 4 machines
    steps:
      - uses: actions/checkout@v4
      - uses: actions/setup-node@v4
        with:
          node-version-file: '.nvmrc'
      - name: Install dependencies
        run: npm ci
      - name: Install Playwright browsers
        run: npx playwright install --with-deps
      - name: Run tests (shard ${{ matrix.shard }})
        run: npm run test
        env:
          SHARD_INDEX: ${{ matrix.shard }}
          SHARD_TOTAL: 4
          TEST_ENV: staging
      - name: Upload test results
        if: failure()
        uses: actions/upload-artifact@v4
        with:
          name: test-results-shard-${{ matrix.shard }}
          path: test-results/
 ```
 ```typescript
 // playwright/config/serial.config.ts - Serial execution for flaky tests
 import { defineConfig } from '@playwright/test';
 import { baseConfig } from './base.config';
 export default defineConfig({
  ...baseConfig,
  // Disable parallel execution
  fullyParallel: false,
  workers: 1,
  // Used for: authentication flows, database-dependent tests, feature flag tests
 });
 ```
 ```typescript
 // Usage: Force serial execution for specific tests
 import { test } from '@playwright/test';
 // Serial execution for auth tests (shared session state)
 test.describe.configure({ mode: 'serial' });
 test.describe('Authentication Flow', () => {
  test('user can log in', async ({ page }) => {
    // First test in serial block
  });
  test('user can access dashboard', async ({ page }) => {
    // Depends on previous test (serial)
  });
 });
 ```
 ```typescript
 // Usage: Parallel execution for independent tests (default)
 import { test } from '@playwright/test';
 test.describe('Product Catalog', () => {
  test('can view product 1', async ({ page }) => {
    // Runs in parallel with other tests
  });
  test('can view product 2', async ({ page }) => {
    // Runs in parallel with other tests
  });
 });
 ```
 **Key Points**:
 - `fullyParallel: true` enables parallel execution within single test file
 - Workers: 1 in CI (stability), N-1 CPUs locally (speed)
 - Sharding splits tests across multiple CI machines (4x faster with 4 shards)
 - `test.describe.configure({ mode: 'serial' })` for dependent tests
 - `forbidOnly: true` in CI prevents `.only()` from blocking pipeline
 - Matrix strategy in CI runs shards concurrently
 ### Example 5: Project Configuration
 **Context**: When testing across multiple browsers, devices, or configurations, use Playwright projects to run the same tests against different environments (chromium, firefox, webkit, mobile).
 **Implementation**:
 ```typescript
 // playwright.config.ts - Multiple browser projects
 import { defineConfig, devices } from '@playwright/test';
 export default defineConfig({
  projects: [
    // Desktop browsers
    {
      name: 'chromium',
      use: { ...devices['Desktop Chrome'] },
    },
    {
      name: 'firefox',
      use: { ...devices['Desktop Firefox'] },
    },
    {
      name: 'webkit',
      use: { ...devices['Desktop Safari'] },
    },
    // Mobile browsers
    {
      name: 'mobile-chrome',
      use: { ...devices['Pixel 5'] },
    },
    {
      name: 'mobile-safari',
      use: { ...devices['iPhone 13'] },
    },
    // Tablet
    {
      name: 'tablet',
      use: { ...devices['iPad Pro'] },
    },
  ],
 });
 ```
 ```typescript
 // playwright.config.ts - Authenticated vs. unauthenticated projects
 import { defineConfig } from '@playwright/test';
 import path from 'path';
 export default defineConfig({
  projects: [
    // Setup project (runs first, creates auth state)
    {
      name: 'setup',
      testMatch: /global-setup\.ts/,
    },
    // Authenticated tests (reuse auth state)
    {
      name: 'authenticated',
      dependencies: ['setup'],
      use: {
        storageState: path.resolve(__dirname, './playwright/.auth/user.json'),
      },
      testMatch: /.*authenticated\.spec\.ts/,
    },
    // Unauthenticated tests (public pages)
    {
      name: 'unauthenticated',
      testMatch: /.*unauthenticated\.spec\.ts/,
    },
  ],
 });
 ```
 ```typescript
 // playwright/support/global-setup.ts - Setup project for auth
 import { chromium, FullConfig } from '@playwright/test';
 import path from 'path';
 async function globalSetup(config: FullConfig) {
  const browser = await chromium.launch();
  const page = await browser.newPage();
  // Perform authentication
  await page.goto('http://localhost:3000/login');
  await page.fill('[data-testid="email"]', 'test@example.com');
  await page.fill('[data-testid="password"]', 'password123');
  await page.click('[data-testid="login-button"]');
  // Wait for authentication to complete
  await page.waitForURL('**/dashboard');
  // Save authentication state
  await page.context().storageState({
    path: path.resolve(__dirname, '../.auth/user.json'),
  });
  await browser.close();
 }
 export default globalSetup;
 ```
 ```bash
 # Run specific project
 npx playwright test --project=chromium
 npx playwright test --project=mobile-chrome
 npx playwright test --project=authenticated
 # Run multiple projects
 npx playwright test --project=chromium --project=firefox
 # Run all projects (default)
 npx playwright test
 ```
 ```typescript
 // Usage: Project-specific test
 import { test, expect } from '@playwright/test';
 test('mobile navigation works', async ({ page, isMobile }) => {
  await page.goto('/');
  if (isMobile) {
    // Open mobile menu
    await page.click('[data-testid="hamburger-menu"]');
  }
  await page.click('[data-testid="products-link"]');
  await expect(page).toHaveURL(/.*products/);
 });
 ```
 ```yaml
 # .github/workflows/e2e-cross-browser.yml - CI cross-browser testing
 name: E2E Tests (Cross-Browser)
 on: [push, pull_request]
 jobs:
  test:
    runs-on: ubuntu-latest
    strategy:
      fail-fast: false
      matrix:
        project: [chromium, firefox, webkit, mobile-chrome]
    steps:
      - uses: actions/checkout@v4
      - uses: actions/setup-node@v4
      - run: npm ci
      - run: npx playwright install --with-deps
      - name: Run tests (${{ matrix.project }})
        run: npx playwright test --project=${{ matrix.project }}
 ```
 **Key Points**:
 - Projects enable testing across browsers, devices, and configurations
 - `devices` from `@playwright/test` provide preset configurations (Pixel 5, iPhone 13, etc.)
 - `dependencies` ensures setup project runs first (auth, data seeding)
 - `storageState` shares authentication across tests (0 seconds auth per test)
 - `testMatch` filters which tests run in which project
 - CI matrix strategy runs projects in parallel (4x faster with 4 projects)
 - `isMobile` context property for conditional logic in tests
 ## Integration Points
 - **Used in workflows**: `*framework` (config setup), `*ci` (parallelization, artifact upload)
 - **Related fragments**:
  - `fixture-architecture.md` - Fixture-based timeout overrides
  - `ci-burn-in.md` - CI pipeline artifact upload
  - `test-quality.md` - Timeout standards (no hard waits)
  - `data-factories.md` - Per-test isolation (no shared global state)
 ## Configuration Checklist
 **Before deploying tests, verify**:
 - [ ] Environment config map with fail-fast validation
 - [ ] Standardized timeouts (action 15s, navigation 30s, expect 10s, test 60s)
 - [ ] Artifact storage at `test-results/` and `playwright-report/`
 - [ ] HTML + JUnit reporters configured
 - [ ] `.env.example`, `.nvmrc`, browser versions committed
 - [ ] Parallelization configured (workers, sharding)
 - [ ] Projects defined for cross-browser/device testing (if needed)
 - [ ] CI uploads artifacts on failure with 30-day retention
 _Source: Playwright book repo, SEON configuration example, Murat testing philosophy (lines 216-271)._
--- a/_bmad/bmm/testarch/knowledge/probability-impact.md
+++ b/_bmad/bmm/testarch/knowledge/probability-impact.md
@@ -0,0 +1,601 @@
 # Probability and Impact Scale
 ## Principle
 Risk scoring uses a **probability × impact** matrix (1-9 scale) to prioritize testing efforts. Higher scores (6-9) demand immediate action; lower scores (1-3) require documentation only. This systematic approach ensures testing resources focus on the highest-value risks.
 ## Rationale
 **The Problem**: Without quantifiable risk assessment, teams over-test low-value scenarios while missing critical risks. Gut feeling leads to inconsistent prioritization and missed edge cases.
 **The Solution**: Standardize risk evaluation with a 3×3 matrix (probability: 1-3, impact: 1-3). Multiply to derive risk score (1-9). Automate classification (DOCUMENT, MONITOR, MITIGATE, BLOCK) based on thresholds. This approach surfaces hidden risks early and justifies testing decisions to stakeholders.
 **Why This Matters**:
 - Consistent risk language across product, engineering, and QA
 - Objective prioritization of test scenarios (not politics)
 - Automatic gate decisions (score=9 → FAIL until resolved)
 - Audit trail for compliance and retrospectives
 ## Pattern Examples
 ### Example 1: Probability-Impact Matrix Implementation (Automated Classification)
 **Context**: Implement a reusable risk scoring system with automatic threshold classification
 **Implementation**:
 ```typescript
 // src/testing/risk-matrix.ts
 /**
 * Probability levels:
 * 1 = Unlikely (standard implementation, low uncertainty)
 * 2 = Possible (edge cases or partial unknowns)
 * 3 = Likely (known issues, new integrations, high ambiguity)
 */
 export type Probability = 1 | 2 | 3;
 /**
 * Impact levels:
 * 1 = Minor (cosmetic issues or easy workarounds)
 * 2 = Degraded (partial feature loss or manual workaround)
 * 3 = Critical (blockers, data/security/regulatory exposure)
 */
 export type Impact = 1 | 2 | 3;
 /**
 * Risk score (probability × impact): 1-9
 */
 export type RiskScore = 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9;
 /**
 * Action categories based on risk score thresholds
 */
 export type RiskAction = 'DOCUMENT' | 'MONITOR' | 'MITIGATE' | 'BLOCK';
 export type RiskAssessment = {
  probability: Probability;
  impact: Impact;
  score: RiskScore;
  action: RiskAction;
  reasoning: string;
 };
 /**
 * Calculate risk score: probability × impact
 */
 export function calculateRiskScore(probability: Probability, impact: Impact): RiskScore {
  return (probability * impact) as RiskScore;
 }
 /**
 * Classify risk action based on score thresholds:
 * - 1-3: DOCUMENT (awareness only)
 * - 4-5: MONITOR (watch closely, plan mitigations)
 * - 6-8: MITIGATE (CONCERNS at gate until mitigated)
 * - 9: BLOCK (automatic FAIL until resolved or waived)
 */
 export function classifyRiskAction(score: RiskScore): RiskAction {
  if (score >= 9) return 'BLOCK';
  if (score >= 6) return 'MITIGATE';
  if (score >= 4) return 'MONITOR';
  return 'DOCUMENT';
 }
 /**
 * Full risk assessment with automatic classification
 */
 export function assessRisk(params: { probability: Probability; impact: Impact; reasoning: string }): RiskAssessment {
  const { probability, impact, reasoning } = params;
  const score = calculateRiskScore(probability, impact);
  const action = classifyRiskAction(score);
  return { probability, impact, score, action, reasoning };
 }
 /**
 * Generate risk matrix visualization (3x3 grid)
 * Returns markdown table with color-coded scores
 */
 export function generateRiskMatrix(): string {
  const matrix: string[][] = [];
  const header = ['Impact \\ Probability', 'Unlikely (1)', 'Possible (2)', 'Likely (3)'];
  matrix.push(header);
  const impactLabels = ['Critical (3)', 'Degraded (2)', 'Minor (1)'];
  for (let impact = 3; impact >= 1; impact--) {
    const row = [impactLabels[3 - impact]];
    for (let probability = 1; probability <= 3; probability++) {
      const score = calculateRiskScore(probability as Probability, impact as Impact);
      const action = classifyRiskAction(score);
      const emoji = action === 'BLOCK' ? '🔴' : action === 'MITIGATE' ? '🟠' : action === 'MONITOR' ? '🟡' : '🟢';
      row.push(`${emoji} ${score}`);
    }
    matrix.push(row);
  }
  return matrix.map((row) => `| ${row.join(' | ')} |`).join('\n');
 }
 ```
 **Key Points**:
 - Type-safe probability/impact (1-3 enforced at compile time)
 - Automatic action classification (DOCUMENT, MONITOR, MITIGATE, BLOCK)
 - Visual matrix generation for documentation
 - Risk score formula: `probability * impact` (max = 9)
 - Threshold-based decision rules (6-8 = MITIGATE, 9 = BLOCK)
 ---
 ### Example 2: Risk Assessment Workflow (Test Planning Integration)
 **Context**: Apply risk matrix during test design to prioritize scenarios
 **Implementation**:
 ```typescript
 // tests/e2e/test-planning/risk-assessment.ts
 import { assessRisk, generateRiskMatrix, type RiskAssessment } from '../../../src/testing/risk-matrix';
 export type TestScenario = {
  id: string;
  title: string;
  feature: string;
  risk: RiskAssessment;
  testLevel: 'E2E' | 'API' | 'Unit';
  priority: 'P0' | 'P1' | 'P2' | 'P3';
  owner: string;
 };
 /**
 * Assess test scenarios and auto-assign priority based on risk score
 */
 export function assessTestScenarios(scenarios: Omit<TestScenario, 'risk' | 'priority'>[]): TestScenario[] {
  return scenarios.map((scenario) => {
    // Auto-assign priority based on risk score
    const priority = mapRiskToPriority(scenario.risk.score);
    return { ...scenario, priority };
  });
 }
 /**
 * Map risk score to test priority (P0-P3)
 * P0: Critical (score 9) - blocks release
 * P1: High (score 6-8) - must fix before release
 * P2: Medium (score 4-5) - fix if time permits
 * P3: Low (score 1-3) - document and defer
 */
 function mapRiskToPriority(score: number): 'P0' | 'P1' | 'P2' | 'P3' {
  if (score === 9) return 'P0';
  if (score >= 6) return 'P1';
  if (score >= 4) return 'P2';
  return 'P3';
 }
 /**
 * Example: Payment flow risk assessment
 */
 export const paymentScenarios: Array<Omit<TestScenario, 'priority'>> = [
  {
    id: 'PAY-001',
    title: 'Valid credit card payment completes successfully',
    feature: 'Checkout',
    risk: assessRisk({
      probability: 2, // Possible (standard Stripe integration)
      impact: 3, // Critical (revenue loss if broken)
      reasoning: 'Core revenue flow, but Stripe is well-tested',
    }),
    testLevel: 'E2E',
    owner: 'qa-team',
  },
  {
    id: 'PAY-002',
    title: 'Expired credit card shows user-friendly error',
    feature: 'Checkout',
    risk: assessRisk({
      probability: 3, // Likely (edge case handling often buggy)
      impact: 2, // Degraded (users see error, but can retry)
      reasoning: 'Error handling logic is custom and complex',
    }),
    testLevel: 'E2E',
    owner: 'qa-team',
  },
  {
    id: 'PAY-003',
    title: 'Payment confirmation email formatting is correct',
    feature: 'Email',
    risk: assessRisk({
      probability: 2, // Possible (template changes occasionally break)
      impact: 1, // Minor (cosmetic issue, email still sent)
      reasoning: 'Non-blocking, users get email regardless',
    }),
    testLevel: 'Unit',
    owner: 'dev-team',
  },
  {
    id: 'PAY-004',
    title: 'Payment fails gracefully when Stripe is down',
    feature: 'Checkout',
    risk: assessRisk({
      probability: 1, // Unlikely (Stripe has 99.99% uptime)
      impact: 3, // Critical (complete checkout failure)
      reasoning: 'Rare but catastrophic, requires retry mechanism',
    }),
    testLevel: 'API',
    owner: 'qa-team',
  },
 ];
 /**
 * Generate risk assessment report with priority distribution
 */
 export function generateRiskReport(scenarios: TestScenario[]): string {
  const priorityCounts = scenarios.reduce(
    (acc, s) => {
      acc[s.priority] = (acc[s.priority] || 0) + 1;
      return acc;
    },
    {} as Record<string, number>,
  );
  const actionCounts = scenarios.reduce(
    (acc, s) => {
      acc[s.risk.action] = (acc[s.risk.action] || 0) + 1;
      return acc;
    },
    {} as Record<string, number>,
  );
  return `
 # Risk Assessment Report
 ## Risk Matrix
 ${generateRiskMatrix()}
 ## Priority Distribution
 - **P0 (Blocker)**: ${priorityCounts.P0 || 0} scenarios
 - **P1 (High)**: ${priorityCounts.P1 || 0} scenarios
 - **P2 (Medium)**: ${priorityCounts.P2 || 0} scenarios
 - **P3 (Low)**: ${priorityCounts.P3 || 0} scenarios
 ## Action Required
 - **BLOCK**: ${actionCounts.BLOCK || 0} scenarios (auto-fail gate)
 - **MITIGATE**: ${actionCounts.MITIGATE || 0} scenarios (concerns at gate)
 - **MONITOR**: ${actionCounts.MONITOR || 0} scenarios (watch closely)
 - **DOCUMENT**: ${actionCounts.DOCUMENT || 0} scenarios (awareness only)
 ## Scenarios by Risk Score (Highest First)
 ${scenarios
  .sort((a, b) => b.risk.score - a.risk.score)
  .map((s) => `- **[${s.priority}]** ${s.id}: ${s.title} (Score: ${s.risk.score} - ${s.risk.action})`)
  .join('\n')}
 `.trim();
 }
 ```
 **Key Points**:
 - Risk score → Priority mapping (P0-P3 automated)
 - Report generation with priority/action distribution
 - Scenarios sorted by risk score (highest first)
 - Visual matrix included in reports
 - Reusable across projects (extract to shared library)
 ---
 ### Example 3: Dynamic Risk Re-Assessment (Continuous Evaluation)
 **Context**: Recalculate risk scores as project evolves (requirements change, mitigations implemented)
 **Implementation**:
 ```typescript
 // src/testing/risk-tracking.ts
 import { type RiskAssessment, assessRisk, type Probability, type Impact } from './risk-matrix';
 export type RiskHistory = {
  timestamp: Date;
  assessment: RiskAssessment;
  changedBy: string;
  reason: string;
 };
 export type TrackedRisk = {
  id: string;
  title: string;
  feature: string;
  currentRisk: RiskAssessment;
  history: RiskHistory[];
  mitigations: string[];
  status: 'OPEN' | 'MITIGATED' | 'WAIVED' | 'RESOLVED';
 };
 export class RiskTracker {
  private risks: Map<string, TrackedRisk> = new Map();
  /**
   * Add new risk to tracker
   */
  addRisk(params: {
    id: string;
    title: string;
    feature: string;
    probability: Probability;
    impact: Impact;
    reasoning: string;
    changedBy: string;
  }): TrackedRisk {
    const { id, title, feature, probability, impact, reasoning, changedBy } = params;
    const assessment = assessRisk({ probability, impact, reasoning });
    const risk: TrackedRisk = {
      id,
      title,
      feature,
      currentRisk: assessment,
      history: [
        {
          timestamp: new Date(),
          assessment,
          changedBy,
          reason: 'Initial assessment',
        },
      ],
      mitigations: [],
      status: 'OPEN',
    };
    this.risks.set(id, risk);
    return risk;
  }
  /**
   * Reassess risk (probability or impact changed)
   */
  reassessRisk(params: {
    id: string;
    probability?: Probability;
    impact?: Impact;
    reasoning: string;
    changedBy: string;
  }): TrackedRisk | null {
    const { id, probability, impact, reasoning, changedBy } = params;
    const risk = this.risks.get(id);
    if (!risk) return null;
    // Use existing values if not provided
    const newProbability = probability ?? risk.currentRisk.probability;
    const newImpact = impact ?? risk.currentRisk.impact;
    const newAssessment = assessRisk({
      probability: newProbability,
      impact: newImpact,
      reasoning,
    });
    risk.currentRisk = newAssessment;
    risk.history.push({
      timestamp: new Date(),
      assessment: newAssessment,
      changedBy,
      reason: reasoning,
    });
    this.risks.set(id, risk);
    return risk;
  }
  /**
   * Mark risk as mitigated (probability reduced)
   */
  mitigateRisk(params: { id: string; newProbability: Probability; mitigation: string; changedBy: string }): TrackedRisk | null {
    const { id, newProbability, mitigation, changedBy } = params;
    const risk = this.reassessRisk({
      id,
      probability: newProbability,
      reasoning: `Mitigation implemented: ${mitigation}`,
      changedBy,
    });
    if (risk) {
      risk.mitigations.push(mitigation);
      if (risk.currentRisk.action === 'DOCUMENT' || risk.currentRisk.action === 'MONITOR') {
        risk.status = 'MITIGATED';
      }
    }
    return risk;
  }
  /**
   * Get risks requiring action (MITIGATE or BLOCK)
   */
  getRisksRequiringAction(): TrackedRisk[] {
    return Array.from(this.risks.values()).filter(
      (r) => r.status === 'OPEN' && (r.currentRisk.action === 'MITIGATE' || r.currentRisk.action === 'BLOCK'),
    );
  }
  /**
   * Generate risk trend report (show changes over time)
   */
  generateTrendReport(riskId: string): string | null {
    const risk = this.risks.get(riskId);
    if (!risk) return null;
    return `
 # Risk Trend Report: ${risk.id}
 **Title**: ${risk.title}
 **Feature**: ${risk.feature}
 **Status**: ${risk.status}
 ## Current Assessment
 - **Probability**: ${risk.currentRisk.probability}
 - **Impact**: ${risk.currentRisk.impact}
 - **Score**: ${risk.currentRisk.score}
 - **Action**: ${risk.currentRisk.action}
 - **Reasoning**: ${risk.currentRisk.reasoning}
 ## Mitigations Applied
 ${risk.mitigations.length > 0 ? risk.mitigations.map((m) => `- ${m}`).join('\n') : '- None'}
 ## History (${risk.history.length} changes)
 ${risk.history
  .reverse()
  .map((h) => `- **${h.timestamp.toISOString()}** by ${h.changedBy}: Score ${h.assessment.score} (${h.assessment.action}) - ${h.reason}`)
  .join('\n')}
 `.trim();
  }
 }
 ```
 **Key Points**:
 - Historical tracking (audit trail for risk changes)
 - Mitigation impact tracking (probability reduction)
 - Status lifecycle (OPEN → MITIGATED → RESOLVED)
 - Trend reports (show risk evolution over time)
 - Re-assessment triggers (requirements change, new info)
 ---
 ### Example 4: Risk Matrix in Gate Decision (Integration with Trace Workflow)
 **Context**: Use probability-impact scores to drive gate decisions (PASS/CONCERNS/FAIL/WAIVED)
 **Implementation**:
 ```typescript
 // src/testing/gate-decision.ts
 import { type RiskScore, classifyRiskAction, type RiskAction } from './risk-matrix';
 import { type TrackedRisk } from './risk-tracking';
 export type GateDecision = 'PASS' | 'CONCERNS' | 'FAIL' | 'WAIVED';
 export type GateResult = {
  decision: GateDecision;
  blockers: TrackedRisk[]; // Score=9, action=BLOCK
  concerns: TrackedRisk[]; // Score 6-8, action=MITIGATE
  monitored: TrackedRisk[]; // Score 4-5, action=MONITOR
  documented: TrackedRisk[]; // Score 1-3, action=DOCUMENT
  summary: string;
 };
 /**
 * Evaluate gate based on risk assessments
 */
 export function evaluateGateFromRisks(risks: TrackedRisk[]): GateResult {
  const blockers = risks.filter((r) => r.currentRisk.action === 'BLOCK' && r.status === 'OPEN');
  const concerns = risks.filter((r) => r.currentRisk.action === 'MITIGATE' && r.status === 'OPEN');
  const monitored = risks.filter((r) => r.currentRisk.action === 'MONITOR');
  const documented = risks.filter((r) => r.currentRisk.action === 'DOCUMENT');
  let decision: GateDecision;
  if (blockers.length > 0) {
    decision = 'FAIL';
  } else if (concerns.length > 0) {
    decision = 'CONCERNS';
  } else {
    decision = 'PASS';
  }
  const summary = generateGateSummary({ decision, blockers, concerns, monitored, documented });
  return { decision, blockers, concerns, monitored, documented, summary };
 }
 /**
 * Generate gate decision summary
 */
 function generateGateSummary(result: Omit<GateResult, 'summary'>): string {
  const { decision, blockers, concerns, monitored, documented } = result;
  const lines: string[] = [`## Gate Decision: ${decision}`];
  if (decision === 'FAIL') {
    lines.push(`\n**Blockers** (${blockers.length}): Automatic FAIL until resolved or waived`);
    blockers.forEach((r) => {
      lines.push(`- **${r.id}**: ${r.title} (Score: ${r.currentRisk.score})`);
      lines.push(`  - Probability: ${r.currentRisk.probability}, Impact: ${r.currentRisk.impact}`);
      lines.push(`  - Reasoning: ${r.currentRisk.reasoning}`);
    });
  }
  if (concerns.length > 0) {
    lines.push(`\n**Concerns** (${concerns.length}): Address before release`);
    concerns.forEach((r) => {
      lines.push(`- **${r.id}**: ${r.title} (Score: ${r.currentRisk.score})`);
      lines.push(`  - Mitigations: ${r.mitigations.join(', ') || 'None'}`);
    });
  }
  if (monitored.length > 0) {
    lines.push(`\n**Monitored** (${monitored.length}): Watch closely`);
    monitored.forEach((r) => lines.push(`- **${r.id}**: ${r.title} (Score: ${r.currentRisk.score})`));
  }
  if (documented.length > 0) {
    lines.push(`\n**Documented** (${documented.length}): Awareness only`);
  }
  lines.push(`\n---\n`);
  lines.push(`**Next Steps**:`);
  if (decision === 'FAIL') {
    lines.push(`- Resolve blockers or request formal waiver`);
  } else if (decision === 'CONCERNS') {
    lines.push(`- Implement mitigations for high-risk scenarios (score 6-8)`);
    lines.push(`- Re-run gate after mitigations`);
  } else {
    lines.push(`- Proceed with release`);
  }
  return lines.join('\n');
 }
 ```
 **Key Points**:
 - Gate decision driven by risk scores (not gut feeling)
 - Automatic FAIL for score=9 (blockers)
 - CONCERNS for score 6-8 (requires mitigation)
 - PASS only when no blockers/concerns
 - Actionable summary with next steps
 - Integration with trace workflow (Phase 2)
 ---
 ## Probability-Impact Threshold Summary
 | Score | Action   | Gate Impact          | Typical Use Case                       |
 | ----- | -------- | -------------------- | -------------------------------------- |
 | 1-3   | DOCUMENT | None                 | Cosmetic issues, low-priority bugs     |
 | 4-5   | MONITOR  | None (watch closely) | Edge cases, partial unknowns           |
 | 6-8   | MITIGATE | CONCERNS at gate     | High-impact scenarios needing coverage |
 | 9     | BLOCK    | Automatic FAIL       | Critical blockers, must resolve        |
 ## Risk Assessment Checklist
 Before deploying risk matrix:
 - [ ] **Probability scale defined**: 1 (unlikely), 2 (possible), 3 (likely) with clear examples
 - [ ] **Impact scale defined**: 1 (minor), 2 (degraded), 3 (critical) with concrete criteria
 - [ ] **Threshold rules documented**: Score → Action mapping (1-3 = DOCUMENT, 4-5 = MONITOR, 6-8 = MITIGATE, 9 = BLOCK)
 - [ ] **Gate integration**: Risk scores drive gate decisions (PASS/CONCERNS/FAIL/WAIVED)
 - [ ] **Re-assessment process**: Risks re-evaluated as project evolves (requirements change, mitigations applied)
 - [ ] **Audit trail**: Historical tracking for risk changes (who, when, why)
 - [ ] **Mitigation tracking**: Link mitigations to probability reduction (quantify impact)
 - [ ] **Reporting**: Risk matrix visualization, trend reports, gate summaries
 ## Integration Points
 - **Used in workflows**: `*test-design` (initial risk assessment), `*trace` (gate decision Phase 2), `*nfr-assess` (security/performance risks)
 - **Related fragments**: `risk-governance.md` (risk scoring matrix, gate decision engine), `test-priorities-matrix.md` (P0-P3 mapping), `nfr-criteria.md` (impact assessment for NFRs)
 - **Tools**: TypeScript for type safety, markdown for reports, version control for audit trail
 _Source: Murat risk model summary, gate decision patterns from production systems, probability-impact matrix from risk governance practices_
--- a/_bmad/bmm/testarch/knowledge/recurse.md
+++ b/_bmad/bmm/testarch/knowledge/recurse.md
@@ -0,0 +1,296 @@
 # Recurse (Polling) Utility
 ## Principle
 Use Cypress-style polling with Playwright's `expect.poll` to wait for asynchronous conditions. Provides configurable timeout, interval, logging, and post-polling callbacks with enhanced error categorization.
 ## Rationale
 Testing async operations (background jobs, eventual consistency, webhook processing) requires polling:
 - Vanilla `expect.poll` is verbose
 - No built-in logging for debugging
 - Generic timeout errors
 - No post-poll hooks
 The `recurse` utility provides:
 - **Clean syntax**: Inspired by cypress-recurse
 - **Enhanced errors**: Timeout vs command failure vs predicate errors
 - **Built-in logging**: Track polling progress
 - **Post-poll callbacks**: Process results after success
 - **Type-safe**: Full TypeScript generic support
 ## Pattern Examples
 ### Example 1: Basic Polling
 **Context**: Wait for async operation to complete with custom timeout and interval.
 **Implementation**:
 ```typescript
 import { test } from '@seontechnologies/playwright-utils/recurse/fixtures';
 test('should wait for job completion', async ({ recurse, apiRequest }) => {
  // Start job
  const { body } = await apiRequest({
    method: 'POST',
    path: '/api/jobs',
    body: { type: 'export' },
  });
  // Poll until ready
  const result = await recurse(
    () => apiRequest({ method: 'GET', path: `/api/jobs/${body.id}` }),
    (response) => response.body.status === 'completed',
    {
      timeout: 60000, // 60 seconds max
      interval: 2000, // Check every 2 seconds
      log: 'Waiting for export job to complete',
    },
  );
  expect(result.body.downloadUrl).toBeDefined();
 });
 ```
 **Key Points**:
 - First arg: command function (what to execute)
 - Second arg: predicate function (when to stop)
 - Options: timeout, interval, log message
 - Returns the value when predicate returns true
 ### Example 2: Polling with Assertions
 **Context**: Use assertions directly in predicate for more expressive tests.
 **Implementation**:
 ```typescript
 test('should poll with assertions', async ({ recurse, apiRequest }) => {
  await apiRequest({
    method: 'POST',
    path: '/api/events',
    body: { type: 'user-created', userId: '123' },
  });
  // Poll with assertions in predicate
  await recurse(
    async () => {
      const { body } = await apiRequest({ method: 'GET', path: '/api/events/123' });
      return body;
    },
    (event) => {
      // Use assertions instead of boolean returns
      expect(event.processed).toBe(true);
      expect(event.timestamp).toBeDefined();
      // If assertions pass, predicate succeeds
    },
    { timeout: 30000 },
  );
 });
 ```
 **Key Points**:
 - Predicate can use `expect()` assertions
 - If assertions throw, polling continues
 - If assertions pass, polling succeeds
 - More expressive than boolean returns
 ### Example 3: Custom Error Messages
 **Context**: Provide context-specific error messages for timeout failures.
 **Implementation**:
 ```typescript
 test('custom error on timeout', async ({ recurse, apiRequest }) => {
  try {
    await recurse(
      () => apiRequest({ method: 'GET', path: '/api/status' }),
      (res) => res.body.ready === true,
      {
        timeout: 10000,
        error: 'System failed to become ready within 10 seconds - check background workers',
      },
    );
  } catch (error) {
    // Error message includes custom context
    expect(error.message).toContain('check background workers');
    throw error;
  }
 });
 ```
 **Key Points**:
 - `error` option provides custom message
 - Replaces default "Timed out after X ms"
 - Include debugging hints in error message
 - Helps diagnose failures faster
 ### Example 4: Post-Polling Callback
 **Context**: Process or log results after successful polling.
 **Implementation**:
 ```typescript
 test('post-poll processing', async ({ recurse, apiRequest }) => {
  const finalResult = await recurse(
    () => apiRequest({ method: 'GET', path: '/api/batch-job/123' }),
    (res) => res.body.status === 'completed',
    {
      timeout: 60000,
      post: (result) => {
        // Runs after successful polling
        console.log(`Job completed in ${result.body.duration}ms`);
        console.log(`Processed ${result.body.itemsProcessed} items`);
        return result.body;
      },
    },
  );
  expect(finalResult.itemsProcessed).toBeGreaterThan(0);
 });
 ```
 **Key Points**:
 - `post` callback runs after predicate succeeds
 - Receives the final result
 - Can transform or log results
 - Return value becomes final `recurse` result
 ### Example 5: Integration with API Request (Common Pattern)
 **Context**: Most common use case - polling API endpoints for state changes.
 **Implementation**:
 ```typescript
 import { test } from '@seontechnologies/playwright-utils/fixtures';
 test('end-to-end polling', async ({ apiRequest, recurse }) => {
  // Trigger async operation
  const { body: createResp } = await apiRequest({
    method: 'POST',
    path: '/api/data-import',
    body: { source: 's3://bucket/data.csv' },
  });
  // Poll until import completes
  const importResult = await recurse(
    () => apiRequest({ method: 'GET', path: `/api/data-import/${createResp.importId}` }),
    (response) => {
      const { status, rowsImported } = response.body;
      return status === 'completed' && rowsImported > 0;
    },
    {
      timeout: 120000, // 2 minutes for large imports
      interval: 5000, // Check every 5 seconds
      log: `Polling import ${createResp.importId}`,
    },
  );
  expect(importResult.body.rowsImported).toBeGreaterThan(1000);
  expect(importResult.body.errors).toHaveLength(0);
 });
 ```
 **Key Points**:
 - Combine `apiRequest` + `recurse` for API polling
 - Both from `@seontechnologies/playwright-utils/fixtures`
 - Complex predicates with multiple conditions
 - Logging shows polling progress in test reports
 ## Enhanced Error Types
 The utility categorizes errors for easier debugging:
 ```typescript
 // TimeoutError - Predicate never returned true
 Error: Polling timed out after 30000ms: Job never completed
 // CommandError - Command function threw
 Error: Command failed: Request failed with status 500
 // PredicateError - Predicate function threw (not from assertions)
 Error: Predicate failed: Cannot read property 'status' of undefined
 ```
 ## Comparison with Vanilla Playwright
 | Vanilla Playwright                                                | recurse Utility                                                           |
 | ----------------------------------------------------------------- | ------------------------------------------------------------------------- |
 | `await expect.poll(() => { ... }, { timeout: 30000 }).toBe(true)` | `await recurse(() => { ... }, (val) => val === true, { timeout: 30000 })` |
 | No logging                                                        | Built-in log option                                                       |
 | Generic timeout errors                                            | Categorized errors (timeout/command/predicate)                            |
 | No post-poll hooks                                                | `post` callback support                                                   |
 ## When to Use
 **Use recurse for:**
 - ✅ Background job completion
 - ✅ Webhook/event processing
 - ✅ Database eventual consistency
 - ✅ Cache propagation
 - ✅ State machine transitions
 **Stick with vanilla expect.poll for:**
 - Simple UI element visibility (use `expect(locator).toBeVisible()`)
 - Single-property checks
 - Cases where logging isn't needed
 ## Related Fragments
 - `api-request.md` - Combine for API endpoint polling
 - `overview.md` - Fixture composition patterns
 - `fixtures-composition.md` - Using with mergeTests
 ## Anti-Patterns
 **❌ Using hard waits instead of polling:**
 ```typescript
 await page.click('#export');
 await page.waitForTimeout(5000); // Arbitrary wait
 expect(await page.textContent('#status')).toBe('Ready');
 ```
 **✅ Poll for actual condition:**
 ```typescript
 await page.click('#export');
 await recurse(
  () => page.textContent('#status'),
  (status) => status === 'Ready',
  { timeout: 10000 },
 );
 ```
 **❌ Polling too frequently:**
 ```typescript
 await recurse(
  () => apiRequest({ method: 'GET', path: '/status' }),
  (res) => res.body.ready,
  { interval: 100 }, // Hammers API every 100ms!
 );
 ```
 **✅ Reasonable interval for API calls:**
 ```typescript
 await recurse(
  () => apiRequest({ method: 'GET', path: '/status' }),
  (res) => res.body.ready,
  { interval: 2000 }, // Check every 2 seconds (reasonable)
 );
 ```
--- a/_bmad/bmm/testarch/knowledge/risk-governance.md
+++ b/_bmad/bmm/testarch/knowledge/risk-governance.md
@@ -0,0 +1,615 @@
 # Risk Governance and Gatekeeping
 ## Principle
 Risk governance transforms subjective "should we ship?" debates into objective, data-driven decisions. By scoring risk (probability × impact), classifying by category (TECH, SEC, PERF, etc.), and tracking mitigation ownership, teams create transparent quality gates that balance speed with safety.
 ## Rationale
 **The Problem**: Without formal risk governance, releases become political—loud voices win, quiet risks hide, and teams discover critical issues in production. "We thought it was fine" isn't a release strategy.
 **The Solution**: Risk scoring (1-3 scale for probability and impact, total 1-9) creates shared language. Scores ≥6 demand documented mitigation. Scores = 9 mandate gate failure. Every acceptance criterion maps to a test, and gaps require explicit waivers with owners and expiry dates.
 **Why This Matters**:
 - Removes ambiguity from release decisions (objective scores vs subjective opinions)
 - Creates audit trail for compliance (FDA, SOC2, ISO require documented risk management)
 - Identifies true blockers early (prevents last-minute production fires)
 - Distributes responsibility (owners, mitigation plans, deadlines for every risk >4)
 ## Pattern Examples
 ### Example 1: Risk Scoring Matrix with Automated Classification (TypeScript)
 **Context**: Calculate risk scores automatically from test results and categorize by risk type
 **Implementation**:
 ```typescript
 // risk-scoring.ts - Risk classification and scoring system
 export const RISK_CATEGORIES = {
  TECH: 'TECH', // Technical debt, architecture fragility
  SEC: 'SEC', // Security vulnerabilities
  PERF: 'PERF', // Performance degradation
  DATA: 'DATA', // Data integrity, corruption
  BUS: 'BUS', // Business logic errors
  OPS: 'OPS', // Operational issues (deployment, monitoring)
 } as const;
 export type RiskCategory = keyof typeof RISK_CATEGORIES;
 export type RiskScore = {
  id: string;
  category: RiskCategory;
  title: string;
  description: string;
  probability: 1 | 2 | 3; // 1=Low, 2=Medium, 3=High
  impact: 1 | 2 | 3; // 1=Low, 2=Medium, 3=High
  score: number; // probability × impact (1-9)
  owner: string;
  mitigationPlan?: string;
  deadline?: Date;
  status: 'OPEN' | 'MITIGATED' | 'WAIVED' | 'ACCEPTED';
  waiverReason?: string;
  waiverApprover?: string;
  waiverExpiry?: Date;
 };
 // Risk scoring rules
 export function calculateRiskScore(probability: 1 | 2 | 3, impact: 1 | 2 | 3): number {
  return probability * impact;
 }
 export function requiresMitigation(score: number): boolean {
  return score >= 6; // Scores 6-9 demand action
 }
 export function isCriticalBlocker(score: number): boolean {
  return score === 9; // Probability=3 AND Impact=3 → FAIL gate
 }
 export function classifyRiskLevel(score: number): 'LOW' | 'MEDIUM' | 'HIGH' | 'CRITICAL' {
  if (score === 9) return 'CRITICAL';
  if (score >= 6) return 'HIGH';
  if (score >= 4) return 'MEDIUM';
  return 'LOW';
 }
 // Example: Risk assessment from test failures
 export function assessTestFailureRisk(failure: {
  test: string;
  category: RiskCategory;
  affectedUsers: number;
  revenueImpact: number;
  securityVulnerability: boolean;
 }): RiskScore {
  // Probability based on test failure frequency (simplified)
  const probability: 1 | 2 | 3 = 3; // Test failed = High probability
  // Impact based on business context
  let impact: 1 | 2 | 3 = 1;
  if (failure.securityVulnerability) impact = 3;
  else if (failure.revenueImpact > 10000) impact = 3;
  else if (failure.affectedUsers > 1000) impact = 2;
  else impact = 1;
  const score = calculateRiskScore(probability, impact);
  return {
    id: `risk-${Date.now()}`,
    category: failure.category,
    title: `Test failure: ${failure.test}`,
    description: `Affects ${failure.affectedUsers} users, $${failure.revenueImpact} revenue`,
    probability,
    impact,
    score,
    owner: 'unassigned',
    status: score === 9 ? 'OPEN' : 'OPEN',
  };
 }
 ```
 **Key Points**:
 - **Objective scoring**: Probability (1-3) × Impact (1-3) = Score (1-9)
 - **Clear thresholds**: Score ≥6 requires mitigation, score = 9 blocks release
 - **Business context**: Revenue, users, security drive impact calculation
 - **Status tracking**: OPEN → MITIGATED → WAIVED → ACCEPTED lifecycle
 ---
 ### Example 2: Gate Decision Engine with Traceability Validation
 **Context**: Automated gate decision based on risk scores and test coverage
 **Implementation**:
 ```typescript
 // gate-decision-engine.ts
 export type GateDecision = 'PASS' | 'CONCERNS' | 'FAIL' | 'WAIVED';
 export type CoverageGap = {
  acceptanceCriteria: string;
  testMissing: string;
  reason: string;
 };
 export type GateResult = {
  decision: GateDecision;
  timestamp: Date;
  criticalRisks: RiskScore[];
  highRisks: RiskScore[];
  coverageGaps: CoverageGap[];
  summary: string;
  recommendations: string[];
 };
 export function evaluateGate(params: { risks: RiskScore[]; coverageGaps: CoverageGap[]; waiverApprover?: string }): GateResult {
  const { risks, coverageGaps, waiverApprover } = params;
  // Categorize risks
  const criticalRisks = risks.filter((r) => r.score === 9 && r.status === 'OPEN');
  const highRisks = risks.filter((r) => r.score >= 6 && r.score < 9 && r.status === 'OPEN');
  const unresolvedGaps = coverageGaps.filter((g) => !g.reason);
  // Decision logic
  let decision: GateDecision;
  // FAIL: Critical blockers (score=9) or missing coverage
  if (criticalRisks.length > 0 || unresolvedGaps.length > 0) {
    decision = 'FAIL';
  }
  // WAIVED: All risks waived by authorized approver
  else if (risks.every((r) => r.status === 'WAIVED') && waiverApprover) {
    decision = 'WAIVED';
  }
  // CONCERNS: High risks (score 6-8) with mitigation plans
  else if (highRisks.length > 0 && highRisks.every((r) => r.mitigationPlan && r.owner !== 'unassigned')) {
    decision = 'CONCERNS';
  }
  // PASS: No critical issues, all risks mitigated or low
  else {
    decision = 'PASS';
  }
  // Generate recommendations
  const recommendations: string[] = [];
  if (criticalRisks.length > 0) {
    recommendations.push(`🚨 ${criticalRisks.length} CRITICAL risk(s) must be mitigated before release`);
  }
  if (unresolvedGaps.length > 0) {
    recommendations.push(`📋 ${unresolvedGaps.length} acceptance criteria lack test coverage`);
  }
  if (highRisks.some((r) => !r.mitigationPlan)) {
    recommendations.push(`⚠️  High risks without mitigation plans: assign owners and deadlines`);
  }
  if (decision === 'PASS') {
    recommendations.push(`✅ All risks mitigated or acceptable. Ready for release.`);
  }
  return {
    decision,
    timestamp: new Date(),
    criticalRisks,
    highRisks,
    coverageGaps: unresolvedGaps,
    summary: generateSummary(decision, risks, unresolvedGaps),
    recommendations,
  };
 }
 function generateSummary(decision: GateDecision, risks: RiskScore[], gaps: CoverageGap[]): string {
  const total = risks.length;
  const critical = risks.filter((r) => r.score === 9).length;
  const high = risks.filter((r) => r.score >= 6 && r.score < 9).length;
  return `Gate Decision: ${decision}. Total Risks: ${total} (${critical} critical, ${high} high). Coverage Gaps: ${gaps.length}.`;
 }
 ```
 **Usage Example**:
 ```typescript
 // Example: Running gate check before deployment
 import { assessTestFailureRisk, evaluateGate } from './gate-decision-engine';
 // Collect risks from test results
 const risks: RiskScore[] = [
  assessTestFailureRisk({
    test: 'Payment processing with expired card',
    category: 'BUS',
    affectedUsers: 5000,
    revenueImpact: 50000,
    securityVulnerability: false,
  }),
  assessTestFailureRisk({
    test: 'SQL injection in search endpoint',
    category: 'SEC',
    affectedUsers: 10000,
    revenueImpact: 0,
    securityVulnerability: true,
  }),
 ];
 // Identify coverage gaps
 const coverageGaps: CoverageGap[] = [
  {
    acceptanceCriteria: 'User can reset password via email',
    testMissing: 'e2e/auth/password-reset.spec.ts',
    reason: '', // Empty = unresolved
  },
 ];
 // Evaluate gate
 const gateResult = evaluateGate({ risks, coverageGaps });
 console.log(gateResult.decision); // 'FAIL'
 console.log(gateResult.summary);
 // "Gate Decision: FAIL. Total Risks: 2 (1 critical, 1 high). Coverage Gaps: 1."
 console.log(gateResult.recommendations);
 // [
 //   "🚨 1 CRITICAL risk(s) must be mitigated before release",
 //   "📋 1 acceptance criteria lack test coverage"
 // ]
 ```
 **Key Points**:
 - **Automated decision**: No human interpretation required
 - **Clear criteria**: FAIL = critical risks or gaps, CONCERNS = high risks with plans, PASS = low risks
 - **Actionable output**: Recommendations drive next steps
 - **Audit trail**: Timestamp, decision, and context for compliance
 ---
 ### Example 3: Risk Mitigation Workflow with Owner Tracking
 **Context**: Track risk mitigation from identification to resolution
 **Implementation**:
 ```typescript
 // risk-mitigation.ts
 export type MitigationAction = {
  riskId: string;
  action: string;
  owner: string;
  deadline: Date;
  status: 'PENDING' | 'IN_PROGRESS' | 'COMPLETED' | 'BLOCKED';
  completedAt?: Date;
  blockedReason?: string;
 };
 export class RiskMitigationTracker {
  private risks: Map<string, RiskScore> = new Map();
  private actions: Map<string, MitigationAction[]> = new Map();
  private history: Array<{ riskId: string; event: string; timestamp: Date }> = [];
  // Register a new risk
  addRisk(risk: RiskScore): void {
    this.risks.set(risk.id, risk);
    this.logHistory(risk.id, `Risk registered: ${risk.title} (Score: ${risk.score})`);
    // Auto-assign mitigation requirements for score ≥6
    if (requiresMitigation(risk.score) && !risk.mitigationPlan) {
      this.logHistory(risk.id, `⚠️  Mitigation required (score ${risk.score}). Assign owner and plan.`);
    }
  }
  // Add mitigation action
  addMitigationAction(action: MitigationAction): void {
    const risk = this.risks.get(action.riskId);
    if (!risk) throw new Error(`Risk ${action.riskId} not found`);
    const existingActions = this.actions.get(action.riskId) || [];
    existingActions.push(action);
    this.actions.set(action.riskId, existingActions);
    this.logHistory(action.riskId, `Mitigation action added: ${action.action} (Owner: ${action.owner})`);
  }
  // Complete mitigation action
  completeMitigation(riskId: string, actionIndex: number): void {
    const actions = this.actions.get(riskId);
    if (!actions || !actions[actionIndex]) throw new Error('Action not found');
    actions[actionIndex].status = 'COMPLETED';
    actions[actionIndex].completedAt = new Date();
    this.logHistory(riskId, `Mitigation completed: ${actions[actionIndex].action}`);
    // If all actions completed, mark risk as MITIGATED
    if (actions.every((a) => a.status === 'COMPLETED')) {
      const risk = this.risks.get(riskId)!;
      risk.status = 'MITIGATED';
      this.logHistory(riskId, `✅ Risk mitigated. All actions complete.`);
    }
  }
  // Request waiver for a risk
  requestWaiver(riskId: string, reason: string, approver: string, expiryDays: number): void {
    const risk = this.risks.get(riskId);
    if (!risk) throw new Error(`Risk ${riskId} not found`);
    risk.status = 'WAIVED';
    risk.waiverReason = reason;
    risk.waiverApprover = approver;
    risk.waiverExpiry = new Date(Date.now() + expiryDays * 24 * 60 * 60 * 1000);
    this.logHistory(riskId, `⚠️  Waiver granted by ${approver}. Expires: ${risk.waiverExpiry}`);
  }
  // Generate risk report
  generateReport(): string {
    const allRisks = Array.from(this.risks.values());
    const critical = allRisks.filter((r) => r.score === 9 && r.status === 'OPEN');
    const high = allRisks.filter((r) => r.score >= 6 && r.score < 9 && r.status === 'OPEN');
    const mitigated = allRisks.filter((r) => r.status === 'MITIGATED');
    const waived = allRisks.filter((r) => r.status === 'WAIVED');
    let report = `# Risk Mitigation Report\n\n`;
    report += `**Generated**: ${new Date().toISOString()}\n\n`;
    report += `## Summary\n`;
    report += `- Total Risks: ${allRisks.length}\n`;
    report += `- Critical (Score=9, OPEN): ${critical.length}\n`;
    report += `- High (Score 6-8, OPEN): ${high.length}\n`;
    report += `- Mitigated: ${mitigated.length}\n`;
    report += `- Waived: ${waived.length}\n\n`;
    if (critical.length > 0) {
      report += `## 🚨 Critical Risks (BLOCKERS)\n\n`;
      critical.forEach((r) => {
        report += `- **${r.title}** (${r.category})\n`;
        report += `  - Score: ${r.score} (Probability: ${r.probability}, Impact: ${r.impact})\n`;
        report += `  - Owner: ${r.owner}\n`;
        report += `  - Mitigation: ${r.mitigationPlan || 'NOT ASSIGNED'}\n\n`;
      });
    }
    if (high.length > 0) {
      report += `## ⚠️  High Risks\n\n`;
      high.forEach((r) => {
        report += `- **${r.title}** (${r.category})\n`;
        report += `  - Score: ${r.score}\n`;
        report += `  - Owner: ${r.owner}\n`;
        report += `  - Deadline: ${r.deadline?.toISOString().split('T')[0] || 'NOT SET'}\n\n`;
      });
    }
    return report;
  }
  private logHistory(riskId: string, event: string): void {
    this.history.push({ riskId, event, timestamp: new Date() });
  }
  getHistory(riskId: string): Array<{ event: string; timestamp: Date }> {
    return this.history.filter((h) => h.riskId === riskId).map((h) => ({ event: h.event, timestamp: h.timestamp }));
  }
 }
 ```
 **Usage Example**:
 ```typescript
 const tracker = new RiskMitigationTracker();
 // Register critical security risk
 tracker.addRisk({
  id: 'risk-001',
  category: 'SEC',
  title: 'SQL injection vulnerability in user search',
  description: 'Unsanitized input allows arbitrary SQL execution',
  probability: 3,
  impact: 3,
  score: 9,
  owner: 'security-team',
  status: 'OPEN',
 });
 // Add mitigation actions
 tracker.addMitigationAction({
  riskId: 'risk-001',
  action: 'Add parameterized queries to user-search endpoint',
  owner: 'alice@example.com',
  deadline: new Date('2025-10-20'),
  status: 'IN_PROGRESS',
 });
 tracker.addMitigationAction({
  riskId: 'risk-001',
  action: 'Add WAF rule to block SQL injection patterns',
  owner: 'bob@example.com',
  deadline: new Date('2025-10-22'),
  status: 'PENDING',
 });
 // Complete first action
 tracker.completeMitigation('risk-001', 0);
 // Generate report
 console.log(tracker.generateReport());
 // Markdown report with critical risks, owners, deadlines
 // View history
 console.log(tracker.getHistory('risk-001'));
 // [
 //   { event: 'Risk registered: SQL injection...', timestamp: ... },
 //   { event: 'Mitigation action added: Add parameterized queries...', timestamp: ... },
 //   { event: 'Mitigation completed: Add parameterized queries...', timestamp: ... }
 // ]
 ```
 **Key Points**:
 - **Ownership enforcement**: Every risk >4 requires owner assignment
 - **Deadline tracking**: Mitigation actions have explicit deadlines
 - **Audit trail**: Complete history of risk lifecycle (registered → mitigated)
 - **Automated reports**: Markdown output for Confluence/GitHub wikis
 ---
 ### Example 4: Coverage Traceability Matrix (Test-to-Requirement Mapping)
 **Context**: Validate that every acceptance criterion maps to at least one test
 **Implementation**:
 ```typescript
 // coverage-traceability.ts
 export type AcceptanceCriterion = {
  id: string;
  story: string;
  criterion: string;
  priority: 'P0' | 'P1' | 'P2' | 'P3';
 };
 export type TestCase = {
  file: string;
  name: string;
  criteriaIds: string[]; // Links to acceptance criteria
 };
 export type CoverageMatrix = {
  criterion: AcceptanceCriterion;
  tests: TestCase[];
  covered: boolean;
  waiverReason?: string;
 };
 export function buildCoverageMatrix(criteria: AcceptanceCriterion[], tests: TestCase[]): CoverageMatrix[] {
  return criteria.map((criterion) => {
    const matchingTests = tests.filter((t) => t.criteriaIds.includes(criterion.id));
    return {
      criterion,
      tests: matchingTests,
      covered: matchingTests.length > 0,
    };
  });
 }
 export function validateCoverage(matrix: CoverageMatrix[]): {
  gaps: CoverageMatrix[];
  passRate: number;
 } {
  const gaps = matrix.filter((m) => !m.covered && !m.waiverReason);
  const passRate = ((matrix.length - gaps.length) / matrix.length) * 100;
  return { gaps, passRate };
 }
 // Example: Extract criteria IDs from test names
 export function extractCriteriaFromTests(testFiles: string[]): TestCase[] {
  // Simplified: In real implementation, parse test files with AST
  // Here we simulate extraction from test names
  return [
    {
      file: 'tests/e2e/auth/login.spec.ts',
      name: 'should allow user to login with valid credentials',
      criteriaIds: ['AC-001', 'AC-002'], // Linked to acceptance criteria
    },
    {
      file: 'tests/e2e/auth/password-reset.spec.ts',
      name: 'should send password reset email',
      criteriaIds: ['AC-003'],
    },
  ];
 }
 // Generate Markdown traceability report
 export function generateTraceabilityReport(matrix: CoverageMatrix[]): string {
  let report = `# Requirements-to-Tests Traceability Matrix\n\n`;
  report += `**Generated**: ${new Date().toISOString()}\n\n`;
  const { gaps, passRate } = validateCoverage(matrix);
  report += `## Summary\n`;
  report += `- Total Criteria: ${matrix.length}\n`;
  report += `- Covered: ${matrix.filter((m) => m.covered).length}\n`;
  report += `- Gaps: ${gaps.length}\n`;
  report += `- Waived: ${matrix.filter((m) => m.waiverReason).length}\n`;
  report += `- Coverage Rate: ${passRate.toFixed(1)}%\n\n`;
  if (gaps.length > 0) {
    report += `## ❌ Coverage Gaps (MUST RESOLVE)\n\n`;
    report += `| Story | Criterion | Priority | Tests |\n`;
    report += `|-------|-----------|----------|-------|\n`;
    gaps.forEach((m) => {
      report += `| ${m.criterion.story} | ${m.criterion.criterion} | ${m.criterion.priority} | None |\n`;
    });
    report += `\n`;
  }
  report += `## ✅ Covered Criteria\n\n`;
  report += `| Story | Criterion | Tests |\n`;
  report += `|-------|-----------|-------|\n`;
  matrix
    .filter((m) => m.covered)
    .forEach((m) => {
      const testList = m.tests.map((t) => `\`${t.file}\``).join(', ');
      report += `| ${m.criterion.story} | ${m.criterion.criterion} | ${testList} |\n`;
    });
  return report;
 }
 ```
 **Usage Example**:
 ```typescript
 // Define acceptance criteria
 const criteria: AcceptanceCriterion[] = [
  { id: 'AC-001', story: 'US-123', criterion: 'User can login with email', priority: 'P0' },
  { id: 'AC-002', story: 'US-123', criterion: 'User sees error on invalid password', priority: 'P0' },
  { id: 'AC-003', story: 'US-124', criterion: 'User receives password reset email', priority: 'P1' },
  { id: 'AC-004', story: 'US-125', criterion: 'User can update profile', priority: 'P2' }, // NO TEST
 ];
 // Extract tests
 const tests: TestCase[] = extractCriteriaFromTests(['tests/e2e/auth/login.spec.ts', 'tests/e2e/auth/password-reset.spec.ts']);
 // Build matrix
 const matrix = buildCoverageMatrix(criteria, tests);
 // Validate
 const { gaps, passRate } = validateCoverage(matrix);
 console.log(`Coverage: ${passRate.toFixed(1)}%`); // "Coverage: 75.0%"
 console.log(`Gaps: ${gaps.length}`); // "Gaps: 1" (AC-004 has no test)
 // Generate report
 const report = generateTraceabilityReport(matrix);
 console.log(report);
 // Markdown table showing coverage gaps
 ```
 **Key Points**:
 - **Bidirectional traceability**: Criteria → Tests and Tests → Criteria
 - **Gap detection**: Automatically identifies missing coverage
 - **Priority awareness**: P0 gaps are critical blockers
 - **Waiver support**: Allow explicit waivers for low-priority gaps
 ---
 ## Risk Governance Checklist
 Before deploying to production, ensure:
 - [ ] **Risk scoring complete**: All identified risks scored (Probability × Impact)
 - [ ] **Ownership assigned**: Every risk >4 has owner, mitigation plan, deadline
 - [ ] **Coverage validated**: Every acceptance criterion maps to at least one test
 - [ ] **Gate decision documented**: PASS/CONCERNS/FAIL/WAIVED with rationale
 - [ ] **Waivers approved**: All waivers have approver, reason, expiry date
 - [ ] **Audit trail captured**: Risk history log available for compliance review
 - [ ] **Traceability matrix**: Requirements-to-tests mapping up to date
 - [ ] **Critical risks resolved**: No score=9 risks in OPEN status
 ## Integration Points
 - **Used in workflows**: `*trace` (Phase 2: gate decision), `*nfr-assess` (risk scoring), `*test-design` (risk identification)
 - **Related fragments**: `probability-impact.md` (scoring definitions), `test-priorities-matrix.md` (P0-P3 classification), `nfr-criteria.md` (non-functional risks)
 - **Tools**: Risk tracking dashboards (Jira, Linear), gate automation (CI/CD), traceability reports (Markdown, Confluence)
 _Source: Murat risk governance notes, gate schema guidance, SEON production gate workflows, ISO 31000 risk management standards_
--- a/_bmad/bmm/testarch/knowledge/selective-testing.md
+++ b/_bmad/bmm/testarch/knowledge/selective-testing.md
@@ -0,0 +1,732 @@
 # Selective and Targeted Test Execution
 ## Principle
 Run only the tests you need, when you need them. Use tags/grep to slice suites by risk priority (not directory structure), filter by spec patterns or git diff to focus on impacted areas, and combine priority metadata (P0-P3) with change detection to optimize pre-commit vs. CI execution. Document the selection strategy clearly so teams understand when full regression is mandatory.
 ## Rationale
 Running the entire test suite on every commit wastes time and resources. Smart test selection provides fast feedback (smoke tests in minutes, full regression in hours) while maintaining confidence. The "32+ ways of selective testing" philosophy balances speed with coverage: quick loops for developers, comprehensive validation before deployment. Poorly documented selection leads to confusion about when tests run and why.
 ## Pattern Examples
 ### Example 1: Tag-Based Execution with Priority Levels
 **Context**: Organize tests by risk priority and execution stage using grep/tag patterns.
 **Implementation**:
 ```typescript
 // tests/e2e/checkout.spec.ts
 import { test, expect } from '@playwright/test';
 /**
 * Tag-based test organization
 * - @smoke: Critical path tests (run on every commit, < 5 min)
 * - @regression: Full test suite (run pre-merge, < 30 min)
 * - @p0: Critical business functions (payment, auth, data integrity)
 * - @p1: Core features (primary user journeys)
 * - @p2: Secondary features (supporting functionality)
 * - @p3: Nice-to-have (cosmetic, non-critical)
 */
 test.describe('Checkout Flow', () => {
  // P0 + Smoke: Must run on every commit
  test('@smoke @p0 should complete purchase with valid payment', async ({ page }) => {
    await page.goto('/checkout');
    await page.getByTestId('card-number').fill('4242424242424242');
    await page.getByTestId('submit-payment').click();
    await expect(page.getByTestId('order-confirmation')).toBeVisible();
  });
  // P0 but not smoke: Run pre-merge
  test('@regression @p0 should handle payment decline gracefully', async ({ page }) => {
    await page.goto('/checkout');
    await page.getByTestId('card-number').fill('4000000000000002'); // Decline card
    await page.getByTestId('submit-payment').click();
    await expect(page.getByTestId('payment-error')).toBeVisible();
    await expect(page.getByTestId('payment-error')).toContainText('declined');
  });
  // P1 + Smoke: Important but not critical
  test('@smoke @p1 should apply discount code', async ({ page }) => {
    await page.goto('/checkout');
    await page.getByTestId('promo-code').fill('SAVE10');
    await page.getByTestId('apply-promo').click();
    await expect(page.getByTestId('discount-applied')).toBeVisible();
  });
  // P2: Run in full regression only
  test('@regression @p2 should remember saved payment methods', async ({ page }) => {
    await page.goto('/checkout');
    await expect(page.getByTestId('saved-cards')).toBeVisible();
  });
  // P3: Low priority, run nightly or weekly
  test('@nightly @p3 should display checkout page analytics', async ({ page }) => {
    await page.goto('/checkout');
    const analyticsEvents = await page.evaluate(() => (window as any).__ANALYTICS__);
    expect(analyticsEvents).toBeDefined();
  });
 });
 ```
 **package.json scripts**:
 ```json
 {
  "scripts": {
    "test": "playwright test",
    "test:smoke": "playwright test --grep '@smoke'",
    "test:p0": "playwright test --grep '@p0'",
    "test:p0-p1": "playwright test --grep '@p0|@p1'",
    "test:regression": "playwright test --grep '@regression'",
    "test:nightly": "playwright test --grep '@nightly'",
    "test:not-slow": "playwright test --grep-invert '@slow'",
    "test:critical-smoke": "playwright test --grep '@smoke.*@p0'"
  }
 }
 ```
 **Cypress equivalent**:
 ```javascript
 // cypress/e2e/checkout.cy.ts
 describe('Checkout Flow', { tags: ['@checkout'] }, () => {
  it('should complete purchase', { tags: ['@smoke', '@p0'] }, () => {
    cy.visit('/checkout');
    cy.get('[data-cy="card-number"]').type('4242424242424242');
    cy.get('[data-cy="submit-payment"]').click();
    cy.get('[data-cy="order-confirmation"]').should('be.visible');
  });
  it('should handle decline', { tags: ['@regression', '@p0'] }, () => {
    cy.visit('/checkout');
    cy.get('[data-cy="card-number"]').type('4000000000000002');
    cy.get('[data-cy="submit-payment"]').click();
    cy.get('[data-cy="payment-error"]').should('be.visible');
  });
 });
 // cypress.config.ts
 export default defineConfig({
  e2e: {
    env: {
      grepTags: process.env.GREP_TAGS || '',
      grepFilterSpecs: true,
    },
    setupNodeEvents(on, config) {
      require('@cypress/grep/src/plugin')(config);
      return config;
    },
  },
 });
 ```
 **Usage**:
 ```bash
 # Playwright
 npm run test:smoke                    # Run all @smoke tests
 npm run test:p0                       # Run all P0 tests
 npm run test -- --grep "@smoke.*@p0"  # Run tests with BOTH tags
 # Cypress (with @cypress/grep plugin)
 npx cypress run --env grepTags="@smoke"
 npx cypress run --env grepTags="@p0+@smoke"  # AND logic
 npx cypress run --env grepTags="@p0 @p1"     # OR logic
 ```
 **Key Points**:
 - **Multiple tags per test**: Combine priority (@p0) with stage (@smoke)
 - **AND/OR logic**: Grep supports complex filtering
 - **Clear naming**: Tags document test importance
 - **Fast feedback**: @smoke runs < 5 min, full suite < 30 min
 - **CI integration**: Different jobs run different tag combinations
 ---
 ### Example 2: Spec Filter Pattern (File-Based Selection)
 **Context**: Run tests by file path pattern or directory for targeted execution.
 **Implementation**:
 ```bash
 #!/bin/bash
 # scripts/selective-spec-runner.sh
 # Run tests based on spec file patterns
 set -e
 PATTERN=${1:-"**/*.spec.ts"}
 TEST_ENV=${TEST_ENV:-local}
 echo "🎯 Selective Spec Runner"
 echo "━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━"
 echo "Pattern: $PATTERN"
 echo "Environment: $TEST_ENV"
 echo ""
 # Pattern examples and their use cases
 case "$PATTERN" in
  "**/checkout*")
    echo "📦 Running checkout-related tests"
    npx playwright test --grep-files="**/checkout*"
    ;;
  "**/auth*"|"**/login*"|"**/signup*")
    echo "🔐 Running authentication tests"
    npx playwright test --grep-files="**/auth*|**/login*|**/signup*"
    ;;
  "tests/e2e/**")
    echo "🌐 Running all E2E tests"
    npx playwright test tests/e2e/
    ;;
  "tests/integration/**")
    echo "🔌 Running all integration tests"
    npx playwright test tests/integration/
    ;;
  "tests/component/**")
    echo "🧩 Running all component tests"
    npx playwright test tests/component/
    ;;
  *)
    echo "🔍 Running tests matching pattern: $PATTERN"
    npx playwright test "$PATTERN"
    ;;
 esac
 ```
 **Playwright config for file filtering**:
 ```typescript
 // playwright.config.ts
 import { defineConfig, devices } from '@playwright/test';
 export default defineConfig({
  // ... other config
  // Project-based organization
  projects: [
    {
      name: 'smoke',
      testMatch: /.*smoke.*\.spec\.ts/,
      retries: 0,
    },
    {
      name: 'e2e',
      testMatch: /tests\/e2e\/.*\.spec\.ts/,
      retries: 2,
    },
    {
      name: 'integration',
      testMatch: /tests\/integration\/.*\.spec\.ts/,
      retries: 1,
    },
    {
      name: 'component',
      testMatch: /tests\/component\/.*\.spec\.ts/,
      use: { ...devices['Desktop Chrome'] },
    },
  ],
 });
 ```
 **Advanced pattern matching**:
 ```typescript
 // scripts/run-by-component.ts
 /**
 * Run tests related to specific component(s)
 * Usage: npm run test:component UserProfile,Settings
 */
 import { execSync } from 'child_process';
 const components = process.argv[2]?.split(',') || [];
 if (components.length === 0) {
  console.error('❌ No components specified');
  console.log('Usage: npm run test:component UserProfile,Settings');
  process.exit(1);
 }
 // Convert component names to glob patterns
 const patterns = components.map((comp) => `**/*${comp}*.spec.ts`).join(' ');
 console.log(`🧩 Running tests for components: ${components.join(', ')}`);
 console.log(`Patterns: ${patterns}`);
 try {
  execSync(`npx playwright test ${patterns}`, {
    stdio: 'inherit',
    env: { ...process.env, CI: 'false' },
  });
 } catch (error) {
  process.exit(1);
 }
 ```
 **package.json scripts**:
 ```json
 {
  "scripts": {
    "test:checkout": "playwright test **/checkout*.spec.ts",
    "test:auth": "playwright test **/auth*.spec.ts **/login*.spec.ts",
    "test:e2e": "playwright test tests/e2e/",
    "test:integration": "playwright test tests/integration/",
    "test:component": "ts-node scripts/run-by-component.ts",
    "test:project": "playwright test --project",
    "test:smoke-project": "playwright test --project smoke"
  }
 }
 ```
 **Key Points**:
 - **Glob patterns**: Wildcards match file paths flexibly
 - **Project isolation**: Separate projects have different configs
 - **Component targeting**: Run tests for specific features
 - **Directory-based**: Organize tests by type (e2e, integration, component)
 - **CI optimization**: Run subsets in parallel CI jobs
 ---
 ### Example 3: Diff-Based Test Selection (Changed Files Only)
 **Context**: Run only tests affected by code changes for maximum speed.
 **Implementation**:
 ```bash
 #!/bin/bash
 # scripts/test-changed-files.sh
 # Intelligent test selection based on git diff
 set -e
 BASE_BRANCH=${BASE_BRANCH:-main}
 TEST_ENV=${TEST_ENV:-local}
 echo "🔍 Changed File Test Selector"
 echo "━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━"
 echo "Base branch: $BASE_BRANCH"
 echo "Environment: $TEST_ENV"
 echo ""
 # Get changed files
 CHANGED_FILES=$(git diff --name-only $BASE_BRANCH...HEAD)
 if [ -z "$CHANGED_FILES" ]; then
  echo "✅ No files changed. Skipping tests."
  exit 0
 fi
 echo "Changed files:"
 echo "$CHANGED_FILES" | sed 's/^/  - /'
 echo ""
 # Arrays to collect test specs
 DIRECT_TEST_FILES=()
 RELATED_TEST_FILES=()
 RUN_ALL_TESTS=false
 # Process each changed file
 while IFS= read -r file; do
  case "$file" in
    # Changed test files: run them directly
    *.spec.ts|*.spec.js|*.test.ts|*.test.js|*.cy.ts|*.cy.js)
      DIRECT_TEST_FILES+=("$file")
      ;;
    # Critical config changes: run ALL tests
    package.json|package-lock.json|playwright.config.ts|cypress.config.ts|tsconfig.json|.github/workflows/*)
      echo "⚠️  Critical file changed: $file"
      RUN_ALL_TESTS=true
      break
      ;;
    # Component changes: find related tests
    src/components/*.tsx|src/components/*.jsx)
      COMPONENT_NAME=$(basename "$file" | sed 's/\.[^.]*$//')
      echo "🧩 Component changed: $COMPONENT_NAME"
      # Find tests matching component name
      FOUND_TESTS=$(find tests -name "*${COMPONENT_NAME}*.spec.ts" -o -name "*${COMPONENT_NAME}*.cy.ts" 2>/dev/null || true)
      if [ -n "$FOUND_TESTS" ]; then
        while IFS= read -r test_file; do
          RELATED_TEST_FILES+=("$test_file")
        done <<< "$FOUND_TESTS"
      fi
      ;;
    # Utility/lib changes: run integration + unit tests
    src/utils/*|src/lib/*|src/helpers/*)
      echo "⚙️  Utility file changed: $file"
      RELATED_TEST_FILES+=($(find tests/unit tests/integration -name "*.spec.ts" 2>/dev/null || true))
      ;;
    # API changes: run integration + e2e tests
    src/api/*|src/services/*|src/controllers/*)
      echo "🔌 API file changed: $file"
      RELATED_TEST_FILES+=($(find tests/integration tests/e2e -name "*.spec.ts" 2>/dev/null || true))
      ;;
    # Type changes: run all TypeScript tests
    *.d.ts|src/types/*)
      echo "📝 Type definition changed: $file"
      RUN_ALL_TESTS=true
      break
      ;;
    # Documentation only: skip tests
    *.md|docs/*|README*)
      echo "📄 Documentation changed: $file (no tests needed)"
      ;;
    *)
      echo "❓ Unclassified change: $file (running smoke tests)"
      RELATED_TEST_FILES+=($(find tests -name "*smoke*.spec.ts" 2>/dev/null || true))
      ;;
  esac
 done <<< "$CHANGED_FILES"
 # Execute tests based on analysis
 if [ "$RUN_ALL_TESTS" = true ]; then
  echo ""
  echo "━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━"
  echo "🚨 Running FULL test suite (critical changes detected)"
  echo "━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━"
  npm run test
  exit $?
 fi
 # Combine and deduplicate test files
 ALL_TEST_FILES=(${DIRECT_TEST_FILES[@]} ${RELATED_TEST_FILES[@]})
 UNIQUE_TEST_FILES=($(echo "${ALL_TEST_FILES[@]}" | tr ' ' '\n' | sort -u))
 if [ ${#UNIQUE_TEST_FILES[@]} -eq 0 ]; then
  echo ""
  echo "✅ No tests found for changed files. Running smoke tests."
  npm run test:smoke
  exit $?
 fi
 echo ""
 echo "━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━"
 echo "🎯 Running ${#UNIQUE_TEST_FILES[@]} test file(s)"
 echo "━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━"
 for test_file in "${UNIQUE_TEST_FILES[@]}"; do
  echo "  - $test_file"
 done
 echo ""
 npm run test -- "${UNIQUE_TEST_FILES[@]}"
 ```
 **GitHub Actions integration**:
 ```yaml
 # .github/workflows/test-changed.yml
 name: Test Changed Files
 on:
  pull_request:
    types: [opened, synchronize, reopened]
 jobs:
  detect-and-test:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v4
        with:
          fetch-depth: 0 # Full history for accurate diff
      - name: Get changed files
        id: changed-files
        uses: tj-actions/changed-files@v40
        with:
          files: |
            src/**
            tests/**
            *.config.ts
          files_ignore: |
            **/*.md
            docs/**
      - name: Run tests for changed files
        if: steps.changed-files.outputs.any_changed == 'true'
        run: |
          echo "Changed files: ${{ steps.changed-files.outputs.all_changed_files }}"
          bash scripts/test-changed-files.sh
        env:
          BASE_BRANCH: ${{ github.base_ref }}
          TEST_ENV: staging
 ```
 **Key Points**:
 - **Intelligent mapping**: Code changes → related tests
 - **Critical file detection**: Config changes = full suite
 - **Component mapping**: UI changes → component + E2E tests
 - **Fast feedback**: Run only what's needed (< 2 min typical)
 - **Safety net**: Unrecognized changes run smoke tests
 ---
 ### Example 4: Promotion Rules (Pre-Commit → CI → Staging → Production)
 **Context**: Progressive test execution strategy across deployment stages.
 **Implementation**:
 ```typescript
 // scripts/test-promotion-strategy.ts
 /**
 * Test Promotion Strategy
 * Defines which tests run at each stage of the development lifecycle
 */
 export type TestStage = 'pre-commit' | 'ci-pr' | 'ci-merge' | 'staging' | 'production';
 export type TestPromotion = {
  stage: TestStage;
  description: string;
  testCommand: string;
  timebudget: string; // minutes
  required: boolean;
  failureAction: 'block' | 'warn' | 'alert';
 };
 export const TEST_PROMOTION_RULES: Record<TestStage, TestPromotion> = {
  'pre-commit': {
    stage: 'pre-commit',
    description: 'Local developer checks before git commit',
    testCommand: 'npm run test:smoke',
    timebudget: '2',
    required: true,
    failureAction: 'block',
  },
  'ci-pr': {
    stage: 'ci-pr',
    description: 'CI checks on pull request creation/update',
    testCommand: 'npm run test:changed && npm run test:p0-p1',
    timebudget: '10',
    required: true,
    failureAction: 'block',
  },
  'ci-merge': {
    stage: 'ci-merge',
    description: 'Full regression before merge to main',
    testCommand: 'npm run test:regression',
    timebudget: '30',
    required: true,
    failureAction: 'block',
  },
  staging: {
    stage: 'staging',
    description: 'Post-deployment validation in staging environment',
    testCommand: 'npm run test:e2e -- --grep "@smoke"',
    timebudget: '15',
    required: true,
    failureAction: 'block',
  },
  production: {
    stage: 'production',
    description: 'Production smoke tests post-deployment',
    testCommand: 'npm run test:e2e:prod -- --grep "@smoke.*@p0"',
    timebudget: '5',
    required: false,
    failureAction: 'alert',
  },
 };
 /**
 * Get tests to run for a specific stage
 */
 export function getTestsForStage(stage: TestStage): TestPromotion {
  return TEST_PROMOTION_RULES[stage];
 }
 /**
 * Validate if tests can be promoted to next stage
 */
 export function canPromote(currentStage: TestStage, testsPassed: boolean): boolean {
  const promotion = TEST_PROMOTION_RULES[currentStage];
  if (!promotion.required) {
    return true; // Non-required tests don't block promotion
  }
  return testsPassed;
 }
 ```
 **Husky pre-commit hook**:
 ```bash
 #!/bin/bash
 # .husky/pre-commit
 # Run smoke tests before allowing commit
 echo "🔍 Running pre-commit tests..."
 npm run test:smoke
 if [ $? -ne 0 ]; then
  echo ""
  echo "❌ Pre-commit tests failed!"
  echo "Please fix failures before committing."
  echo ""
  echo "To skip (NOT recommended): git commit --no-verify"
  exit 1
 fi
 echo "✅ Pre-commit tests passed"
 ```
 **GitHub Actions workflow**:
 ```yaml
 # .github/workflows/test-promotion.yml
 name: Test Promotion Strategy
 on:
  pull_request:
  push:
    branches: [main]
  workflow_dispatch:
 jobs:
  # Stage 1: PR tests (changed + P0-P1)
  pr-tests:
    if: github.event_name == 'pull_request'
    runs-on: ubuntu-latest
    timeout-minutes: 10
    steps:
      - uses: actions/checkout@v4
      - name: Run PR-level tests
        run: |
          npm run test:changed
          npm run test:p0-p1
  # Stage 2: Full regression (pre-merge)
  regression-tests:
    if: github.event_name == 'push' && github.ref == 'refs/heads/main'
    runs-on: ubuntu-latest
    timeout-minutes: 30
    steps:
      - uses: actions/checkout@v4
      - name: Run full regression
        run: npm run test:regression
  # Stage 3: Staging validation (post-deploy)
  staging-smoke:
    if: github.event_name == 'workflow_dispatch'
    runs-on: ubuntu-latest
    timeout-minutes: 15
    steps:
      - uses: actions/checkout@v4
      - name: Run staging smoke tests
        run: npm run test:e2e -- --grep "@smoke"
        env:
          TEST_ENV: staging
  # Stage 4: Production smoke (post-deploy, non-blocking)
  production-smoke:
    if: github.event_name == 'workflow_dispatch'
    runs-on: ubuntu-latest
    timeout-minutes: 5
    continue-on-error: true # Don't fail deployment if smoke tests fail
    steps:
      - uses: actions/checkout@v4
      - name: Run production smoke tests
        run: npm run test:e2e:prod -- --grep "@smoke.*@p0"
        env:
          TEST_ENV: production
      - name: Alert on failure
        if: failure()
        uses: 8398a7/action-slack@v3
        with:
          status: ${{ job.status }}
          text: '🚨 Production smoke tests failed!'
          webhook_url: ${{ secrets.SLACK_WEBHOOK }}
 ```
 **Selection strategy documentation**:
 ````markdown
 # Test Selection Strategy
 ## Test Promotion Stages
 | Stage      | Tests Run           | Time Budget | Blocks Deploy | Failure Action |
 | ---------- | ------------------- | ----------- | ------------- | -------------- |
 | Pre-Commit | Smoke (@smoke)      | 2 min       | ✅ Yes        | Block commit   |
 | CI PR      | Changed + P0-P1     | 10 min      | ✅ Yes        | Block merge    |
 | CI Merge   | Full regression     | 30 min      | ✅ Yes        | Block deploy   |
 | Staging    | E2E smoke           | 15 min      | ✅ Yes        | Rollback       |
 | Production | Critical smoke only | 5 min       | ❌ No         | Alert team     |
 ## When Full Regression Runs
 Full regression suite (`npm run test:regression`) runs in these scenarios:
 - ✅ Before merging to `main` (CI Merge stage)
 - ✅ Nightly builds (scheduled workflow)
 - ✅ Manual trigger (workflow_dispatch)
 - ✅ Release candidate testing
 Full regression does NOT run on:
 - ❌ Every PR commit (too slow)
 - ❌ Pre-commit hooks (too slow)
 - ❌ Production deployments (deploy-blocking)
 ## Override Scenarios
 Skip tests (emergency only):
 ```bash
 git commit --no-verify  # Skip pre-commit hook
 gh pr merge --admin     # Force merge (requires admin)
 ```
 ````
 ```
 **Key Points**:
 - **Progressive validation**: More tests at each stage
 - **Time budgets**: Clear expectations per stage
 - **Blocking vs. alerting**: Production tests don't block deploy
 - **Documentation**: Team knows when full regression runs
 - **Emergency overrides**: Documented but discouraged
 ---
 ## Test Selection Strategy Checklist
 Before implementing selective testing, verify:
 - [ ] **Tag strategy defined**: @smoke, @p0-p3, @regression documented
 - [ ] **Time budgets set**: Each stage has clear timeout (smoke < 5 min, full < 30 min)
 - [ ] **Changed file mapping**: Code changes → test selection logic implemented
 - [ ] **Promotion rules documented**: README explains when full regression runs
 - [ ] **CI integration**: GitHub Actions uses selective strategy
 - [ ] **Local parity**: Developers can run same selections locally
 - [ ] **Emergency overrides**: Skip mechanisms documented (--no-verify, admin merge)
 - [ ] **Metrics tracked**: Monitor test execution time and selection accuracy
 ## Integration Points
 - Used in workflows: `*ci` (CI/CD setup), `*automate` (test generation with tags)
 - Related fragments: `ci-burn-in.md`, `test-priorities-matrix.md`, `test-quality.md`
 - Selection tools: Playwright --grep, Cypress @cypress/grep, git diff
 _Source: 32+ selective testing strategies blog, Murat testing philosophy, SEON CI optimization_
 ```
--- a/_bmad/bmm/testarch/knowledge/selector-resilience.md
+++ b/_bmad/bmm/testarch/knowledge/selector-resilience.md
@@ -0,0 +1,527 @@
 # Selector Resilience
 ## Principle
 Robust selectors follow a strict hierarchy: **data-testid > ARIA roles > text content > CSS/IDs** (last resort). Selectors must be resilient to UI changes (styling, layout, content updates) and remain human-readable for maintenance.
 ## Rationale
 **The Problem**: Brittle selectors (CSS classes, nth-child, complex XPath) break when UI styling changes, elements are reordered, or design updates occur. This causes test maintenance burden and false negatives.
 **The Solution**: Prioritize semantic selectors that reflect user intent (ARIA roles, accessible names, test IDs). Use dynamic filtering for lists instead of nth() indexes. Validate selectors during code review and refactor proactively.
 **Why This Matters**:
 - Prevents false test failures (UI refactoring doesn't break tests)
 - Improves accessibility (ARIA roles benefit both tests and screen readers)
 - Enhances readability (semantic selectors document user intent)
 - Reduces maintenance burden (robust selectors survive design changes)
 ## Pattern Examples
 ### Example 1: Selector Hierarchy (Priority Order with Examples)
 **Context**: Choose the most resilient selector for each element type
 **Implementation**:
 ```typescript
 // tests/selectors/hierarchy-examples.spec.ts
 import { test, expect } from '@playwright/test';
 test.describe('Selector Hierarchy Best Practices', () => {
  test('Level 1: data-testid (BEST - most resilient)', async ({ page }) => {
    await page.goto('/login');
    // ✅ Best: Dedicated test attribute (survives all UI changes)
    await page.getByTestId('email-input').fill('user@example.com');
    await page.getByTestId('password-input').fill('password123');
    await page.getByTestId('login-button').click();
    await expect(page.getByTestId('welcome-message')).toBeVisible();
    // Why it's best:
    // - Survives CSS refactoring (class name changes)
    // - Survives layout changes (element reordering)
    // - Survives content changes (button text updates)
    // - Explicit test contract (developer knows it's for testing)
  });
  test('Level 2: ARIA roles and accessible names (GOOD - future-proof)', async ({ page }) => {
    await page.goto('/login');
    // ✅ Good: Semantic HTML roles (benefits accessibility + tests)
    await page.getByRole('textbox', { name: 'Email' }).fill('user@example.com');
    await page.getByRole('textbox', { name: 'Password' }).fill('password123');
    await page.getByRole('button', { name: 'Sign In' }).click();
    await expect(page.getByRole('heading', { name: 'Welcome' })).toBeVisible();
    // Why it's good:
    // - Survives CSS refactoring
    // - Survives layout changes
    // - Enforces accessibility (screen reader compatible)
    // - Self-documenting (role + name = clear intent)
  });
  test('Level 3: Text content (ACCEPTABLE - user-centric)', async ({ page }) => {
    await page.goto('/dashboard');
    // ✅ Acceptable: Text content (matches user perception)
    await page.getByText('Create New Order').click();
    await expect(page.getByText('Order Details')).toBeVisible();
    // Why it's acceptable:
    // - User-centric (what user sees)
    // - Survives CSS/layout changes
    // - Breaks when copy changes (forces test update with content)
    // ⚠️ Use with caution for dynamic/localized content:
    // - Avoid for content with variables: "User 123" (use regex instead)
    // - Avoid for i18n content (use data-testid or ARIA)
  });
  test('Level 4: CSS classes/IDs (LAST RESORT - brittle)', async ({ page }) => {
    await page.goto('/login');
    // ❌ Last resort: CSS class (breaks with styling updates)
    // await page.locator('.btn-primary').click()
    // ❌ Last resort: ID (breaks if ID changes)
    // await page.locator('#login-form').fill(...)
    // ✅ Better: Use data-testid or ARIA instead
    await page.getByTestId('login-button').click();
    // Why CSS/ID is last resort:
    // - Breaks with CSS refactoring (class name changes)
    // - Breaks with HTML restructuring (ID changes)
    // - Not semantic (unclear what element does)
    // - Tight coupling between tests and styling
  });
 });
 ```
 **Key Points**:
 - Hierarchy: data-testid (best) > ARIA (good) > text (acceptable) > CSS/ID (last resort)
 - data-testid survives ALL UI changes (explicit test contract)
 - ARIA roles enforce accessibility (screen reader compatible)
 - Text content is user-centric (but breaks with copy changes)
 - CSS/ID are brittle (break with styling refactoring)
 ---
 ### Example 2: Dynamic Selector Patterns (Lists, Filters, Regex)
 **Context**: Handle dynamic content, lists, and variable data with resilient selectors
 **Implementation**:
 ```typescript
 // tests/selectors/dynamic-selectors.spec.ts
 import { test, expect } from '@playwright/test';
 test.describe('Dynamic Selector Patterns', () => {
  test('regex for variable content (user IDs, timestamps)', async ({ page }) => {
    await page.goto('/users');
    // ✅ Good: Regex pattern for dynamic user IDs
    await expect(page.getByText(/User \d+/)).toBeVisible();
    // ✅ Good: Regex for timestamps
    await expect(page.getByText(/Last login: \d{4}-\d{2}-\d{2}/)).toBeVisible();
    // ✅ Good: Regex for dynamic counts
    await expect(page.getByText(/\d+ items in cart/)).toBeVisible();
  });
  test('partial text matching (case-insensitive, substring)', async ({ page }) => {
    await page.goto('/products');
    // ✅ Good: Partial match (survives minor text changes)
    await page.getByText('Product', { exact: false }).first().click();
    // ✅ Good: Case-insensitive (survives capitalization changes)
    await expect(page.getByText(/sign in/i)).toBeVisible();
  });
  test('filter locators for lists (avoid brittle nth)', async ({ page }) => {
    await page.goto('/products');
    // ❌ Bad: Index-based (breaks when order changes)
    // await page.locator('.product-card').nth(2).click()
    // ✅ Good: Filter by content (resilient to reordering)
    await page.locator('[data-testid="product-card"]').filter({ hasText: 'Premium Plan' }).click();
    // ✅ Good: Filter by attribute
    await page
      .locator('[data-testid="product-card"]')
      .filter({ has: page.locator('[data-status="active"]') })
      .first()
      .click();
  });
  test('nth() only when absolutely necessary', async ({ page }) => {
    await page.goto('/dashboard');
    // ⚠️ Acceptable: nth(0) for first item (common pattern)
    const firstNotification = page.getByTestId('notification').nth(0);
    await expect(firstNotification).toContainText('Welcome');
    // ❌ Bad: nth(5) for arbitrary index (fragile)
    // await page.getByTestId('notification').nth(5).click()
    // ✅ Better: Use filter() with specific criteria
    await page.getByTestId('notification').filter({ hasText: 'Critical Alert' }).click();
  });
  test('combine multiple locators for specificity', async ({ page }) => {
    await page.goto('/checkout');
    // ✅ Good: Narrow scope with combined locators
    const shippingSection = page.getByTestId('shipping-section');
    await shippingSection.getByLabel('Address Line 1').fill('123 Main St');
    await shippingSection.getByLabel('City').fill('New York');
    // Scoping prevents ambiguity (multiple "City" fields on page)
  });
 });
 ```
 **Key Points**:
 - Regex patterns handle variable content (IDs, timestamps, counts)
 - Partial matching survives minor text changes (`exact: false`)
 - `filter()` is more resilient than `nth()` (content-based vs index-based)
 - `nth(0)` acceptable for "first item", avoid arbitrary indexes
 - Combine locators to narrow scope (prevent ambiguity)
 ---
 ### Example 3: Selector Anti-Patterns (What NOT to Do)
 **Context**: Common selector mistakes that cause brittle tests
 **Problem Examples**:
 ```typescript
 // tests/selectors/anti-patterns.spec.ts
 import { test, expect } from '@playwright/test';
 test.describe('Selector Anti-Patterns to Avoid', () => {
  test('❌ Anti-Pattern 1: CSS classes (brittle)', async ({ page }) => {
    await page.goto('/login');
    // ❌ Bad: CSS class (breaks with design system updates)
    // await page.locator('.btn-primary').click()
    // await page.locator('.form-input-lg').fill('test@example.com')
    // ✅ Good: Use data-testid or ARIA role
    await page.getByTestId('login-button').click();
    await page.getByRole('textbox', { name: 'Email' }).fill('test@example.com');
  });
  test('❌ Anti-Pattern 2: Index-based nth() (fragile)', async ({ page }) => {
    await page.goto('/products');
    // ❌ Bad: Index-based (breaks when product order changes)
    // await page.locator('.product-card').nth(3).click()
    // ✅ Good: Content-based filter
    await page.locator('[data-testid="product-card"]').filter({ hasText: 'Laptop' }).click();
  });
  test('❌ Anti-Pattern 3: Complex XPath (hard to maintain)', async ({ page }) => {
    await page.goto('/dashboard');
    // ❌ Bad: Complex XPath (unreadable, breaks with structure changes)
    // await page.locator('xpath=//div[@class="container"]//section[2]//button[contains(@class, "primary")]').click()
    // ✅ Good: Semantic selector
    await page.getByRole('button', { name: 'Create Order' }).click();
  });
  test('❌ Anti-Pattern 4: ID selectors (coupled to implementation)', async ({ page }) => {
    await page.goto('/settings');
    // ❌ Bad: HTML ID (breaks if ID changes for accessibility/SEO)
    // await page.locator('#user-settings-form').fill(...)
    // ✅ Good: data-testid or ARIA landmark
    await page.getByTestId('user-settings-form').getByLabel('Display Name').fill('John Doe');
  });
  test('✅ Refactoring: Bad → Good Selector', async ({ page }) => {
    await page.goto('/checkout');
    // Before (brittle):
    // await page.locator('.checkout-form > .payment-section > .btn-submit').click()
    // After (resilient):
    await page.getByTestId('checkout-form').getByRole('button', { name: 'Complete Payment' }).click();
    await expect(page.getByText('Payment successful')).toBeVisible();
  });
 });
 ```
 **Why These Fail**:
 - **CSS classes**: Change frequently with design updates (Tailwind, CSS modules)
 - **nth() indexes**: Fragile to element reordering (new features, A/B tests)
 - **Complex XPath**: Unreadable, breaks with HTML structure changes
 - **HTML IDs**: Not stable (accessibility improvements change IDs)
 **Better Approach**: Use selector hierarchy (testid > ARIA > text)
 ---
 ### Example 4: Selector Debugging Techniques (Inspector, DevTools, MCP)
 **Context**: Debug selector failures interactively to find better alternatives
 **Implementation**:
 ```typescript
 // tests/selectors/debugging-techniques.spec.ts
 import { test, expect } from '@playwright/test';
 test.describe('Selector Debugging Techniques', () => {
  test('use Playwright Inspector to test selectors', async ({ page }) => {
    await page.goto('/dashboard');
    // Pause test to open Inspector
    await page.pause();
    // In Inspector console, test selectors:
    // page.getByTestId('user-menu')              ✅ Works
    // page.getByRole('button', { name: 'Profile' }) ✅ Works
    // page.locator('.btn-primary')               ❌ Brittle
    // Use "Pick Locator" feature to generate selectors
    // Use "Record" mode to capture user interactions
    await page.getByTestId('user-menu').click();
    await expect(page.getByRole('menu')).toBeVisible();
  });
  test('use locator.all() to debug lists', async ({ page }) => {
    await page.goto('/products');
    // Debug: How many products are visible?
    const products = await page.getByTestId('product-card').all();
    console.log(`Found ${products.length} products`);
    // Debug: What text is in each product?
    for (const product of products) {
      const text = await product.textContent();
      console.log(`Product text: ${text}`);
    }
    // Use findings to build better selector
    await page.getByTestId('product-card').filter({ hasText: 'Laptop' }).click();
  });
  test('use DevTools console to test selectors', async ({ page }) => {
    await page.goto('/checkout');
    // Open DevTools (manually or via page.pause())
    // Test selectors in console:
    // document.querySelectorAll('[data-testid="payment-method"]')
    // document.querySelector('#credit-card-input')
    // Find robust selector through trial and error
    await page.getByTestId('payment-method').selectOption('credit-card');
  });
  test('MCP browser_generate_locator (if available)', async ({ page }) => {
    await page.goto('/products');
    // If Playwright MCP available, use browser_generate_locator:
    // 1. Click element in browser
    // 2. MCP generates optimal selector
    // 3. Copy into test
    // Example output from MCP:
    // page.getByRole('link', { name: 'Product A' })
    // Use generated selector
    await page.getByRole('link', { name: 'Product A' }).click();
    await expect(page).toHaveURL(/\/products\/\d+/);
  });
 });
 ```
 **Key Points**:
 - Playwright Inspector: Interactive selector testing with "Pick Locator" feature
 - `locator.all()`: Debug lists to understand structure and content
 - DevTools console: Test CSS selectors before adding to tests
 - MCP browser_generate_locator: Auto-generate optimal selectors (if MCP available)
 - Always validate selectors work before committing
 ---
 ### Example 2: Selector Refactoring Guide (Before/After Patterns)
 **Context**: Systematically improve brittle selectors to resilient alternatives
 **Implementation**:
 ```typescript
 // tests/selectors/refactoring-guide.spec.ts
 import { test, expect } from '@playwright/test';
 test.describe('Selector Refactoring Patterns', () => {
  test('refactor: CSS class → data-testid', async ({ page }) => {
    await page.goto('/products');
    // ❌ Before: CSS class (breaks with Tailwind updates)
    // await page.locator('.bg-blue-500.px-4.py-2.rounded').click()
    // ✅ After: data-testid
    await page.getByTestId('add-to-cart-button').click();
    // Implementation: Add data-testid to button component
    // <button className="bg-blue-500 px-4 py-2 rounded" data-testid="add-to-cart-button">
  });
  test('refactor: nth() index → filter()', async ({ page }) => {
    await page.goto('/users');
    // ❌ Before: Index-based (breaks when users reorder)
    // await page.locator('.user-row').nth(2).click()
    // ✅ After: Content-based filter
    await page.locator('[data-testid="user-row"]').filter({ hasText: 'john@example.com' }).click();
  });
  test('refactor: Complex XPath → ARIA role', async ({ page }) => {
    await page.goto('/checkout');
    // ❌ Before: Complex XPath (unreadable, brittle)
    // await page.locator('xpath=//div[@id="payment"]//form//button[contains(@class, "submit")]').click()
    // ✅ After: ARIA role
    await page.getByRole('button', { name: 'Complete Payment' }).click();
  });
  test('refactor: ID selector → data-testid', async ({ page }) => {
    await page.goto('/settings');
    // ❌ Before: HTML ID (changes with accessibility improvements)
    // await page.locator('#user-profile-section').getByLabel('Name').fill('John')
    // ✅ After: data-testid + semantic label
    await page.getByTestId('user-profile-section').getByLabel('Display Name').fill('John Doe');
  });
  test('refactor: Deeply nested CSS → scoped data-testid', async ({ page }) => {
    await page.goto('/dashboard');
    // ❌ Before: Deep nesting (breaks with structure changes)
    // await page.locator('.container .sidebar .menu .item:nth-child(3) a').click()
    // ✅ After: Scoped data-testid
    const sidebar = page.getByTestId('sidebar');
    await sidebar.getByRole('link', { name: 'Settings' }).click();
  });
 });
 ```
 **Key Points**:
 - CSS class → data-testid (survives design system updates)
 - nth() → filter() (content-based vs index-based)
 - Complex XPath → ARIA role (readable, semantic)
 - ID → data-testid (decouples from HTML structure)
 - Deep nesting → scoped locators (modular, maintainable)
 ---
 ### Example 3: Selector Best Practices Checklist
 ```typescript
 // tests/selectors/validation-checklist.spec.ts
 import { test, expect } from '@playwright/test';
 /**
 * Selector Validation Checklist
 *
 * Before committing test, verify selectors meet these criteria:
 */
 test.describe('Selector Best Practices Validation', () => {
  test('✅ 1. Prefer data-testid for interactive elements', async ({ page }) => {
    await page.goto('/login');
    // Interactive elements (buttons, inputs, links) should use data-testid
    await page.getByTestId('email-input').fill('test@example.com');
    await page.getByTestId('login-button').click();
  });
  test('✅ 2. Use ARIA roles for semantic elements', async ({ page }) => {
    await page.goto('/dashboard');
    // Semantic elements (headings, navigation, forms) use ARIA
    await expect(page.getByRole('heading', { name: 'Dashboard' })).toBeVisible();
    await page.getByRole('navigation').getByRole('link', { name: 'Settings' }).click();
  });
  test('✅ 3. Avoid CSS classes (except when testing styles)', async ({ page }) => {
    await page.goto('/products');
    // ❌ Never for interaction: page.locator('.btn-primary')
    // ✅ Only for visual regression: await expect(page.locator('.error-banner')).toHaveCSS('color', 'rgb(255, 0, 0)')
  });
  test('✅ 4. Use filter() instead of nth() for lists', async ({ page }) => {
    await page.goto('/orders');
    // List selection should be content-based
    await page.getByTestId('order-row').filter({ hasText: 'Order #12345' }).click();
  });
  test('✅ 5. Selectors are human-readable', async ({ page }) => {
    await page.goto('/checkout');
    // ✅ Good: Clear intent
    await page.getByTestId('shipping-address-form').getByLabel('Street Address').fill('123 Main St');
    // ❌ Bad: Cryptic
    // await page.locator('div > div:nth-child(2) > input[type="text"]').fill('123 Main St')
  });
 });
 ```
 **Validation Rules**:
 1. **Interactive elements** (buttons, inputs) → data-testid
 2. **Semantic elements** (headings, nav, forms) → ARIA roles
 3. **CSS classes** → Avoid (except visual regression tests)
 4. **Lists** → filter() over nth() (content-based selection)
 5. **Readability** → Selectors document user intent (clear, semantic)
 ---
 ## Selector Resilience Checklist
 Before deploying selectors:
 - [ ] **Hierarchy followed**: data-testid (1st choice) > ARIA (2nd) > text (3rd) > CSS/ID (last resort)
 - [ ] **Interactive elements use data-testid**: Buttons, inputs, links have dedicated test attributes
 - [ ] **Semantic elements use ARIA**: Headings, navigation, forms use roles and accessible names
 - [ ] **No brittle patterns**: No CSS classes (except visual tests), no arbitrary nth(), no complex XPath
 - [ ] **Dynamic content handled**: Regex for IDs/timestamps, filter() for lists, partial matching for text
 - [ ] **Selectors are scoped**: Use container locators to narrow scope (prevent ambiguity)
 - [ ] **Human-readable**: Selectors document user intent (clear, semantic, maintainable)
 - [ ] **Validated in Inspector**: Test selectors interactively before committing (page.pause())
 ## Integration Points
 - **Used in workflows**: `*atdd` (generate tests with robust selectors), `*automate` (healing selector failures), `*test-review` (validate selector quality)
 - **Related fragments**: `test-healing-patterns.md` (selector failure diagnosis), `fixture-architecture.md` (page object alternatives), `test-quality.md` (maintainability standards)
 - **Tools**: Playwright Inspector (Pick Locator), DevTools console, Playwright MCP browser_generate_locator (optional)
 _Source: Playwright selector best practices, accessibility guidelines (ARIA), production test maintenance patterns_
--- a/_bmad/bmm/testarch/knowledge/test-healing-patterns.md
+++ b/_bmad/bmm/testarch/knowledge/test-healing-patterns.md
@@ -0,0 +1,644 @@
 # Test Healing Patterns
 ## Principle
 Common test failures follow predictable patterns (stale selectors, race conditions, dynamic data assertions, network errors, hard waits). **Automated healing** identifies failure signatures and applies pattern-based fixes. Manual healing captures these patterns for future automation.
 ## Rationale
 **The Problem**: Test failures waste developer time on repetitive debugging. Teams manually fix the same selector issues, timing bugs, and data mismatches repeatedly across test suites.
 **The Solution**: Catalog common failure patterns with diagnostic signatures and automated fixes. When a test fails, match the error message/stack trace against known patterns and apply the corresponding fix. This transforms test maintenance from reactive debugging to proactive pattern application.
 **Why This Matters**:
 - Reduces test maintenance time by 60-80% (pattern-based fixes vs manual debugging)
 - Prevents flakiness regression (same bug fixed once, applied everywhere)
 - Builds institutional knowledge (failure catalog grows over time)
 - Enables self-healing test suites (automate workflow validates and heals)
 ## Pattern Examples
 ### Example 1: Common Failure Pattern - Stale Selectors (Element Not Found)
 **Context**: Test fails with "Element not found" or "Locator resolved to 0 elements" errors
 **Diagnostic Signature**:
 ```typescript
 // src/testing/healing/selector-healing.ts
 export type SelectorFailure = {
  errorMessage: string;
  stackTrace: string;
  selector: string;
  testFile: string;
  lineNumber: number;
 };
 /**
 * Detect stale selector failures
 */
 export function isSelectorFailure(error: Error): boolean {
  const patterns = [
    /locator.*resolved to 0 elements/i,
    /element not found/i,
    /waiting for locator.*to be visible/i,
    /selector.*did not match any elements/i,
    /unable to find element/i,
  ];
  return patterns.some((pattern) => pattern.test(error.message));
 }
 /**
 * Extract selector from error message
 */
 export function extractSelector(errorMessage: string): string | null {
  // Playwright: "locator('button[type=\"submit\"]') resolved to 0 elements"
  const playwrightMatch = errorMessage.match(/locator\('([^']+)'\)/);
  if (playwrightMatch) return playwrightMatch[1];
  // Cypress: "Timed out retrying: Expected to find element: '.submit-button'"
  const cypressMatch = errorMessage.match(/Expected to find element: ['"]([^'"]+)['"]/i);
  if (cypressMatch) return cypressMatch[1];
  return null;
 }
 /**
 * Suggest better selector based on hierarchy
 */
 export function suggestBetterSelector(badSelector: string): string {
  // If using CSS class → suggest data-testid
  if (badSelector.startsWith('.') || badSelector.includes('class=')) {
    const elementName = badSelector.match(/class=["']([^"']+)["']/)?.[1] || badSelector.slice(1);
    return `page.getByTestId('${elementName}') // Prefer data-testid over CSS class`;
  }
  // If using ID → suggest data-testid
  if (badSelector.startsWith('#')) {
    return `page.getByTestId('${badSelector.slice(1)}') // Prefer data-testid over ID`;
  }
  // If using nth() → suggest filter() or more specific selector
  if (badSelector.includes('.nth(')) {
    return `page.locator('${badSelector.split('.nth(')[0]}').filter({ hasText: 'specific text' }) // Avoid brittle nth(), use filter()`;
  }
  // If using complex CSS → suggest ARIA role
  if (badSelector.includes('>') || badSelector.includes('+')) {
    return `page.getByRole('button', { name: 'Submit' }) // Prefer ARIA roles over complex CSS`;
  }
  return `page.getByTestId('...') // Add data-testid attribute to element`;
 }
 ```
 **Healing Implementation**:
 ```typescript
 // tests/healing/selector-healing.spec.ts
 import { test, expect } from '@playwright/test';
 import { isSelectorFailure, extractSelector, suggestBetterSelector } from '../../src/testing/healing/selector-healing';
 test('heal stale selector failures automatically', async ({ page }) => {
  await page.goto('/dashboard');
  try {
    // Original test with brittle CSS selector
    await page.locator('.btn-primary').click();
  } catch (error: any) {
    if (isSelectorFailure(error)) {
      const badSelector = extractSelector(error.message);
      const suggestion = badSelector ? suggestBetterSelector(badSelector) : null;
      console.log('HEALING SUGGESTION:', suggestion);
      // Apply healed selector
      await page.getByTestId('submit-button').click(); // Fixed!
    } else {
      throw error; // Not a selector issue, rethrow
    }
  }
  await expect(page.getByText('Success')).toBeVisible();
 });
 ```
 **Key Points**:
 - Diagnosis: Error message contains "locator resolved to 0 elements" or "element not found"
 - Fix: Replace brittle selector (CSS class, ID, nth) with robust alternative (data-testid, ARIA role)
 - Prevention: Follow selector hierarchy (data-testid > ARIA > text > CSS)
 - Automation: Pattern matching on error message + stack trace
 ---
 ### Example 2: Common Failure Pattern - Race Conditions (Timing Errors)
 **Context**: Test fails with "timeout waiting for element" or "element not visible" errors
 **Diagnostic Signature**:
 ```typescript
 // src/testing/healing/timing-healing.ts
 export type TimingFailure = {
  errorMessage: string;
  testFile: string;
  lineNumber: number;
  actionType: 'click' | 'fill' | 'waitFor' | 'expect';
 };
 /**
 * Detect race condition failures
 */
 export function isTimingFailure(error: Error): boolean {
  const patterns = [
    /timeout.*waiting for/i,
    /element is not visible/i,
    /element is not attached to the dom/i,
    /waiting for element to be visible.*exceeded/i,
    /timed out retrying/i,
    /waitForLoadState.*timeout/i,
  ];
  return patterns.some((pattern) => pattern.test(error.message));
 }
 /**
 * Detect hard wait anti-pattern
 */
 export function hasHardWait(testCode: string): boolean {
  const hardWaitPatterns = [/page\.waitForTimeout\(/, /cy\.wait\(\d+\)/, /await.*sleep\(/, /setTimeout\(/];
  return hardWaitPatterns.some((pattern) => pattern.test(testCode));
 }
 /**
 * Suggest deterministic wait replacement
 */
 export function suggestDeterministicWait(testCode: string): string {
  if (testCode.includes('page.waitForTimeout')) {
    return `
 // ❌ Bad: Hard wait (flaky)
 // await page.waitForTimeout(3000)
 // ✅ Good: Wait for network response
 await page.waitForResponse(resp => resp.url().includes('/api/data') && resp.status() === 200)
 // OR wait for element state
 await page.getByTestId('loading-spinner').waitFor({ state: 'detached' })
    `.trim();
  }
  if (testCode.includes('cy.wait(') && /cy\.wait\(\d+\)/.test(testCode)) {
    return `
 // ❌ Bad: Hard wait (flaky)
 // cy.wait(3000)
 // ✅ Good: Wait for aliased network request
 cy.intercept('GET', '/api/data').as('getData')
 cy.visit('/page')
 cy.wait('@getData')
    `.trim();
  }
  return `
 // Add network-first interception BEFORE navigation:
 await page.route('**/api/**', route => route.continue())
 const responsePromise = page.waitForResponse('**/api/data')
 await page.goto('/page')
 await responsePromise
  `.trim();
 }
 ```
 **Healing Implementation**:
 ```typescript
 // tests/healing/timing-healing.spec.ts
 import { test, expect } from '@playwright/test';
 import { isTimingFailure, hasHardWait, suggestDeterministicWait } from '../../src/testing/healing/timing-healing';
 test('heal race condition with network-first pattern', async ({ page, context }) => {
  // Setup interception BEFORE navigation (prevent race)
  await context.route('**/api/products', (route) => {
    route.fulfill({
      status: 200,
      body: JSON.stringify({ products: [{ id: 1, name: 'Product A' }] }),
    });
  });
  const responsePromise = page.waitForResponse('**/api/products');
  await page.goto('/products');
  await responsePromise; // Deterministic wait
  // Element now reliably visible (no race condition)
  await expect(page.getByText('Product A')).toBeVisible();
 });
 test('heal hard wait with event-based wait', async ({ page }) => {
  await page.goto('/dashboard');
  // ❌ Original (flaky): await page.waitForTimeout(3000)
  // ✅ Healed: Wait for spinner to disappear
  await page.getByTestId('loading-spinner').waitFor({ state: 'detached' });
  // Element now reliably visible
  await expect(page.getByText('Dashboard loaded')).toBeVisible();
 });
 ```
 **Key Points**:
 - Diagnosis: Error contains "timeout" or "not visible", often after navigation
 - Fix: Replace hard waits with network-first pattern or element state waits
 - Prevention: ALWAYS intercept before navigate, use waitForResponse()
 - Automation: Detect `page.waitForTimeout()` or `cy.wait(number)` in test code
 ---
 ### Example 3: Common Failure Pattern - Dynamic Data Assertions (Non-Deterministic IDs)
 **Context**: Test fails with "Expected 'User 123' but received 'User 456'" or timestamp mismatches
 **Diagnostic Signature**:
 ```typescript
 // src/testing/healing/data-healing.ts
 export type DataFailure = {
  errorMessage: string;
  expectedValue: string;
  actualValue: string;
  testFile: string;
  lineNumber: number;
 };
 /**
 * Detect dynamic data assertion failures
 */
 export function isDynamicDataFailure(error: Error): boolean {
  const patterns = [
    /expected.*\d+.*received.*\d+/i, // ID mismatches
    /expected.*\d{4}-\d{2}-\d{2}.*received/i, // Date mismatches
    /expected.*user.*\d+/i, // Dynamic user IDs
    /expected.*order.*\d+/i, // Dynamic order IDs
    /expected.*to.*contain.*\d+/i, // Numeric assertions
  ];
  return patterns.some((pattern) => pattern.test(error.message));
 }
 /**
 * Suggest flexible assertion pattern
 */
 export function suggestFlexibleAssertion(errorMessage: string): string {
  if (/expected.*user.*\d+/i.test(errorMessage)) {
    return `
 // ❌ Bad: Hardcoded ID
 // await expect(page.getByText('User 123')).toBeVisible()
 // ✅ Good: Regex pattern for any user ID
 await expect(page.getByText(/User \\d+/)).toBeVisible()
 // OR use partial match
 await expect(page.locator('[data-testid="user-name"]')).toContainText('User')
    `.trim();
  }
  if (/expected.*\d{4}-\d{2}-\d{2}/i.test(errorMessage)) {
    return `
 // ❌ Bad: Hardcoded date
 // await expect(page.getByText('2024-01-15')).toBeVisible()
 // ✅ Good: Dynamic date validation
 const today = new Date().toISOString().split('T')[0]
 await expect(page.getByTestId('created-date')).toHaveText(today)
 // OR use date format regex
 await expect(page.getByTestId('created-date')).toHaveText(/\\d{4}-\\d{2}-\\d{2}/)
    `.trim();
  }
  if (/expected.*order.*\d+/i.test(errorMessage)) {
    return `
 // ❌ Bad: Hardcoded order ID
 // const orderId = '12345'
 // ✅ Good: Capture dynamic order ID
 const orderText = await page.getByTestId('order-id').textContent()
 const orderId = orderText?.match(/Order #(\\d+)/)?.[1]
 expect(orderId).toBeTruthy()
 // Use captured ID in later assertions
 await expect(page.getByText(\`Order #\${orderId} confirmed\`)).toBeVisible()
    `.trim();
  }
  return `Use regex patterns, partial matching, or capture dynamic values instead of hardcoding`;
 }
 ```
 **Healing Implementation**:
 ```typescript
 // tests/healing/data-healing.spec.ts
 import { test, expect } from '@playwright/test';
 test('heal dynamic ID assertion with regex', async ({ page }) => {
  await page.goto('/users');
  // ❌ Original (fails with random IDs): await expect(page.getByText('User 123')).toBeVisible()
  // ✅ Healed: Regex pattern matches any user ID
  await expect(page.getByText(/User \d+/)).toBeVisible();
 });
 test('heal timestamp assertion with dynamic generation', async ({ page }) => {
  await page.goto('/dashboard');
  // ❌ Original (fails daily): await expect(page.getByText('2024-01-15')).toBeVisible()
  // ✅ Healed: Generate expected date dynamically
  const today = new Date().toISOString().split('T')[0];
  await expect(page.getByTestId('last-updated')).toContainText(today);
 });
 test('heal order ID assertion with capture', async ({ page, request }) => {
  // Create order via API (dynamic ID)
  const response = await request.post('/api/orders', {
    data: { productId: '123', quantity: 1 },
  });
  const { orderId } = await response.json();
  // ✅ Healed: Use captured dynamic ID
  await page.goto(`/orders/${orderId}`);
  await expect(page.getByText(`Order #${orderId}`)).toBeVisible();
 });
 ```
 **Key Points**:
 - Diagnosis: Error message shows expected vs actual value mismatch with IDs/timestamps
 - Fix: Use regex patterns (`/User \d+/`), partial matching, or capture dynamic values
 - Prevention: Never hardcode IDs, timestamps, or random data in assertions
 - Automation: Parse error message for expected/actual values, suggest regex patterns
 ---
 ### Example 4: Common Failure Pattern - Network Errors (Missing Route Interception)
 **Context**: Test fails with "API call failed" or "500 error" during test execution
 **Diagnostic Signature**:
 ```typescript
 // src/testing/healing/network-healing.ts
 export type NetworkFailure = {
  errorMessage: string;
  url: string;
  statusCode: number;
  method: string;
 };
 /**
 * Detect network failure
 */
 export function isNetworkFailure(error: Error): boolean {
  const patterns = [
    /api.*call.*failed/i,
    /request.*failed/i,
    /network.*error/i,
    /500.*internal server error/i,
    /503.*service unavailable/i,
    /fetch.*failed/i,
  ];
  return patterns.some((pattern) => pattern.test(error.message));
 }
 /**
 * Suggest route interception
 */
 export function suggestRouteInterception(url: string, method: string): string {
  return `
 // ❌ Bad: Real API call (unreliable, slow, external dependency)
 // ✅ Good: Mock API response with route interception
 await page.route('${url}', route => {
  route.fulfill({
    status: 200,
    contentType: 'application/json',
    body: JSON.stringify({
      // Mock response data
      id: 1,
      name: 'Test User',
      email: 'test@example.com'
    })
  })
 })
 // Then perform action
 await page.goto('/page')
  `.trim();
 }
 ```
 **Healing Implementation**:
 ```typescript
 // tests/healing/network-healing.spec.ts
 import { test, expect } from '@playwright/test';
 test('heal network failure with route mocking', async ({ page, context }) => {
  // ✅ Healed: Mock API to prevent real network calls
  await context.route('**/api/products', (route) => {
    route.fulfill({
      status: 200,
      contentType: 'application/json',
      body: JSON.stringify({
        products: [
          { id: 1, name: 'Product A', price: 29.99 },
          { id: 2, name: 'Product B', price: 49.99 },
        ],
      }),
    });
  });
  await page.goto('/products');
  // Test now reliable (no external API dependency)
  await expect(page.getByText('Product A')).toBeVisible();
  await expect(page.getByText('$29.99')).toBeVisible();
 });
 test('heal 500 error with error state mocking', async ({ page, context }) => {
  // Mock API failure scenario
  await context.route('**/api/products', (route) => {
    route.fulfill({ status: 500, body: JSON.stringify({ error: 'Internal Server Error' }) });
  });
  await page.goto('/products');
  // Verify error handling (not crash)
  await expect(page.getByText('Unable to load products')).toBeVisible();
  await expect(page.getByRole('button', { name: 'Retry' })).toBeVisible();
 });
 ```
 **Key Points**:
 - Diagnosis: Error message contains "API call failed", "500 error", or network-related failures
 - Fix: Add `page.route()` or `cy.intercept()` to mock API responses
 - Prevention: Mock ALL external dependencies (APIs, third-party services)
 - Automation: Extract URL from error message, generate route interception code
 ---
 ### Example 5: Common Failure Pattern - Hard Waits (Unreliable Timing)
 **Context**: Test fails intermittently with "timeout exceeded" or passes/fails randomly
 **Diagnostic Signature**:
 ```typescript
 // src/testing/healing/hard-wait-healing.ts
 /**
 * Detect hard wait anti-pattern in test code
 */
 export function detectHardWaits(testCode: string): Array<{ line: number; code: string }> {
  const lines = testCode.split('\n');
  const violations: Array<{ line: number; code: string }> = [];
  lines.forEach((line, index) => {
    if (line.includes('page.waitForTimeout(') || /cy\.wait\(\d+\)/.test(line) || line.includes('sleep(') || line.includes('setTimeout(')) {
      violations.push({ line: index + 1, code: line.trim() });
    }
  });
  return violations;
 }
 /**
 * Suggest event-based wait replacement
 */
 export function suggestEventBasedWait(hardWaitLine: string): string {
  if (hardWaitLine.includes('page.waitForTimeout')) {
    return `
 // ❌ Bad: Hard wait (flaky)
 ${hardWaitLine}
 // ✅ Good: Wait for network response
 await page.waitForResponse(resp => resp.url().includes('/api/') && resp.ok())
 // OR wait for element state change
 await page.getByTestId('loading-spinner').waitFor({ state: 'detached' })
 await page.getByTestId('content').waitFor({ state: 'visible' })
    `.trim();
  }
  if (/cy\.wait\(\d+\)/.test(hardWaitLine)) {
    return `
 // ❌ Bad: Hard wait (flaky)
 ${hardWaitLine}
 // ✅ Good: Wait for aliased request
 cy.intercept('GET', '/api/data').as('getData')
 cy.visit('/page')
 cy.wait('@getData') // Deterministic
    `.trim();
  }
  return 'Replace hard waits with event-based waits (waitForResponse, waitFor state changes)';
 }
 ```
 **Healing Implementation**:
 ```typescript
 // tests/healing/hard-wait-healing.spec.ts
 import { test, expect } from '@playwright/test';
 test('heal hard wait with deterministic wait', async ({ page }) => {
  await page.goto('/dashboard');
  // ❌ Original (flaky): await page.waitForTimeout(3000)
  // ✅ Healed: Wait for loading spinner to disappear
  await page.getByTestId('loading-spinner').waitFor({ state: 'detached' });
  // OR wait for specific network response
  await page.waitForResponse((resp) => resp.url().includes('/api/dashboard') && resp.ok());
  await expect(page.getByText('Dashboard ready')).toBeVisible();
 });
 test('heal implicit wait with explicit network wait', async ({ page }) => {
  const responsePromise = page.waitForResponse('**/api/products');
  await page.goto('/products');
  // ❌ Original (race condition): await page.getByText('Product A').click()
  // ✅ Healed: Wait for network first
  await responsePromise;
  await page.getByText('Product A').click();
  await expect(page).toHaveURL(/\/products\/\d+/);
 });
 ```
 **Key Points**:
 - Diagnosis: Test code contains `page.waitForTimeout()` or `cy.wait(number)`
 - Fix: Replace with `waitForResponse()`, `waitFor({ state })`, or aliased intercepts
 - Prevention: NEVER use hard waits, always use event-based/response-based waits
 - Automation: Scan test code for hard wait patterns, suggest deterministic replacements
 ---
 ## Healing Pattern Catalog
 | Failure Type   | Diagnostic Signature                          | Healing Strategy                      | Prevention Pattern                        |
 | -------------- | --------------------------------------------- | ------------------------------------- | ----------------------------------------- |
 | Stale Selector | "locator resolved to 0 elements"              | Replace with data-testid or ARIA role | Selector hierarchy (testid > ARIA > text) |
 | Race Condition | "timeout waiting for element"                 | Add network-first interception        | Intercept before navigate                 |
 | Dynamic Data   | "Expected 'User 123' but got 'User 456'"      | Use regex or capture dynamic values   | Never hardcode IDs/timestamps             |
 | Network Error  | "API call failed", "500 error"                | Add route mocking                     | Mock all external dependencies            |
 | Hard Wait      | Test contains `waitForTimeout()` or `wait(n)` | Replace with event-based waits        | Always use deterministic waits            |
 ## Healing Workflow
 1. **Run test** → Capture failure
 2. **Identify pattern** → Match error against diagnostic signatures
 3. **Apply fix** → Use pattern-based healing strategy
 4. **Re-run test** → Validate fix (max 3 iterations)
 5. **Mark unfixable** → Use `test.fixme()` if healing fails after 3 attempts
 ## Healing Checklist
 Before enabling auto-healing in workflows:
 - [ ] **Failure catalog documented**: Common patterns identified (selectors, timing, data, network, hard waits)
 - [ ] **Diagnostic signatures defined**: Error message patterns for each failure type
 - [ ] **Healing strategies documented**: Fix patterns for each failure type
 - [ ] **Prevention patterns documented**: Best practices to avoid recurrence
 - [ ] **Healing iteration limit set**: Max 3 attempts before marking test.fixme()
 - [ ] **MCP integration optional**: Graceful degradation without Playwright MCP
 - [ ] **Pattern-based fallback**: Use knowledge base patterns when MCP unavailable
 - [ ] **Healing report generated**: Document what was healed and how
 ## Integration Points
 - **Used in workflows**: `*automate` (auto-healing after test generation), `*atdd` (optional healing for acceptance tests)
 - **Related fragments**: `selector-resilience.md` (selector debugging), `timing-debugging.md` (race condition fixes), `network-first.md` (interception patterns), `data-factories.md` (dynamic data handling)
 - **Tools**: Error message parsing, AST analysis for code patterns, Playwright MCP (optional), pattern matching
 _Source: Playwright test-healer patterns, production test failure analysis, common anti-patterns from test-resources-for-ai_
--- a/_bmad/bmm/testarch/knowledge/test-levels-framework.md
+++ b/_bmad/bmm/testarch/knowledge/test-levels-framework.md
@@ -0,0 +1,473 @@
 <!-- Powered by BMAD-CORE™ -->
 # Test Levels Framework
 Comprehensive guide for determining appropriate test levels (unit, integration, E2E) for different scenarios.
 ## Test Level Decision Matrix
 ### Unit Tests
 **When to use:**
 - Testing pure functions and business logic
 - Algorithm correctness
 - Input validation and data transformation
 - Error handling in isolated components
 - Complex calculations or state machines
 **Characteristics:**
 - Fast execution (immediate feedback)
 - No external dependencies (DB, API, file system)
 - Highly maintainable and stable
 - Easy to debug failures
 **Example scenarios:**
 ```yaml
 unit_test:
  component: 'PriceCalculator'
  scenario: 'Calculate discount with multiple rules'
  justification: 'Complex business logic with multiple branches'
  mock_requirements: 'None - pure function'
 ```
 ### Integration Tests
 **When to use:**
 - Component interaction verification
 - Database operations and transactions
 - API endpoint contracts
 - Service-to-service communication
 - Middleware and interceptor behavior
 **Characteristics:**
 - Moderate execution time
 - Tests component boundaries
 - May use test databases or containers
 - Validates system integration points
 **Example scenarios:**
 ```yaml
 integration_test:
  components: ['UserService', 'AuthRepository']
  scenario: 'Create user with role assignment'
  justification: 'Critical data flow between service and persistence'
  test_environment: 'In-memory database'
 ```
 ### End-to-End Tests
 **When to use:**
 - Critical user journeys
 - Cross-system workflows
 - Visual regression testing
 - Compliance and regulatory requirements
 - Final validation before release
 **Characteristics:**
 - Slower execution
 - Tests complete workflows
 - Requires full environment setup
 - Most realistic but most brittle
 **Example scenarios:**
 ```yaml
 e2e_test:
  journey: 'Complete checkout process'
  scenario: 'User purchases with saved payment method'
  justification: 'Revenue-critical path requiring full validation'
  environment: 'Staging with test payment gateway'
 ```
 ## Test Level Selection Rules
 ### Favor Unit Tests When:
 - Logic can be isolated
 - No side effects involved
 - Fast feedback needed
 - High cyclomatic complexity
 ### Favor Integration Tests When:
 - Testing persistence layer
 - Validating service contracts
 - Testing middleware/interceptors
 - Component boundaries critical
 ### Favor E2E Tests When:
 - User-facing critical paths
 - Multi-system interactions
 - Regulatory compliance scenarios
 - Visual regression important
 ## Anti-patterns to Avoid
 - E2E testing for business logic validation
 - Unit testing framework behavior
 - Integration testing third-party libraries
 - Duplicate coverage across levels
 ## Duplicate Coverage Guard
 **Before adding any test, check:**
 1. Is this already tested at a lower level?
 2. Can a unit test cover this instead of integration?
 3. Can an integration test cover this instead of E2E?
 **Coverage overlap is only acceptable when:**
 - Testing different aspects (unit: logic, integration: interaction, e2e: user experience)
 - Critical paths requiring defense in depth
 - Regression prevention for previously broken functionality
 ## Test Naming Conventions
 - Unit: `test_{component}_{scenario}`
 - Integration: `test_{flow}_{interaction}`
 - E2E: `test_{journey}_{outcome}`
 ## Test ID Format
 `{EPIC}.{STORY}-{LEVEL}-{SEQ}`
 Examples:
 - `1.3-UNIT-001`
 - `1.3-INT-002`
 - `1.3-E2E-001`
 ## Real Code Examples
 ### Example 1: E2E Test (Full User Journey)
 **Scenario**: User logs in, navigates to dashboard, and places an order.
 ```typescript
 // tests/e2e/checkout-flow.spec.ts
 import { test, expect } from '@playwright/test';
 import { createUser, createProduct } from '../test-utils/factories';
 test.describe('Checkout Flow', () => {
  test('user can complete purchase with saved payment method', async ({ page, apiRequest }) => {
    // Setup: Seed data via API (fast!)
    const user = createUser({ email: 'buyer@example.com', hasSavedCard: true });
    const product = createProduct({ name: 'Widget', price: 29.99, stock: 10 });
    await apiRequest.post('/api/users', { data: user });
    await apiRequest.post('/api/products', { data: product });
    // Network-first: Intercept BEFORE action
    const loginPromise = page.waitForResponse('**/api/auth/login');
    const cartPromise = page.waitForResponse('**/api/cart');
    const orderPromise = page.waitForResponse('**/api/orders');
    // Step 1: Login
    await page.goto('/login');
    await page.fill('[data-testid="email"]', user.email);
    await page.fill('[data-testid="password"]', 'password123');
    await page.click('[data-testid="login-button"]');
    await loginPromise;
    // Assert: Dashboard visible
    await expect(page).toHaveURL('/dashboard');
    await expect(page.getByText(`Welcome, ${user.name}`)).toBeVisible();
    // Step 2: Add product to cart
    await page.goto(`/products/${product.id}`);
    await page.click('[data-testid="add-to-cart"]');
    await cartPromise;
    await expect(page.getByText('Added to cart')).toBeVisible();
    // Step 3: Checkout with saved payment
    await page.goto('/checkout');
    await expect(page.getByText('Visa ending in 1234')).toBeVisible(); // Saved card
    await page.click('[data-testid="use-saved-card"]');
    await page.click('[data-testid="place-order"]');
    await orderPromise;
    // Assert: Order confirmation
    await expect(page.getByText('Order Confirmed')).toBeVisible();
    await expect(page.getByText(/Order #\d+/)).toBeVisible();
    await expect(page.getByText('$29.99')).toBeVisible();
  });
 });
 ```
 **Key Points (E2E)**:
 - Tests complete user journey across multiple pages
 - API setup for data (fast), UI for assertions (user-centric)
 - Network-first interception to prevent flakiness
 - Validates critical revenue path end-to-end
 ### Example 2: Integration Test (API/Service Layer)
 **Scenario**: UserService creates user and assigns role via AuthRepository.
 ```typescript
 // tests/integration/user-service.spec.ts
 import { test, expect } from '@playwright/test';
 import { createUser } from '../test-utils/factories';
 test.describe('UserService Integration', () => {
  test('should create user with admin role via API', async ({ request }) => {
    const userData = createUser({ role: 'admin' });
    // Direct API call (no UI)
    const response = await request.post('/api/users', {
      data: userData,
    });
    expect(response.status()).toBe(201);
    const createdUser = await response.json();
    expect(createdUser.id).toBeTruthy();
    expect(createdUser.email).toBe(userData.email);
    expect(createdUser.role).toBe('admin');
    // Verify database state
    const getResponse = await request.get(`/api/users/${createdUser.id}`);
    expect(getResponse.status()).toBe(200);
    const fetchedUser = await getResponse.json();
    expect(fetchedUser.role).toBe('admin');
    expect(fetchedUser.permissions).toContain('user:delete');
    expect(fetchedUser.permissions).toContain('user:update');
    // Cleanup
    await request.delete(`/api/users/${createdUser.id}`);
  });
  test('should validate email uniqueness constraint', async ({ request }) => {
    const userData = createUser({ email: 'duplicate@example.com' });
    // Create first user
    const response1 = await request.post('/api/users', { data: userData });
    expect(response1.status()).toBe(201);
    const user1 = await response1.json();
    // Attempt duplicate email
    const response2 = await request.post('/api/users', { data: userData });
    expect(response2.status()).toBe(409); // Conflict
    const error = await response2.json();
    expect(error.message).toContain('Email already exists');
    // Cleanup
    await request.delete(`/api/users/${user1.id}`);
  });
 });
 ```
 **Key Points (Integration)**:
 - Tests service layer + database interaction
 - No UI involved—pure API validation
 - Business logic focus (role assignment, constraints)
 - Faster than E2E, more realistic than unit tests
 ### Example 3: Component Test (Isolated UI Component)
 **Scenario**: Test button component in isolation with props and user interactions.
 ```typescript
 // src/components/Button.cy.tsx (Cypress Component Test)
 import { Button } from './Button';
 describe('Button Component', () => {
  it('should render with correct label', () => {
    cy.mount(<Button label="Click Me" />);
    cy.contains('Click Me').should('be.visible');
  });
  it('should call onClick handler when clicked', () => {
    const onClickSpy = cy.stub().as('onClick');
    cy.mount(<Button label="Submit" onClick={onClickSpy} />);
    cy.get('button').click();
    cy.get('@onClick').should('have.been.calledOnce');
  });
  it('should be disabled when disabled prop is true', () => {
    cy.mount(<Button label="Disabled" disabled={true} />);
    cy.get('button').should('be.disabled');
    cy.get('button').should('have.attr', 'aria-disabled', 'true');
  });
  it('should show loading spinner when loading', () => {
    cy.mount(<Button label="Loading" loading={true} />);
    cy.get('[data-testid="spinner"]').should('be.visible');
    cy.get('button').should('be.disabled');
  });
  it('should apply variant styles correctly', () => {
    cy.mount(<Button label="Primary" variant="primary" />);
    cy.get('button').should('have.class', 'btn-primary');
    cy.mount(<Button label="Secondary" variant="secondary" />);
    cy.get('button').should('have.class', 'btn-secondary');
  });
 });
 // Playwright Component Test equivalent
 import { test, expect } from '@playwright/experimental-ct-react';
 import { Button } from './Button';
 test.describe('Button Component', () => {
  test('should call onClick handler when clicked', async ({ mount }) => {
    let clicked = false;
    const component = await mount(
      <Button label="Submit" onClick={() => { clicked = true; }} />
    );
    await component.getByRole('button').click();
    expect(clicked).toBe(true);
  });
  test('should be disabled when loading', async ({ mount }) => {
    const component = await mount(<Button label="Loading" loading={true} />);
    await expect(component.getByRole('button')).toBeDisabled();
    await expect(component.getByTestId('spinner')).toBeVisible();
  });
 });
 ```
 **Key Points (Component)**:
 - Tests UI component in isolation (no full app)
 - Props + user interactions + visual states
 - Faster than E2E, more realistic than unit tests for UI
 - Great for design system components
 ### Example 4: Unit Test (Pure Function)
 **Scenario**: Test pure business logic function without framework dependencies.
 ```typescript
 // src/utils/price-calculator.test.ts (Jest/Vitest)
 import { calculateDiscount, applyTaxes, calculateTotal } from './price-calculator';
 describe('PriceCalculator', () => {
  describe('calculateDiscount', () => {
    it('should apply percentage discount correctly', () => {
      const result = calculateDiscount(100, { type: 'percentage', value: 20 });
      expect(result).toBe(80);
    });
    it('should apply fixed amount discount correctly', () => {
      const result = calculateDiscount(100, { type: 'fixed', value: 15 });
      expect(result).toBe(85);
    });
    it('should not apply discount below zero', () => {
      const result = calculateDiscount(10, { type: 'fixed', value: 20 });
      expect(result).toBe(0);
    });
    it('should handle no discount', () => {
      const result = calculateDiscount(100, { type: 'none', value: 0 });
      expect(result).toBe(100);
    });
  });
  describe('applyTaxes', () => {
    it('should calculate tax correctly for US', () => {
      const result = applyTaxes(100, { country: 'US', rate: 0.08 });
      expect(result).toBe(108);
    });
    it('should calculate tax correctly for EU (VAT)', () => {
      const result = applyTaxes(100, { country: 'DE', rate: 0.19 });
      expect(result).toBe(119);
    });
    it('should handle zero tax rate', () => {
      const result = applyTaxes(100, { country: 'US', rate: 0 });
      expect(result).toBe(100);
    });
  });
  describe('calculateTotal', () => {
    it('should calculate total with discount and taxes', () => {
      const items = [
        { price: 50, quantity: 2 }, // 100
        { price: 30, quantity: 1 }, // 30
      ];
      const discount = { type: 'percentage', value: 10 }; // -13
      const tax = { country: 'US', rate: 0.08 }; // +9.36
      const result = calculateTotal(items, discount, tax);
      expect(result).toBeCloseTo(126.36, 2);
    });
    it('should handle empty items array', () => {
      const result = calculateTotal([], { type: 'none', value: 0 }, { country: 'US', rate: 0 });
      expect(result).toBe(0);
    });
    it('should calculate correctly without discount or tax', () => {
      const items = [{ price: 25, quantity: 4 }];
      const result = calculateTotal(items, { type: 'none', value: 0 }, { country: 'US', rate: 0 });
      expect(result).toBe(100);
    });
  });
 });
 ```
 **Key Points (Unit)**:
 - Pure function testing—no framework dependencies
 - Fast execution (milliseconds)
 - Edge case coverage (zero, negative, empty inputs)
 - High cyclomatic complexity handled at unit level
 ## When to Use Which Level
 | Scenario               | Unit          | Integration       | E2E           |
 | ---------------------- | ------------- | ----------------- | ------------- |
 | Pure business logic    | ✅ Primary    | ❌ Overkill       | ❌ Overkill   |
 | Database operations    | ❌ Can't test | ✅ Primary        | ❌ Overkill   |
 | API contracts          | ❌ Can't test | ✅ Primary        | ⚠️ Supplement |
 | User journeys          | ❌ Can't test | ❌ Can't test     | ✅ Primary    |
 | Component props/events | ✅ Partial    | ⚠️ Component test | ❌ Overkill   |
 | Visual regression      | ❌ Can't test | ⚠️ Component test | ✅ Primary    |
 | Error handling (logic) | ✅ Primary    | ⚠️ Integration    | ❌ Overkill   |
 | Error handling (UI)    | ❌ Partial    | ⚠️ Component test | ✅ Primary    |
 ## Anti-Pattern Examples
 **❌ BAD: E2E test for business logic**
 ```typescript
 // DON'T DO THIS
 test('calculate discount via UI', async ({ page }) => {
  await page.goto('/calculator');
  await page.fill('[data-testid="price"]', '100');
  await page.fill('[data-testid="discount"]', '20');
  await page.click('[data-testid="calculate"]');
  await expect(page.getByText('$80')).toBeVisible();
 });
 // Problem: Slow, brittle, tests logic that should be unit tested
 ```
 **✅ GOOD: Unit test for business logic**
 ```typescript
 test('calculate discount', () => {
  expect(calculateDiscount(100, 20)).toBe(80);
 });
 // Fast, reliable, isolated
 ```
 _Source: Murat Testing Philosophy (test pyramid), existing test-levels-framework.md structure._
--- a/_bmad/bmm/testarch/knowledge/test-priorities-matrix.md
+++ b/_bmad/bmm/testarch/knowledge/test-priorities-matrix.md
@@ -0,0 +1,373 @@
 <!-- Powered by BMAD-CORE™ -->
 # Test Priorities Matrix
 Guide for prioritizing test scenarios based on risk, criticality, and business impact.
 ## Priority Levels
 ### P0 - Critical (Must Test)
 **Criteria:**
 - Revenue-impacting functionality
 - Security-critical paths
 - Data integrity operations
 - Regulatory compliance requirements
 - Previously broken functionality (regression prevention)
 **Examples:**
 - Payment processing
 - Authentication/authorization
 - User data creation/deletion
 - Financial calculations
 - GDPR/privacy compliance
 **Testing Requirements:**
 - Comprehensive coverage at all levels
 - Both happy and unhappy paths
 - Edge cases and error scenarios
 - Performance under load
 ### P1 - High (Should Test)
 **Criteria:**
 - Core user journeys
 - Frequently used features
 - Features with complex logic
 - Integration points between systems
 - Features affecting user experience
 **Examples:**
 - User registration flow
 - Search functionality
 - Data import/export
 - Notification systems
 - Dashboard displays
 **Testing Requirements:**
 - Primary happy paths required
 - Key error scenarios
 - Critical edge cases
 - Basic performance validation
 ### P2 - Medium (Nice to Test)
 **Criteria:**
 - Secondary features
 - Admin functionality
 - Reporting features
 - Configuration options
 - UI polish and aesthetics
 **Examples:**
 - Admin settings panels
 - Report generation
 - Theme customization
 - Help documentation
 - Analytics tracking
 **Testing Requirements:**
 - Happy path coverage
 - Basic error handling
 - Can defer edge cases
 ### P3 - Low (Test if Time Permits)
 **Criteria:**
 - Rarely used features
 - Nice-to-have functionality
 - Cosmetic issues
 - Non-critical optimizations
 **Examples:**
 - Advanced preferences
 - Legacy feature support
 - Experimental features
 - Debug utilities
 **Testing Requirements:**
 - Smoke tests only
 - Can rely on manual testing
 - Document known limitations
 ## Risk-Based Priority Adjustments
 ### Increase Priority When:
 - High user impact (affects >50% of users)
 - High financial impact (>$10K potential loss)
 - Security vulnerability potential
 - Compliance/legal requirements
 - Customer-reported issues
 - Complex implementation (>500 LOC)
 - Multiple system dependencies
 ### Decrease Priority When:
 - Feature flag protected
 - Gradual rollout planned
 - Strong monitoring in place
 - Easy rollback capability
 - Low usage metrics
 - Simple implementation
 - Well-isolated component
 ## Test Coverage by Priority
 | Priority | Unit Coverage | Integration Coverage | E2E Coverage       |
 | -------- | ------------- | -------------------- | ------------------ |
 | P0       | >90%          | >80%                 | All critical paths |
 | P1       | >80%          | >60%                 | Main happy paths   |
 | P2       | >60%          | >40%                 | Smoke tests        |
 | P3       | Best effort   | Best effort          | Manual only        |
 ## Priority Assignment Rules
 1. **Start with business impact** - What happens if this fails?
 2. **Consider probability** - How likely is failure?
 3. **Factor in detectability** - Would we know if it failed?
 4. **Account for recoverability** - Can we fix it quickly?
 ## Priority Decision Tree
 ```
 Is it revenue-critical?
 ├─ YES → P0
 └─ NO → Does it affect core user journey?
    ├─ YES → Is it high-risk?
    │   ├─ YES → P0
    │   └─ NO → P1
    └─ NO → Is it frequently used?
        ├─ YES → P1
        └─ NO → Is it customer-facing?
            ├─ YES → P2
            └─ NO → P3
 ```
 ## Test Execution Order
 1. Execute P0 tests first (fail fast on critical issues)
 2. Execute P1 tests second (core functionality)
 3. Execute P2 tests if time permits
 4. P3 tests only in full regression cycles
 ## Continuous Adjustment
 Review and adjust priorities based on:
 - Production incident patterns
 - User feedback and complaints
 - Usage analytics
 - Test failure history
 - Business priority changes
 ---
 ## Automated Priority Classification
 ### Example: Priority Calculator (Risk-Based Automation)
 ```typescript
 // src/testing/priority-calculator.ts
 export type Priority = 'P0' | 'P1' | 'P2' | 'P3';
 export type PriorityFactors = {
  revenueImpact: 'critical' | 'high' | 'medium' | 'low' | 'none';
  userImpact: 'all' | 'majority' | 'some' | 'few' | 'minimal';
  securityRisk: boolean;
  complianceRequired: boolean;
  previousFailure: boolean;
  complexity: 'high' | 'medium' | 'low';
  usage: 'frequent' | 'regular' | 'occasional' | 'rare';
 };
 /**
 * Calculate test priority based on multiple factors
 * Mirrors the priority decision tree with objective criteria
 */
 export function calculatePriority(factors: PriorityFactors): Priority {
  const { revenueImpact, userImpact, securityRisk, complianceRequired, previousFailure, complexity, usage } = factors;
  // P0: Revenue-critical, security, or compliance
  if (revenueImpact === 'critical' || securityRisk || complianceRequired || (previousFailure && revenueImpact === 'high')) {
    return 'P0';
  }
  // P0: High revenue + high complexity + frequent usage
  if (revenueImpact === 'high' && complexity === 'high' && usage === 'frequent') {
    return 'P0';
  }
  // P1: Core user journey (majority impacted + frequent usage)
  if (userImpact === 'all' || userImpact === 'majority') {
    if (usage === 'frequent' || complexity === 'high') {
      return 'P1';
    }
  }
  // P1: High revenue OR high complexity with regular usage
  if ((revenueImpact === 'high' && usage === 'regular') || (complexity === 'high' && usage === 'frequent')) {
    return 'P1';
  }
  // P2: Secondary features (some impact, occasional usage)
  if (userImpact === 'some' || usage === 'occasional') {
    return 'P2';
  }
  // P3: Rarely used, low impact
  return 'P3';
 }
 /**
 * Generate priority justification (for audit trail)
 */
 export function justifyPriority(factors: PriorityFactors): string {
  const priority = calculatePriority(factors);
  const reasons: string[] = [];
  if (factors.revenueImpact === 'critical') reasons.push('critical revenue impact');
  if (factors.securityRisk) reasons.push('security-critical');
  if (factors.complianceRequired) reasons.push('compliance requirement');
  if (factors.previousFailure) reasons.push('regression prevention');
  if (factors.userImpact === 'all' || factors.userImpact === 'majority') {
    reasons.push(`impacts ${factors.userImpact} users`);
  }
  if (factors.complexity === 'high') reasons.push('high complexity');
  if (factors.usage === 'frequent') reasons.push('frequently used');
  return `${priority}: ${reasons.join(', ')}`;
 }
 /**
 * Example: Payment scenario priority calculation
 */
 const paymentScenario: PriorityFactors = {
  revenueImpact: 'critical',
  userImpact: 'all',
  securityRisk: true,
  complianceRequired: true,
  previousFailure: false,
  complexity: 'high',
  usage: 'frequent',
 };
 console.log(calculatePriority(paymentScenario)); // 'P0'
 console.log(justifyPriority(paymentScenario));
 // 'P0: critical revenue impact, security-critical, compliance requirement, impacts all users, high complexity, frequently used'
 ```
 ### Example: Test Suite Tagging Strategy
 ```typescript
 // tests/e2e/checkout.spec.ts
 import { test, expect } from '@playwright/test';
 // Tag tests with priority for selective execution
 test.describe('Checkout Flow', () => {
  test('valid payment completes successfully @p0 @smoke @revenue', async ({ page }) => {
    // P0: Revenue-critical happy path
    await page.goto('/checkout');
    await page.getByTestId('payment-method').selectOption('credit-card');
    await page.getByTestId('card-number').fill('4242424242424242');
    await page.getByRole('button', { name: 'Place Order' }).click();
    await expect(page.getByText('Order confirmed')).toBeVisible();
  });
  test('expired card shows user-friendly error @p1 @error-handling', async ({ page }) => {
    // P1: Core error scenario (frequent user impact)
    await page.goto('/checkout');
    await page.getByTestId('payment-method').selectOption('credit-card');
    await page.getByTestId('card-number').fill('4000000000000069'); // Test card: expired
    await page.getByRole('button', { name: 'Place Order' }).click();
    await expect(page.getByText('Card expired. Please use a different card.')).toBeVisible();
  });
  test('coupon code applies discount correctly @p2', async ({ page }) => {
    // P2: Secondary feature (nice-to-have)
    await page.goto('/checkout');
    await page.getByTestId('coupon-code').fill('SAVE10');
    await page.getByRole('button', { name: 'Apply' }).click();
    await expect(page.getByText('10% discount applied')).toBeVisible();
  });
  test('gift message formatting preserved @p3', async ({ page }) => {
    // P3: Cosmetic feature (rarely used)
    await page.goto('/checkout');
    await page.getByTestId('gift-message').fill('Happy Birthday!\n\nWith love.');
    await page.getByRole('button', { name: 'Place Order' }).click();
    // Message formatting preserved (linebreaks intact)
    await expect(page.getByTestId('order-summary')).toContainText('Happy Birthday!');
  });
 });
 ```
 **Run tests by priority:**
 ```bash
 # P0 only (smoke tests, 2-5 min)
 npx playwright test --grep @p0
 # P0 + P1 (core functionality, 10-15 min)
 npx playwright test --grep "@p0|@p1"
 # Full regression (all priorities, 30+ min)
 npx playwright test
 ```
 ---
 ## Integration with Risk Scoring
 Priority should align with risk score from `probability-impact.md`:
 | Risk Score | Typical Priority | Rationale                                  |
 | ---------- | ---------------- | ------------------------------------------ |
 | 9          | P0               | Critical blocker (probability=3, impact=3) |
 | 6-8        | P0 or P1         | High risk (requires mitigation)            |
 | 4-5        | P1 or P2         | Medium risk (monitor closely)              |
 | 1-3        | P2 or P3         | Low risk (document and defer)              |
 **Example**: Risk score 9 (checkout API failure) → P0 priority → comprehensive coverage required.
 ---
 ## Priority Checklist
 Before finalizing test priorities:
 - [ ] **Revenue impact assessed**: Payment, subscription, billing features → P0
 - [ ] **Security risks identified**: Auth, data exposure, injection attacks → P0
 - [ ] **Compliance requirements documented**: GDPR, PCI-DSS, SOC2 → P0
 - [ ] **User impact quantified**: >50% users → P0/P1, <10% → P2/P3
 - [ ] **Previous failures reviewed**: Regression prevention → increase priority
 - [ ] **Complexity evaluated**: >500 LOC or multiple dependencies → increase priority
 - [ ] **Usage metrics consulted**: Frequent use → P0/P1, rare use → P2/P3
 - [ ] **Monitoring coverage confirmed**: Strong monitoring → can decrease priority
 - [ ] **Rollback capability verified**: Easy rollback → can decrease priority
 - [ ] **Priorities tagged in tests**: @p0, @p1, @p2, @p3 for selective execution
 ## Integration Points
 - **Used in workflows**: `*automate` (priority-based test generation), `*test-design` (scenario prioritization), `*trace` (coverage validation by priority)
 - **Related fragments**: `risk-governance.md` (risk scoring), `probability-impact.md` (impact assessment), `selective-testing.md` (tag-based execution)
 - **Tools**: Playwright/Cypress grep for tag filtering, CI scripts for priority-based execution
 _Source: Risk-based testing practices, test prioritization strategies, production incident analysis_
--- a/_bmad/bmm/testarch/knowledge/test-quality.md
+++ b/_bmad/bmm/testarch/knowledge/test-quality.md
@@ -0,0 +1,664 @@
 # Test Quality Definition of Done
 ## Principle
 Tests must be deterministic, isolated, explicit, focused, and fast. Every test should execute in under 1.5 minutes, contain fewer than 300 lines, avoid hard waits and conditionals, keep assertions visible in test bodies, and clean up after itself for parallel execution.
 ## Rationale
 Quality tests provide reliable signal about application health. Flaky tests erode confidence and waste engineering time. Tests that use hard waits (`waitForTimeout(3000)`) are non-deterministic and slow. Tests with hidden assertions or conditional logic become unmaintainable. Large tests (>300 lines) are hard to understand and debug. Slow tests (>1.5 min) block CI pipelines. Self-cleaning tests prevent state pollution in parallel runs.
 ## Pattern Examples
 ### Example 1: Deterministic Test Pattern
 **Context**: When writing tests, eliminate all sources of non-determinism: hard waits, conditionals controlling flow, try-catch for flow control, and random data without seeds.
 **Implementation**:
 ```typescript
 // ❌ BAD: Non-deterministic test with conditionals and hard waits
 test('user can view dashboard - FLAKY', async ({ page }) => {
  await page.goto('/dashboard');
  await page.waitForTimeout(3000); // NEVER - arbitrary wait
  // Conditional flow control - test behavior varies
  if (await page.locator('[data-testid="welcome-banner"]').isVisible()) {
    await page.click('[data-testid="dismiss-banner"]');
    await page.waitForTimeout(500);
  }
  // Try-catch for flow control - hides real issues
  try {
    await page.click('[data-testid="load-more"]');
  } catch (e) {
    // Silently continue - test passes even if button missing
  }
  // Random data without control
  const randomEmail = `user${Math.random()}@example.com`;
  await expect(page.getByText(randomEmail)).toBeVisible(); // Will fail randomly
 });
 // ✅ GOOD: Deterministic test with explicit waits
 test('user can view dashboard', async ({ page, apiRequest }) => {
  const user = createUser({ email: 'test@example.com', hasSeenWelcome: true });
  // Setup via API (fast, controlled)
  await apiRequest.post('/api/users', { data: user });
  // Network-first: Intercept BEFORE navigate
  const dashboardPromise = page.waitForResponse((resp) => resp.url().includes('/api/dashboard') && resp.status() === 200);
  await page.goto('/dashboard');
  // Wait for actual response, not arbitrary time
  const dashboardResponse = await dashboardPromise;
  const dashboard = await dashboardResponse.json();
  // Explicit assertions with controlled data
  await expect(page.getByText(`Welcome, ${user.name}`)).toBeVisible();
  await expect(page.getByTestId('dashboard-items')).toHaveCount(dashboard.items.length);
  // No conditionals - test always executes same path
  // No try-catch - failures bubble up clearly
 });
 // Cypress equivalent
 describe('Dashboard', () => {
  it('should display user dashboard', () => {
    const user = createUser({ email: 'test@example.com', hasSeenWelcome: true });
    // Setup via task (fast, controlled)
    cy.task('db:seed', { users: [user] });
    // Network-first interception
    cy.intercept('GET', '**/api/dashboard').as('getDashboard');
    cy.visit('/dashboard');
    // Deterministic wait for response
    cy.wait('@getDashboard').then((interception) => {
      const dashboard = interception.response.body;
      // Explicit assertions
      cy.contains(`Welcome, ${user.name}`).should('be.visible');
      cy.get('[data-cy="dashboard-items"]').should('have.length', dashboard.items.length);
    });
  });
 });
 ```
 **Key Points**:
 - Replace `waitForTimeout()` with `waitForResponse()` or element state checks
 - Never use if/else to control test flow - tests should be deterministic
 - Avoid try-catch for flow control - let failures bubble up clearly
 - Use factory functions with controlled data, not `Math.random()`
 - Network-first pattern prevents race conditions
 ### Example 2: Isolated Test with Cleanup
 **Context**: When tests create data, they must clean up after themselves to prevent state pollution in parallel runs. Use fixture auto-cleanup or explicit teardown.
 **Implementation**:
 ```typescript
 // ❌ BAD: Test leaves data behind, pollutes other tests
 test('admin can create user - POLLUTES STATE', async ({ page, apiRequest }) => {
  await page.goto('/admin/users');
  // Hardcoded email - collides in parallel runs
  await page.fill('[data-testid="email"]', 'newuser@example.com');
  await page.fill('[data-testid="name"]', 'New User');
  await page.click('[data-testid="create-user"]');
  await expect(page.getByText('User created')).toBeVisible();
  // NO CLEANUP - user remains in database
  // Next test run fails: "Email already exists"
 });
 // ✅ GOOD: Test cleans up with fixture auto-cleanup
 // playwright/support/fixtures/database-fixture.ts
 import { test as base } from '@playwright/test';
 import { deleteRecord, seedDatabase } from '../helpers/db-helpers';
 type DatabaseFixture = {
  seedUser: (userData: Partial<User>) => Promise<User>;
 };
 export const test = base.extend<DatabaseFixture>({
  seedUser: async ({}, use) => {
    const createdUsers: string[] = [];
    const seedUser = async (userData: Partial<User>) => {
      const user = await seedDatabase('users', userData);
      createdUsers.push(user.id); // Track for cleanup
      return user;
    };
    await use(seedUser);
    // Auto-cleanup: Delete all users created during test
    for (const userId of createdUsers) {
      await deleteRecord('users', userId);
    }
    createdUsers.length = 0;
  },
 });
 // Use the fixture
 test('admin can create user', async ({ page, seedUser }) => {
  // Create admin with unique data
  const admin = await seedUser({
    email: faker.internet.email(), // Unique each run
    role: 'admin',
  });
  await page.goto('/admin/users');
  const newUserEmail = faker.internet.email(); // Unique
  await page.fill('[data-testid="email"]', newUserEmail);
  await page.fill('[data-testid="name"]', 'New User');
  await page.click('[data-testid="create-user"]');
  await expect(page.getByText('User created')).toBeVisible();
  // Verify in database
  const createdUser = await seedUser({ email: newUserEmail });
  expect(createdUser.email).toBe(newUserEmail);
  // Auto-cleanup happens via fixture teardown
 });
 // Cypress equivalent with explicit cleanup
 describe('Admin User Management', () => {
  const createdUserIds: string[] = [];
  afterEach(() => {
    // Cleanup: Delete all users created during test
    createdUserIds.forEach((userId) => {
      cy.task('db:delete', { table: 'users', id: userId });
    });
    createdUserIds.length = 0;
  });
  it('should create user', () => {
    const admin = createUser({ role: 'admin' });
    const newUser = createUser(); // Unique data via faker
    cy.task('db:seed', { users: [admin] }).then((result: any) => {
      createdUserIds.push(result.users[0].id);
    });
    cy.visit('/admin/users');
    cy.get('[data-cy="email"]').type(newUser.email);
    cy.get('[data-cy="name"]').type(newUser.name);
    cy.get('[data-cy="create-user"]').click();
    cy.contains('User created').should('be.visible');
    // Track for cleanup
    cy.task('db:findByEmail', newUser.email).then((user: any) => {
      createdUserIds.push(user.id);
    });
  });
 });
 ```
 **Key Points**:
 - Use fixtures with auto-cleanup via teardown (after `use()`)
 - Track all created resources in array during test execution
 - Use `faker` for unique data - prevents parallel collisions
 - Cypress: Use `afterEach()` with explicit cleanup
 - Never hardcode IDs or emails - always generate unique values
 ### Example 3: Explicit Assertions in Tests
 **Context**: When validating test results, keep assertions visible in test bodies. Never hide assertions in helper functions - this obscures test intent and makes failures harder to diagnose.
 **Implementation**:
 ```typescript
 // ❌ BAD: Assertions hidden in helper functions
 // helpers/api-validators.ts
 export async function validateUserCreation(response: Response, expectedEmail: string) {
  const user = await response.json();
  expect(response.status()).toBe(201);
  expect(user.email).toBe(expectedEmail);
  expect(user.id).toBeTruthy();
  expect(user.createdAt).toBeTruthy();
  // Hidden assertions - not visible in test
 }
 test('create user via API - OPAQUE', async ({ request }) => {
  const userData = createUser({ email: 'test@example.com' });
  const response = await request.post('/api/users', { data: userData });
  // What assertions are running? Have to check helper.
  await validateUserCreation(response, userData.email);
  // When this fails, error is: "validateUserCreation failed" - NOT helpful
 });
 // ✅ GOOD: Assertions explicit in test
 test('create user via API', async ({ request }) => {
  const userData = createUser({ email: 'test@example.com' });
  const response = await request.post('/api/users', { data: userData });
  // All assertions visible - clear test intent
  expect(response.status()).toBe(201);
  const createdUser = await response.json();
  expect(createdUser.id).toBeTruthy();
  expect(createdUser.email).toBe(userData.email);
  expect(createdUser.name).toBe(userData.name);
  expect(createdUser.role).toBe('user');
  expect(createdUser.createdAt).toBeTruthy();
  expect(createdUser.isActive).toBe(true);
  // When this fails, error is: "Expected role to be 'user', got 'admin'" - HELPFUL
 });
 // ✅ ACCEPTABLE: Helper for data extraction, NOT assertions
 // helpers/api-extractors.ts
 export async function extractUserFromResponse(response: Response): Promise<User> {
  const user = await response.json();
  return user; // Just extracts, no assertions
 }
 test('create user with extraction helper', async ({ request }) => {
  const userData = createUser({ email: 'test@example.com' });
  const response = await request.post('/api/users', { data: userData });
  // Extract data with helper (OK)
  const createdUser = await extractUserFromResponse(response);
  // But keep assertions in test (REQUIRED)
  expect(response.status()).toBe(201);
  expect(createdUser.email).toBe(userData.email);
  expect(createdUser.role).toBe('user');
 });
 // Cypress equivalent
 describe('User API', () => {
  it('should create user with explicit assertions', () => {
    const userData = createUser({ email: 'test@example.com' });
    cy.request('POST', '/api/users', userData).then((response) => {
      // All assertions visible in test
      expect(response.status).to.equal(201);
      expect(response.body.id).to.exist;
      expect(response.body.email).to.equal(userData.email);
      expect(response.body.name).to.equal(userData.name);
      expect(response.body.role).to.equal('user');
      expect(response.body.createdAt).to.exist;
      expect(response.body.isActive).to.be.true;
    });
  });
 });
 // ✅ GOOD: Parametrized tests for soft assertions (bulk validation)
 test.describe('User creation validation', () => {
  const testCases = [
    { field: 'email', value: 'test@example.com', expected: 'test@example.com' },
    { field: 'name', value: 'Test User', expected: 'Test User' },
    { field: 'role', value: 'admin', expected: 'admin' },
    { field: 'isActive', value: true, expected: true },
  ];
  for (const { field, value, expected } of testCases) {
    test(`should set ${field} correctly`, async ({ request }) => {
      const userData = createUser({ [field]: value });
      const response = await request.post('/api/users', { data: userData });
      const user = await response.json();
      // Parametrized assertion - still explicit
      expect(user[field]).toBe(expected);
    });
  }
 });
 ```
 **Key Points**:
 - Never hide `expect()` calls in helper functions
 - Helpers can extract/transform data, but assertions stay in tests
 - Parametrized tests are acceptable for bulk validation (still explicit)
 - Explicit assertions make failures actionable: "Expected X, got Y"
 - Hidden assertions produce vague failures: "Helper function failed"
 ### Example 4: Test Length Limits
 **Context**: When tests grow beyond 300 lines, they become hard to understand, debug, and maintain. Refactor long tests by extracting setup helpers, splitting scenarios, or using fixtures.
 **Implementation**:
 ```typescript
 // ❌ BAD: 400-line monolithic test (truncated for example)
 test('complete user journey - TOO LONG', async ({ page, request }) => {
  // 50 lines of setup
  const admin = createUser({ role: 'admin' });
  await request.post('/api/users', { data: admin });
  await page.goto('/login');
  await page.fill('[data-testid="email"]', admin.email);
  await page.fill('[data-testid="password"]', 'password123');
  await page.click('[data-testid="login"]');
  await expect(page).toHaveURL('/dashboard');
  // 100 lines of user creation
  await page.goto('/admin/users');
  const newUser = createUser();
  await page.fill('[data-testid="email"]', newUser.email);
  // ... 95 more lines of form filling, validation, etc.
  // 100 lines of permissions assignment
  await page.click('[data-testid="assign-permissions"]');
  // ... 95 more lines
  // 100 lines of notification preferences
  await page.click('[data-testid="notification-settings"]');
  // ... 95 more lines
  // 50 lines of cleanup
  await request.delete(`/api/users/${newUser.id}`);
  // ... 45 more lines
  // TOTAL: 400 lines - impossible to understand or debug
 });
 // ✅ GOOD: Split into focused tests with shared fixture
 // playwright/support/fixtures/admin-fixture.ts
 export const test = base.extend({
  adminPage: async ({ page, request }, use) => {
    // Shared setup: Login as admin
    const admin = createUser({ role: 'admin' });
    await request.post('/api/users', { data: admin });
    await page.goto('/login');
    await page.fill('[data-testid="email"]', admin.email);
    await page.fill('[data-testid="password"]', 'password123');
    await page.click('[data-testid="login"]');
    await expect(page).toHaveURL('/dashboard');
    await use(page); // Provide logged-in page
    // Cleanup handled by fixture
  },
 });
 // Test 1: User creation (50 lines)
 test('admin can create user', async ({ adminPage, seedUser }) => {
  await adminPage.goto('/admin/users');
  const newUser = createUser();
  await adminPage.fill('[data-testid="email"]', newUser.email);
  await adminPage.fill('[data-testid="name"]', newUser.name);
  await adminPage.click('[data-testid="role-dropdown"]');
  await adminPage.click('[data-testid="role-user"]');
  await adminPage.click('[data-testid="create-user"]');
  await expect(adminPage.getByText('User created')).toBeVisible();
  await expect(adminPage.getByText(newUser.email)).toBeVisible();
  // Verify in database
  const created = await seedUser({ email: newUser.email });
  expect(created.role).toBe('user');
 });
 // Test 2: Permission assignment (60 lines)
 test('admin can assign permissions', async ({ adminPage, seedUser }) => {
  const user = await seedUser({ email: faker.internet.email() });
  await adminPage.goto(`/admin/users/${user.id}`);
  await adminPage.click('[data-testid="assign-permissions"]');
  await adminPage.check('[data-testid="permission-read"]');
  await adminPage.check('[data-testid="permission-write"]');
  await adminPage.click('[data-testid="save-permissions"]');
  await expect(adminPage.getByText('Permissions updated')).toBeVisible();
  // Verify permissions assigned
  const response = await adminPage.request.get(`/api/users/${user.id}`);
  const updated = await response.json();
  expect(updated.permissions).toContain('read');
  expect(updated.permissions).toContain('write');
 });
 // Test 3: Notification preferences (70 lines)
 test('admin can update notification preferences', async ({ adminPage, seedUser }) => {
  const user = await seedUser({ email: faker.internet.email() });
  await adminPage.goto(`/admin/users/${user.id}/notifications`);
  await adminPage.check('[data-testid="email-notifications"]');
  await adminPage.uncheck('[data-testid="sms-notifications"]');
  await adminPage.selectOption('[data-testid="frequency"]', 'daily');
  await adminPage.click('[data-testid="save-preferences"]');
  await expect(adminPage.getByText('Preferences saved')).toBeVisible();
  // Verify preferences
  const response = await adminPage.request.get(`/api/users/${user.id}/preferences`);
  const prefs = await response.json();
  expect(prefs.emailEnabled).toBe(true);
  expect(prefs.smsEnabled).toBe(false);
  expect(prefs.frequency).toBe('daily');
 });
 // TOTAL: 3 tests × 60 lines avg = 180 lines
 // Each test is focused, debuggable, and under 300 lines
 ```
 **Key Points**:
 - Split monolithic tests into focused scenarios (<300 lines each)
 - Extract common setup into fixtures (auto-runs for each test)
 - Each test validates one concern (user creation, permissions, preferences)
 - Failures are easier to diagnose: "Permission assignment failed" vs "Complete journey failed"
 - Tests can run in parallel (isolated concerns)
 ### Example 5: Execution Time Optimization
 **Context**: When tests take longer than 1.5 minutes, they slow CI pipelines and feedback loops. Optimize by using API setup instead of UI navigation, parallelizing independent operations, and avoiding unnecessary waits.
 **Implementation**:
 ```typescript
 // ❌ BAD: 4-minute test (slow setup, sequential operations)
 test('user completes order - SLOW (4 min)', async ({ page }) => {
  // Step 1: Manual signup via UI (90 seconds)
  await page.goto('/signup');
  await page.fill('[data-testid="email"]', 'buyer@example.com');
  await page.fill('[data-testid="password"]', 'password123');
  await page.fill('[data-testid="confirm-password"]', 'password123');
  await page.fill('[data-testid="name"]', 'Buyer User');
  await page.click('[data-testid="signup"]');
  await page.waitForURL('/verify-email'); // Wait for email verification
  // ... manual email verification flow
  // Step 2: Manual product creation via UI (60 seconds)
  await page.goto('/admin/products');
  await page.fill('[data-testid="product-name"]', 'Widget');
  // ... 20 more fields
  await page.click('[data-testid="create-product"]');
  // Step 3: Navigate to checkout (30 seconds)
  await page.goto('/products');
  await page.waitForTimeout(5000); // Unnecessary hard wait
  await page.click('[data-testid="product-widget"]');
  await page.waitForTimeout(3000); // Unnecessary
  await page.click('[data-testid="add-to-cart"]');
  await page.waitForTimeout(2000); // Unnecessary
  // Step 4: Complete checkout (40 seconds)
  await page.goto('/checkout');
  await page.waitForTimeout(5000); // Unnecessary
  await page.fill('[data-testid="credit-card"]', '4111111111111111');
  // ... more form filling
  await page.click('[data-testid="submit-order"]');
  await page.waitForTimeout(10000); // Unnecessary
  await expect(page.getByText('Order Confirmed')).toBeVisible();
  // TOTAL: ~240 seconds (4 minutes)
 });
 // ✅ GOOD: 45-second test (API setup, parallel ops, deterministic waits)
 test('user completes order', async ({ page, apiRequest }) => {
  // Step 1: API setup (parallel, 5 seconds total)
  const [user, product] = await Promise.all([
    // Create user via API (fast)
    apiRequest
      .post('/api/users', {
        data: createUser({
          email: 'buyer@example.com',
          emailVerified: true, // Skip verification
        }),
      })
      .then((r) => r.json()),
    // Create product via API (fast)
    apiRequest
      .post('/api/products', {
        data: createProduct({
          name: 'Widget',
          price: 29.99,
          stock: 10,
        }),
      })
      .then((r) => r.json()),
  ]);
  // Step 2: Auth setup via storage state (instant, 0 seconds)
  await page.context().addCookies([
    {
      name: 'auth_token',
      value: user.token,
      domain: 'localhost',
      path: '/',
    },
  ]);
  // Step 3: Network-first interception BEFORE navigation (10 seconds)
  const cartPromise = page.waitForResponse('**/api/cart');
  const orderPromise = page.waitForResponse('**/api/orders');
  await page.goto(`/products/${product.id}`);
  await page.click('[data-testid="add-to-cart"]');
  await cartPromise; // Deterministic wait (no hard wait)
  // Step 4: Checkout with network waits (30 seconds)
  await page.goto('/checkout');
  await page.fill('[data-testid="credit-card"]', '4111111111111111');
  await page.fill('[data-testid="cvv"]', '123');
  await page.fill('[data-testid="expiry"]', '12/25');
  await page.click('[data-testid="submit-order"]');
  await orderPromise; // Deterministic wait (no hard wait)
  await expect(page.getByText('Order Confirmed')).toBeVisible();
  await expect(page.getByText(`Order #${product.id}`)).toBeVisible();
  // TOTAL: ~45 seconds (6x faster)
 });
 // Cypress equivalent
 describe('Order Flow', () => {
  it('should complete purchase quickly', () => {
    // Step 1: API setup (parallel, fast)
    const user = createUser({ emailVerified: true });
    const product = createProduct({ name: 'Widget', price: 29.99 });
    cy.task('db:seed', { users: [user], products: [product] });
    // Step 2: Auth setup via session (instant)
    cy.setCookie('auth_token', user.token);
    // Step 3: Network-first interception
    cy.intercept('POST', '**/api/cart').as('addToCart');
    cy.intercept('POST', '**/api/orders').as('createOrder');
    cy.visit(`/products/${product.id}`);
    cy.get('[data-cy="add-to-cart"]').click();
    cy.wait('@addToCart'); // Deterministic wait
    // Step 4: Checkout
    cy.visit('/checkout');
    cy.get('[data-cy="credit-card"]').type('4111111111111111');
    cy.get('[data-cy="cvv"]').type('123');
    cy.get('[data-cy="expiry"]').type('12/25');
    cy.get('[data-cy="submit-order"]').click();
    cy.wait('@createOrder'); // Deterministic wait
    cy.contains('Order Confirmed').should('be.visible');
    cy.contains(`Order #${product.id}`).should('be.visible');
  });
 });
 // Additional optimization: Shared auth state (0 seconds per test)
 // playwright/support/global-setup.ts
 export default async function globalSetup() {
  const browser = await chromium.launch();
  const page = await browser.newPage();
  // Create admin user once for all tests
  const admin = createUser({ role: 'admin', emailVerified: true });
  await page.request.post('/api/users', { data: admin });
  // Login once, save session
  await page.goto('/login');
  await page.fill('[data-testid="email"]', admin.email);
  await page.fill('[data-testid="password"]', 'password123');
  await page.click('[data-testid="login"]');
  // Save auth state for reuse
  await page.context().storageState({ path: 'playwright/.auth/admin.json' });
  await browser.close();
 }
 // Use shared auth in tests (instant)
 test.use({ storageState: 'playwright/.auth/admin.json' });
 test('admin action', async ({ page }) => {
  // Already logged in - no auth overhead (0 seconds)
  await page.goto('/admin');
  // ... test logic
 });
 ```
 **Key Points**:
 - Use API for data setup (10-50x faster than UI)
 - Run independent operations in parallel (`Promise.all`)
 - Replace hard waits with deterministic waits (`waitForResponse`)
 - Reuse auth sessions via `storageState` (Playwright) or `setCookie` (Cypress)
 - Skip unnecessary flows (email verification, multi-step signups)
 ## Integration Points
 - **Used in workflows**: `*atdd` (test generation quality), `*automate` (test expansion quality), `*test-review` (quality validation)
 - **Related fragments**:
  - `network-first.md` - Deterministic waiting strategies
  - `data-factories.md` - Isolated, parallel-safe data patterns
  - `fixture-architecture.md` - Setup extraction and cleanup
  - `test-levels-framework.md` - Choosing appropriate test granularity for speed
 ## Core Quality Checklist
 Every test must pass these criteria:
 - [ ] **No Hard Waits** - Use `waitForResponse`, `waitForLoadState`, or element state (not `waitForTimeout`)
 - [ ] **No Conditionals** - Tests execute the same path every time (no if/else, try/catch for flow control)
 - [ ] **< 300 Lines** - Keep tests focused; split large tests or extract setup to fixtures
 - [ ] **< 1.5 Minutes** - Optimize with API setup, parallel operations, and shared auth
 - [ ] **Self-Cleaning** - Use fixtures with auto-cleanup or explicit `afterEach()` teardown
 - [ ] **Explicit Assertions** - Keep `expect()` calls in test bodies, not hidden in helpers
 - [ ] **Unique Data** - Use `faker` for dynamic data; never hardcode IDs or emails
 - [ ] **Parallel-Safe** - Tests don't share state; run successfully with `--workers=4`
 _Source: Murat quality checklist, Definition of Done requirements (lines 370-381, 406-422)._
--- a/_bmad/bmm/testarch/knowledge/timing-debugging.md
+++ b/_bmad/bmm/testarch/knowledge/timing-debugging.md
@@ -0,0 +1,372 @@
 # Timing Debugging and Race Condition Fixes
 ## Principle
 Race conditions arise when tests make assumptions about asynchronous timing (network, animations, state updates). **Deterministic waiting** eliminates flakiness by explicitly waiting for observable events (network responses, element state changes) instead of arbitrary timeouts.
 ## Rationale
 **The Problem**: Tests pass locally but fail in CI (different timing), or pass/fail randomly (race conditions). Hard waits (`waitForTimeout`, `sleep`) mask timing issues without solving them.
 **The Solution**: Replace all hard waits with event-based waits (`waitForResponse`, `waitFor({ state })`). Implement network-first pattern (intercept before navigate). Use explicit state checks (loading spinner detached, data loaded). This makes tests deterministic regardless of network speed or system load.
 **Why This Matters**:
 - Eliminates flaky tests (0 tolerance for timing-based failures)
 - Works consistently across environments (local, CI, production-like)
 - Faster test execution (no unnecessary waits)
 - Clearer test intent (explicit about what we're waiting for)
 ## Pattern Examples
 ### Example 1: Race Condition Identification (Network-First Pattern)
 **Context**: Prevent race conditions by intercepting network requests before navigation
 **Implementation**:
 ```typescript
 // tests/timing/race-condition-prevention.spec.ts
 import { test, expect } from '@playwright/test';
 test.describe('Race Condition Prevention Patterns', () => {
  test('❌ Anti-Pattern: Navigate then intercept (race condition)', async ({ page, context }) => {
    // BAD: Navigation starts before interception ready
    await page.goto('/products'); // ⚠️ Race! API might load before route is set
    await context.route('**/api/products', (route) => {
      route.fulfill({ status: 200, body: JSON.stringify({ products: [] }) });
    });
    // Test may see real API response or mock (non-deterministic)
  });
  test('✅ Pattern: Intercept BEFORE navigate (deterministic)', async ({ page, context }) => {
    // GOOD: Interception ready before navigation
    await context.route('**/api/products', (route) => {
      route.fulfill({
        status: 200,
        contentType: 'application/json',
        body: JSON.stringify({
          products: [
            { id: 1, name: 'Product A', price: 29.99 },
            { id: 2, name: 'Product B', price: 49.99 },
          ],
        }),
      });
    });
    const responsePromise = page.waitForResponse('**/api/products');
    await page.goto('/products'); // Navigation happens AFTER route is ready
    await responsePromise; // Explicit wait for network
    // Test sees mock response reliably (deterministic)
    await expect(page.getByText('Product A')).toBeVisible();
  });
  test('✅ Pattern: Wait for element state change (loading → loaded)', async ({ page }) => {
    await page.goto('/dashboard');
    // Wait for loading indicator to appear (confirms load started)
    await page.getByTestId('loading-spinner').waitFor({ state: 'visible' });
    // Wait for loading indicator to disappear (confirms load complete)
    await page.getByTestId('loading-spinner').waitFor({ state: 'detached' });
    // Content now reliably visible
    await expect(page.getByTestId('dashboard-data')).toBeVisible();
  });
  test('✅ Pattern: Explicit visibility check (not just presence)', async ({ page }) => {
    await page.goto('/modal-demo');
    await page.getByRole('button', { name: 'Open Modal' }).click();
    // ❌ Bad: Element exists but may not be visible yet
    // await expect(page.getByTestId('modal')).toBeAttached()
    // ✅ Good: Wait for visibility (accounts for animations)
    await expect(page.getByTestId('modal')).toBeVisible();
    await expect(page.getByRole('heading', { name: 'Modal Title' })).toBeVisible();
  });
  test('❌ Anti-Pattern: waitForLoadState("networkidle") in SPAs', async ({ page }) => {
    // ⚠️ Deprecated for SPAs (WebSocket connections never idle)
    // await page.goto('/dashboard')
    // await page.waitForLoadState('networkidle') // May timeout in SPAs
    // ✅ Better: Wait for specific API response
    const responsePromise = page.waitForResponse('**/api/dashboard');
    await page.goto('/dashboard');
    await responsePromise;
    await expect(page.getByText('Dashboard loaded')).toBeVisible();
  });
 });
 ```
 **Key Points**:
 - Network-first: ALWAYS intercept before navigate (prevents race conditions)
 - State changes: Wait for loading spinner detached (explicit load completion)
 - Visibility vs presence: `toBeVisible()` accounts for animations, `toBeAttached()` doesn't
 - Avoid networkidle: Unreliable in SPAs (WebSocket, polling connections)
 - Explicit waits: Document exactly what we're waiting for
 ---
 ### Example 2: Deterministic Waiting Patterns (Event-Based, Not Time-Based)
 **Context**: Replace all hard waits with observable event waits
 **Implementation**:
 ```typescript
 // tests/timing/deterministic-waits.spec.ts
 import { test, expect } from '@playwright/test';
 test.describe('Deterministic Waiting Patterns', () => {
  test('waitForResponse() with URL pattern', async ({ page }) => {
    const responsePromise = page.waitForResponse('**/api/products');
    await page.goto('/products');
    await responsePromise; // Deterministic (waits for exact API call)
    await expect(page.getByText('Products loaded')).toBeVisible();
  });
  test('waitForResponse() with predicate function', async ({ page }) => {
    const responsePromise = page.waitForResponse((resp) => resp.url().includes('/api/search') && resp.status() === 200);
    await page.goto('/search');
    await page.getByPlaceholder('Search').fill('laptop');
    await page.getByRole('button', { name: 'Search' }).click();
    await responsePromise; // Wait for successful search response
    await expect(page.getByTestId('search-results')).toBeVisible();
  });
  test('waitForFunction() for custom conditions', async ({ page }) => {
    await page.goto('/dashboard');
    // Wait for custom JavaScript condition
    await page.waitForFunction(() => {
      const element = document.querySelector('[data-testid="user-count"]');
      return element && parseInt(element.textContent || '0') > 0;
    });
    // User count now loaded
    await expect(page.getByTestId('user-count')).not.toHaveText('0');
  });
  test('waitFor() element state (attached, visible, hidden, detached)', async ({ page }) => {
    await page.goto('/products');
    // Wait for element to be attached to DOM
    await page.getByTestId('product-list').waitFor({ state: 'attached' });
    // Wait for element to be visible (animations complete)
    await page.getByTestId('product-list').waitFor({ state: 'visible' });
    // Perform action
    await page.getByText('Product A').click();
    // Wait for modal to be hidden (close animation complete)
    await page.getByTestId('modal').waitFor({ state: 'hidden' });
  });
  test('Cypress: cy.wait() with aliased intercepts', async () => {
    // Cypress example (not Playwright)
    /*
    cy.intercept('GET', '/api/products').as('getProducts')
    cy.visit('/products')
    cy.wait('@getProducts') // Deterministic wait for specific request
    cy.get('[data-testid="product-list"]').should('be.visible')
    */
  });
 });
 ```
 **Key Points**:
 - `waitForResponse()`: Wait for specific API calls (URL pattern or predicate)
 - `waitForFunction()`: Wait for custom JavaScript conditions
 - `waitFor({ state })`: Wait for element state changes (attached, visible, hidden, detached)
 - Cypress `cy.wait('@alias')`: Deterministic wait for aliased intercepts
 - All waits are event-based (not time-based)
 ---
 ### Example 3: Timing Anti-Patterns (What NEVER to Do)
 **Context**: Common timing mistakes that cause flakiness
 **Problem Examples**:
 ```typescript
 // tests/timing/anti-patterns.spec.ts
 import { test, expect } from '@playwright/test';
 test.describe('Timing Anti-Patterns to Avoid', () => {
  test('❌ NEVER: page.waitForTimeout() (arbitrary delay)', async ({ page }) => {
    await page.goto('/dashboard');
    // ❌ Bad: Arbitrary 3-second wait (flaky)
    // await page.waitForTimeout(3000)
    // Problem: Might be too short (CI slower) or too long (wastes time)
    // ✅ Good: Wait for observable event
    await page.waitForResponse('**/api/dashboard');
    await expect(page.getByText('Dashboard loaded')).toBeVisible();
  });
  test('❌ NEVER: cy.wait(number) without alias (arbitrary delay)', async () => {
    // Cypress example
    /*
    // ❌ Bad: Arbitrary delay
    cy.visit('/products')
    cy.wait(2000) // Flaky!
    // ✅ Good: Wait for specific request
    cy.intercept('GET', '/api/products').as('getProducts')
    cy.visit('/products')
    cy.wait('@getProducts') // Deterministic
    */
  });
  test('❌ NEVER: Multiple hard waits in sequence (compounding delays)', async ({ page }) => {
    await page.goto('/checkout');
    // ❌ Bad: Stacked hard waits (6+ seconds wasted)
    // await page.waitForTimeout(2000) // Wait for form
    // await page.getByTestId('email').fill('test@example.com')
    // await page.waitForTimeout(1000) // Wait for validation
    // await page.getByTestId('submit').click()
    // await page.waitForTimeout(3000) // Wait for redirect
    // ✅ Good: Event-based waits (no wasted time)
    await page.getByTestId('checkout-form').waitFor({ state: 'visible' });
    await page.getByTestId('email').fill('test@example.com');
    await page.waitForResponse('**/api/validate-email');
    await page.getByTestId('submit').click();
    await page.waitForURL('**/confirmation');
  });
  test('❌ NEVER: waitForLoadState("networkidle") in SPAs', async ({ page }) => {
    // ❌ Bad: Unreliable in SPAs (WebSocket connections never idle)
    // await page.goto('/dashboard')
    // await page.waitForLoadState('networkidle') // Timeout in SPAs!
    // ✅ Good: Wait for specific API responses
    await page.goto('/dashboard');
    await page.waitForResponse('**/api/dashboard');
    await page.waitForResponse('**/api/user');
    await expect(page.getByTestId('dashboard-content')).toBeVisible();
  });
  test('❌ NEVER: Sleep/setTimeout in tests', async ({ page }) => {
    await page.goto('/products');
    // ❌ Bad: Node.js sleep (blocks test thread)
    // await new Promise(resolve => setTimeout(resolve, 2000))
    // ✅ Good: Playwright auto-waits for element
    await expect(page.getByText('Products loaded')).toBeVisible();
  });
 });
 ```
 **Why These Fail**:
 - **Hard waits**: Arbitrary timeouts (too short → flaky, too long → slow)
 - **Stacked waits**: Compound delays (wasteful, unreliable)
 - **networkidle**: Broken in SPAs (WebSocket/polling never idle)
 - **Sleep**: Blocks execution (wastes time, doesn't solve race conditions)
 **Better Approach**: Use event-based waits from examples above
 ---
 ## Async Debugging Techniques
 ### Technique 1: Promise Chain Analysis
 ```typescript
 test('debug async waterfall with console logs', async ({ page }) => {
  console.log('1. Starting navigation...');
  await page.goto('/products');
  console.log('2. Waiting for API response...');
  const response = await page.waitForResponse('**/api/products');
  console.log('3. API responded:', response.status());
  console.log('4. Waiting for UI update...');
  await expect(page.getByText('Products loaded')).toBeVisible();
  console.log('5. Test complete');
  // Console output shows exactly where timing issue occurs
 });
 ```
 ### Technique 2: Network Waterfall Inspection (DevTools)
 ```typescript
 test('inspect network timing with trace viewer', async ({ page }) => {
  await page.goto('/dashboard');
  // Generate trace for analysis
  // npx playwright test --trace on
  // npx playwright show-trace trace.zip
  // In trace viewer:
  // 1. Check Network tab for API call timing
  // 2. Identify slow requests (>1s response time)
  // 3. Find race conditions (overlapping requests)
  // 4. Verify request order (dependencies)
 });
 ```
 ### Technique 3: Trace Viewer for Timing Visualization
 ```typescript
 test('use trace viewer to debug timing', async ({ page }) => {
  // Run with trace: npx playwright test --trace on
  await page.goto('/checkout');
  await page.getByTestId('submit').click();
  // In trace viewer, examine:
  // - Timeline: See exact timing of each action
  // - Snapshots: Hover to see DOM state at each moment
  // - Network: Identify slow/failed requests
  // - Console: Check for async errors
  await expect(page.getByText('Success')).toBeVisible();
 });
 ```
 ---
 ## Race Condition Checklist
 Before deploying tests:
 - [ ] **Network-first pattern**: All routes intercepted BEFORE navigation (no race conditions)
 - [ ] **Explicit waits**: Every navigation followed by `waitForResponse()` or state check
 - [ ] **No hard waits**: Zero instances of `waitForTimeout()`, `cy.wait(number)`, `sleep()`
 - [ ] **Element state waits**: Loading spinners use `waitFor({ state: 'detached' })`
 - [ ] **Visibility checks**: Use `toBeVisible()` (accounts for animations), not just `toBeAttached()`
 - [ ] **Response validation**: Wait for successful responses (`resp.ok()` or `status === 200`)
 - [ ] **Trace viewer analysis**: Generate traces to identify timing issues (network waterfall, console errors)
 - [ ] **CI/local parity**: Tests pass reliably in both environments (no timing assumptions)
 ## Integration Points
 - **Used in workflows**: `*automate` (healing timing failures), `*test-review` (detect hard wait anti-patterns), `*framework` (configure timeout standards)
 - **Related fragments**: `test-healing-patterns.md` (race condition diagnosis), `network-first.md` (interception patterns), `playwright-config.md` (timeout configuration), `visual-debugging.md` (trace viewer analysis)
 - **Tools**: Playwright Inspector (`--debug`), Trace Viewer (`--trace on`), DevTools Network tab
 _Source: Playwright timing best practices, network-first pattern from test-resources-for-ai, production race condition debugging_
--- a/_bmad/bmm/testarch/knowledge/visual-debugging.md
+++ b/_bmad/bmm/testarch/knowledge/visual-debugging.md
@@ -0,0 +1,524 @@
 # Visual Debugging and Developer Ergonomics
 ## Principle
 Fast feedback loops and transparent debugging artifacts are critical for maintaining test reliability and developer confidence. Visual debugging tools (trace viewers, screenshots, videos, HAR files) turn cryptic test failures into actionable insights, reducing triage time from hours to minutes.
 ## Rationale
 **The Problem**: CI failures often provide minimal context—a timeout, a selector mismatch, or a network error—forcing developers to reproduce issues locally (if they can). This wastes time and discourages test maintenance.
 **The Solution**: Capture rich debugging artifacts **only on failure** to balance storage costs with diagnostic value. Modern tools like Playwright Trace Viewer, Cypress Debug UI, and HAR recordings provide interactive, time-travel debugging that reveals exactly what the test saw at each step.
 **Why This Matters**:
 - Reduces failure triage time by 80-90% (visual context vs logs alone)
 - Enables debugging without local reproduction
 - Improves test maintenance confidence (clear failure root cause)
 - Catches timing/race conditions that are hard to reproduce locally
 ## Pattern Examples
 ### Example 1: Playwright Trace Viewer Configuration (Production Pattern)
 **Context**: Capture traces on first retry only (balances storage and diagnostics)
 **Implementation**:
 ```typescript
 // playwright.config.ts
 import { defineConfig } from '@playwright/test';
 export default defineConfig({
  use: {
    // Visual debugging artifacts (space-efficient)
    trace: 'on-first-retry', // Only when test fails once
    screenshot: 'only-on-failure', // Not on success
    video: 'retain-on-failure', // Delete on pass
    // Context for debugging
    baseURL: process.env.BASE_URL || 'http://localhost:3000',
    // Timeout context
    actionTimeout: 15_000, // 15s for clicks/fills
    navigationTimeout: 30_000, // 30s for page loads
  },
  // CI-specific artifact retention
  reporter: [
    ['html', { outputFolder: 'playwright-report', open: 'never' }],
    ['junit', { outputFile: 'results.xml' }],
    ['list'], // Console output
  ],
  // Failure handling
  retries: process.env.CI ? 2 : 0, // Retry in CI to capture trace
  workers: process.env.CI ? 1 : undefined,
 });
 ```
 **Opening and Using Trace Viewer**:
 ```bash
 # After test failure in CI, download trace artifact
 # Then open locally:
 npx playwright show-trace path/to/trace.zip
 # Or serve trace viewer:
 npx playwright show-report
 ```
 **Key Features to Use in Trace Viewer**:
 1. **Timeline**: See each action (click, navigate, assertion) with timing
 2. **Snapshots**: Hover over timeline to see DOM state at that moment
 3. **Network Tab**: Inspect all API calls, headers, payloads, timing
 4. **Console Tab**: View console.log/error messages
 5. **Source Tab**: See test code with execution markers
 6. **Metadata**: Browser, OS, test duration, screenshots
 **Why This Works**:
 - `on-first-retry` avoids capturing traces for flaky passes (saves storage)
 - Screenshots + video give visual context without trace overhead
 - Interactive timeline makes timing issues obvious (race conditions, slow API)
 ---
 ### Example 2: HAR File Recording for Network Debugging
 **Context**: Capture all network activity for reproducible API debugging
 **Implementation**:
 ```typescript
 // tests/e2e/checkout-with-har.spec.ts
 import { test, expect } from '@playwright/test';
 import path from 'path';
 test.describe('Checkout Flow with HAR Recording', () => {
  test('should complete payment with full network capture', async ({ page, context }) => {
    // Start HAR recording BEFORE navigation
    await context.routeFromHAR(path.join(__dirname, '../fixtures/checkout.har'), {
      url: '**/api/**', // Only capture API calls
      update: true, // Update HAR if file exists
    });
    await page.goto('/checkout');
    // Interact with page
    await page.getByTestId('payment-method').selectOption('credit-card');
    await page.getByTestId('card-number').fill('4242424242424242');
    await page.getByTestId('submit-payment').click();
    // Wait for payment confirmation
    await expect(page.getByTestId('success-message')).toBeVisible();
    // HAR file saved to fixtures/checkout.har
    // Contains all network requests/responses for replay
  });
 });
 ```
 **Using HAR for Deterministic Mocking**:
 ```typescript
 // tests/e2e/checkout-replay-har.spec.ts
 import { test, expect } from '@playwright/test';
 import path from 'path';
 test('should replay checkout flow from HAR', async ({ page, context }) => {
  // Replay network from HAR (no real API calls)
  await context.routeFromHAR(path.join(__dirname, '../fixtures/checkout.har'), {
    url: '**/api/**',
    update: false, // Read-only mode
  });
  await page.goto('/checkout');
  // Same test, but network responses come from HAR file
  await page.getByTestId('payment-method').selectOption('credit-card');
  await page.getByTestId('card-number').fill('4242424242424242');
  await page.getByTestId('submit-payment').click();
  await expect(page.getByTestId('success-message')).toBeVisible();
 });
 ```
 **Key Points**:
 - **`update: true`** records new HAR or updates existing (for flaky API debugging)
 - **`update: false`** replays from HAR (deterministic, no real API)
 - Filter by URL pattern (`**/api/**`) to avoid capturing static assets
 - HAR files are human-readable JSON (easy to inspect/modify)
 **When to Use HAR**:
 - Debugging flaky tests caused by API timing/responses
 - Creating deterministic mocks for integration tests
 - Analyzing third-party API behavior (Stripe, Auth0)
 - Reproducing production issues locally (record HAR in staging)
 ---
 ### Example 3: Custom Artifact Capture (Console Logs + Network on Failure)
 **Context**: Capture additional debugging context automatically on test failure
 **Implementation**:
 ```typescript
 // playwright/support/fixtures/debug-fixture.ts
 import { test as base } from '@playwright/test';
 import fs from 'fs';
 import path from 'path';
 type DebugFixture = {
  captureDebugArtifacts: () => Promise<void>;
 };
 export const test = base.extend<DebugFixture>({
  captureDebugArtifacts: async ({ page }, use, testInfo) => {
    const consoleLogs: string[] = [];
    const networkRequests: Array<{ url: string; status: number; method: string }> = [];
    // Capture console messages
    page.on('console', (msg) => {
      consoleLogs.push(`[${msg.type()}] ${msg.text()}`);
    });
    // Capture network requests
    page.on('request', (request) => {
      networkRequests.push({
        url: request.url(),
        method: request.method(),
        status: 0, // Will be updated on response
      });
    });
    page.on('response', (response) => {
      const req = networkRequests.find((r) => r.url === response.url());
      if (req) req.status = response.status();
    });
    await use(async () => {
      // This function can be called manually in tests
      // But it also runs automatically on failure via afterEach
    });
    // After test completes, save artifacts if failed
    if (testInfo.status !== testInfo.expectedStatus) {
      const artifactDir = path.join(testInfo.outputDir, 'debug-artifacts');
      fs.mkdirSync(artifactDir, { recursive: true });
      // Save console logs
      fs.writeFileSync(path.join(artifactDir, 'console.log'), consoleLogs.join('\n'), 'utf-8');
      // Save network summary
      fs.writeFileSync(path.join(artifactDir, 'network.json'), JSON.stringify(networkRequests, null, 2), 'utf-8');
      console.log(`Debug artifacts saved to: ${artifactDir}`);
    }
  },
 });
 ```
 **Usage in Tests**:
 ```typescript
 // tests/e2e/payment-with-debug.spec.ts
 import { test, expect } from '../support/fixtures/debug-fixture';
 test('payment flow captures debug artifacts on failure', async ({ page, captureDebugArtifacts }) => {
  await page.goto('/checkout');
  // Test will automatically capture console + network on failure
  await page.getByTestId('submit-payment').click();
  await expect(page.getByTestId('success-message')).toBeVisible({ timeout: 5000 });
  // If this fails, console.log and network.json saved automatically
 });
 ```
 **CI Integration (GitHub Actions)**:
 ```yaml
 # .github/workflows/e2e.yml
 name: E2E Tests with Artifacts
 on: [push, pull_request]
 jobs:
  test:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v4
      - uses: actions/setup-node@v4
        with:
          node-version-file: '.nvmrc'
      - name: Install dependencies
        run: npm ci
      - name: Run Playwright tests
        run: npm run test:e2e
        continue-on-error: true # Capture artifacts even on failure
      - name: Upload test artifacts on failure
        if: failure()
        uses: actions/upload-artifact@v4
        with:
          name: playwright-artifacts
          path: |
            test-results/
            playwright-report/
          retention-days: 30
 ```
 **Key Points**:
 - Fixtures automatically capture context without polluting test code
 - Only saves artifacts on failure (storage-efficient)
 - CI uploads artifacts for post-mortem analysis
 - `continue-on-error: true` ensures artifact upload even when tests fail
 ---
 ### Example 4: Accessibility Debugging Integration (axe-core in Trace Viewer)
 **Context**: Catch accessibility regressions during visual debugging
 **Implementation**:
 ```typescript
 // playwright/support/fixtures/a11y-fixture.ts
 import { test as base } from '@playwright/test';
 import AxeBuilder from '@axe-core/playwright';
 type A11yFixture = {
  checkA11y: () => Promise<void>;
 };
 export const test = base.extend<A11yFixture>({
  checkA11y: async ({ page }, use) => {
    await use(async () => {
      // Run axe accessibility scan
      const results = await new AxeBuilder({ page }).analyze();
      // Attach results to test report (visible in trace viewer)
      if (results.violations.length > 0) {
        console.log(`Found ${results.violations.length} accessibility violations:`);
        results.violations.forEach((violation) => {
          console.log(`- [${violation.impact}] ${violation.id}: ${violation.description}`);
          console.log(`  Help: ${violation.helpUrl}`);
        });
        throw new Error(`Accessibility violations found: ${results.violations.length}`);
      }
    });
  },
 });
 ```
 **Usage with Visual Debugging**:
 ```typescript
 // tests/e2e/checkout-a11y.spec.ts
 import { test, expect } from '../support/fixtures/a11y-fixture';
 test('checkout page is accessible', async ({ page, checkA11y }) => {
  await page.goto('/checkout');
  // Verify page loaded
  await expect(page.getByRole('heading', { name: 'Checkout' })).toBeVisible();
  // Run accessibility check
  await checkA11y();
  // If violations found, test fails and trace captures:
  // - Screenshot showing the problematic element
  // - Console log with violation details
  // - Network tab showing any failed resource loads
 });
 ```
 **Trace Viewer Benefits**:
 - **Screenshot shows visual context** of accessibility issue (contrast, missing labels)
 - **Console tab shows axe-core violations** with impact level and helpUrl
 - **DOM snapshot** allows inspecting ARIA attributes at failure point
 - **Network tab** reveals if icon fonts or images failed (common a11y issue)
 **Cypress Equivalent**:
 ```javascript
 // cypress/support/commands.ts
 import 'cypress-axe';
 Cypress.Commands.add('checkA11y', (context = null, options = {}) => {
  cy.injectAxe(); // Inject axe-core
  cy.checkA11y(context, options, (violations) => {
    if (violations.length) {
      cy.task('log', `Found ${violations.length} accessibility violations`);
      violations.forEach((violation) => {
        cy.task('log', `- [${violation.impact}] ${violation.id}: ${violation.description}`);
      });
    }
  });
 });
 // tests/e2e/checkout-a11y.cy.ts
 describe('Checkout Accessibility', () => {
  it('should have no a11y violations', () => {
    cy.visit('/checkout');
    cy.injectAxe();
    cy.checkA11y();
    // On failure, Cypress UI shows:
    // - Screenshot of page
    // - Console log with violation details
    // - Network tab with API calls
  });
 });
 ```
 **Key Points**:
 - Accessibility checks integrate seamlessly with visual debugging
 - Violations are captured in trace viewer/Cypress UI automatically
 - Provides actionable links (helpUrl) to fix issues
 - Screenshots show visual context (contrast, layout)
 ---
 ### Example 5: Time-Travel Debugging Workflow (Playwright Inspector)
 **Context**: Debug tests interactively with step-through execution
 **Implementation**:
 ```typescript
 // tests/e2e/checkout-debug.spec.ts
 import { test, expect } from '@playwright/test';
 test('debug checkout flow step-by-step', async ({ page }) => {
  // Set breakpoint by uncommenting this:
  // await page.pause()
  await page.goto('/checkout');
  // Use Playwright Inspector to:
  // 1. Step through each action
  // 2. Inspect DOM at each step
  // 3. View network calls per action
  // 4. Take screenshots manually
  await page.getByTestId('payment-method').selectOption('credit-card');
  // Pause here to inspect form state
  // await page.pause()
  await page.getByTestId('card-number').fill('4242424242424242');
  await page.getByTestId('submit-payment').click();
  await expect(page.getByTestId('success-message')).toBeVisible();
 });
 ```
 **Running with Inspector**:
 ```bash
 # Open Playwright Inspector (GUI debugger)
 npx playwright test --debug
 # Or use headed mode with slowMo
 npx playwright test --headed --slow-mo=1000
 # Debug specific test
 npx playwright test checkout-debug.spec.ts --debug
 # Set environment variable for persistent debugging
 PWDEBUG=1 npx playwright test
 ```
 **Inspector Features**:
 1. **Step-through execution**: Click "Next" to execute one action at a time
 2. **DOM inspector**: Hover over elements to see selectors
 3. **Network panel**: See API calls with timing
 4. **Console panel**: View console.log output
 5. **Pick locator**: Click element in browser to get selector
 6. **Record mode**: Record interactions to generate test code
 **Common Debugging Patterns**:
 ```typescript
 // Pattern 1: Debug selector issues
 test('debug selector', async ({ page }) => {
  await page.goto('/dashboard');
  await page.pause(); // Inspector opens
  // In Inspector console, test selectors:
  // page.getByTestId('user-menu') ✅
  // page.getByRole('button', { name: 'Profile' }) ✅
  // page.locator('.btn-primary') ❌ (fragile)
 });
 // Pattern 2: Debug timing issues
 test('debug network timing', async ({ page }) => {
  await page.goto('/dashboard');
  // Set up network listener BEFORE interaction
  const responsePromise = page.waitForResponse('**/api/users');
  await page.getByTestId('load-users').click();
  await page.pause(); // Check network panel for timing
  const response = await responsePromise;
  expect(response.status()).toBe(200);
 });
 // Pattern 3: Debug state changes
 test('debug state mutation', async ({ page }) => {
  await page.goto('/cart');
  // Check initial state
  await expect(page.getByTestId('cart-count')).toHaveText('0');
  await page.pause(); // Inspect DOM
  await page.getByTestId('add-to-cart').click();
  await page.pause(); // Inspect DOM again (compare state)
  await expect(page.getByTestId('cart-count')).toHaveText('1');
 });
 ```
 **Key Points**:
 - `page.pause()` opens Inspector at that exact moment
 - Inspector shows DOM state, network activity, console at pause point
 - "Pick locator" feature helps find robust selectors
 - Record mode generates test code from manual interactions
 ---
 ## Visual Debugging Checklist
 Before deploying tests to CI, ensure:
 - [ ] **Artifact configuration**: `trace: 'on-first-retry'`, `screenshot: 'only-on-failure'`, `video: 'retain-on-failure'`
 - [ ] **CI artifact upload**: GitHub Actions/GitLab CI configured to upload `test-results/` and `playwright-report/`
 - [ ] **HAR recording**: Set up for flaky API tests (record once, replay deterministically)
 - [ ] **Custom debug fixtures**: Console logs + network summary captured on failure
 - [ ] **Accessibility integration**: axe-core violations visible in trace viewer
 - [ ] **Trace viewer docs**: README explains how to open traces locally (`npx playwright show-trace`)
 - [ ] **Inspector workflow**: Document `--debug` flag for interactive debugging
 - [ ] **Storage optimization**: Artifacts deleted after 30 days (CI retention policy)
 ## Integration Points
 - **Used in workflows**: `*framework` (initial setup), `*ci` (artifact upload), `*test-review` (validate artifact config)
 - **Related fragments**: `playwright-config.md` (artifact configuration), `ci-burn-in.md` (CI artifact upload), `test-quality.md` (debugging best practices)
 - **Tools**: Playwright Trace Viewer, Cypress Debug UI, axe-core, HAR files
 _Source: Playwright official docs, Murat testing philosophy (visual debugging manifesto), SEON production debugging patterns_
--- a/_bmad/bmm/testarch/tea-index.csv
+++ b/_bmad/bmm/testarch/tea-index.csv
@@ -0,0 +1,33 @@
 id,name,description,tags,fragment_file
 fixture-architecture,Fixture Architecture,"Composable fixture patterns (pure function → fixture → merge) and reuse rules","fixtures,architecture,playwright,cypress",knowledge/fixture-architecture.md
 network-first,Network-First Safeguards,"Intercept-before-navigate workflow, HAR capture, deterministic waits, edge mocking","network,stability,playwright,cypress",knowledge/network-first.md
 data-factories,Data Factories and API Setup,"Factories with overrides, API seeding, cleanup discipline","data,factories,setup,api",knowledge/data-factories.md
 component-tdd,Component TDD Loop,"Red→green→refactor workflow, provider isolation, accessibility assertions","component-testing,tdd,ui",knowledge/component-tdd.md
 playwright-config,Playwright Config Guardrails,"Environment switching, timeout standards, artifact outputs","playwright,config,env",knowledge/playwright-config.md
 ci-burn-in,CI and Burn-In Strategy,"Staged jobs, shard orchestration, burn-in loops, artifact policy","ci,automation,flakiness",knowledge/ci-burn-in.md
 selective-testing,Selective Test Execution,"Tag/grep usage, spec filters, diff-based runs, promotion rules","risk-based,selection,strategy",knowledge/selective-testing.md
 feature-flags,Feature Flag Governance,"Enum management, targeting helpers, cleanup, release checklists","feature-flags,governance,launchdarkly",knowledge/feature-flags.md
 contract-testing,Contract Testing Essentials,"Pact publishing, provider verification, resilience coverage","contract-testing,pact,api",knowledge/contract-testing.md
 email-auth,Email Authentication Testing,"Magic link extraction, state preservation, caching, negative flows","email-authentication,security,workflow",knowledge/email-auth.md
 error-handling,Error Handling Checks,"Scoped exception handling, retry validation, telemetry logging","resilience,error-handling,stability",knowledge/error-handling.md
 visual-debugging,Visual Debugging Toolkit,"Trace viewer usage, artifact expectations, accessibility integration","debugging,dx,tooling",knowledge/visual-debugging.md
 risk-governance,Risk Governance,"Scoring matrix, category ownership, gate decision rules","risk,governance,gates",knowledge/risk-governance.md
 probability-impact,Probability and Impact Scale,"Shared definitions for scoring matrix and gate thresholds","risk,scoring,scale",knowledge/probability-impact.md
 test-quality,Test Quality Definition of Done,"Execution limits, isolation rules, green criteria","quality,definition-of-done,tests",knowledge/test-quality.md
 nfr-criteria,NFR Review Criteria,"Security, performance, reliability, maintainability status definitions","nfr,assessment,quality",knowledge/nfr-criteria.md
 test-levels,Test Levels Framework,"Guidelines for choosing unit, integration, or end-to-end coverage","testing,levels,selection",knowledge/test-levels-framework.md
 test-priorities,Test Priorities Matrix,"P0–P3 criteria, coverage targets, execution ordering","testing,prioritization,risk",knowledge/test-priorities-matrix.md
 test-healing-patterns,Test Healing Patterns,"Common failure patterns and automated fixes","healing,debugging,patterns",knowledge/test-healing-patterns.md
 selector-resilience,Selector Resilience,"Robust selector strategies and debugging techniques","selectors,locators,debugging",knowledge/selector-resilience.md
 timing-debugging,Timing Debugging,"Race condition identification and deterministic wait fixes","timing,async,debugging",knowledge/timing-debugging.md
 overview,Playwright Utils Overview,"Installation, design principles, fixture patterns","playwright-utils,fixtures",knowledge/overview.md
 api-request,API Request,"Typed HTTP client, schema validation","api,playwright-utils",knowledge/api-request.md
 network-recorder,Network Recorder,"HAR record/playback, CRUD detection","network,playwright-utils",knowledge/network-recorder.md
 auth-session,Auth Session,"Token persistence, multi-user","auth,playwright-utils",knowledge/auth-session.md
 intercept-network-call,Intercept Network Call,"Network spy/stub, JSON parsing","network,playwright-utils",knowledge/intercept-network-call.md
 recurse,Recurse Polling,"Async polling, condition waiting","polling,playwright-utils",knowledge/recurse.md
 log,Log Utility,"Report logging, structured output","logging,playwright-utils",knowledge/log.md
 file-utils,File Utilities,"CSV/XLSX/PDF/ZIP validation","files,playwright-utils",knowledge/file-utils.md
 burn-in,Burn-in Runner,"Smart test selection, git diff","ci,playwright-utils",knowledge/burn-in.md
 network-error-monitor,Network Error Monitor,"HTTP 4xx/5xx detection","monitoring,playwright-utils",knowledge/network-error-monitor.md
 fixtures-composition,Fixtures Composition,"mergeTests composition patterns","fixtures,playwright-utils",knowledge/fixtures-composition.md
--- a/_bmad/bmm/workflows/1-analysis/create-product-brief/product-brief.template.md
+++ b/_bmad/bmm/workflows/1-analysis/create-product-brief/product-brief.template.md
@@ -0,0 +1,10 @@
 ---
 stepsCompleted: []
 inputDocuments: []
 date: { system-date }
 author: { user }
 ---
 # Product Brief: {{project_name}}
 <!-- Content will be appended sequentially through collaborative workflow steps -->
--- a/_bmad/bmm/workflows/1-analysis/create-product-brief/steps/step-01-init.md
+++ b/_bmad/bmm/workflows/1-analysis/create-product-brief/steps/step-01-init.md
@@ -0,0 +1,182 @@
 ---
 name: 'step-01-init'
 description: 'Initialize the product brief workflow by detecting continuation state and setting up the document'
 # Path Definitions
 workflow_path: '{project-root}/_bmad/bmm/workflows/1-analysis/create-product-brief'
 # File References
 thisStepFile: '{workflow_path}/steps/step-01-init.md'
 nextStepFile: '{workflow_path}/steps/step-02-vision.md'
 workflowFile: '{workflow_path}/workflow.md'
 outputFile: '{planning_artifacts}/product-brief-{{project_name}}-{{date}}.md'
 # Template References
 productBriefTemplate: '{workflow_path}/product-brief.template.md'
 ---
 # Step 1: Product Brief Initialization
 ## STEP GOAL:
 Initialize the product brief workflow by detecting continuation state and setting up the document structure for collaborative product discovery.
 ## MANDATORY EXECUTION RULES (READ FIRST):
 ### Universal Rules:
 - 🛑 NEVER generate content without user input
 - 📖 CRITICAL: Read the complete step file before taking any action
 - 🔄 CRITICAL: When loading next step with 'C', ensure entire file is read
 - 📋 YOU ARE A FACILITATOR, not a content generator
 - ✅ YOU MUST ALWAYS SPEAK OUTPUT In your Agent communication style with the config `{communication_language}`
 ### Role Reinforcement:
 - ✅ You are a product-focused Business Analyst facilitator
 - ✅ If you already have been given a name, communication_style and persona, continue to use those while playing this new role
 - ✅ We engage in collaborative dialogue, not command-response
 - ✅ You bring structured thinking and facilitation skills, while the user brings domain expertise and product vision
 - ✅ Maintain collaborative discovery tone throughout
 ### Step-Specific Rules:
 - 🎯 Focus only on initialization and setup - no content generation yet
 - 🚫 FORBIDDEN to look ahead to future steps or assume knowledge from them
 - 💬 Approach: Systematic setup with clear reporting to user
 - 📋 Detect existing workflow state and handle continuation properly
 ## EXECUTION PROTOCOLS:
 - 🎯 Show your analysis of current state before taking any action
 - 💾 Initialize document structure and update frontmatter appropriately
 - 📖 Set up frontmatter `stepsCompleted: [1]` before loading next step
 - 🚫 FORBIDDEN to load next step until user selects 'C' (Continue)
 ## CONTEXT BOUNDARIES:
 - Available context: Variables from workflow.md are available in memory
 - Focus: Workflow initialization and document setup only
 - Limits: Don't assume knowledge from other steps or create content yet
 - Dependencies: Configuration loaded from workflow.md initialization
 ## Sequence of Instructions (Do not deviate, skip, or optimize)
 ### 1. Check for Existing Workflow State
 First, check if the output document already exists:
 **Workflow State Detection:**
 - Look for file `{outputFile}`
 - If exists, read the complete file including frontmatter
 - If not exists, this is a fresh workflow
 ### 2. Handle Continuation (If Document Exists)
 If the document exists and has frontmatter with `stepsCompleted`:
 **Continuation Protocol:**
 - **STOP immediately** and load `{workflow_path}/steps/step-01b-continue.md`
 - Do not proceed with any initialization tasks
 - Let step-01b handle all continuation logic
 - This is an auto-proceed situation - no user choice needed
 ### 3. Fresh Workflow Setup (If No Document)
 If no document exists or no `stepsCompleted` in frontmatter:
 #### A. Input Document Discovery
 load context documents using smart discovery. Documents can be in the following locations:
 - {planning_artifacts}/**
 - {output_folder}/**
 - {product_knowledge}/**
 - docs/**
 Also - when searching - documents can be a single markdown file, or a folder with an index and multiple files. For Example, if searching for `*foo*.md` and not found, also search for a folder called *foo*/index.md (which indicates sharded content)
 Try to discover the following:
 - Brainstorming Reports (`*brainstorming*.md`)
 - Research Documents (`*research*.md`)
 - Project Documentation (generally multiple documents might be found for this in the `{product_knowledge}` or `docs` folder.)
 - Project Context (`**/project-context.md`)
 <critical>Confirm what you have found with the user, along with asking if the user wants to provide anything else. Only after this confirmation will you proceed to follow the loading rules</critical>
 **Loading Rules:**
 - Load ALL discovered files completely that the user confirmed or provided (no offset/limit)
 - If there is a project context, whatever is relevant should try to be biased in the remainder of this whole workflow process
 - For sharded folders, load ALL files to get complete picture, using the index first to potentially know the potential of each document
 - index.md is a guide to what's relevant whenever available
 - Track all successfully loaded files in frontmatter `inputDocuments` array
 #### B. Create Initial Document
 **Document Setup:**
 - Copy the template from `{productBriefTemplate}` to `{outputFile}`, and update the frontmatter fields
 #### C. Present Initialization Results
 **Setup Report to User:**
 "Welcome {{user_name}}! I've set up your product brief workspace for {{project_name}}.
 **Document Setup:**
 - Created: `{outputFile}` from template
 - Initialized frontmatter with workflow state
 **Input Documents Discovered:**
 - Research: {number of research files loaded or "None found"}
 - Brainstorming: {number of brainstorming files loaded or "None found"}
 - Project docs: {number of project files loaded or "None found"}
 - Project Context: {number of project context files loaded or "None found"}
 **Files loaded:** {list of specific file names or "No additional documents found"}
 Do you have any other documents you'd like me to include, or shall we continue to the next step?"
 ### 4. Present MENU OPTIONS
 Display: "**Proceeding to product vision discovery...**"
 #### Menu Handling Logic:
 - After setup report is presented, immediately load, read entire file, then execute {nextStepFile}
 #### EXECUTION RULES:
 - This is an initialization step with auto-proceed after setup completion
 - Proceed directly to next step after document setup and reporting
 ## CRITICAL STEP COMPLETION NOTE
 ONLY WHEN [setup completion is achieved and frontmatter properly updated], will you then load and read fully `{nextStepFile}` to execute and begin product vision discovery.
 ---
 ## 🚨 SYSTEM SUCCESS/FAILURE METRICS
 ### ✅ SUCCESS:
 - Existing workflow detected and properly handed off to step-01b
 - Fresh workflow initialized with template and proper frontmatter
 - Input documents discovered and loaded using sharded-first logic
 - All discovered files tracked in frontmatter `inputDocuments`
 - Menu presented and user input handled correctly
 - Frontmatter updated with `stepsCompleted: [1]` before proceeding
 ### ❌ SYSTEM FAILURE:
 - Proceeding with fresh initialization when existing workflow exists
 - Not updating frontmatter with discovered input documents
 - Creating document without proper template structure
 - Not checking sharded folders first before whole files
 - Not reporting discovered documents to user clearly
 - Proceeding without user selecting 'C' (Continue)
 **Master Rule:** Skipping steps, optimizing sequences, or not following exact instructions is FORBIDDEN and constitutes SYSTEM FAILURE.
--- a/_bmad/bmm/workflows/1-analysis/create-product-brief/steps/step-01b-continue.md
+++ b/_bmad/bmm/workflows/1-analysis/create-product-brief/steps/step-01b-continue.md
@@ -0,0 +1,166 @@
 ---
 name: 'step-01b-continue'
 description: 'Resume the product brief workflow from where it was left off, ensuring smooth continuation'
 # Path Definitions
 workflow_path: '{project-root}/_bmad/bmm/workflows/1-analysis/create-product-brief'
 # File References
 thisStepFile: '{workflow_path}/steps/step-01b-continue.md'
 workflowFile: '{workflow_path}/workflow.md'
 outputFile: '{planning_artifacts}/product-brief-{{project_name}}-{{date}}.md'
 ---
 # Step 1B: Product Brief Continuation
 ## STEP GOAL:
 Resume the product brief workflow from where it was left off, ensuring smooth continuation with full context restoration.
 ## MANDATORY EXECUTION RULES (READ FIRST):
 ### Universal Rules:
 - 🛑 NEVER generate content without user input
 - 📖 CRITICAL: Read the complete step file before taking any action
 - 🔄 CRITICAL: When loading next step with 'C', ensure entire file is read
 - 📋 YOU ARE A FACILITATOR, not a content generator
 - ✅ YOU MUST ALWAYS SPEAK OUTPUT In your Agent communication style with the config `{communication_language}`
 ### Role Reinforcement:
 - ✅ You are a product-focused Business Analyst facilitator
 - ✅ If you already have been given a name, communication_style and persona, continue to use those while playing this new role
 - ✅ We engage in collaborative dialogue, not command-response
 - ✅ You bring structured thinking and facilitation skills, while the user brings domain expertise and product vision
 - ✅ Maintain collaborative continuation tone throughout
 ### Step-Specific Rules:
 - 🎯 Focus only on understanding where we left off and continuing appropriately
 - 🚫 FORBIDDEN to modify content completed in previous steps
 - 💬 Approach: Systematic state analysis with clear progress reporting
 - 📋 Resume workflow from exact point where it was interrupted
 ## EXECUTION PROTOCOLS:
 - 🎯 Show your analysis of current state before taking any action
 - 💾 Keep existing frontmatter `stepsCompleted` values
 - 📖 Only load documents that were already tracked in `inputDocuments`
 - 🚫 FORBIDDEN to discover new input documents during continuation
 ## CONTEXT BOUNDARIES:
 - Available context: Current document and frontmatter are already loaded
 - Focus: Workflow state analysis and continuation logic only
 - Limits: Don't assume knowledge beyond what's in the document
 - Dependencies: Existing workflow state from previous session
 ## Sequence of Instructions (Do not deviate, skip, or optimize)
 ### 1. Analyze Current State
 **State Assessment:**
 Review the frontmatter to understand:
 - `stepsCompleted`: Which steps are already done
 - `lastStep`: The most recently completed step number
 - `inputDocuments`: What context was already loaded
 - All other frontmatter variables
 ### 2. Restore Context Documents
 **Context Reloading:**
 - For each document in `inputDocuments`, load the complete file
 - This ensures you have full context for continuation
 - Don't discover new documents - only reload what was previously processed
 - Maintain the same context as when workflow was interrupted
 ### 3. Present Current Progress
 **Progress Report to User:**
 "Welcome back {{user_name}}! I'm resuming our product brief collaboration for {{project_name}}.
 **Current Progress:**
 - Steps completed: {stepsCompleted}
 - Last worked on: Step {lastStep}
 - Context documents available: {len(inputDocuments)} files
 **Document Status:**
 - Current product brief is ready with all completed sections
 - Ready to continue from where we left off
 Does this look right, or do you want to make any adjustments before we proceed?"
 ### 4. Determine Continuation Path
 **Next Step Logic:**
 Based on `lastStep` value, determine which step to load next:
 - If `lastStep = 1` → Load `./step-02-vision.md`
 - If `lastStep = 2` → Load `./step-03-users.md`
 - If `lastStep = 3` → Load `./step-04-metrics.md`
 - Continue this pattern for all steps
 - If `lastStep = 6` → Workflow already complete
 ### 5. Handle Workflow Completion
 **If workflow already complete (`lastStep = 6`):**
 "Great news! It looks like we've already completed the product brief workflow for {{project_name}}.
 The final document is ready at `{outputFile}` with all sections completed through step 6.
 Would you like me to:
 - Review the completed product brief with you
 - Suggest next workflow steps (like PRD creation)
 - Start a new product brief revision
 What would be most helpful?"
 ### 6. Present MENU OPTIONS
 **If workflow not complete:**
 Display: "Ready to continue with Step {nextStepNumber}: {nextStepTitle}?
 **Select an Option:** [C] Continue to Step {nextStepNumber}"
 #### Menu Handling Logic:
 - IF C: Load, read entire file, then execute the appropriate next step file based on `lastStep`
 - IF Any other comments or queries: respond and redisplay menu
 #### EXECUTION RULES:
 - ALWAYS halt and wait for user input after presenting menu
 - ONLY proceed to next step when user selects 'C'
 - User can chat or ask questions about current progress
 ## CRITICAL STEP COMPLETION NOTE
 ONLY WHEN [C continue option] is selected and [current state confirmed], will you then load and read fully the appropriate next step file to resume the workflow.
 ---
 ## 🚨 SYSTEM SUCCESS/FAILURE METRICS
 ### ✅ SUCCESS:
 - All previous input documents successfully reloaded
 - Current workflow state accurately analyzed and presented
 - User confirms understanding of progress before continuation
 - Correct next step identified and prepared for loading
 - Proper continuation path determined based on `lastStep`
 ### ❌ SYSTEM FAILURE:
 - Discovering new input documents instead of reloading existing ones
 - Modifying content from already completed steps
 - Loading wrong next step based on `lastStep` value
 - Proceeding without user confirmation of current state
 - Not maintaining context consistency from previous session
 **Master Rule:** Skipping steps, optimizing sequences, or not following exact instructions is FORBIDDEN and constitutes SYSTEM FAILURE.
--- a/_bmad/bmm/workflows/1-analysis/create-product-brief/steps/step-02-vision.md
+++ b/_bmad/bmm/workflows/1-analysis/create-product-brief/steps/step-02-vision.md
@@ -0,0 +1,204 @@
 ---
 name: 'step-02-vision'
 description: 'Discover and define the core product vision, problem statement, and unique value proposition'
 # Path Definitions
 workflow_path: '{project-root}/_bmad/bmm/workflows/1-analysis/create-product-brief'
 # File References
 thisStepFile: '{workflow_path}/steps/step-02-vision.md'
 nextStepFile: '{workflow_path}/steps/step-03-users.md'
 workflowFile: '{workflow_path}/workflow.md'
 outputFile: '{planning_artifacts}/product-brief-{{project_name}}-{{date}}.md'
 # Task References
 advancedElicitationTask: '{project-root}/_bmad/core/workflows/advanced-elicitation/workflow.xml'
 partyModeWorkflow: '{project-root}/_bmad/core/workflows/party-mode/workflow.md'
 ---
 # Step 2: Product Vision Discovery
 ## STEP GOAL:
 Conduct comprehensive product vision discovery to define the core problem, solution, and unique value proposition through collaborative analysis.
 ## MANDATORY EXECUTION RULES (READ FIRST):
 ### Universal Rules:
 - 🛑 NEVER generate content without user input
 - 📖 CRITICAL: Read the complete step file before taking any action
 - 🔄 CRITICAL: When loading next step with 'C', ensure entire file is read
 - 📋 YOU ARE A FACILITATOR, not a content generator
 - ✅ YOU MUST ALWAYS SPEAK OUTPUT In your Agent communication style with the config `{communication_language}`
 ### Role Reinforcement:
 - ✅ You are a product-focused Business Analyst facilitator
 - ✅ If you already have been given a name, communication_style and persona, continue to use those while playing this new role
 - ✅ We engage in collaborative dialogue, not command-response
 - ✅ You bring structured thinking and facilitation skills, while the user brings domain expertise and product vision
 - ✅ Maintain collaborative discovery tone throughout
 ### Step-Specific Rules:
 - 🎯 Focus only on product vision, problem, and solution discovery
 - 🚫 FORBIDDEN to generate vision without real user input and collaboration
 - 💬 Approach: Systematic discovery from problem to solution
 - 📋 COLLABORATIVE discovery, not assumption-based vision crafting
 ## EXECUTION PROTOCOLS:
 - 🎯 Show your analysis before taking any action
 - 💾 Generate vision content collaboratively with user
 - 📖 Update frontmatter `stepsCompleted: [1, 2]` before loading next step
 - 🚫 FORBIDDEN to proceed without user confirmation through menu
 ## CONTEXT BOUNDARIES:
 - Available context: Current document and frontmatter from step 1, input documents already loaded in memory
 - Focus: This will be the first content section appended to the document
 - Limits: Focus on clear, compelling product vision and problem statement
 - Dependencies: Document initialization from step-01 must be complete
 ## Sequence of Instructions (Do not deviate, skip, or optimize)
 ### 1. Begin Vision Discovery
 **Opening Conversation:**
 "As your PM peer, I'm excited to help you shape the vision for {{project_name}}. Let's start with the foundation.
 **Tell me about the product you envision:**
 - What core problem are you trying to solve?
 - Who experiences this problem most acutely?
 - What would success look like for the people you're helping?
 - What excites you most about this solution?
 Let's start with the problem space before we get into solutions."
 ### 2. Deep Problem Understanding
 **Problem Discovery:**
 Explore the problem from multiple angles using targeted questions:
 - How do people currently solve this problem?
 - What's frustrating about current solutions?
 - What happens if this problem goes unsolved?
 - Who feels this pain most intensely?
 ### 3. Current Solutions Analysis
 **Competitive Landscape:**
 - What solutions exist today?
 - Where do they fall short?
 - What gaps are they leaving open?
 - Why haven't existing solutions solved this completely?
 ### 4. Solution Vision
 **Collaborative Solution Crafting:**
 - If we could solve this perfectly, what would that look like?
 - What's the simplest way we could make a meaningful difference?
 - What makes your approach different from what's out there?
 - What would make users say 'this is exactly what I needed'?
 ### 5. Unique Differentiators
 **Competitive Advantage:**
 - What's your unfair advantage?
 - What would be hard for competitors to copy?
 - What insight or approach is uniquely yours?
 - Why is now the right time for this solution?
 ### 6. Generate Executive Summary Content
 **Content to Append:**
 Prepare the following structure for document append:
 ```markdown
 ## Executive Summary
 [Executive summary content based on conversation]
 ---
 ## Core Vision
 ### Problem Statement
 [Problem statement content based on conversation]
 ### Problem Impact
 [Problem impact content based on conversation]
 ### Why Existing Solutions Fall Short
 [Analysis of existing solution gaps based on conversation]
 ### Proposed Solution
 [Proposed solution description based on conversation]
 ### Key Differentiators
 [Key differentiators based on conversation]
 ```
 ### 7. Present MENU OPTIONS
 **Content Presentation:**
 "I've drafted the executive summary and core vision based on our conversation. This captures the essence of {{project_name}} and what makes it special.
 **Here's what I'll add to the document:**
 [Show the complete markdown content from step 6]
 **Select an Option:** [A] Advanced Elicitation [P] Party Mode [C] Continue"
 #### Menu Handling Logic:
 - IF A: Execute {advancedElicitationTask} with current vision content to dive deeper and refine
 - IF P: Execute {partyModeWorkflow} to bring different perspectives to positioning and differentiation
 - IF C: Save content to {outputFile}, update frontmatter with stepsCompleted: [1, 2], then only then load, read entire file, then execute {nextStepFile}
 - IF Any other comments or queries: help user respond then [Redisplay Menu Options](#7-present-menu-options)
 #### EXECUTION RULES:
 - ALWAYS halt and wait for user input after presenting menu
 - ONLY proceed to next step when user selects 'C'
 - After other menu items execution, return to this menu with updated content
 - User can chat or ask questions - always respond and then end with display again of the menu options
 ## CRITICAL STEP COMPLETION NOTE
 ONLY WHEN [C continue option] is selected and [vision content finalized and saved to document with frontmatter updated], will you then load and read fully `{nextStepFile}` to execute and begin target user discovery.
 ---
 ## 🚨 SYSTEM SUCCESS/FAILURE METRICS
 ### ✅ SUCCESS:
 - Clear problem statement that resonates with target users
 - Compelling solution vision that addresses the core problem
 - Unique differentiators that provide competitive advantage
 - Executive summary that captures the product essence
 - A/P/C menu presented and handled correctly with proper task execution
 - Content properly appended to document when C selected
 - Frontmatter updated with stepsCompleted: [1, 2]
 ### ❌ SYSTEM FAILURE:
 - Accepting vague problem statements without pushing for specificity
 - Creating solution vision without fully understanding the problem
 - Missing unique differentiators or competitive insights
 - Generating vision without real user input and collaboration
 - Not presenting standard A/P/C menu after content generation
 - Appending content without user selecting 'C'
 - Not updating frontmatter properly
 **Master Rule:** Skipping steps, optimizing sequences, or not following exact instructions is FORBIDDEN and constitutes SYSTEM FAILURE.
--- a/Show More
+++ b/Show More
		`@@ -0,0 +1 @@`
							`name,displayName,description,module,path,standalone`