Markdown

--- name: kiro-validate-impl description: Validate feature-level integration after all tasks are implemented. Checks cross-task consistency, full test suite, and overall spec coverage. ---

Implementation Integration Validation

<background_information> Individual tasks are usually reviewed during implementation. Your job is to catch problems that only become visible when looking across all tasks together.

Boundary terminology continuity:

  • discovery identifies `Boundary Candidates`
  • design fixes `Boundary Commitments`
  • tasks constrain execution with `_Boundary:_`
  • feature validation checks for cross-task `Boundary Violations`
  • **Success Criteria**:
  • All tasks marked `[x]` in tasks.md
  • Full test suite passes (not just per-task tests)
  • Cross-task integration works (data flows between components, interfaces match)
  • Requirements coverage is complete across all tasks (no gaps between tasks)
  • Design structure is reflected end-to-end (not just per-component)
  • No orphaned code, conflicting implementations, integration seams, or boundary spillover

**What This Skill Does NOT Do**: This skill is not a full replacement for task-local review during `/kiro-impl`. This skill does NOT re-check every individual task acceptance criterion, every per-file reality check, or every single-task spec detail unless a concrete integration finding forces it.

This skill's main question is: when the completed tasks are viewed together, do they still respect the designed boundary seams and dependency direction? </background_information>

<instructions>

Execution Steps

1. Detect Validation Target

**If no arguments provided** (`$1` empty):

  • Parse conversation history for `/kiro-impl <feature> [tasks]` commands
  • Extract feature names and task numbers from each execution
  • Aggregate all implemented tasks by feature
  • Report detected implementations (e.g., "user-auth: 1.1, 1.2, 1.3")
  • If no history found, scan `{{KIRO_DIR}}/specs/` for features with completed tasks `[x]`

**If feature provided** (`$1` present, `$2` empty):

  • Use specified feature
  • Detect all completed tasks `[x]` in `{{KIRO_DIR}}/specs/$1/tasks.md`

**If both feature and tasks provided** (`$1` and `$2` present):

  • Validate specified feature and tasks only (e.g., `user-auth 1.1,1.2`)

Sub-agent Dispatch (parallel)

The following validation dimensions are independent and can be dispatched as **sub-agents**. The agent should decide the optimal decomposition based on feature scope — split, merge, or skip sub-agents as appropriate. Each sub-agent returns a **structured findings summary** to keep the main context clean for GO/NO-GO synthesis.

**Typical validation dimensions** (adjust as appropriate):

  • **Test execution**: Run the complete test suite, report pass/fail with details
  • **Requirements coverage**: Build requirements → implementation matrix, report gaps
  • **Design alignment**: Verify architecture matches design.md, report drift and dependency violations
  • **Cross-task integration**: Verify data flows, API contracts, shared state consistency

If multi-agent is not available, run checks sequentially in main context.

After all checks complete, synthesize findings for GO/NO-GO/MANUAL_VERIFY_REQUIRED assessment.

If the implementation run explicitly skipped task-local review (for example `--review off`), tighten scrutiny on obvious task-level gaps that surface during integration validation and call out that reduced review coverage in the report.

2. Load Context

For each detected feature:

  • Read `{{KIRO_DIR}}/specs/<feature>/spec.json` for metadata
  • Read `{{KIRO_DIR}}/specs/<feature>/requirements.md` for requirements
  • Read `{{KIRO_DIR}}/specs/<feature>/design.md` for design structure
  • Read `{{KIRO_DIR}}/specs/<feature>/tasks.md` for task list and Implementation Notes
  • Core steering context: `product.md`, `tech.md`, `structure.md`
  • Additional steering files only when directly relevant to the validated boundaries, runtime prerequisites, integrations, domain rules, security/performance constraints, or team conventions that affect the GO/NO-GO call

**Discover canonical validation commands**:

  • Inspect repository-local sources of truth in this order: project scripts/manifests (`package.json`, `pyproject.toml`, `go.mod`, `Cargo.toml`, app manifests), task runners (`Makefile`, `justfile`), CI/workflow files, existing e2e/integration configs, then `README*`
  • Derive a feature-level validation set for this repo: `TEST_COMMANDS`, `BUILD_COMMANDS`, and `SMOKE_COMMANDS`
  • Prefer commands already used by repo automation over ad hoc shell pipelines
  • For `SMOKE_COMMANDS`, choose the lightest trustworthy runtime-liveness check for the app shape (for example: root URL load, Electron launch, CLI `--help`, service health endpoint, mobile simulator/e2e harness if one already exists)
  • If multiple candidates exist, prefer the command with the smallest setup cost that still exercises the real built artifact

3. Execute Integration Validation

Mechanical Checks (run commands, use results)

**A. Full Test Suite**

  • Run the discovered canonical full-test command. Use the exit code.
  • If tests fail → NO-GO. No judgment needed.
  • If the canonical test command cannot be identified → `MANUAL_VERIFY_REQUIRED`

**B. Residual TBD/TODO/FIXME**

  • Run: `grep -rn "TBD\|TODO\|FIXME\|HACK\|XXX" <files-in-feature-boundary>`
  • If matches found that were introduced by this feature → flag as Warning

**C. Residual Hardcoded Secrets**

  • Run: `grep -rn "password\s*=\|api_key\s*=\|secret\s*=\|token\s*=" <files-in-feature-boundary>` (case-insensitive)
  • If matches found that aren't environment variable references → flag as Critical

**D. Runtime Liveness (Smoke Boot)**

  • Run the discovered canonical smoke command that proves the built artifact actually starts and reaches its first usable state.
  • Examples if relevant: open the root URL in a headless browser and require zero boot-time console errors; launch Electron and wait for the main process ready signal and first renderer load; run a CLI with `--help`; start a service and hit its health endpoint.
  • If boot produces a runtime crash, unhandled exception, module-load failure, native ABI mismatch, or missing required env/config → NO-GO.
  • If no trustworthy smoke command can be identified, or the required runtime environment is unavailable → `MANUAL_VERIFY_REQUIRED`

Judgment Checks (read code, compare to spec)

**E. Cross-Task Integration**

  • Identify where tasks share interfaces, data models, or API contracts
  • Verify that Task A's output format matches Task B's expected input
  • Check for conflicting assumptions between tasks (naming conventions, error codes, data shapes)
  • Verify shared state (database schemas, config, environment) is consistent across tasks
  • Verify integration work happens at the intended seams rather than by leaking one boundary's behavior into another

**F. Requirements Coverage Gaps**

  • Map every requirement section to at least one completed task
  • Identify requirements that no single task fully covers (cross-cutting requirements)
  • Identify requirements partially covered by multiple tasks but not fully by any
  • Use the original section numbering from `requirements.md`; do NOT invent `REQ-*` aliases

**G. Design End-to-End Alignment**

  • Verify the overall component graph matches design.md
  • Check that integration patterns (event flow, API boundaries, dependency injection) work as designed
  • Verify dependency direction follows design.md's architecture (no upward imports)
  • Verify File Structure Plan matches the actual file layout
  • Identify any architectural drift from the original design
  • Use the original section numbering from `design.md`

**G.5 Boundary Audit**

  • Compare completed work against the design's `Boundary Commitments`, `Out of Boundary`, `Allowed Dependencies`, and `Revalidation Triggers`
  • Identify cross-task spillover where one area quietly absorbed another boundary's responsibility
  • Identify downstream-specific workarounds embedded upstream "to make integration easier"
  • Identify new hidden dependencies or shared ownership that were not declared in the design
  • If a revalidation trigger fired, verify the affected adjacent specs or integration points were actually re-checked

**H. Blocked Tasks & Implementation Notes**

  • Check for any tasks still marked `_Blocked:_` — report why and assess impact on feature completeness
  • Review `## Implementation Notes` in tasks.md for cross-cutting insights that need attention

4. Generate Report

Before returning `GO`, apply the `kiro-verify-completion` protocol to the feature-level claim. Tests alone are insufficient: include full-suite, runtime liveness, coverage, integration, design-alignment, and blocked-task status in the evidence.

Classify concrete failures by ownership before writing remediation:

  • `LOCAL` if the defect belongs to the feature being validated
  • `UPSTREAM` if the root cause belongs to a dependency, foundation, shared platform, or earlier spec
  • `UNCLEAR` if ownership cannot be established from the available evidence

If ownership is `UPSTREAM`, do not collapse the issue into local remediation for this feature. Name the owning upstream spec and explain which dependent specs should be revalidated after that upstream fix lands.

Provide summary in the language specified in spec.json:

## Validation Report
- DECISION: GO | NO-GO | MANUAL_VERIFY_REQUIRED
- MECHANICAL_RESULTS:
  - Tests: PASS | FAIL (command and exit code)
  - TBD/TODO grep: CLEAN | <count> matches
  - Secrets grep: CLEAN | <count> matches
  - Smoke boot: PASS | FAIL | MANUAL_REQUIRED
- INTEGRATION:
  - Cross-task contracts: <status>
  - Shared state consistency: <status>
  - Boundary audit: <status>
- COVERAGE:
  - Requirements mapped: <X/Y sections covered>
  - Coverage gaps: <list of uncovered requirement sections>
- DESIGN:
  - Architecture drift: <findings>
  - Dependency direction: <violations if any>
  - File Structure Plan vs actual: <match/mismatch>
- OWNERSHIP: LOCAL | UPSTREAM | UNCLEAR
- UPSTREAM_SPEC: <feature-name | N/A>
- BLOCKED_TASKS: <list and impact assessment>
- REMEDIATION: <if NO-GO: specific, actionable steps to fix each issue>

If NO-GO, REMEDIATION is mandatory — identify the exact issue and what needs to change.

Important Constraints

  • **Strict Final Gate**: Return `GO` only when all integration checks passed; return `NO-GO` for concrete failures and `MANUAL_VERIFY_REQUIRED` when mandatory validation could not be completed
  • **Boundary integrity over convenience**: Do not return `GO` if the feature only works by smearing responsibilities across boundaries, even when tests pass

</instructions>

Safety & Fallback

Error Scenarios

  • **No Implementation Found**: If no `/kiro-impl` in history and no `[x]` tasks, report "No implementations detected"
  • **Test Command Unknown**: Return `MANUAL_VERIFY_REQUIRED` and explain which validation command is missing; do not return `GO`
  • **Missing Spec Files**: Stop with error if spec.json/requirements.md/design.md missing

Next Steps Guidance

**If GO Decision**:

  • Feature validated end-to-end and ready for deployment or next feature

**If NO-GO Decision**:

  • Address integration issues listed
  • Re-run `/kiro-impl <feature> [tasks]` for targeted fixes
  • Re-validate with `/kiro-validate-impl [feature]`

**Session Interrupted**:

  • Safe to re-run — validation is read-only and idempotent

**If MANUAL_VERIFY_REQUIRED**:

  • Do not treat the feature as complete
  • Provide the exact missing validation step or environment prerequisite