Testing Guide

Quick Start

npm test              # Run unit tests
npm run test:watch    # Run tests in watch mode
npm run test:integ    # Run integration tests
npm run test:tui      # Run TUI integration tests (builds first)
npm run test:browser  # Run browser tests (requires AWS creds, uv, agentcore)
npm run test:all      # Run all tests (unit + integ)

Test Organization

Unit Tests

Unit tests are co-located with source files in __tests__/ directories:

src/cli/commands/add/
├── action.ts
├── command.ts
└── __tests__/
    └── add.test.ts

Integration Tests

Integration tests live in integ-tests/:

integ-tests/
├── create-no-agent.test.ts
├── create-with-agent.test.ts
├── deploy.test.ts
└── ...

See integ-tests/README.md for integration test details.

E2E Tests

E2E tests live in e2e-tests/ and verify the full user journey across the AWS boundary — deploy, invoke, status, logs, traces, and control plane API calls.

e2e-tests/
├── e2e-helper.ts           # Shared utilities and createE2ESuite() factory
├── strands-bedrock.test.ts
├── langgraph-openai.test.ts
└── ...

See e2e-tests/README.md for e2e test details.

Writing Tests

Imports

Use vitest for all test utilities:

import { afterEach, beforeEach, describe, expect, it, vi } from 'vitest';

Assertions

Use expect assertions:

// Equality
expect(result).toBe('expected');
expect(obj).toEqual({ key: 'value' });

// Truthiness
expect(value).toBeTruthy();
expect(value).toBeFalsy();

// Errors
expect(() => fn()).toThrow();
expect(() => fn()).toThrow('message');

Mocking

Use vi for mocks:

// Mock functions
const mockFn = vi.fn();
mockFn.mockReturnValue('value');
mockFn.mockResolvedValue('async value');

// Spies
vi.spyOn(module, 'method');

// Module mocks
vi.mock('./module');

Test Utilities

CLI Runner

src/test-utils/cli-runner.ts runs CLI commands in tests:

import { runCLI } from '../src/test-utils/cli-runner';

const result = await runCLI(['create', '--name', 'test'], tempDir);
expect(result.exitCode).toBe(0);

Snapshot Tests

The src/assets/ directory contains template files vended to users when they create projects. Snapshot tests ensure these templates don't change unexpectedly.

Running Snapshot Tests

Snapshot tests run as part of unit tests:

npm test           # Runs all unit tests including snapshots
npm run test:unit  # Same as above

Updating Snapshots

When you intentionally modify asset files (templates, configs, etc.), update snapshots:

npm run test:update-snapshots

Review the changes in src/assets/__tests__/__snapshots__/ before committing.

What's Tested

File structure of src/assets/
Contents of all template files (CDK, Python frameworks, MCP, static assets)
Any file addition or removal

TUI Integration Tests

TUI integration tests run the full CLI binary inside a pseudo-terminal (PTY) and verify screen output, keyboard navigation, and end-to-end wizard flows.

Note: TUI tests require node-pty (native addon). If node-pty is not installed, TUI tests are automatically skipped.

Running TUI Tests

npm run test:tui              # Builds first, then runs TUI tests
npx vitest run --project tui  # Skip build (use when build is fresh)

Test Organization

integ-tests/tui/
├── setup.ts              # Global setup: availability check, afterAll cleanup
├── helpers.ts            # createMinimalProjectDir, common test setup
├── harness.test.ts       # TuiSession self-tests (spawn, send, read)
├── navigation.test.ts    # Screen navigation flows
├── create-flow.test.ts   # Create wizard end-to-end
├── add-flow.test.ts      # Add resource flows
└── deploy-screen.test.ts # Deploy screen rendering

Writing a TUI Flow Test

Below is a complete example showing the typical pattern for a TUI flow test:

import { isAvailable } from '../../src/test-utils/tui-harness/index.js';
import { TuiSession } from '../../src/test-utils/tui-harness/index.js';
import { createMinimalProjectDir } from './helpers.js';
import { afterEach, describe, expect, it } from 'vitest';

describe.skipIf(!isAvailable)('my TUI flow', () => {
  let session: TuiSession;

  afterEach(async () => {
    await session?.close();
  });

  it('navigates to the add screen', async () => {
    // createMinimalProjectDir makes a temp dir with agentcore config (~10ms)
    const { dir, cleanup } = await createMinimalProjectDir({ hasAgents: true });

    try {
      // Launch the CLI TUI in the project directory
      session = await TuiSession.launch({
        command: 'node',
        args: ['../../dist/cli/index.mjs'],
        cwd: dir,
      });

      // Wait for the HelpScreen to render
      await session.waitFor('Commands');

      // Navigate: type 'add' to filter, then Enter
      await session.sendKeys('add');
      await session.sendSpecialKey('enter');

      // Verify we reached the AddScreen
      await session.waitFor('agent');
      const screen = session.readScreen();
      expect(screen.lines.join('\n')).toContain('agent');
    } finally {
      await cleanup();
    }
  });
});

Key points:

describe.skipIf(!isAvailable) -- gracefully skips when node-pty is missing.
afterEach with session?.close() -- always clean up PTY processes.
createMinimalProjectDir -- fast temp directory setup (no npm install).
try/finally with cleanup() -- always remove temp directories.

TuiSession API Quick Reference

Method	Returns	Description
`TuiSession.launch(options)`	`Promise<TuiSession>`	Spawn CLI in PTY. Throws `LaunchError` if process exits during startup.
`session.sendKeys(text, waitMs?)`	`Promise<ScreenState>`	Type text, wait for screen to settle, return screen.
`session.sendSpecialKey(key, waitMs?)`	`Promise<ScreenState>`	Send special key (enter, tab, escape, etc.), wait, return screen.
`session.readScreen(options?)`	`ScreenState`	Read current screen (synchronous). Options: `{ includeScrollback?, numbered? }`.
`session.waitFor(pattern, timeoutMs?)`	`Promise<ScreenState>`	Wait for text/regex on screen. Throws `WaitForTimeoutError` on timeout (default 5000ms).
`session.close(signal?)`	`Promise<CloseResult>`	Close session. Returns exit code, signal, final screen.
`session.info`	`SessionInfo`	Session metadata: sessionId, pid, dimensions, alive status.
`session.alive`	`boolean`	Whether the PTY process is still running.

ScreenState Shape

interface ScreenState {
  lines: string[]; // Each line of terminal text
  cursor: { x: number; y: number }; // Cursor position
  dimensions: { cols: number; rows: number }; // Terminal size
  bufferType: 'normal' | 'alternate'; // Active buffer
}

Special Keys

The following special keys can be passed to session.sendSpecialKey():

enter, tab, escape, backspace, delete, space, up, down, left, right, home, end, pageup, pagedown, ctrl+c, ctrl+d, ctrl+q, ctrl+g, ctrl+a, ctrl+e, ctrl+w, ctrl+u, ctrl+k, f1 through f12.

Key Concepts

waitFor vs Settling

Settling (automatic after sendKeys/sendSpecialKey): Waits for screen text to stop changing. Good for most screens. Fails on spinner/animation screens because text changes continuously.
waitFor: Polls for a specific text pattern. Use for: (a) async operations with spinners, (b) confirming you reached the right screen, (c) any case where you need a specific pattern before proceeding.
Rule of thumb: Use waitFor when waiting for an async result (project creation, deployment). Use sendKeys/sendSpecialKey (which auto-settle) for navigating between static screens.

waitFor Throws on Timeout

waitFor() throws WaitForTimeoutError when the pattern is not found within the timeout. The error includes:

The pattern that was not found
How long it waited
The full screen content at timeout

This means tests fail fast with useful diagnostics. You do not need to check a found boolean.

WaitForTimeoutError Output

When waitFor() times out, the thrown WaitForTimeoutError produces a message like this:

WaitForTimeoutError: waitFor("created successfully") timed out after 5000ms.
Screen content:
AgentCore Create

Creating project...
⠋ Installing dependencies

The error message includes the full non-blank screen content at the time of the timeout. This makes it straightforward to diagnose why the expected pattern was not found -- was the screen still loading? Did the test land on the wrong screen? Was there a typo in the pattern?

If you need to inspect the error properties programmatically (for example, to log additional context or make assertions on the screen state), you can catch the error directly:

import { WaitForTimeoutError } from '../../src/test-utils/tui-harness/index.js';

try {
  await session.waitFor('expected text', 3000);
} catch (err) {
  if (err instanceof WaitForTimeoutError) {
    console.log(err.pattern); // 'expected text'
    console.log(err.elapsed); // ~3000
    console.log(err.screen); // ScreenState with full content
  }
  throw err;
}

createMinimalProjectDir

Creates a temp directory that AgentCore recognizes as a project in ~10ms (no npm install). Use it when your test needs a project context:

const { dir, cleanup } = await createMinimalProjectDir({
  projectName: 'mytest', // optional, defaults to 'testproject'
  hasAgents: true, // optional, adds a sample agent
});

Always call cleanup() when done (in finally or afterEach).

LaunchError

TuiSession.launch() throws LaunchError when the spawned process exits before the screen settles. Common causes include a missing binary, a crash on startup, or an invalid working directory.

The error includes the following diagnostic properties:

command -- the executable that was launched
args -- the arguments passed to the command
cwd -- the working directory used for the spawned process
exitCode -- the process exit code (or null if terminated by signal)
screen -- the ScreenState captured at the time of exit

You can assert that a launch fails with LaunchError:

import { LaunchError, TuiSession } from '../../src/test-utils/tui-harness/index.js';

it('throws LaunchError for missing binary', async () => {
  await expect(TuiSession.launch({ command: 'nonexistent-binary' })).rejects.toThrow(LaunchError);
});

// Or if you need to inspect the error:
it('provides diagnostics in LaunchError', async () => {
  try {
    await TuiSession.launch({ command: 'node', args: ['missing-file.js'] });
  } catch (err) {
    if (err instanceof LaunchError) {
      console.log(err.command); // 'node'
      console.log(err.exitCode); // 1
      console.log(err.screen); // ScreenState at time of crash
    }
    throw err;
  }
});

Browser Tests

Browser tests use Playwright to test the web UI (agent inspector) served by agentcore dev.

Prerequisites

AWS credentials configured (aws sts get-caller-identity must succeed)
uv on PATH
Local build (npm run build)
Playwright browsers installed: npx playwright install chromium

Running

npm run test:browser

Test results and the HTML report are written to browser-tests/test-results/ and browser-tests/playwright-report/ respectively. To view the report:

npx playwright show-report browser-tests/playwright-report

By default, tests run against the @aws/agent-inspector package from npm (in node_modules).

Testing against a local agent-inspector build

To test with a local checkout of the agent-inspector (e.g. when developing new UI features or adding test IDs):

Clone agent-inspector as a sibling directory and build it
Run with AGENT_INSPECTOR_PATH:

AGENT_INSPECTOR_PATH=../agent-inspector/dist-assets npm run test:browser

Test structure

browser-tests/
├── playwright.config.ts  # Playwright configuration
├── global-setup.ts       # Creates test project, starts agentcore dev
├── global-teardown.ts    # Stops dev server, cleans up temp files
├── constants.ts          # Shared constants (env file path)
├── fixtures.ts           # Custom test fixtures (testEnv with port, project path)
└── tests/                # Test files
    ├── chat-invocation.test.ts
    ├── inspector-loads.test.ts
    ├── resources.test.ts
    ├── start-agent.test.ts
    └── traces.test.ts

The global setup creates a temporary project via agentcore create, starts agentcore dev, and writes connection details to an env file. Tests read the env file via the testEnv fixture.

Configuration

Test configuration is in vitest.config.ts using Vitest projects:

unit project: src/**/*.test.ts (includes snapshot tests)
integ project: integ-tests/**/*.test.ts
tui project: integ-tests/tui/**/*.test.ts (TUI integration tests)
Test timeout: 120 seconds
Hook timeout: 120 seconds

Troubleshooting

`Cannot find module '@playwright/test'`

Playwright is not installed. Run:

npm install

`browserType.launch: Executable doesn't exist` (Playwright browsers)

Playwright browsers need to be downloaded after install. Run:

npx playwright install chromium

Integration Tests

Integration tests require no AWS credentials. They run the real CLI binary and assert on local files and stdout only.

Run integration tests:

npm run test:integ

See integ-tests/README.md for full details.

FilesExpand file tree

TESTING.md

Latest commit

History

TESTING.md

File metadata and controls

Testing Guide

Quick Start

Test Organization

Unit Tests

Integration Tests

E2E Tests

Writing Tests

Imports

Assertions

Mocking

Test Utilities

CLI Runner

Snapshot Tests

Running Snapshot Tests

Updating Snapshots

What's Tested

TUI Integration Tests

Running TUI Tests

Test Organization

Writing a TUI Flow Test

TuiSession API Quick Reference

ScreenState Shape

Special Keys

Key Concepts

waitFor vs Settling

waitFor Throws on Timeout

WaitForTimeoutError Output

createMinimalProjectDir

LaunchError

Browser Tests

Prerequisites

Running

Testing against a local agent-inspector build

Test structure

Configuration

Troubleshooting

Cannot find module '@playwright/test'

browserType.launch: Executable doesn't exist (Playwright browsers)

Integration Tests

`Cannot find module '@playwright/test'`

`browserType.launch: Executable doesn't exist` (Playwright browsers)