An intelligent QA assistant that automatically plans, generates, and executes Playwright E2E tests from PR diffs and specifications. Similar to how Strix works for penetration testing, QAAI provides AI-powered test coverage with human-in-the-loop approval.
- AI Test Planning: Automatically analyzes PR diffs and generates comprehensive test plans
- Intelligent Test Generation: Creates Playwright tests from natural language specifications
- Multi-LLM Support: Works with OpenAI, Anthropic Claude, and local Ollama models
- Deterministic Execution: Stable test runs with retries, fixtures, and seeded data
- Rich Artifacts: Captures traces, videos, screenshots, and HAR files
- GitHub Integration: Seamless PR workflow with GitHub Checks and webhooks
- Flake Detection: Automatically identifies and tracks flaky tests
- Coverage Tracking: Route and API coverage visualization
- Multi-tenant: Organization-based access control with RLS
QAAI consists of three main components:
- Next.js Web App - Dashboard, test management, and artifact viewer
- Runner Service - Background workers for planning, generation, and execution
- Supabase Backend - PostgreSQL database, authentication, and storage
┌─────────────┐ ┌──────────────┐ ┌─────────────┐
│ Next.js │────▶│ Supabase │◀────│ Runner │
│ Web App │ │ Backend │ │ Service │
└─────────────┘ └──────────────┘ └─────────────┘
│ │ │
│ │ │
▼ ▼ ▼
Dashboard PostgreSQL Playwright
Artifacts Storage AI Workers
API Routes Auth/RLS Job Queue
- USER_GUIDE.md - Complete user guide and tutorials
- ARCHITECTURE.md - Detailed system architecture
- IMPLEMENTATION_GUIDE.md - Implementation instructions
- GITHUB_INTEGRATION.md - GitHub integration guide
- PHASE5_COMPLETE.md - Analytics & coverage features
- PHASE6_COMPLETE.md - Production hardening features
- Node.js 20+
- pnpm
- Supabase account
- Railway account (for deployment)
- OpenAI/Anthropic API key OR Ollama
# Clone the repository
git clone https://github.com/yourusername/qaai.git
cd qaai
# Install dependencies
pnpm install
# Set up environment variables
cp .env.example .env
# Edit .env with your credentials
# Run database migrations
pnpm db:migrate
# Start development servers
pnpm dev:web # Next.js on :3000
pnpm dev:runner # Runner serviceqaai/
├── apps/
│ └── web/ # Next.js application
│ ├── app/ # App Router pages
│ ├── components/ # React components
│ └── lib/ # Utilities and clients
├── services/
│ └── runner/ # Background worker service
│ ├── workers/ # Planner, Generator, Runner
│ ├── lib/ # LLM client, job queue
│ └── Dockerfile
├── packages/
│ └── playwright-tests/ # Generated test files
├── infra/
│ ├── supabase/ # Database schema and policies
│ ├── railway/ # Deployment configs
│ └── github/ # CI/CD workflows
└── docs/ # Additional documentation
- Next.js 15 (App Router)
- Tailwind CSS
- Supabase Auth
- Supabase (PostgreSQL + Storage + Auth)
- Next.js API Routes
- Row Level Security (RLS)
- Node.js
- Playwright
- Docker
- Railway
- OpenAI (GPT-4o-mini)
- Anthropic (Claude Sonnet)
- Ollama (Local models)
- Project structure and monorepo setup
- Next.js app with Tailwind
- Supabase integration
- Database schema and RLS
- Authentication flow
- Base UI components
- Job polling system
- Playwright configuration
- Test execution engine
- Artifact upload (traces, videos, screenshots)
- Result recording and storage
- Multi-LLM client (OpenAI, Anthropic, Ollama)
- Planner worker
- Generator worker
- Plan approval UI
- Test file management
- GitHub App webhook handler
- PR diff analysis
- GitHub Checks reporter
- Automatic issue creation
- Issue templates and tracking
- Statistical flake detection algorithm
- Flake heatmap dashboard
- Route/API coverage tracking
- Coverage matrix visualization
- Analytics API endpoints
- Test data seeding strategy
- Retry and timeout configuration
- Settings management UI
- CI/CD pipeline (GitHub Actions)
- Comprehensive user documentation
All 41 tasks completed! QAAI is production-ready with:
- ✅ AI-powered test generation
- ✅ Automated test execution
- ✅ GitHub integration
- ✅ Flake detection & analytics
- ✅ Coverage tracking
- ✅ Test data management
- ✅ CI/CD pipeline
- ✅ Complete documentation
- Supabase Auth with JWT tokens
- Row Level Security (RLS) for multi-tenancy
- Private storage buckets with signed URLs
- Service role key only on server-side
- GitHub App with minimal permissions
# Deploy web app
railway up --service web
# Deploy runner service
railway up --service runnerSee IMPLEMENTATION_GUIDE.md for detailed deployment instructions.
- Job queue metrics
- Test execution duration
- Flake rate tracking
- API response times
- Storage usage
Contributions are welcome! Please read our contributing guidelines before submitting PRs.
MIT License - see LICENSE file for details
Inspired by Strix for penetration testing, adapted for QA automation.
- Documentation: docs/
- Issues: GitHub Issues
- Discussions: GitHub Discussions
Built with ❤️ for better software quality