- 2 agent versions × 3 prompt versions × 2 configs = 12 variants
- All running in production at the same time
- Each user gets a consistent experience
- Data-driven decisions on what actually works
A/B Testing Basics
Simple A/B Test
Test two prompt versions:Test Results
Multivariate Testing
Test multiple variables simultaneously.2×2 Test: Prompt × Model
3×3 Test: Agent × Prompt × Config
analyzer v2.0.0 + prompt v1.0.0 + config v2.0.0 is the optimal combination.

