Evaluations
Run experiments and continuous testing on logs
LLMs are inherently unpredictable, which can make feature-development challenging. With Velvet Evaluations, you can feel confident that your LLM-powered features work the way you expect them to.
Test your inputs against models, settings, and metrics.
Set up an evaluation
- Continuous monitoring: Run ongoing tests on samples of logs
- Experiments: Run a one-time test to review variants
Video tutorials
Updated 3 days ago