Evaluations

Run experiments and continuous testing on logs

LLMs are inherently unpredictable, which can make feature-development challenging. With Velvet Evaluations, you can feel confident that your LLM-powered features work the way you expect them to.

Test your inputs against models, settings, and metrics.


Set up an evaluation


Video tutorials