typpo

promptfoo/promptfoo

Test your prompts, models, and RAGs. Catch regressions and improve prompt quality. LLM evals for OpenAI, Azure, Anthropic, Gemini, Mistral, Llama, Bedrock, Ollama, and other local & private models with CI/CD integration.

Stars总数3140

Forks总数206

本周Stars130

源码分类

更新时间(12月前)

扫码关注公众号获取最新文章,并可免费领取前端工程师必备学习资源

 
41querys in 1.169 seconds.