
promptfoo/promptfoo
Test your prompts, models, and RAGs. Catch regressions and improve prompt quality. LLM evals for OpenAI, Azure, Anthropic, Gemini, Mistral, Llama, Bedrock, Ollama, and other local & private models with CI/CD integration.
查看项目:promptfoo/promptfoo
扫码关注公众号获取最新文章,并可免费领取前端工程师必备学习资源
