A robust, modular pipeline for automated LLM chatbot evaluation, using DeepEval, GROQ models, and Confident AI dashboard logging. Designed for systematic QA, reliable evaluation, and portfolio-quality results in AI/QA engineering.
Stars
0
Forks
0
Watchers
0
Open Issues
0
Overall repository health assessment
No package.json found
This might not be a Node.js project
10
commits
feat: Implement a financial advisor chatbot evaluation framework, including a chatbot, batch evaluation script, golden test cases, and comprehensive documentation.
4a05ed4View on GitHubfeat: Implement a financial advisor chatbot evaluation framework, including a chatbot, batch evaluation script, golden test cases, and comprehensive documentation.
7579c86View on GitHub