Back to search
A scalable system for evaluating conversation turns on hundreds to thousands of linguistic, pragmatic, safety, and emotional facets using open-weight LLMs.
Stars
0
Forks
0
Watchers
0
Open Issues
0
Overall repository health assessment
No package.json found
This might not be a Node.js project
10
commits
Changed evaluator code to async code for better performance and scalability
d5ab1a2View on GitHubDid categorizing using embeddings based classification (cosine similarity), implemented and processed data is saved
7f3af6dView on GitHub