Home Explore Bookmarks Inbox Collections Profile

Recent Search

Command Palette

Search for a command to run...

Command Palette

Search for a command to run...

Home Explore Bookmarks

Inbox Collections Profile

Task-Specific LLM Evals that Do & Don’t Work — Blankdot

eugeneyan.com

4 months ago

Task-Specific LLM Evals that Do & Don’t Work

Task-Specific LLM Evals that Do & Don’t Work

Evals for classification, summarization, translation, copyright regurgitation, and toxicity.

No discussion yet. Be the first to share your thoughts!