Evaluating Language Models
Learn how Spice evaluates, tracks, compares, and improves language model performance for specific tasks
Learn how Spice evaluates, tracks, compares, and improves language model performance for specific tasks
Return all evals available to run in the runtime.