This guide explains how to perform evaluations using the Together AI UI.For a comprehensive guide with detailed parameter descriptions, see AI Evaluations.
labels – (list of strings) Categories for input classification. For each category, you can specify whether it’s considered ‘pass’ or ‘fail’ for statistics computation
model_to_evaluate – Configuration for the model being evaluated
Choose whether to evaluate existing data or generate new responses:
“No” – Data required for evaluation is already present in your dataset
“Yes” – Generate data using the model for evaluation
Option 1: Column Reference (String)
Use when evaluating pre-existing data from your dataset. Simply specify the column name containing the data to evaluate.Option 2: Model Object
Use when generating new responses for evaluation. The object requires:
model_name – (string) One of our supported models
system_template – (Jinja2 template) An instruction for generation, e.g., “You are a helpful assistant.” (see Understanding Templates)
input_template – (Jinja2 template) Input format, e.g., "{{prompt}}" (see Understanding Templates)
max_tokens – (integer) Maximum tokens for generation
temperature – (float) Temperature setting for generation