If you set the judges
argument to a non-NULL value in evaluate()
,
this function will be evoked automatically.
Usage
grade_queue(x, judges, type = c("pairwise", "score"))
Arguments
- judges
A
judges()
object orNULL
to grade the evals yourself.- type
One of
"pairwise"
or"score"
. See details for more.
Examples
library(ellmer)
ggplot2 <- evaluate(
"tests/evalthat/test-ggplot2-graded.R",
across = tibble(chat = c(
chat_openai(model = "gpt-4o-mini", echo = FALSE),
chat_claude(model = "claude-3-5-sonnet-latest", echo = FALSE),
chat_ollama(model = "qwen2.5-coder:14b", echo = FALSE))
),
repeats = 2
)
#> Error in tibble(chat = c(chat_openai(model = "gpt-4o-mini", echo = FALSE), chat_claude(model = "claude-3-5-sonnet-latest", echo = FALSE), chat_ollama(model = "qwen2.5-coder:14b", echo = FALSE))): could not find function "tibble"
ggplot2
#> Error: object 'ggplot2' not found
ggplot2_graded
#> Error: object 'ggplot2_graded' not found