includes
, excludes
, etc. This is either unary
like is_safe
or binary
(meaning it takes a criteria filled in by the user). Each operator is a tool we’ve defined to check one specific item in the output
Criteria - Free text filled in by the user that the operator will be evaluated against
Right answer - An alternative way to evaluate an output. The user writes the “right answer” to a given prompt/question and we compare that right answer to the answer the LLM produces.