Basic CLI usage#

All of FlexEval’s features can be accessed using YAML configuration files.

Running an eval is as simple as pointing to that configuration file and using the run() command:

flexeval run vignettes/eval_run.yaml

The file vignettes/eval_run.yaml demonstrates a complete configuration using the EvalRun schema.

data_sources:
  - type: file
    path: vignettes/conversations.jsonl
database_path: eval_results.db
eval:
  metrics:
    function:
      - name: has_question_mark
      - name: is_role
        kwargs:
          role: assistant
    rubric:
      - name: assistant_asks_a_question
        depends_on:
          - name: is_role
            kwargs:
              role: assistant
            metric_min_value: 1
  grader_llm:
    function_name: echo_completion
    kwargs:
      response: Reasoning.\nNO
config:
  clear_tables: True
  logs_path: vignettes/log
rubric_paths:
 - vignettes/custom_rubrics.yaml
function_modules:
 - vignettes/custom_functions.py

Use the summarize_metrics() command to print a result summary:

python -m flexeval summarize-metrics vignettes/eval_run.yaml

See Metric Analysis for a more in-depth vignette analyzing FlexEval outputs.