Welcome to Arrakis Viewer

A comprehensive platform for AI model evaluation, analysis, and quality assessment

Cosmos DB: Configured

Local Filesystem: Disabled

Available Features

Manage, import, and run evaluation test suites.

Access Evaluation Management

View and explore trace data from evaluation runs.

Access Traces

View evaluation results from all storage sources.

Access Evaluation Results

Manage spice definitions and tool configurations.

Access Spice Registry

Configure ensemble compositions and inheritance.

Access Spice Ensembles

Welcome to the Arrakis Viewer platform! This tool helps you test and evaluate AI model performance:

Check the configuration status above to see which features are available
For developers: Run evaluations with uv run python -m peanut_eval.bin.run_evals to generate test results in the eval_results/ directory
Access these results through the Local Results section if filesystem access is enabled
For stakeholders: Use the Evaluations and Runs sections in the hosted environment (requires Cosmos DB)

Local development workflow:

Hosted environment:

Contact your administrator if you need help configuring these options.