Query Expression Language

3 minute read

The W&B Query Expression Language lets you programmatically analyze and visualize your ML experiments directly in the W&B UI. Transform raw experiment data into actionable insights using powerful query operations.

Important: Where Query Expressions Run

Query Expressions are NOT local code! They are typed directly into the W&B web interface, not in your Python/JavaScript files.

Getting Started

Step 1: Log Data to W&B (Local Code)

First, you need data in W&B to query. This requires the W&B Python SDK:

pip install wandb  # Only installation needed - for logging data

# your_training_script.py - runs locally
import wandb

wandb.init(project="my-ml-project")
wandb.log({"loss": 0.5, "accuracy": 0.85})
wandb.finish()

Step 2: Query Your Data (W&B Web UI)

After logging runs, analyze them in the W&B web interface:

Open your browser and go to wandb.ai
Navigate to your project (e.g., wandb.ai/your-username/my-ml-project)
Click “+ Add Panel” → Select “Query Panel”

Type expressions in the web editor (NOT in your local code):

// This is typed into the wandb.ai interface
runs.map(r => runSummary(r).accuracy).avg()

See results instantly as charts or tables in your browser

Complete Example: Finding Your Best Model

Here’s what you would type in the W&B Query Panel editor to analyze a hyperparameter sweep:

// Remember: This is typed into the Query Panel at wandb.ai
// NOT in your local code files!

// Step 1: Filter to successful runs from your latest sweep
const validRuns = runs
  .filter(r => r.state === "finished")
  .filter(r => runConfig(r).sweep_id === "sweep_2024_01")

// Step 2: Extract key metrics and configurations
const runAnalysis = validRuns.map(r => ({
  name: r.name,
  accuracy: runSummary(r).best_accuracy,
  loss: runSummary(r).final_loss,
  learning_rate: runConfig(r).learning_rate,
  batch_size: runConfig(r).batch_size,
  training_time: r.duration
}))

// Step 3: Find the best run
const bestRun = validRuns
  .reduce((best, current) => 
    runSummary(current).best_accuracy > runSummary(best).best_accuracy 
      ? current 
      : best
  )

// Step 4: Calculate statistics across all runs
const stats = {
  avg_accuracy: validRuns.map(r => runSummary(r).best_accuracy).avg(),
  std_accuracy: validRuns.map(r => runSummary(r).best_accuracy).std(),
  total_compute_hours: validRuns.map(r => r.duration).sum() / 3600
}

// Step 5: Group by hyperparameter to find optimal values
const byLearningRate = validRuns
  .groupby(r => runConfig(r).learning_rate)
  .map(group => ({
    lr: group.key,
    avg_accuracy: group.values.map(r => runSummary(r).best_accuracy).avg(),
    num_runs: group.values.length
  }))

Core Concepts

Chainable Operations

All operations can be chained together for powerful data transformations:

runs
  .filter(/* select runs */)
  .map(/* transform data */)
  .groupby(/* organize results */)
  .sort(/* order output */)

Type Safety

The expression language is fully typed, providing autocomplete and validation as you write queries.

Operations

Functions for querying and manipulating W&B data:

Run Operations - Query and manipulate runs
Artifact Operations - Work with artifacts

Data Types

Core type definitions:

Run - Experiment runs with metadata and metrics
Artifact - Versioned files and directories
ArtifactType - Artifact type definitions
ArtifactVersion - Specific artifact versions
ConfigDict - Configuration parameters
SummaryDict - Summary metrics from runs
Table - Tabular data structure
User - User account information
Project - Project metadata
Entity - Team or user organization

Common Patterns

The following examples show Query Expressions you would type in the W&B web UI:

Compare Model Architectures

// Type this in the Query Panel at wandb.ai
// Group runs by model type and compare average performance
runs
  .groupby(r => runConfig(r).model_type)
  .map(g => ({
    model: g.key,
    avg_accuracy: g.values.map(r => runSummary(r).accuracy).avg(),
    best_accuracy: g.values.map(r => runSummary(r).accuracy).max(),
    training_hours: g.values.map(r => r.duration).sum() / 3600
  }))
  .sort((a, b) => b.avg_accuracy - a.avg_accuracy)

Track Experiment Progress

// Monitor ongoing experiments
runs
  .filter(r => r.state === "running")
  .map(r => ({
    name: r.name,
    progress: runSummary(r).epoch / runConfig(r).total_epochs,
    current_loss: runSummary(r).loss,
    eta_minutes: (r.duration / runSummary(r).epoch) * 
                 (runConfig(r).total_epochs - runSummary(r).epoch) / 60
  }))

Find Optimal Hyperparameters

// Identify best performing hyperparameter combinations
runs
  .filter(r => runSummary(r).val_accuracy > 0.85)
  .map(r => ({
    accuracy: runSummary(r).val_accuracy,
    lr: runConfig(r).learning_rate,
    batch_size: runConfig(r).batch_size,
    optimizer: runConfig(r).optimizer
  }))
  .sort((a, b) => b.accuracy - a.accuracy)
  .slice(0, 10)  // Top 10 configurations

Feedback

Was this page helpful?

Glad to hear it! If you have more to say, please let us know.

Sorry to hear that. Please tell us how we can improve.

Last modified September 18, 2025

Edit page Report issue PDF

Query Expression Language

Important: Where Query Expressions Run

Getting Started

Step 1: Log Data to W&B (Local Code)

Step 2: Query Your Data (W&B Web UI)

Complete Example: Finding Your Best Model

Core Concepts

Chainable Operations

Type Safety

Operations

Data Types

Common Patterns

Compare Model Architectures

Track Experiment Progress

Find Optimal Hyperparameters

See Also

Data Types

Operations

Feedback