Interactive demo: live backend API

Concept Bottleneck LLM Intervention Console

Edit concept-linked neurons, run baseline vs intervened, and inspect evidence. This UI is a researcher-focused shell over live CB-LLM backend responses.

Concept Bottleneck Control Panel

Loading… No pending changes

Concept Browser

Search, sort, and select concepts for batch edits.

Concept Activation Intervened Status
Selected: 0 Intervention active: 0

Intervention Editor

Edit activation and weight interventions for selected concepts.

A'_j = A_j + Δ

Additive adds Δ, override sets a fixed value, scale multiplies the activation.

Zero removes the concept’s influence; scale dampens or amplifies its effect downstream.

Input Workspace

Loaded:
Decoding settings

Results & Evidence

Top Concepts

Latest Model Output
Run the model to view output.
No response yet.
Concept Activation Weight Contribution

Text classification task

Visualizes how concept activations flow into predicted classes.

Project Summary

We reproduce and extend Concept Bottleneck LLM work by building tooling to select concept-linked neurons and apply activation interventions (ablation / amplification) while measuring downstream performance and behavioral steering.

Team: Christian Guerra, Neil Dandekar