Using Downsizing with Hermes
Route Hermes Agent traffic through Downsizing using the CLI or manual configuration.
Prerequisites
Before linking Hermes, either attach a credit card or connect your Anthropic subscription to Downsizing. We also recommend using the CLI — check the CLI guide to install.
CLI Setup (Recommended)
The easiest way to connect Hermes is with the Downsizing CLI. If you haven't installed it yet, grab it from the CLI guide.
Sign in, then link Hermes:
ds link hermesThe CLI writes the correct base_url and api_key to ~/.hermes/config.yaml. The command is idempotent — safe to run multiple times.
Manual Setup
If you prefer to configure manually, edit ~/.hermes/config.yaml and set:
model:
default: claude-opus-4-8
provider: anthropic
base_url: http://localhost:3001/YOUR_INFERENCE_ID/anthropic
providers:
anthropic:
api_key: dws_YOUR_API_KEYFind your Inference ID and API key in the Downsizing dashboard. Replace YOUR_INFERENCE_ID with the workspace UUID and dws_YOUR_API_KEY with your Downsizing API key.
How it works
Hermes sends all Claude API requests to the Downsizing proxy using your Downsizing API key for authentication. The proxy validates the request, applies routing and reduction, then forwards the call to Anthropic using your stored Claude API key. Responses stream back to Hermes unchanged.
Verify
Run a quick one-shot query to confirm the proxy is working:
hermes chat -q "test"If Hermes responds, the proxy is correctly configured. You can also check your savings dashboard to see the request appear in your usage metrics.
Unlinking
To remove Downsizing from Hermes:
ds unlink hermes