Downsizing logodownsizing
Downsizing logodownsizing
Documentation

Using Downsizing with Hermes

Route Hermes Agent traffic through Downsizing using the CLI or manual configuration.


Prerequisites

Before linking Hermes, either attach a credit card or connect your Anthropic subscription to Downsizing. We also recommend using the CLI — check the CLI guide to install.

CLI Setup (Recommended)

The easiest way to connect Hermes is with the Downsizing CLI. If you haven't installed it yet, grab it from the CLI guide.

Sign in, then link Hermes:

bash
ds link hermes

The CLI writes the correct base_url and api_key to ~/.hermes/config.yaml. The command is idempotent — safe to run multiple times.

Manual Setup

If you prefer to configure manually, edit ~/.hermes/config.yaml and set:

yaml
model:
  default: claude-opus-4-8
  provider: anthropic
  base_url: http://localhost:3001/YOUR_INFERENCE_ID/anthropic

providers:
  anthropic:
    api_key: dws_YOUR_API_KEY

Find your Inference ID and API key in the Downsizing dashboard. Replace YOUR_INFERENCE_ID with the workspace UUID and dws_YOUR_API_KEY with your Downsizing API key.

How it works

Hermes sends all Claude API requests to the Downsizing proxy using your Downsizing API key for authentication. The proxy validates the request, applies routing and reduction, then forwards the call to Anthropic using your stored Claude API key. Responses stream back to Hermes unchanged.

Verify

Run a quick one-shot query to confirm the proxy is working:

bash
hermes chat -q "test"

If Hermes responds, the proxy is correctly configured. You can also check your savings dashboard to see the request appear in your usage metrics.

Unlinking

To remove Downsizing from Hermes:

bash
ds unlink hermes