Prerequisites
- Claude Code installed
- A Fireworks API key (
fw_...) or a Fire Pass key (fpk_...) - The FireConnect CLI (see Install)
Enable Fireworks routing
fireconnect claude on, model select, or model reset, settings are updated immediately. To use a new model in the same session, run /model in Claude Code, start a new session, or /exit and resume with claude --resume <id>.
Using Fire Pass
Use yourfpk_... key instead of a standard fw_... key:
glm-latest.
Default model mapping
| Alias | Standard key (fw_...) | Fire Pass key (fpk_...) |
|---|---|---|
| main | glm-latest | glm-latest |
| opus | glm-latest | glm-latest |
| sonnet | glm-5p1 | glm-latest |
| haiku | deepseek-v4-flash | glm-latest |
| subagent | deepseek-v4-flash | glm-latest |
glm-latest are expanded to full Fireworks paths (for example, accounts/fireworks/routers/glm-latest[1m]). FireConnect appends the [1m] suffix on main and opus so Claude Code enables 1M context. The subagent slot is written without [1m] because Claude Code forwards that value verbatim to the provider API.
What gets written
FireConnect writes these settings to~/.claude/settings.json:
ANTHROPIC_API_KEY (preferred) and ANTHROPIC_AUTH_TOKEN (compatibility alias) with the same Fireworks key. It saves a backup of your previous provider settings to ~/.fireconnect/claude/ so fireconnect claude off can restore them.
Browsing and picking models
fireconnect claude model list
Lists serverless models from the Fireworks API (supports_serverless=true) and merges known public platform aliases (glm-latest, kimi-fast-latest, kimi-latest, kimi-k2p6-turbo, and kimi-k2p7-code-fast). Every row is tagged serverless.
--api-key, harness settings, ~/.fireconnect/config.json, or FIREWORKS_API_KEY. Fire Pass keys (fpk_...) show Fire Pass-supported models only.
fireconnect claude model select
Interactive picker. Requires a terminal and Fireworks to be enabled.
fireconnect claude status vs fireconnect claude model list
| Command | Shows |
|---|---|
fireconnect claude status | Your current provider, auth, configured alias mapping, and Fireworks serverless rates per slot |
fireconnect claude model list | Available serverless endpoints from the Fireworks API, with IN / OUT pricing where known |
Claude Code pricing estimates
Claude Code’s/model picker and session cost estimates use Anthropic list prices, not Fireworks serverless rates. For example, the default glm-latest mapping may show Opus-tier estimates around 25 per Mtok while Fireworks bills at model-specific serverless rates (often much lower).
FireConnect cannot override Claude Code’s price column. Use fireconnect claude status and fireconnect claude model list for Fireworks rates, and check the billing dashboard for actual spend.
CLI reference
fireconnect claude help for all options.
Switch models
Turn off Fireworks routing
~/.claude/settings.json from the backup saved in ~/.fireconnect/claude/.