- Direct Fireworks routing (default) — requests go to the Fireworks gateway; use a Fireworks API key (
fw_...) or Fire Pass key (fpk_...) - Fireworks on Microsoft Foundry — the same models, deployed in your Azure subscription and billed through Azure; use
--provider azureand an Azure API key (see Fireworks on Microsoft Foundry)
Claude Code
Anthropic-compatible routing with multi-slot model aliases
OpenCode
OpenAI-compatible adapter in
opencode.jsonCodex
OpenAI Codex CLI via the Responses API
Pi
Pi agent settings and auth
Cursor
OpenAI BYOK settings for Cursor IDE
VS Code
GitHub Copilot Chat custom endpoint
Prerequisites
- A Fireworks API key (
fw_...) or a Fire Pass key (fpk_...) for direct Fireworks routing - For Fireworks on Microsoft Foundry: a Microsoft Foundry resource, an Azure API key, and at least one model deployment (see Fireworks on Microsoft Foundry and the Microsoft Foundry integration guide)
- Node.js (the installer can install it via Homebrew or apt if it is missing)
- At least one supported harness installed locally
Install
fpk_... key directly. FireConnect detects the key type and applies the correct defaults.
The installer:
- Clones the CLI to
~/.fireconnect/cliand adds afireconnectlauncher to~/.local/bin - Runs
fireconnect configureto register harnesses and store your API key preference - Uses Node.js to update harness settings (it does not install or update npm packages)
Upgrade
If you installed via the curl installer, pull the latest release in place:Fireworks on Microsoft Foundry
Route OpenCode, Codex, and Pi through a Microsoft Foundry deployment of Fireworks models instead of the Fireworks gateway. Usage is billed through your Azure account and counts toward Azure commitments where applicable.Microsoft Foundry integration guide
Portal setup, deployment modes (PayGo / PTU), billing, and troubleshooting — start here if you have not deployed a model yet
Terminology: In the CLI, the Foundry provider is
--provider azure (or on --azure). Harness configs display the label Fireworks on Microsoft Foundry. Foundry uses deployment names you chose at deploy time (for example, FW-GLM-5.1) — not Fireworks serverless short IDs like glm-latest.Claude Code, Cursor, and VS Code do not support Fireworks on Microsoft Foundry. Foundry exposes an OpenAI-compatible chat API, not the Anthropic Messages API Claude Code expects, and IDE BYOK flows require direct Fireworks credentials.
Prerequisites
- A Microsoft Foundry resource with at least one Fireworks model deployment (for example,
FW-GLM-5.1orFW-MiniMax-M2.5) - Your Foundry resource endpoint and Azure API key from the Microsoft Foundry portal
- OpenCode, Codex, or Pi installed locally
Configure the Foundry provider
Set the active provider once withconfigure. Harness on commands then use that provider — there is no separate global azure on command.
--base-url to the OpenAI-compatible surface at .../openai/v1. You can paste the resource root, a project endpoint (.../api/projects/<name>), or an already-correct .../openai/v1 URL.
To keep secrets out of ~/.fireconnect/config.json, export AZURE_API_KEY before configure instead of passing --api-key — FireConnect stores an environment reference.
Enable harnesses
After configuring the Foundry provider, enable harnesses the same way as with direct Fireworks routing:~/.fireconnect/config.json and writes OpenAI-compatible adapter settings. In harness UIs, the provider appears as Fireworks on Microsoft Foundry.
Choose a deployment
Pass the deployment name you chose in the Azure portal with--main:
--main, FireConnect defaults to FW-GLM-5.1.
One-off Foundry routing
To route a single harness through Foundry without changing global config, pass--azure on on:
--azure alone reuses it:
Switch between direct Fireworks and Foundry
Update global config, then re-enable each harness:off restores the harness config snapshot from before FireConnect was enabled for that provider.
API key resolution (Foundry)
When the Foundry provider is active, FireConnect resolves credentials in this order:- Explicit
--api-keyon the command - Global
~/.fireconnect/config.json AZURE_API_KEYenvironment variable
Verify routing
fireconnect <harness> off. This restores your previous config from the snapshot in ~/.fireconnect/<harness>/.
CLI design
FireConnect uses harness-first syntax:fireconnect <harness> <command>
Bare harness names run on (for example, fireconnect claude is the same as fireconnect claude on).
Global commands
Providers
FireConnect supports two inference backends. The CLI provider flag is shown in parentheses.| Provider flag | Where inference runs | API key | Supported harnesses |
|---|---|---|---|
fireworks (default) | Fireworks gateway | fw_... or fpk_... | Claude Code, OpenCode, Codex, Pi, Cursor, VS Code |
azure | Fireworks on Microsoft Foundry | Azure API key | OpenCode, Codex, Pi |
fireconnect configure --provider fireworks or --provider azure. For Foundry, also pass --base-url with your resource endpoint. Harness on commands use the configured provider unless you pass --azure or per-command --base-url / --api-key overrides.
Per-harness commands
Each CLI harness (claude, opencode, codex, pi) supports:
fireconnect <harness> on— Route through the configured providerfireconnect <harness> off— Restore your previous configfireconnect <harness> status— Show provider, auth, and modelsfireconnect <harness> model list— Browse Fireworks serverless models (direct Fireworks routing only)fireconnect <harness> model select— Pick a serverless model interactively (direct Fireworks routing only)fireconnect <harness> model reset— Reset models to defaultsfireconnect <harness> help— Harness-specific help
cursor, vscode) supports:
fireconnect <harness> on— Configure direct Fireworks routing (not Foundry; quit the IDE first)fireconnect <harness> off— Restore your previous config (quit the IDE first)fireconnect <harness> status— Show provider, auth, and registered models (read-only; IDE can stay open)fireconnect <harness> model list— Browse Fireworks serverless models (read-only)fireconnect <harness> model add <id>— Add a Fireworks model to the IDE pickerfireconnect <harness> model select— Pick a model interactively (Cursor: pass--mode)fireconnect <harness> model reset— Reset fireconnect-managed models to defaultsfireconnect <harness> help— Harness-specific help
state.vscdb). Commands that write to that database (on, off, model add, model select, model reset) require Cursor to be fully quit first. VS Code stores the API key in state.vscdb as well, so on and off require quitting VS Code; model add, model select, and model reset only edit chatLanguageModels.json, which VS Code hot-reloads.
With Fireworks on Microsoft Foundry (--provider azure), pass your Foundry deployment name to on with --main instead of using model list or model select.
Run fireconnect help for the overview, or fireconnect claude help (and similarly for other harnesses) for harness-level options.
API key resolution
When a command needs credentials, FireConnect resolves them based on the active provider. Direct Fireworks routing (--provider fireworks)
- Explicit
--api-key - Harness-local stored key (for example, in
~/.claude/settings.json) - Global
~/.fireconnect/config.json FIREWORKS_API_KEYenvironment variable
--provider azure)
- Explicit
--api-key - Global
~/.fireconnect/config.json AZURE_API_KEYenvironment variable
Recommended models
These serverless model short IDs apply to direct Fireworks routing and expand to full Fireworks paths automatically. With Fireworks on Microsoft Foundry, pass your deployment name instead (for example,FW-GLM-5.1 or FW-MiniMax-M2.5) via --main.
| Short ID | Best for | Notes |
|---|---|---|
glm-latest | All-around use, agentic tasks | Default for main and opus slots in Claude Code. Strong reasoning, 1M context. |
glm-5p1 | General use (lighter) | Default sonnet slot in Claude Code. Good balance of speed and quality. |
deepseek-v4-flash | Background / fast tasks | Default haiku and subagent slots in Claude Code. Lowest latency. |
Fire Pass keys
Fire Pass keys (fpk_...) default all slots to glm-latest. The model browser shows Fire Pass-supported models: glm-latest, kimi-fast-latest, and kimi-k2p7-code-fast.
Fire Pass keys work with direct Fireworks routing only. Use an Azure API key with --provider azure.
Migration from earlier syntax
FireConnect uses harness-first commands. If you have older docs or scripts from pre-0.5.0 releases, update them:| Before | After |
|---|---|
fireconnect on | fireconnect claude on |
fireconnect off | fireconnect claude off |
fireconnect status | fireconnect claude status |
fireconnect list | fireconnect claude status |
fireconnect set --main <id> | fireconnect claude on --main <id> |
fireconnect reset | fireconnect claude model reset |
fireconnect on --harness opencode | fireconnect opencode on |
fireconnect model list | fireconnect <harness> model list |