Getting Started
Getting Started
Tip: the Get Started with SideCar walkthrough auto-opens in VS Code’s Welcome editor the first time you install SideCar. It walks through the five essentials in under a minute and auto-checks off each step as you run the relevant command. Reopen it any time with
SideCar: Open Walkthroughfrom the Command Palette.
Install Ollama
Download and install Ollama for your platform. After installation, verify it’s working:
ollama --version
Pull a recommended model:
ollama pull qwen3-coder:30b
Install SideCar
Install from the VS Code Marketplace, or search for “SideCar” in the VS Code Extensions panel (Cmd+Shift+X / Ctrl+Shift+X).
Your first chat
- Click the SideCar icon in the activity bar (left sidebar) or press
⌘⇧I(Ctrl+Shift+Ion Windows/Linux) to toggle the chat panel. - The empty-state welcome card shows you the active model, three quick-action buttons, and four starter prompts you can click to pre-fill the input.
- Type a message in the chat input or click a starter prompt.
- SideCar auto-launches Ollama if it’s not already running.
SideCar will automatically include your active file and workspace context in the conversation.
Discover every action
Press ⌘⇧P (Ctrl+Shift+P on Windows/Linux) and type SideCar: — every user-facing action is in the palette with a consistent prefix and icon. Useful ones to remember:
SideCar: Toggle Chat— open or hide the chat sidebarSideCar: Inline Chat— inline edit prompt in the editor (⌘I)SideCar: Select Model— keyboard-first model pickerSideCar: Set / Refresh API Key— paste or rotate your keySideCar: Switch Backend— flip between Ollama / Anthropic / OpenAI / KickstandSideCar: Show Session Spend—$breakdown for the current session on paid backendsSideCar: Open Walkthrough— reopen the getting-started page
Code actions on diagnostics — when VS Code shows a red or yellow squiggle on a line, press ⌘. / Ctrl+. and you’ll see Fix with SideCar and Explain this error with SideCar alongside the built-in Quick Fix suggestions. Refactor with SideCar appears in the Refactor submenu on any selection.
Choosing a model
Use the model dropdown at the top of the chat panel to browse and switch models. Models are organized into two categories:
- Full Features (Tools) — models that support function calling (e.g.,
qwen3-coder,llama3.1,command-r) - Chat-Only — models that work for conversation but can’t use tools (e.g.,
gemma2,llama2,mistral)
You can also install new models directly from the dropdown — SideCar will pull them from the Ollama registry.
The default model is qwen3-coder:30b. For machines with less RAM, try qwen3-coder:8b or qwen2.5-coder:7b.
Switching backends
The fastest way to move between Ollama, Anthropic, OpenAI, and Kickstand is the ⚙ gear button in the chat header. It opens a settings menu with a Backend section — pick a profile and SideCar flips baseUrl, provider, and model in one click. Each profile keeps its own API key in VS Code’s SecretStorage, so switching doesn’t clobber keys you’ve already set. The same flow is available from the Command Palette as SideCar: Switch Backend.
The sections below describe each backend in detail and also cover the manual settings path if you prefer editing settings.json.
Using the Anthropic API
Recommended: click the ⚙ gear → Anthropic Claude. SideCar prompts for your API key on first switch, saves it to the Anthropic-specific SecretStorage slot, and sets baseUrl / provider / model for you.
Manual:
- Set
sidecar.baseUrltohttps://api.anthropic.com - Run
SideCar: Set / Refresh API Keyfrom the command palette and paste your Anthropic API key - Set
sidecar.modelto a Claude model (e.g.,claude-sonnet-4-6)
SideCar uses prompt caching with Anthropic, reducing input token costs by ~90% on cache hits.
Note: the Anthropic API is a separate paid service from Claude.ai subscriptions — your Max or Pro plan does not include API credits. Get a key at platform.claude.com.
Using Kickstand
Kickstand is a self-hosted local inference server that manages model loading, unloading, and GPU memory efficiently. CLI command: kick.
- Start the server with
kick start. - Recommended: click the ⚙ gear → Kickstand in the chat header.
Manual: set
sidecar.baseUrltohttp://localhost:11435(default Kickstand port). - Set
sidecar.modelto the model you want to use.
SideCar auto-detects Kickstand by the port number and reads the bearer token from ~/.config/kickstand/token automatically — no API key prompt or settings plumbing needed.
Using OpenAI-compatible servers
SideCar works with any server that exposes the OpenAI /v1/chat/completions endpoint — including LM Studio, vLLM, llama.cpp, text-generation-webui, and OpenRouter.
- Set
sidecar.baseUrlto your server’s URL (e.g.,http://localhost:1234) - Set
sidecar.apiKeyif required (optional for most local servers) - Set
sidecar.modelto the model name your server is running
SideCar auto-detects the provider from the URL. If auto-detection gets it wrong, set sidecar.provider explicitly:
"sidecar.provider": "openai"
Provider examples
| Server | Base URL | Notes |
|---|---|---|
| LM Studio | http://localhost:1234 |
Auto-detected as OpenAI |
| vLLM | http://localhost:8000 |
Auto-detected as OpenAI |
| llama.cpp | http://localhost:8080 |
Auto-detected as OpenAI |
| OpenRouter | https://openrouter.ai/api |
Set API key, access 400+ models |
| text-generation-webui | http://localhost:5000 |
Enable OpenAI extension in the UI |
| Kickstand | http://localhost:11435 |
Auto-detected as Kickstand; token read from ~/.config/kickstand/token |
| Ollama | http://localhost:11434 |
Auto-detected as Ollama (native API) |
| Anthropic | https://api.anthropic.com |
Auto-detected as Anthropic |
Tool support
Most OpenAI-compatible models support function calling (tool use), which enables SideCar’s full agentic capabilities — file editing, shell commands, git operations, etc. If a model doesn’t support tools, SideCar falls back to chat-only mode automatically.
Keyboard shortcuts
| Shortcut | Action |
|---|---|
Cmd+Shift+I / Ctrl+Shift+I |
Toggle SideCar chat panel |
Cmd+I / Ctrl+I |
Inline chat (edit code in place) |
Cmd+L / Ctrl+L |
Clear chat |
Cmd+Shift+U / Ctrl+Shift+U |
Undo all AI changes |
Cmd+Shift+E / Ctrl+Shift+E |
Export chat as Markdown |