Getting Started

Tip: the Get Started with SideCar walkthrough auto-opens in VS Code’s Welcome editor the first time you install SideCar. It walks through the five essentials in under a minute and auto-checks off each step as you run the relevant command. Reopen it any time with SideCar: Open Walkthrough from the Command Palette.

Install Ollama

Download and install Ollama for your platform. After installation, verify it’s working:

ollama --version

Pull the recommended model (SideCar’s default):

ollama pull gemma4:e4b

Install SideCar

Install from the VS Code Marketplace, or search for “SideCar” in the VS Code Extensions panel (Cmd+Shift+X / Ctrl+Shift+X).

Your first chat

Click the SideCar icon in the activity bar (left sidebar) or press ⌘⇧I (Ctrl+Shift+I on Windows/Linux) to toggle the chat panel.
The empty-state welcome card shows you the active model, three quick-action buttons, and four starter prompts you can click to pre-fill the input.
Type a message in the chat input or click a starter prompt.
SideCar auto-launches Ollama if it’s not already running.

SideCar will automatically include your active file and workspace context in the conversation.

Discover every action

Press ⌘⇧P (Ctrl+Shift+P on Windows/Linux) and type SideCar: — every user-facing action is in the palette with a consistent prefix and icon. Useful ones to remember:

SideCar: Toggle Chat — open or hide the chat sidebar
SideCar: Inline Chat — inline edit prompt in the editor (⌘I)
SideCar: Select Model — keyboard-first model picker
SideCar: Set / Refresh API Key — paste or rotate your key
SideCar: Switch Backend — flip between Ollama / Anthropic / OpenAI / Kickstand
SideCar: Show Session Spend — $ breakdown for the current session on paid backends
SideCar: Open Walkthrough — reopen the getting-started page

Code actions on diagnostics — when VS Code shows a red or yellow squiggle on a line, press ⌘. / Ctrl+. and you’ll see Fix with SideCar and Explain this error with SideCar alongside the built-in Quick Fix suggestions. Refactor with SideCar appears in the Refactor submenu on any selection.

Choosing a model

Use the model dropdown at the top of the chat panel to browse and switch models. Models are organized into two categories:

Full Features (Tools) — models that support function calling (e.g., qwen3-coder, llama3.1, command-r)
Chat-Only — models that work for conversation but can’t use tools (e.g., gemma2, llama2, mistral)

You can also install new models directly from the dropdown — SideCar will pull them from the Ollama registry.

The default model is gemma4:e4b (~10 GB VRAM). For lighter hardware, try ministral-3:latest (6 GB), gemma4:2b, or qwen2.5-coder:7b.

Switching backends

The fastest way to move between Ollama, Anthropic, OpenAI, and Kickstand is the ⚙ gear button in the chat header. It opens a settings menu with a Backend section — pick a profile and SideCar flips baseUrl, provider, and model in one click. Each profile keeps its own API key in VS Code’s SecretStorage, so switching doesn’t clobber keys you’ve already set. The same flow is available from the Command Palette as SideCar: Switch Backend.

The sections below describe each backend in detail and also cover the manual settings path if you prefer editing settings.json.

Using the Anthropic API

Recommended: click the ⚙ gear → Anthropic Claude. SideCar prompts for your API key on first switch, saves it to the Anthropic-specific SecretStorage slot, and sets baseUrl / provider / model for you.

Manual:

Set sidecar.baseUrl to https://api.anthropic.com
Run SideCar: Set / Refresh API Key from the command palette and paste your Anthropic API key
Set sidecar.model to a Claude model (e.g., claude-sonnet-4-6)

SideCar uses prompt caching with Anthropic, reducing input token costs by ~90% on cache hits.

Note: the Anthropic API is a separate paid service from Claude.ai subscriptions — your Max or Pro plan does not include API credits. Get a key at platform.claude.com.

Using Kickstand

Kickstand is a self-hosted local inference server that manages model loading, unloading, and GPU memory efficiently. CLI command: kick.

Start the server with kick start.
Recommended: click the ⚙ gear → Kickstand in the chat header. Manual: set sidecar.baseUrl to http://localhost:11435 (default Kickstand port).
Set sidecar.model to the model you want to use.

SideCar auto-detects Kickstand by the port number and reads the bearer token from ~/.config/kickstand/token automatically — no API key prompt or settings plumbing needed.

Using OpenAI-compatible servers

SideCar works with any server that exposes the OpenAI /v1/chat/completions endpoint — including LM Studio, vLLM, llama.cpp, text-generation-webui, and OpenRouter.

Set sidecar.baseUrl to your server’s URL (e.g., http://localhost:1234)
Set sidecar.apiKey if required (optional for most local servers)
Set sidecar.model to the model name your server is running

SideCar auto-detects the provider from the URL. If auto-detection gets it wrong, set sidecar.provider explicitly:

"sidecar.provider": "openai"

Provider examples

Server	Base URL	Notes
LM Studio	`http://localhost:1234`	Auto-detected as OpenAI
vLLM	`http://localhost:8000`	Auto-detected as OpenAI
llama.cpp	`http://localhost:8080`	Auto-detected as OpenAI
OpenRouter	`https://openrouter.ai/api`	Set API key, access 400+ models
text-generation-webui	`http://localhost:5000`	Enable OpenAI extension in the UI
Kickstand	`http://localhost:11435`	Auto-detected as Kickstand; token read from `~/.config/kickstand/token`
Ollama	`http://localhost:11434`	Auto-detected as Ollama (native API)
Anthropic	`https://api.anthropic.com`	Auto-detected as Anthropic

Tool support

Most OpenAI-compatible models support function calling (tool use), which enables SideCar’s full agentic capabilities — file editing, shell commands, git operations, etc. If a model doesn’t support tools, SideCar falls back to chat-only mode automatically.

Keyboard shortcuts

Shortcut	Action
`Cmd+Shift+I` / `Ctrl+Shift+I`	Toggle SideCar chat panel
`Cmd+I` / `Ctrl+I`	Inline chat (edit code in place)
`Cmd+L` / `Ctrl+L`	Clear chat
`Cmd+Shift+U` / `Ctrl+Shift+U`	Undo all AI changes
`Cmd+Shift+E` / `Ctrl+Shift+E`	Export chat as Markdown