Ollama
Ollama
Section titled “Ollama”Ollama is a local LLM runtime that makes it easy to run open-source models on your machine. CoderClaw integrates with Ollama’s native API (/api/chat), supporting streaming and tool calling, and can auto-discover tool-capable models when you opt in with OLLAMA_API_KEY (or an auth profile) and do not define an explicit models.providers.ollama entry.
Quick start
Section titled “Quick start”-
Install Ollama: https://ollama.ai
-
Pull a model:
ollama pull gpt-oss:20b# orollama pull llama3.3# orollama pull qwen2.5-coder:32b# orollama pull deepseek-r1:32b- Enable Ollama for CoderClaw (any value works; Ollama doesn’t require a real key):
# Set environment variableexport OLLAMA_API_KEY="ollama-local"
# Or configure in your config filecoderclaw config set models.providers.ollama.apiKey "ollama-local"- Use Ollama models:
{ agents: { defaults: { model: { primary: "ollama/gpt-oss:20b" }, }, },}Model discovery (implicit provider)
Section titled “Model discovery (implicit provider)”When you set OLLAMA_API_KEY (or an auth profile) and do not define models.providers.ollama, CoderClaw discovers models from the local Ollama instance at http://127.0.0.1:11434:
- Queries
/api/tagsand/api/show - Keeps only models that report
toolscapability - Marks
reasoningwhen the model reportsthinking - Reads
contextWindowfrommodel_info["<arch>.context_length"]when available - Sets
maxTokensto 10× the context window - Sets all costs to
0
This avoids manual model entries while keeping the catalog aligned with Ollama’s capabilities.
To see what models are available:
ollama listcoderclaw models listTo add a new model, simply pull it with Ollama:
ollama pull mistralThe new model will be automatically discovered and available to use.
If you set models.providers.ollama explicitly, auto-discovery is skipped and you must define models manually (see below).
Configuration
Section titled “Configuration”Basic setup (implicit discovery)
Section titled “Basic setup (implicit discovery)”The simplest way to enable Ollama is via environment variable:
export OLLAMA_API_KEY="ollama-local"Explicit setup (manual models)
Section titled “Explicit setup (manual models)”Use explicit config when:
- Ollama runs on another host/port.
- You want to force specific context windows or model lists.
- You want to include models that do not report tool support.
{ models: { providers: { ollama: { baseUrl: "http://ollama-host:11434", apiKey: "ollama-local", api: "ollama", models: [ { id: "gpt-oss:20b", name: "GPT-OSS 20B", reasoning: false, input: ["text"], cost: { input: 0, output: 0, cacheRead: 0, cacheWrite: 0 }, contextWindow: 8192, maxTokens: 8192 * 10 } ] } } }}If OLLAMA_API_KEY is set, you can omit apiKey in the provider entry and CoderClaw will fill it for availability checks.
Custom base URL (explicit config)
Section titled “Custom base URL (explicit config)”If Ollama is running on a different host or port (explicit config disables auto-discovery, so define models manually):
{ models: { providers: { ollama: { apiKey: "ollama-local", baseUrl: "http://ollama-host:11434", }, }, },}Model selection
Section titled “Model selection”Once configured, all your Ollama models are available:
{ agents: { defaults: { model: { primary: "ollama/gpt-oss:20b", fallbacks: ["ollama/llama3.3", "ollama/qwen2.5-coder:32b"], }, }, },}Advanced
Section titled “Advanced”Reasoning models
Section titled “Reasoning models”CoderClaw marks models as reasoning-capable when Ollama reports thinking in /api/show:
ollama pull deepseek-r1:32bModel Costs
Section titled “Model Costs”Ollama is free and runs locally, so all model costs are set to $0.
Streaming Configuration
Section titled “Streaming Configuration”CoderClaw’s Ollama integration uses the native Ollama API (/api/chat) by default, which fully supports streaming and tool calling simultaneously. No special configuration is needed.
Legacy OpenAI-Compatible Mode
Section titled “Legacy OpenAI-Compatible Mode”If you need to use the OpenAI-compatible endpoint instead (e.g., behind a proxy that only supports OpenAI format), set api: "openai-completions" explicitly:
{ models: { providers: { ollama: { baseUrl: "http://ollama-host:11434/v1", api: "openai-completions", apiKey: "ollama-local", models: [...] } } }}Note: The OpenAI-compatible endpoint may not support streaming + tool calling simultaneously. You may need to disable streaming with params: { streaming: false } in model config.
Context windows
Section titled “Context windows”For auto-discovered models, CoderClaw uses the context window reported by Ollama when available, otherwise it defaults to 8192. You can override contextWindow and maxTokens in explicit provider config.
Troubleshooting
Section titled “Troubleshooting”Ollama not detected
Section titled “Ollama not detected”Make sure Ollama is running and that you set OLLAMA_API_KEY (or an auth profile), and that you did not define an explicit models.providers.ollama entry:
ollama serveAnd that the API is accessible:
curl http://localhost:11434/api/tagsNo models available
Section titled “No models available”CoderClaw only auto-discovers models that report tool support. If your model isn’t listed, either:
- Pull a tool-capable model, or
- Define the model explicitly in
models.providers.ollama.
To add models:
ollama list # See what's installedollama pull gpt-oss:20b # Pull a tool-capable modelollama pull llama3.3 # Or another modelConnection refused
Section titled “Connection refused”Check that Ollama is running on the correct port:
# Check if Ollama is runningps aux | grep ollama
# Or restart Ollamaollama serveSee Also
Section titled “See Also”- Model Providers - Overview of all providers
- Model Selection - How to choose models
- Configuration - Full config reference