gpt-oss:120b-cloud
is an excellent general-purpose cloud model for most tasks.llama3.3
models are good for most basic use-cases.qwen
models perform specifically well with tool use.deepseek-r1
models have strong reasoning capabilities.phi4
models are powerful, while being really small in size.
Authentication (Ollama Cloud Only)
To use Ollama Cloud, set yourOLLAMA_API_KEY
environment variable. You can get an API key from Ollama Cloud.
https://ollama.com
. For local usage, no API key is required.
Set up a model
Local Usage
Install ollama and run a model:run model
pull model
Cloud Usage
For Ollama Cloud, no local Ollama server installation is required. Install the Ollama library, set up your API key as described in the Authentication section above, and access cloud-hosted models directly.Examples
Local Usage
Once the model is available locally, use theOllama
model class to access it:
Cloud Usage
When using Ollama Cloud with an API key, the host is automatically set to
https://ollama.com
. You can omit the host
parameter. View more examples here.
Params
Parameter | Type | Default | Description |
---|---|---|---|
id | str | "llama3.2" | The name of the Ollama model to use |
name | str | "Ollama" | The name of the model |
provider | str | "Ollama" | The provider of the model |
host | str | "http://localhost:11434" | The host URL for the Ollama server |
timeout | Optional[int] | None | Request timeout in seconds |
format | Optional[str] | None | The format to return the response in (e.g., “json”) |
options | Optional[Dict[str, Any]] | None | Additional model options (temperature, top_p, etc.) |
keep_alive | Optional[Union[float, str]] | None | How long to keep the model loaded (e.g., “5m”, 3600 seconds) |
template | Optional[str] | None | The prompt template to use |
system | Optional[str] | None | System message to use |
raw | Optional[bool] | None | Whether to return raw response without formatting |
stream | bool | True | Whether to stream the response |
Ollama
is a subclass of the Model class and has access to the same params.