Dense LLM from IBM supporting up to 128K context length, trained on 12T tokens. Suitable for general instructions following and can be used to build AI assistants.
ollama pull granite3.1:8b
State-of-the-art image + text input models from Google, built from the same research and tech used to create the Gemini models.
ollama pull gemma3:4b
The mid-sized option of the Gemma 2 model family. Built by Google, using from the same research and technology used to create the Gemini models.
ollama pull gemma2:9b
The large option of the Gemma 2 model family. Built by Google, using from the same research and technology used to create the Gemini models.
ollama pull gemma2:27b
Devstral by MistralAI is based on Mistral Small 3.1. Debuts as the #1 open source model on SWE-bench.
ollama pull devstral-small:2505
A 7B Vision Language Model (VLM) from the Qwen2.5 family.
ollama pull qwen2.5-vl:7b
Advanced open-weight reasoning model, finetuned from Phi-4 with additional reinforcement learning for higher accuracy.
ollama pull phi4:14b
Lightweight open model from the Phi-4 family.
ollama pull phi4-mini:3.8b
State-of-the-art image + text input models from Google, built from the same research and tech used to create the Gemini models.
ollama pull gemma3:27b
State-of-the-art image + text input models from Google, built from the same research and tech used to create the Gemini models. Smallest model in the Gemma 3 family — runs anywhere.
ollama pull gemma3:1b
Meta's lightweight multilingual model. Text-only, great for on-device and edge deployment. Supports 8 languages.
ollama pull llama3.2:3b
Meta's flagship 8B. Multilingual, strong reasoning, tool use. The most popular open model for general tasks.
ollama pull llama3.1:8b
Meta's most powerful open model. Near GPT-4 class on reasoning, coding, and instruction following.
ollama pull llama3.3:70b
Mistral AI's groundbreaking 7B. Outperforms larger models on reasoning. Excellent for fine-tuning.
ollama pull mistral:7b
Mistral's Mixture-of-Experts. 8 experts x 7B, activates 2 per token. 46B performance at 12B inference cost.
ollama pull mixtral:8x7b
DeepSeek's reasoning model with chain-of-thought. Shows thinking process. Strong on math, code, logic.
ollama pull deepseek-r1:8b
DeepSeek-R1 distilled to 32B (Qwen base). Strong reasoning with visible chain-of-thought. Near GPT-4o on math.
ollama pull deepseek-r1:32b
MoE code model from DeepSeek. 338 languages, 128K context. Strongest open code model for single-GPU.
ollama pull deepseek-coder-v2:16b
Alibaba's Qwen2.5 7B. 29 languages, 128K context. Great all-rounder for Asian and European languages.
ollama pull qwen2.5:7b
Alibaba's code-specialized model. Trained on 5.5T tokens of code. 92 programming languages.
ollama pull qwen2.5-coder:7b
Meta's code-specialized Llama. Python specialist with fill-in-the-middle. Code completion and generation.
ollama pull codellama:13b
Large Language and Vision Assistant. Llama 2 + vision encoder. Image understanding, OCR, visual QA.
ollama pull llava:13b
Uncensored Mixtral fine-tune. Removes refusal behaviour. Good for creative and unrestricted tasks.
ollama pull dolphin-mixtral:8x7b
HuggingFace's DPO-trained Mistral 7B. Punches above weight on chat benchmarks. Excellent conversationalist.
ollama pull zephyr:7b
C-RLFT trained Mistral 7B. Top performer on MT-Bench among 7B models. Natural conversation.
ollama pull openchat:7b
01.AI's 34B model. 3T training tokens. Strong bilingual (EN/ZH). Near GPT-3.5 performance.
ollama pull yi:34b
Cohere's flagship. RAG-optimised, tool-use native, 10 languages. Enterprise-grade reasoning.
ollama pull command-r-plus:104b
BigCode project. The Stack v2 (600+ languages). Fill-in-the-middle, code completion.
ollama pull starcoder2:15b
Defog.ai's SQL specialist. 20K+ SQL queries across diverse schemas. Text-to-SQL near GPT-4 accuracy.
ollama pull sqlcoder:7b
Microsoft's MoE model. Evolved training on complex instructions. Top reasoning, multilingual, coding.
ollama pull wizardlm2:8x22b
Nous Research fine-tune on 1M+ instructions. Structured output, function calling, JSON mode. ChatML format.
ollama pull nous-hermes2:mixtral
Databricks' MoE flagship. 16 experts, 4 active. Strong at structured data, SQL, Python, reasoning.
ollama pull dbrx:instruct
OpenBMB's vision model. Strong OCR, image understanding, bilingual (EN/ZH). Runs on edge devices.
ollama pull minicpm-v:8b
TII's latest Falcon. 14T training tokens. Strong multilingual (EN, FR, ES, PT). Function calling, code.
ollama pull falcon3:10b
Nomic AI's embedding model. #1 on MTEB among open models. 768-dim vectors. Perfect for RAG and search.
ollama pull nomic-embed-text
Mixedbread AI's embedding. 1024-dim vectors, MTEB leader. Best open embedding for retrieval tasks.
ollama pull mxbai-embed-large