what stackpulse tracks
Ollama releases from GitHub
StackPulse watches Ollama release notes and keeps the original source link close to every summary.
Get up and running with large language models locally StackPulse turns upstream changelogs into scannable summaries with risky changes, deprecations, migration notes, and source links.
what stackpulse tracks
StackPulse watches Ollama release notes and keeps the original source link close to every summary.
upgrade risk
Risky changes are separated from normal feature notes so you can scan upgrade impact before changing production dependencies.
migration notes
Migration steps and recommended actions are only shown when the upstream release notes support them.
This release adds automatic installation for Claude Code and OpenCode models, improves Vulkan graphics classification on Windows, adjusts speculative decoding in MLXRunner, and updates the documentation.
Users who rely on automatic model installation or use Windows hybrid graphics systems will benefit from these changes.
Update to take advantage of new automatic model installations and improved Windows graphics handling.
This release focuses on improving auto-install capabilities, fixing GPU classification issues, and enhancing model performance with speculative decoding and memory management improvements.
Users relying on auto-install features, GPU classification, or memory management will benefit from these changes.
Update to this version if you use Claude Code, opencode, or need improved GPU performance.
This release focuses on improving launch capabilities, fixing Vulkan classification issues on Windows, and enhancing CUDA support. It also introduces auto-installation features for Claude Code and opencode, along with optimizations for speculative decoding and memory management.
Users relying on Windows hybrid graphics or requiring auto-installation of Claude Code and opencode will benefit from this release.
Added support for Command A and North family models on Apple Silicon using the MLX engine. Updated the underlying llama.cpp engine.
Users running Command A or North family models on Apple Silicon hardware are affected.
Update to take advantage of improved Apple Silicon support.
This release updates the underlying llama.cpp library to version b9637, which may include performance improvements or bug fixes.
Users relying on the llama.cpp library may benefit from potential improvements or fixes.
This release introduces support for the Cohere2Moe architecture and fixes several issues, including token output limitations and LFM2 parser/render improvements.
Users leveraging Cohere2Moe architecture or experiencing token output issues will be affected.
Upgrade to v0.30.9 to benefit from the new architecture support and bug fixes.
This release introduces support for the Cohere2Moe architecture and fixes several issues, including token output limitations in coding agent use cases and LFM2 parser/render improvements.
Users leveraging coding agents or assistants, and those working with Cohere2Moe architecture, are most affected.
Update to the latest release to benefit from new features and fixes.
This release focuses on stability improvements, particularly in MLX inference and prompt caching, along with fixes for provider selection in `ollama launch`.
Users relying on MLX inference or recurrent models may benefit from improved stability and performance.
This release introduces Hermes Desktop, a native desktop interface for the Hermes agent, providing a visual interface for managing conversations, integrations, and messaging apps. It also includes updates to the OpenAI-compatible API models list and documentation improvements.
Users of the Hermes agent who want a visual desktop interface will benefit from this release.
Run `ollama launch hermes-desktop` to start using Hermes Desktop.
This release introduces Quantization-Aware Training (QAT) optimized Gemma 4 models for reduced memory usage and improved performance. It also enhances MLX embedding layers for better quantization on Apple Silicon and integrates with Oh My Pi for AI coding assistance.
Users leveraging Gemma 4 models or Apple Silicon devices will benefit from improved performance and memory efficiency.
Update to v0.30.6 to take advantage of the new QAT-optimized Gemma 4 models and enhanced quantization on Apple Silicon.
This release fixes a critical crash issue with `gemma4:12b` on multiple platforms and improves Hermes Desktop integration, including native Windows support.
Users running `gemma4:12b` on x86, CUDA, Linux, or Windows systems are affected by the crash fix.
Update to v0.30.5 to resolve the `gemma4:12b` crash issue.