What's Changed
Bug fixes
- fix(distributed): cascade-clean stale node_models rows + filter routing by healthy status by @localai-bot in #9754
- fix(http): honor X-Forwarded-Prefix when proxy strips the prefix by @Dennisadira in #9614
- fix(agentpool): close truncate-then-read race in agent_jobs.json persistence by @localai-bot in #9811
- fix(middleware): parse OpenAI-spec tool_choice in /v1/chat/completions by @Anai-Guo in #9559
Exciting New Features
- feat: also parse VRAM budget/usage from vulkaninfo by @eglia in #9800
- feat(realtime): Add Liquid Audio s2s model and assistant mode on talk page by @richiejp in #9801
Other Changes
- chore: Update ggml-org/llama.cpp to
a9883db8ee021cf16783016a60996d41820b5195by @localai-bot in #9796 - chore: Update TheTom/llama-cpp-turboquant to
5aeb2fdbe26cd4c534c6fa15de73cb5749bd0403by @localai-bot in #9740 - docs: update docs version mudler/LocalAI by @localai-bot in #9805
- chore: Update antirez/ds4 to
0cba357ca1bc0e7510421cc26888e420ea942123by @localai-bot in #9806 - chore: Update ikawrakow/ik_llama.cpp to
949bb8f1d660fc1264c137a6f3dbd619375f6134by @localai-bot in #9807 - chore: Update ggml-org/whisper.cpp to
3e9b7d0fef3528ee2208da3cdb873a2c53d2ae2fby @localai-bot in #9808 - ci(image): publish missing :latest-* and :v-* singleton image tags by @localai-bot in #9812
Full Changelog: v4.2.3...v4.2.4


