Pinned
LM Studio
2,592 posts
Discover and run open models 👾 we are hiring lmstudio.ai/careers
- For WWDC, we worked with Apple to run Kimi K2.6, a 1T-parameter model, across a cluster of four Mac Studios using a preview version of LM Studio. We showcased secure remote access from a MacBook Neo and iPhone using LM Link. A glimpse of your own private, frontier-scale AI.
- LM Studio repostedLM Studio on stage for WWDC at the Steve Jobs Theater demonstrating MLX distributed on 4 Mac Studio! Coming later this year @lmstudio 👾
- LM Studio repostedWe made @lmstudio's MLX Engine a lot faster in the latest release. Read the technical deep dive from @ostensiblyneil. P.S. it's all open source!
- Gemma 4 QAT is here. Available for all sizes of Gemma 4, optimized with Quantization-Aware Training (QAT) to reduce memory requirements while preserving performance. Live now in LM Studio.
- Replying to @lmstudioLocally is now LM Studio’s mobile app. And today we're bringing LM Link to iPhone. Use your largest local models over a secure, end-to-end encrypted connection, anywhere you go. Download the app now:
- Gemma 4 model load issues fixed in engine version 2.20.1. lms runtime update --allGemma 4 12B is here! Dense, mid-sized Gemma that fits right on your laptop - released by @Google under Apache 2.0 Available now in LM Studio lmstudio.ai/models/google/…
- Make sure to update your runtime first! > lms runtime update --all Learn more about this model releaseMeet Gemma 4 12B! A unified, encoder-free multimodal model designed to bring high-performance intelligence directly to your laptop, and released under an Apache 2.0 license. Bridging the gap between edge efficiency and advanced reasoning. Here is what’s new with Gemma 4 12B: 👇
- MTP means Multi Token Prediction. It's a speculative decoding technique that can result in large inference speedups in many cases. 1. Update to LM Studio 0.4.14 2. Download a model that supports MTP like Qwen3.6-35B-A3B-MTP-GGUF or Qwen3.6-27B-MTP-GGUF 3. Enable it when loading
- LM Studio repostedSubagents running locally and simultaneously on MacBook Pro M5 with Codex CLI + @lmstudio to review code and find bugs using Qwen 3.6 Powered by the updated MLX engine with batching in beta in the app The batching speed boost is noticeable
00:00

















