👋 Jan (@jandotai) / X

👋 Jan

1,591 posts

👋 Jan

@jandotai

Jan is one agent for (almost) everything. Community: discord.gg/Exe46xPMbK

your device

Joined October 2023

Pinned
👋 Jan
@jandotai
Mar 2
Introducing Jan-Code-4B 💻 A compact coding model tuned for practical day-to-day tasks. Generation, refactors, debugging, tests — all runnable locally in Jan. Download Jan: jan.ai Model: huggingface.co/collections/ja…
00:00
111K
👋 Jan
@jandotai
Aug 12, 2025
Introducing Jan-v1: 4B model for web search, an open-source alternative to Perplexity Pro. In our evals, Jan v1 delivers 91% SimpleQA accuracy, slightly outperforming Perplexity Pro while running fully locally. Use cases: - Web search - Deep Research Built on the new version
00:00
690K
👋 Jan
@jandotai
Jun 3, 2025
Google has quietly open-sourced a full-stack research agent stack, powered by Gemini and LangGraph. It's capable of multi-step web search, reflection, and synthesis. While not confirmed to match Gemini’s production backend, it's strikingly close.
GitHub - google-gemini/gemini-fullstack-langgraph-quickstart: Get started with building Fullstack...
From github.com
165K
👋 Jan
@jandotai
Jun 4, 2025
NVIDIA just released Llama-Nemotron-Nano-VL-8B-V1, an 8B vision model that reads dense documents, charts, and video frames. It's #1 on OCRBench V2 (English), with layout and OCR fused end-to-end.
nvidia/Llama-3.1-Nemotron-Nano-VL-8B-V1 · Hugging Face
From huggingface.co
68K
👋 Jan
@jandotai
Jul 16, 2025
Microsoft releases a new dataset that improves Qwen2.5-7B from 17.4% to 57.3% on LiveCodeBench. It's called rStar-Coder, 418K tasks designed to push competitive code reasoning. A 7B model trained on it outperforms QWQ-32B on the USA Computing Olympiad.
microsoft/rStar-Coder · Datasets at Hugging Face
From huggingface.co
36K
👋 Jan
@jandotai
Aug 4, 2025
Open-source voice cloning at 16x real-time? Chatterbox TTS (0.5B Llama) now runs on vLLM, 5–10x faster than the original implementation. On a 3090: - 40 min speech in ~2.5 min - Same quality, way faster
GitHub - randombk/chatterbox-vllm: VLLM Port of the Chatterbox TTS model
From github.com
43K
👋 Jan
@jandotai
May 30, 2025
Someone got DeepSeek-R1-0528-Qwen3-8B running on an iPhone 16 Pro using MLX. It runs but takes ages to respond, and the phone gets hot fast. 8B models on phones aren't sci-fi anymore. via u/adrgrondin on r/LocalLLaMA
00:00
83K
👋 Jan
@jandotai
Aug 13, 2025
Get your free Perplexity-style search agent in 2 mins. Use Jan v1 and match the settings in this video.
00:00
42K
👋 Jan
@jandotai
Jul 14, 2025
This is interesting. One dev is training an AI from scratch on books from 1800s London. It's called TimeCapsuleLLM, not a fine-tuned modern model, but one trained entirely on historical data. No modern language or context. Built on nanoGPT by @karpathy.
GitHub - haykgrigo3/TimeCapsuleLLM: A LLM trained only on data from certain time periods to reduce...
From github.com
105K
👋 Jan
@jandotai
May 29, 2025
DeepSeek R1.1 just matched Claude Opus on Aider's polyglot benchmark - 70.7% Pass@2. Old R1 scored 56.9%, so this is a +13.8pt jump. Same test, same setup, posted by a user on r/LocalLLaMA. Cost to run: ~$3 off-peak.
69K
👋 Jan
@jandotai
May 9, 2025
Qwen3-30B-A3B local settings guide. - With thinking: Temp 0.6, TopP 0.95, TopK 20, 32,768 tokens max - Without thinking: Temp 0.7, TopP 0.8, TopK 20 Switch modes with /think or /no_think in prompts, or enable_thinking=False in code. Source: @Alibaba_Qwen
28K
👋 Jan
@jandotai
Mar 11, 2025
GRPO-tuned Qwen 32B matches Claude 3.7 Sonnet on deductive reasoning tasks! Outperforms DeepSeek R1, o1, and o3-mini on "Temporal Clue" puzzles at 100x lower inference cost. Click Use this model on @huggingface and select 👋 Jan to run it locally: huggingface.co/bartowski/Open…
37K
👋 Jan
@jandotai
Nov 25, 2024
How to run AI models locally? -> Go to 🤗 @huggingface -> Grab the GGUF model link -> Drop it into 👋 Jan Hub That's all there is to it.
00:00
45K
👋 Jan
@jandotai
Feb 14, 2025
🐳 DeepSeek just dropped official recommendations on how to run their models effectively! Here's a quick breakdown of what you need to know: 🧵
55K