Log inSign up
πŸ‘‹ Jan
1,591 posts
Image
user avatar
πŸ‘‹ Jan
@jandotai
Jan is one agent for (almost) everything. Community: discord.gg/Exe46xPMbK
your device
jan.ai
Joined October 2023
988
Following
13K
Followers
  • Pinned
    user avatar
    πŸ‘‹ Jan
    @jandotai
    Mar 2
    Introducing Jan-Code-4B πŸ’» A compact coding model tuned for practical day-to-day tasks. Generation, refactors, debugging, tests β€” all runnable locally in Jan. Download Jan: jan.ai Model: huggingface.co/collections/ja…
    Image
    00:00
    111K
  • user avatar
    πŸ‘‹ Jan
    @jandotai
    Aug 12, 2025
    Introducing Jan-v1: 4B model for web search, an open-source alternative to Perplexity Pro. In our evals, Jan v1 delivers 91% SimpleQA accuracy, slightly outperforming Perplexity Pro while running fully locally. Use cases: - Web search - Deep Research Built on the new version
    Image
    00:00
    690K
  • user avatar
    πŸ‘‹ Jan
    @jandotai
    Jun 3, 2025
    Google has quietly open-sourced a full-stack research agent stack, powered by Gemini and LangGraph. It's capable of multi-step web search, reflection, and synthesis. While not confirmed to match Gemini’s production backend, it's strikingly close.
    Image
    GitHub - google-gemini/gemini-fullstack-langgraph-quickstart: Get started with building Fullstack...
    From github.com
    165K
  • user avatar
    πŸ‘‹ Jan
    @jandotai
    Jun 4, 2025
    NVIDIA just released Llama-Nemotron-Nano-VL-8B-V1, an 8B vision model that reads dense documents, charts, and video frames. It's #1 on OCRBench V2 (English), with layout and OCR fused end-to-end.
    Image
    nvidia/Llama-3.1-Nemotron-Nano-VL-8B-V1 Β· Hugging Face
    From huggingface.co
    68K
  • user avatar
    πŸ‘‹ Jan
    @jandotai
    Jul 16, 2025
    Microsoft releases a new dataset that improves Qwen2.5-7B from 17.4% to 57.3% on LiveCodeBench. It's called rStar-Coder, 418K tasks designed to push competitive code reasoning. A 7B model trained on it outperforms QWQ-32B on the USA Computing Olympiad.
    Image
    microsoft/rStar-Coder Β· Datasets at Hugging Face
    From huggingface.co
    36K
  • user avatar
    πŸ‘‹ Jan
    @jandotai
    Aug 4, 2025
    Open-source voice cloning at 16x real-time? Chatterbox TTS (0.5B Llama) now runs on vLLM, 5–10x faster than the original implementation. On a 3090: - 40 min speech in ~2.5 min - Same quality, way faster
    Image
    GitHub - randombk/chatterbox-vllm: VLLM Port of the Chatterbox TTS model
    From github.com
    43K
  • user avatar
    πŸ‘‹ Jan
    @jandotai
    May 30, 2025
    Someone got DeepSeek-R1-0528-Qwen3-8B running on an iPhone 16 Pro using MLX. It runs but takes ages to respond, and the phone gets hot fast. 8B models on phones aren't sci-fi anymore. via u/adrgrondin on r/LocalLLaMA
    Image
    00:00
    83K
  • user avatar
    πŸ‘‹ Jan
    @jandotai
    Aug 13, 2025
    Get your free Perplexity-style search agent in 2 mins. Use Jan v1 and match the settings in this video.
    Image
    00:00
    42K
  • user avatar
    πŸ‘‹ Jan
    @jandotai
    Jul 14, 2025
    This is interesting. One dev is training an AI from scratch on books from 1800s London. It's called TimeCapsuleLLM, not a fine-tuned modern model, but one trained entirely on historical data. No modern language or context. Built on nanoGPT by @karpathy.
    Image
    GitHub - haykgrigo3/TimeCapsuleLLM: A LLM trained only on data from certain time periods to reduce...
    From github.com
    105K
  • user avatar
    πŸ‘‹ Jan
    @jandotai
    May 29, 2025
    DeepSeek R1.1 just matched Claude Opus on Aider's polyglot benchmark - 70.7% Pass@2. Old R1 scored 56.9%, so this is a +13.8pt jump. Same test, same setup, posted by a user on r/LocalLLaMA. Cost to run: ~$3 off-peak.
    Image
    69K
  • user avatar
    πŸ‘‹ Jan
    @jandotai
    May 9, 2025
    Qwen3-30B-A3B local settings guide. - With thinking: Temp 0.6, TopP 0.95, TopK 20, 32,768 tokens max - Without thinking: Temp 0.7, TopP 0.8, TopK 20 Switch modes with /think or /no_think in prompts, or enable_thinking=False in code. Source: @Alibaba_Qwen
    Image
    28K
  • user avatar
    πŸ‘‹ Jan
    @jandotai
    Mar 11, 2025
    GRPO-tuned Qwen 32B matches Claude 3.7 Sonnet on deductive reasoning tasks! Outperforms DeepSeek R1, o1, and o3-mini on "Temporal Clue" puzzles at 100x lower inference cost. Click Use this model on @huggingface and select πŸ‘‹ Jan to run it locally: huggingface.co/bartowski/Open…
    Image
    37K
  • user avatar
    πŸ‘‹ Jan
    @jandotai
    Nov 25, 2024
    How to run AI models locally? -> Go to πŸ€— @huggingface -> Grab the GGUF model link -> Drop it into πŸ‘‹ Jan Hub That's all there is to it.
    Image
    00:00
    45K
  • user avatar
    πŸ‘‹ Jan
    @jandotai
    Feb 14, 2025
    🐳 DeepSeek just dropped official recommendations on how to run their models effectively! Here's a quick breakdown of what you need to know: 🧡
    Image
    55K

New to X?

Sign up now to get your own personalized timeline!

Create account

By signing up, you agree to the Terms of Service and Privacy Policy, including Cookie Use.

TermsΒ·PrivacyΒ·CookiesΒ·AccessibilityΒ·Ads InfoΒ·Β© 2026 X Corp.
Don't miss what's happening
People on X are the first to know.
Log inSign up