🚀Introducing Intern-S1, our most advanced open-source multimodal reasoning model yet!
🥳Strong general-task capabilities + SOTA performance on scientific tasks, rivaling leading closed-source commercial models.
🥰Built upon a 235B MoE language model and a 6B Vision encoder.
Intern Large Models
106 posts
Intern-series large models by Shanghai AI Laboratory.
Joined June 2023
- 🚀 We have released InternLM 2.5, the best model under 12B on the @huggingface Open LLM Leaderboard. 🔥 Outstanding reasoning capability 🚀 1M context window 🔧 Stronger tool use Try using it to build your amazing application! huggingface.co/collections/in…
- 🔥Introducing Intern-S1-mini, a lightweight open-source multimodal reasoning model based on the same techniques as Intern-S1. 🥳With just 8B parameters, it’s optimized for fast deployment and easy customization. - Strong general capabilities while excelling in specialized
- 🥳Introducing #OREAL, a new RL method for math reasoning. 😊With OREAL, a 7B model achieves 94.0 pass@1 on MATH-500, matching many 32B models, while OREAL-32B achieves 95.0 pass@1, surpassing #DeepSeek-R1 Distilled models. 🤗Paper/Model/Data: huggingface.co/papers/2502.06…
- 🥳We have released our InternLM2.5 new models in 1.8B and 20B on @huggingface. 😉1.8B: Ultra-lightweight, high-performance, with great adaptability. 😉20B: More powerful, ideal for complex tasks. 😍Explore now! Models: huggingface.co/collections/in… GitHub: github.com/InternLM/Inter…
- 🚀Introducing InternLM3-8B-Instruct with Apache License 2.0. -Trained on only 4T tokens, saving more than 75% of the training cost. -Supports deep thinking for complex reasoning and normal mode for chat. Model:@huggingface huggingface.co/internlm/inter… GitHub: github.com/InternLM/Inter…
- Try our new multimodal reasoning model #InternS1😋 Model: huggingface.co/internlm/Inter… GitHub: github.com/InternLM/Inter… Chat here: chat.intern-ai.org.cn
- 🥳We release InternLM2-Math-Plus, a series of math-focused #LLMs in sizes 1.8B, 7B, 20B, and 8x22B, with enhanced capabilities in chain-of-thought, code interpretation, and LEAN 4 mathematical reasoning 🥰Model: @huggingface huggingface.co/internlm/inter… 🥰Code: github.com/InternLM/Inter…
- 🥳Thrill to release the full RL training code of #OREAL! 😊Now you can fully reproduce the results of OREAL-7B/32B. Using #DeepSeek-R1-Distill-Qwen-32B, you can further obtain a model has 95.6 on MATH-500! 🤗Code: github.com/InternLM/OREAL 🤗Based on: github.com/InternLM/xtuner🥳Introducing #OREAL, a new RL method for math reasoning. 😊With OREAL, a 7B model achieves 94.0 pass@1 on MATH-500, matching many 32B models, while OREAL-32B achieves 95.0 pass@1, surpassing #DeepSeek-R1 Distilled models. 🤗Paper/Model/Data: huggingface.co/papers/2502.06…
- 🥳InternLM-XComposer2.5-OmniLive, a comprehensive multimodal system for long-term streaming video and audio interactions. Real-time visual & auditory understanding Long-term memory formation Natural voice interaction Code: github.com/InternLM/Inter… Model: huggingface.co/internlm/inter…
- 🚀 Introducing #POLAR: Bring Reward Model into a New Pre-training Era! ✨ Say goodbye to reward models with poor generalization! POLAR (Policy Discriminative Learning) is a groundbreaking pre-training paradigm that trains reward models to distinguish policy distributions,
- 🥳Multi-modal Phi-3-mini is here! #LLaVA-Phi-3-mini outperforms LLaVA-v1.5-7B and matches the performance of LLaVA-Llama-3-8B in multiple benchmarks. 😊For easy applications, #GGUF weights are provided. 👉github.com/InternLM/xtuner @_akhaliq #Phi3
- 🚀 Introducing InternLM2-Reward! 🚀 🥳Releasing our reward models in 1.8B, 7B, and 20B on🤗 @huggingface . Trained with 2.4M preference samples, they balance helpfulness and harmlessness in both English and Chinese. Show strong results on RewardBench💪! 😉huggingface.co/internlm/inter…🚀 #XTuner's Zero Memory Waste and Sequence Parallel in Preference Alignment! 🚀 ⚡50% Faster: Cut DPO training time in half with packed preference data. 📈 Sequence Parallel: Train #Llama3 70B RM with sequence lengths up to 1M tokens on 64 A100s. 😉Code: github.com/InternLM/xtuner
- 🥳The #GGUF-format weights for #LLaVA-Llama-3-8B and LLaVA-Phi-3-mini (supporting FP16 and INT4 dtypes), have been released, supporting the deployment on @LMStudioAI , llama.cpp, and @ollama platforms. 🥰Welcome to follow and star! 👉github.com/InternLM/xtuner #LLaMA3 #Phi3


























