Log inSign up
Zengyi Qin
696 posts
Image
user avatar
Zengyi Qin
@qinzytech
Multi-modal Agent Research | MIT PhD
Bay Area, USA
qinzy.tech
Joined December 2023
538
Following
5,020
Followers
  • Pinned
    user avatar
    Zengyi Qin
    @qinzytech
    Dec 1, 2025
    Introducing Lux, the most powerful and fastest Computer Use model, built by OpenAGI Foundation @agiopen_org Lux outperforms Google Gemini CUA, OpenAI Operator and Anthropic Claude on benchmark with 300 real-world tasks. Try our developer-friendly SDK to build powerful,
    Image
    00:00
    103K
  • user avatar
    Zengyi Qin
    @qinzytech
    Jan 16, 2025
    Our MIT team just developed an internal Agent benchmark where GPT4-o and Claude has ~0% success rateā˜•ļøLet's see how much we can achieve by the NeurIPS deadline🐈 BTW - These are computer-use tasks that are not too hard for humans but difficult for LLMs due to long-horizon
    156K
  • user avatar
    Zengyi Qin
    @qinzytech
    Apr 4, 2024
    Training LLMs can be much cheaper than previously thought. 0.1 million USD is sufficient for training LLaMA2-level LLMs🤯 While @OpenAI and @Meta use billions of dollars to train theirs, you can also train yours with much less money. Introducing our open-source project JetMoE:
    Image
    247K
  • user avatar
    Zengyi Qin
    @qinzytech
    Jan 5, 2024
    OpenVoice: Instantly clone any voice and generate speech in any style and any language! We are trending #1 on Github! Website: research.myshell.ai/open-voice HuggingFace demo: huggingface.co/spaces/myshell… MyShell demo: app.myshell.ai/bot/z6Bvua/170… Source code: github.com/myshell-ai/Ope…
    Image
    00:00
    78K
  • user avatar
    Zengyi Qin
    @qinzytech
    Apr 24, 2024
    Introducing OpenVoice V2, our latest voice clone model · Clone Any Voice, Speak in Many Languages · Totally Free, Open-Sourced Now your voice goes global in multiple languages🤯 Joint work by @myshell_ai and @MIT_CSAIL
    Image
    00:00
    Image
    01:26
    user avatar
    MyShell.AI
    @myshell_ai
    Apr 24, 2024
    Introduce OpenVoice V2 - a Text-to-Speech model that can clone any voice and speak in any language. Developed by MyShell and @MIT_CSAIL researchers. 🌐 Imagine your voice going global in multiple languages. šŸ”Š OpenVoice V2 breaks the language barrier and redefines voice
    51K
  • user avatar
    Zengyi Qin
    @qinzytech
    Dec 27, 2024
    DeepSeek: 200 people + $5.5M cost beats Llama3.1-405B JetMoE: 4 people + $0.08M cost beats Llama2-7B and Llama-13B @deepseek_ai @AIatMeta @Alibaba_Qwen @xai @ZihangDai @tingchenai @Guodzh @TheGregYang @elonmusk give us JetMoE team $6M and see what we can do
    Image
    35K
  • user avatar
    Zengyi Qin
    @qinzytech
    Jan 24, 2025
    Do NOT overhype OpenAI Operator We show some failure modes that indicates it is almost surely below a college-level computer use. My guess: OpenAI devoted a lot to post-train this model but not sufficiently pre-train it. The model does not even know some basic skills for
    Image
    38K
  • user avatar
    Zengyi Qin
    @qinzytech
    Mar 7, 2024
    Super quality Text-to-Speech library #MeloTTS is open-sourced. Multi-lingual, multi-accent, CPU-realtime, and completely free. It supports English, Spanish, French, Chinese, Japanese and Korean. Take a trip through seminal moments in open source history that led to Linux,
    Image
    00:00
    25K
  • user avatar
    Zengyi Qin
    @qinzytech
    Mar 18, 2024
    MeloTTS is probably the best free & open-sourced text-to-speech library that supports multiple languages. Do you believe this voicešŸ‘‡is AI-Generated? 🤯 It has almost 3K stars on Github: github.com/myshell-ai/Mel… Completely open-sourced. Completely free. Supports English,
    Image
    00:00
    20K
  • user avatar
    Zengyi Qin
    @qinzytech
    Mar 6, 2025
    Many people ask me about Manus @manusai and here are my thoughts:
    Image
    69K
  • user avatar
    Zengyi Qin
    @qinzytech
    Apr 15, 2024
    JetMoE technical report is out! The key to train LLaMA2-level LLM with only $0.1M cost: Ā· 2-phase training strategy Ā· MoA + MoE Great work supported by @myshell_ai: Ā· Fully open-sourced Ā· No proprietary data / code needed
    Image
    00:00
    Image
    00:08
    user avatar
    MyShell.AI
    @myshell_ai
    Apr 15, 2024
    šŸ”¬ Since a lot of you are asking, here it comes: We reveal the technical report of JetMoE. The key to JetMoE is: Ā· 2-phase training strategy Ā· MoA + MoE In the 2-phase training, you need: Ā· Different data mixtures Ā· Different learning rate schedules In the MoA
    41K
  • user avatar
    Zengyi Qin
    @qinzytech
    Jan 16, 2025
    We will release and open-source a model that significantly outperforms o1 in computer-use agents and release the benchmark at the same time. Stay tuned
    6.4K
  • user avatar
    Zengyi Qin
    @qinzytech
    Dec 22, 2024
    Today, I spent two hours listening to @ZeyuanAllenZhu's talk, and it was undoubtedly one of the most insightful presentations that I have ever watched.
    Image
    9.3K
  • user avatar
    Zengyi Qin
    @qinzytech
    Apr 4, 2024
    Replying to @qinzytech
    JetMoE is fully open-sourced & academia-friendly because: 1. It only uses public datasets for training. No proprietary resource is needed. 2. It can be finetuned with a very limited computing budget (e.g., consumer-grade GPU). Github:
    Image
    GitHub - myshell-ai/JetMoE: Reaching LLaMA2 Performance with 0.1M Dollars
    From github.com
    8K

New to X?

Sign up now to get your own personalized timeline!

Create account

By signing up, you agree to the Terms of Service and Privacy Policy, including Cookie Use.

TermsĀ·PrivacyĀ·CookiesĀ·AccessibilityĀ·Ads InfoĀ·Ā© 2026 X Corp.
Don't miss what's happening
People on X are the first to know.
Log inSign up