I have a hot take that most people still underestimate how impactful AI will be.
Last month I gave two talks at Columbia and Harvard on the state of AI and how I slowly got AGI‑pilled over the last decade (yes, I was very skeptical about AGI 10 years ago).
Many friends who
Shuchao Bi
249 posts
Research at Meta Superintelligence Labs, RL/post-training/agents; Previously Research at OpenAI on multimodal and RL; Opinions are my own.
Joined December 2009
- Today, we are rolling out Search in Advanced Voice mode. You can now get real time information while speaking with ChatGPT. It has been a very fruitful collaboration between the Search and multimodal product research team.Day 8: ChatGPT Search Day openai.com/12-days/?day=8
- Great work from the team. Please give it a try and let us what you think.We launched an update to Advanced Voice to make it way more natural and effortless to talk to. Now available to all paid users in ChatGPT.
- Reinforcement fine-tuning is here after 5 months of amazing work. You can build your own reasoning model in your domain now. Would love your feedback and use cases!Day 2: Reinforcement Fine-Tuning openai.com/12-days/?day=2
- 10x cheaper realtime voice API. The internet went from text only to multimodal over the last 25 years: blogs + Google → instagram → short-form videos (YouTube Shorts, TikTok). Think about how many human hours are spent on writing / reading text vs talking or watching videos.Day 9: DevDay Holiday Edition openai.com/12-days/?day=9
- Today we started rolling out video, screensharing, image uploads and Santa Voice in ChatGPT Advanced Voice. This is an important step forward my team drove on the research side toward building a truly multimodal interface to AI. ChatGPT can now see, hear, and speak in real-time.Day 6: A gift for everyone who has been nice this year 🎅 openai.com/12-days/?day=6
- I gave this talk at Harvard in June similar to the talk at Columbia during our east coast trip. I had a lot of fuds about deep learning and I shared my personal journey of resolving these fuds and slowly getting AGI-pilled over the last decade. Advancing the Frontier of Silicon
- Replying to @shuchaobiwe built internet, kernels, software and the whole virtual world as environment for virtual AI to learn
- Replying to @_jasonweiCould i take the complements given the number of times we run into each other over the weekend, lol
- This is the opportunity of our generation. Things will be vastly different 5-10 years from now.What do you want to create next?
00:00 - We have been hearing many cases like this, ChatGPT is literally saving many lives. It will only be increasingly so. This is what keeps us up at night.Redditor says ChatGPT saved his wife's life by correcting a doctor's fatal misdiagnosis. Comments are filled with people sharing their own stories. I don't understand the AI haters at all. This technology saves lives.
- Great work from the team!Replying to @samanext up: upgraded voice mode! much more natural and smarter. also, free users now can chat for hours, and plus users nearly unlimited. works well with study mode, and lots of other things.
- If you are curious about how to pass speech Turing test, @pika7ma and I shared some high level learnings here:
- A small step towards voice agent. Congrats Yi Shen, Liyu, @ChengxuZhuang , Damian, @junhuamao, Erik, Yu, @christinahkimThree new state-of-the-art audio models in the API: 🗣️ Two speech-to-text models—outperforming Whisper 💬 A new TTS model—you can instruct it *how* to speak 🤖 And the Agents SDK now supports audio, making it easy to build voice agents. Try TTS now at OpenAI.fm.







