Log inSign up
Jiaming Song
874 posts
user avatar
Jiaming Song
@baaadas
I need a vacation
tsong.me
Joined November 2014
1,155
Following
10.4K
Followers

New to X?

Sign up now to get your own personalized timeline!

Create account

By signing up, you agree to the Terms of Service and Privacy Policy, including Cookie Use.

Terms·Privacy·Cookies·Accessibility·Ads Info·© 2026 X Corp.
Don't miss what's happening
People on X are the first to know.
Log inSign up
  • Pinned
    user avatar
    Jiaming Song
    @baaadas
    May 31
    time for some vacation stuff
    Image
    00:00
    user avatar
    Jiaming Song
    @baaadas
    May 31
    Yesterday was my last day at @LumaLabsAI. Over the last three years, I had the privilege of helping drive the company's transition from 3D AI to video generation and native multimodal foundation models. I am grateful to have worked alongside an extraordinary group of
    25K
  • user avatar
    Jiaming Song
    @baaadas
    Mar 11, 2025
    As one of the people who popularized the field of diffusion models, I am excited to share something that might be the “beginning of the end” of it. IMM has a single stable training stage, a single objective, and a single network — all are what make diffusion so popular today.
    156K
  • user avatar
    Jiaming Song
    @baaadas
    Mar 13, 2022
    "The paper is not novel because some arxiv paper in February of 2022 already did it" -- I recall that the #icml2022 submission deadline was on January of 2022? I am fine with the paper getting rejected, but not all of us have time machines 🤣 @icmlconf
    Image
  • user avatar
    Jiaming Song
    @baaadas
    Mar 31, 2025
    Found this figure while randomly reading @torchcompiled 's blog. It reminds me about something I had thought about around 4.5 years ago: adding noise is like drawing balls in persistent homology. Maybe we can use it to analyze the optimal way to sample timesteps during training?
    Image
    Image
    34K
  • user avatar
    Jiaming Song
    @baaadas
    Nov 22, 2022
    📢 We are looking for highly motivated #ML #AI Ph.D. students to work with us at NVIDIA Research as summer #interns next year. We encourage applicants with experience in generative modeling in one of these domains:
  • user avatar
    Jiaming Song
    @baaadas
    May 20, 2023
    @chenlin_meng, @ArashVahdat, and I are presenting the #diffusion model tutorial at #CVPR2023 on June 18 (…3-tutorial-diffusion-models.github.io). Since there are > 1300 papers on this topic, we cannot read all of them😭, and we need your help on uncovering all the "hidden gems"!
    Image
    Denoising Diffusion-based Generative Modeling: Foundations and Applications
    From cvpr2023-tutorial-diffusion-models.github.io
    56K
  • user avatar
    Jiaming Song
    @baaadas
    Jun 17, 2023
    📢 Our #CVPR2023 tutorial on "Denoising Diffusion Models: A Generative Learning Big Bang" w/ @chenlin_meng and @ArashVahdat is happening tomorrow morning! 9:00 to 12:30, West 202-204. …3-tutorial-diffusion-models.github.io This is the year of big bang for diffusion models in CVPR!
    Rough estimate of the percentage of papers of GAN and diffusion model papers on CVPR each year.
    41K
  • user avatar
    Jiaming Song
    @baaadas
    Jul 12, 2023
    After a wonderful year at NVIDIA, I am starting a new adventure @LumaLabsAI 🐻
    Image
    00:00
    49K
  • user avatar
    Jiaming Song
    @baaadas
    Jun 12, 2024
    Extremely proud to be working on this with many amazing people @LumaLabsAI! Generate a 5-second, 120 frames video in 120 seconds from text or images *now* on: lumalabs.ai/dream-machine Available to everyone. #LumaDreamMachine
    26K
  • user avatar
    Jiaming Song
    @baaadas
    Jun 15, 2021
    Introducing Diffusion-Denoising Models with Contrastive Representations (D2C), a non-adversarial image generative model for few-shot conditional generation (e.g. image manipulation). d2c-model.github.io arxiv.org/abs/2106.06819 w/ @a7b2_3 @chenlin_meng @StefanoErmon
    Image
  • user avatar
    Jiaming Song
    @baaadas
    Mar 14, 2025
    Came across this gem earlier. TLDR: perplexity is a flawed metric for diffusion language models due to model mis-specification, so other metrics like the "Sequence Error Rate" proposed here might be better.
    Image
    15K
  • user avatar
    Jiaming Song
    @baaadas
    Jul 13, 2025
    Based on developments on "flow-map / average velocity" type methods, such as consistency trajectory models, shortcut models, IMM, and mean flow, I believe that the community will develop a proper replacement to diffusion / flow matching in 6 - 12 months.
    user avatar
    Jiaming Song
    @baaadas
    Mar 11, 2025
    As one of the people who popularized the field of diffusion models, I am excited to share something that might be the “beginning of the end” of it. IMM has a single stable training stage, a single objective, and a single network — all are what make diffusion so popular today.
    20K
  • user avatar
    Jiaming Song
    @baaadas
    Dec 2, 2017
    Basically you can use all kinds of regularization to maximize MI between data and code - e.g. GAN, Stein and MMD. Our experiments on PixelCNN show that MMD works the best, and can be implemented in 10 lines.
    user avatar
    Stefano Ermon
    Inception
    @StefanoErmon
    Dec 2, 2017
    Check out our new blog post by @shengjia_zhao on InfoVAE: ermongroup.github.io/blog/a-tutoria…
    Image
  • user avatar
    Jiaming Song
    @baaadas
    Dec 8, 2020
    Can we make better use of negative samples in contrastive learning? In our #NeurIPS2020 paper, we show this is true by simply using a multi-label objective. Come to our oral presentation at 6:15 PT (neurips.cc/virtual/2020/p…) and poster at 9-11 for more details! @StefanoErmon
    Image