Log inSign up
Cameron R. Wolfe, Ph.D.
4,742 posts
Image
user avatar
Cameron R. Wolfe, Ph.D.
@cwolferesearch
Research @Netflix • Writer @ Deep (Learning) Focus • PhD @optimalab1
Austin, TX
cameronrwolfe.substack.com
Joined August 2021
810
Following
38.9K
Followers
  • Pinned
    user avatar
    Cameron R. Wolfe, Ph.D.
    @cwolferesearch
    Apr 16, 2025
    Reinforcement Learning (RL) is quickly becoming the most important skill for AI researchers. Here are the best resources for learning RL for LLMs… TL;DR: RL is more important now than it has ever been, but (probably due to its complexity) there aren’t a ton of great resources
    Image
    106K
  • user avatar
    Cameron R. Wolfe, Ph.D.
    @cwolferesearch
    Nov 23, 2023
    Q-Learning is *probably* not the secret to unlocking AGI. But, combining synthetic data generation (RLAIF, self-instruct, etc.) and data efficient reinforcement learning algorithms is likely the key to advancing the current paradigm of AI research… TL;DR: Finetuning with
    Image
    545K
  • user avatar
    Cameron R. Wolfe, Ph.D.
    @cwolferesearch
    Aug 20, 2023
    Replying to @MerabDvalishvil
    Image
    86K
  • user avatar
    Cameron R. Wolfe, Ph.D.
    @cwolferesearch
    Mar 27, 2023
    Large language models (LLMs) are fun to use, but understanding the fundamentals of how they work is also incredibly important. One major idea and building block of LLMs is their underlying architecture: the decoder-only transformer model. 🧵[1/6]
    Image
    393K
  • user avatar
    Cameron R. Wolfe, Ph.D.
    @cwolferesearch
    Nov 27, 2023
    Due to the recent surge in popularity of AI and language models, one of the most common questions I hear is: How can we train a specialized LLM over our own data? The answer is actually pretty simple… TL;DR: Training LLMs end-to-end is quite difficult due to the size of the
    Image
    411K
  • user avatar
    Cameron R. Wolfe, Ph.D.
    @cwolferesearch
    Mar 2, 2023
    The ChatGPT API was released yesterday and it costs 90% less than expected. Here’s five methods (and resources to learn about them) that are **probably** being used to enable this price reduction… 🧵[1/6]
    Image
    263K
  • user avatar
    Cameron R. Wolfe, Ph.D.
    @cwolferesearch
    Aug 25, 2023
    One of the best ways to reduce hallucinations with LLMs is by retrieving useful, factual information and injecting it into the LLM’s prompt as added context. Although this might sound complicated, it’s actually quite easy to implement with standard vector search functionality…
    Image
    357K
  • user avatar
    Cameron R. Wolfe, Ph.D.
    @cwolferesearch
    Feb 5, 2024
    RAG is one of the best (and easiest) ways to specialize an LLM over your own data, but successfully applying RAG in practice involves more than just stitching together pretrained models… What is RAG? At the highest level, RAG is a combination of a pretrained LLM with an
    Image
    184K
  • user avatar
    Cameron R. Wolfe, Ph.D.
    @cwolferesearch
    Jan 23, 2024
    The volume of LLM research being released is staggering. Although there are too many new papers for any one person to read, this work can be largely distilled into a much smaller set of overlapping themes. Recently, there are three trends in LLM research that have been especially
    Image
    191K
  • user avatar
    Cameron R. Wolfe, Ph.D.
    @cwolferesearch
    Mar 31, 2023
    Self-attention is the primary building block of large language models (LLMs) and transformers in general. But, how exactly does it work? 🧵 [1/8]
    Image
    328K
  • user avatar
    Cameron R. Wolfe, Ph.D.
    @cwolferesearch
    Sep 26, 2024
    I find it so interesting (and smart) that Meta / LLaMA is eliminating the dependence of their models on the HuggingFace stack. The LLaMA models now: - Have their own website to download weights. - Have one of the best LLM cookbooks that's available. - Provide extensive
    Image
    253K
  • user avatar
    Cameron R. Wolfe, Ph.D.
    @cwolferesearch
    Mar 14, 2023
    Although large language models (LLMs) are incredibly capable, they are pretty simple to understand. In fact, the core components of most LLMs can be distilled into five major components… 🧵[1/7]
    Image
    195K
  • user avatar
    Cameron R. Wolfe, Ph.D.
    @cwolferesearch
    Nov 4, 2024
    Prompt engineering requires a lot of manual effort. Here are four automatic prompt optimization algorithms that can help to improve your prompt with minimal effort… (1) Automatic Prompt Engineer (APE) [1] searches over a pool of prompts proposed by an LLM–usually ~32-64
    Image
    97K
  • user avatar
    Cameron R. Wolfe, Ph.D.
    @cwolferesearch
    Jan 12, 2024
    Generative large language models (LLMs) are based upon the decoder-only transformer architecture. Currently, these types of generative LLMs are incredibly popular. However, I use encoder-only architectures for 90% of use cases as a practitioner. Here’s why… History of
    Image
    261K

New to X?

Sign up now to get your own personalized timeline!

Create account

By signing up, you agree to the Terms of Service and Privacy Policy, including Cookie Use.

Terms of Service|Privacy Policy|Cookie Policy|Accessibility|Ads info|© 2026 X Corp.
Don't miss what's happening
People on X are the first to know.
Log inSign up