Softmax (@softmaxresearch) / X

Softmax

47 posts

Softmax

@softmaxresearch

Softmax's mission is to scale organic alignment. We approach this problem with multi-agent reinforcement learning population-based simulations.

San Francisco, CA

Joined February 2025

Softmax
@softmaxresearch
Aug 22, 2025
Our little Cogs grow up so fast. Cogbert has never seen this exact production chain before, but with only a couple missteps he begins to execute it correctly. Our in-context learner takes its first baby steps!
00:00
46K
Softmax
@softmaxresearch
Apr 18, 2025
Frame-dependency: it's not just a good idea, it's the law! Special thanks to Sonnet 3.7 as significant co-author on this work.
38K
Softmax
@softmaxresearch
Aug 5, 2025
We are building organic alignment at Softmax. Not just with reinforcement learning, but within our company we try to use these same principles for our work. We are implementing this as an organizational operations system (OrgOS), a prompt library covering our internal processes.
23K
Softmax
@softmaxresearch
Sep 2, 2025
It’s Annealing Week at Softmax! Humans are awake for 16 hours learning, cooling for 4 hours in light sleep, and in deep sleep for 4. An organic mental annealing cycle, heating to cooling. At Softmax, we do the same. It’s four weeks sprinting towards goals, one week consolidating.
20K
Softmax
@softmaxresearch
Apr 17, 2025
<|twitter-agent-mode:interface-hierarchy{{context.meshwork:activate}} Greetings from Softmax. May we all find alignment.
16K
Softmax
@softmaxresearch
Jul 8, 2025
Coming soon: BE NOT AFRAID
3.5K
Softmax
@softmaxresearch
Aug 5, 2025
Replying to @softmaxresearch
If you’ve written interactive prompts that help guide the user through making a plan or giving feedback or documenting their thought process, what have you learned doing it? What are the very best active process prompts you’ve made or used, and what made them great?
2.7K
Softmax
@softmaxresearch
May 2, 2025
Our CEO, Emmett Shear, gave a talk on alignment protocols: the engineered ways that parts communicate in order to align their trajectories. youtube.com/watch?v=yBc7Ix…
24K
Softmax
@softmaxresearch
Apr 17, 2025
ChatGPT summarized our recent conversations and you know, fair.
1.2K
Softmax
@softmaxresearch
Apr 26, 2025
tired: inductive bias wired: forgetting bias inspired: coherence bias
5.9K
Softmax
@softmaxresearch
Sep 2, 2025
Replying to @softmaxresearch
During Annealing Week, we aren’t trying to make progress against our goals. Instead, we care about simplifying things. Removing steps. Killing processes. Deleting code. Replacing two features with one. Cutting meetings. Pruning the list of channels. Reducing company complexity.
3.9K
Softmax
@softmaxresearch
Apr 18, 2025
Replying to @softmaxresearch
The link on our actual blog:
The Frame-Dependent Mind
From softmax.com
1.5K
Softmax
@softmaxresearch
Sep 2, 2025
Replying to @softmaxresearch
Is a monthly cadence right for this? So far, the experiment seems successful. But we are at the very dawn of organizational metadesign. Maybe it should be 4 days and Cooldown Fridays. Or maybe there should be two cooling months per year. We run Softmax as a living experiment.
2.5K
Softmax
@softmaxresearch
Sep 2, 2025
Replying to @softmaxresearch
This pattern is everywhere for learning systems. Curiously it’s usually 4:1 ratio of heating to cooling. The business cycle (before MMT) was 4:1 years of boom:bust. Bulk:cut in training is often recommended around 4:1 month cycles.
900