Modal (@modal) / X

Modal

1,529 posts

Modal

@modal

AI infrastructure that developers love 💚 Run inference, sandboxes, batch processing, training, and many other things on Modal

New York City

Joined July 2022

Pinned
Modal
@modal
May 21
Article
Modal's Series C: Raising $355M at a $4.65B valuation
We’ve raised $355 million after growing fivefold since September, surpassing $300 million in annualized revenue. Our valuation is $4.65B post-money in a round led by @generalcatalyst and @Redpoint,...
582K
Modal
@modal
19h
Our new Auto Endpoints feature is powered by a new Modal primitive: Modal Servers. In this blogpost, we walk through design principles and detailed architecture: @EnvoyProxy, @googlecloud Spanner config store, and a @Cloudflare Pingora-based custom proxy.
00:00
23K
Modal
@modal
19h
Routing for serverless servers with Pingora, Envoy, and Spanner | Modal Blog
From modal.com
1.3K
Modal reposted
Cyrus
@cyrusasg
22h
Still don't think people fully appreciate how big dflash can be for inference latency/throughput. Genuine game changer for latency-sensitive workloads.
Modal
@modal
Jun 24
Modal Auto Endpoints provide state-of-the-art open source inference perf with a click. Learn how we developed our low latency inference playbook with @DecagonAI, delivering responses 60ms faster than the best proprietary provider. modal.com/blog/achieve-s…
3K
Modal reposted
Charles 🎉 Frye
@charles_irl
Jun 24
The no-longer-secret ingredient is DFlash by @zhijianliu_ and @jianchen1799. If you train a custom DFlash speculator on your data, you can get to lower latencies than any generic inference API can achieve. That's the benefit of owning your inference!
Modal
@modal
Jun 24
Modal Auto Endpoints provide state-of-the-art open source inference perf with a click. Learn how we developed our low latency inference playbook with @DecagonAI, delivering responses 60ms faster than the best proprietary provider. modal.com/blog/achieve-s…
10K
Modal
@modal
Jun 24
Modal Auto Endpoints provide state-of-the-art open source inference perf with a click. Learn how we developed our low latency inference playbook with @DecagonAI, delivering responses 60ms faster than the best proprietary provider. modal.com/blog/achieve-s…
25K
Modal reposted
Akshat Bubna
@akshat_b
Jun 23
You no longer have to pick between the performance of a black box API and the flexibility and control of @modal. Auto Endpoints give you both. We're unlocking frontier performance for everyone without having to talk to sales or an FDE. More cooking here, stay tuned.
Modal
@modal
Jun 23
It is not too late to _actually_ own your inference. Introducing: Modal Auto Endpoints.
00:00
8.9K
Modal reposted
Erik Bernhardsson
@bernhardsson
Jun 23
Managed private LLM endpoints, now available for everyone in @modal. Deploy in a few clicks with the UI or a few keystrokes with our CLI. The coolest thing is that these are not black boxes – customers have full access to the code underneath.
Modal
@modal
Jun 23
It is not too late to _actually_ own your inference. Introducing: Modal Auto Endpoints.
00:00
26K
Modal
@modal
Jun 23
It is not too late to _actually_ own your inference. Introducing: Modal Auto Endpoints.
00:00
191K
Modal
@modal
Jun 23
Introducing Modal Auto Endpoints: Optimized inference you actually own | Modal Blog
From modal.com
9.3K
Modal
@modal
Jun 22
.wait_until_ready(), set, go Building performant sandbox systems goes way beyond the initial container boot. We're unpacking what that means, and breaking down some tools to help you manage the entire lifecycle.
19K
Modal
@modal
Jun 22
Read here:
Unpacking sandbox startup latency: why started ≠ ready | Modal Blog
From modal.com
1.5K
Modal reposted
Connor
@cnnradams
Jun 22
light work
18K
Modal
@modal
Jun 18
We're hosting an art show with @GrayAreaorg in San Francisco! 💚 Submissions are open till July 15:
Gray Area
@GrayAreaorg
Jun 18
📢 We're partnering with @modal to offer a new development and exhibition opportunity for artists with sustained engagements in artificial intelligence and the arts. This global open call seeks proposals for creative projects that demonstrate the intentional use of AI to further
00:00
Call for Submissions — Modal × Gray Area
From modal.art
7.1K
Modal
@modal
Jun 16
Sandbox startup latency and scaling can make or break your RL training run. Great post breaking this down, shown using Modal Sandboxes.
SemiAnalysis
@SemiAnalysis_
Jun 16
RL Systems Mind the Gap: Matching Trainer and Generator Throughput RL Training Infrastructure, GRPO, PipelineRL, Async RL, Policy Staleness, RL Sandbox Infra, CPU Requirements, TCO Analysis, Thinking Machines Tinker newsletter.semianalysis.com/p/rl-systems-m…
9.8K
Modal reposted
Erik Bernhardsson
@bernhardsson
Jun 16
Our sandbox team has been on a crusade against every millisecond of latency and it's paying off. More cool results coming very soon!
18K