TensorZero (@TensorZero) / X

TensorZero

273 posts

TensorZero

@TensorZero

NYC

github.com/tensorzero/ten…

Joined October 2023

Following

1,357

Followers

Pinned
TensorZero
@TensorZero
Mar 23
We’re building TensorZero Autopilot, an automated AI engineer that analyzes LLM observability data, optimizes prompts and models, sets up evals, and runs A/B tests. It dramatically improves the performance of LLM agents on every single benchmark we’ve tried. Read more below.
9.4K
TensorZero reposted
Kristian Ernst
@_kristianernst
Jun 7
Just wanted to give a huge shoutout to @TensorZero! you guys have really done an amazing job. De facto recommendation on every client project we work on right now.
940
TensorZero reposted
Michelle
@michellehui
Jun 3
it was a full house last night! (pun intended) inaugural @TensorZero ai poker night co-hosted with @BessemerVP
509
TensorZero reposted
Gabriel Bianconi
@gabrielbianconi
Jun 3
🃏 Great time last night at @TensorZero's inaugural AI Poker Night co-hosted with @BessemerVP!
392
TensorZero
@TensorZero
May 18
You might be overpaying 5.3x+ for Claude Opus 4.7! Our CEO @gabrielbianconi found out that on tool-heavy workloads, you're paying 5.3x more for Claude Opus 4.7 than GPT 5.4. The common metric is to compare cost per million tokens. But different providers use different
00:00
1.8K
TensorZero
@TensorZero
May 18
Stop comparing price per million tokens: the hidden LLM API costs · TensorZero
From tensorzero.com
198
TensorZero
@TensorZero
May 12
LLM evaluators are often noisy and weakly correlated with real-world outcomes. Noisy evaluators have limited value for production decisions that hinge on judging a single output (e.g. guardrails). However, even (very) noisy evaluators can reliably tell you which agent is better
639
TensorZero
@TensorZero
May 12
Even (very) noisy LLM evaluators are useful for improving AI agents · TensorZero
From tensorzero.com
147
TensorZero
@TensorZero
May 11
Michelle Hui is joining us with a focus on developer relations. She recently graduated from Cornell with BS & MS degrees in computer science, during which she organized large tech events, conducted ML research, and held product internships (Alphabet / Wing, UN). Welcome to the
2.7K
TensorZero
@TensorZero
May 5
"If your security relies on your code being obfuscated, you're telling yourself a fake story." Our CTO Viraj Mehta (@thebigmehtaphor) chats about AI scaling, open source, and being a technical founder following his PhD in Reinforcement Learning at CMU. In our in-depth
00:00
1K
TensorZero
@TensorZero
May 5
192
TensorZero
@TensorZero
Apr 28
TensorZero is committed to open source. We sit down with our CTO Viraj @thebigmehtaphor. Takeaways: 1. Thinking closed-source code == security is a LIE. You still need to build from first principles and fundamentally secure code. 2. Open-source keeps more eyes on your code,
00:00
264
TensorZero
@TensorZero
Apr 16
Article
Stop comparing price per million tokens: the hidden LLM API costs
Summary Token pricing is misleading: the same input produces 2.65x+ more tokens depending on the model. We got wildly different token counts from identical content using OpenAI, Anthropic, and...
1.3K
TensorZero
@TensorZero
Mar 25
Can an automated AI engineer autonomously debug and optimize an LLM pipeline in 5 minutes? Last night, ours did: it cut errors in ~half during its first live demo. TensorZero Autopilot (our automated AI engineer) analyzed hundreds of historical LLM traces to identify failure
3.3K
TensorZero
@TensorZero
Mar 25
Learn more: tensorzero.com/blog/automated…
236
TensorZero
@TensorZero
Mar 23
Replying to @TensorZero
Read more about this work on the TensorZero Blog:
We're building an automated AI engineer, and it works · TensorZero
From tensorzero.com
237
TensorZero
@TensorZero
Mar 23
TensorZero Autopilot is powered by our open-source LLMOps platform that unifies an LLM gateway, observability, optimization, evaluation, and experimentation. The open-source project is used by companies ranging from frontier AI startups to the Fortune 10 and powers ~1% of the
228