Log inSign up
Comet
3,493 posts
Image
user avatar
Comet
@Cometml
Comet provides an end-to-end model evaluation platform for AI developers, with best in class LLM evaluations, experiment tracking, and production monitoring
New York, NY
comet.com
Joined October 2017
880
Following
15K
Followers
  • Comet reposted
    user avatar
    Abby
    @anmorgan2414
    Jun 25
    🧵 The @aiDotEngineer World's Fair schedule just dropped. 600+ sessions, 29 tracks. I'll be there next week (June 29 - July 2 in SF). Here are the 8 talks and tracks I'm planning my days around:
    Image
    1.1K
  • Comet reposted
    user avatar
    Akshay 🚀
    @akshay_pachaar
    Jun 24
    Andrej Karpathy: "Remove yourself as the bottleneck. Maximize your leverage. Put in very few tokens, and a huge amount of stuff happens on your behalf." loop engineering is the exact thing that gets you there. in a hand-run session you do two things. you decide what the agent
    Image
    GIF
    user avatar
    Akshay 🚀
    @akshay_pachaar
    Jun 22
    Article cover image
    Article
    Loop Engineering Clearly Explained
    Half your feed is suddenly saying the same thing. Stop prompting your agents, start engineering loops. Boris Cherny, the person who built Claude Code, said it plainly: "I don't prompt Claude anymore....
    425K
  • Comet reposted
    user avatar
    Akshay 🚀
    @akshay_pachaar
    Jun 22
    Article cover image
    Article
    Loop Engineering Clearly Explained
    Half your feed is suddenly saying the same thing. Stop prompting your agents, start engineering loops. Boris Cherny, the person who built Claude Code, said it plainly: "I don't prompt Claude anymore....
    1.1M
  • user avatar
    Comet
    @Cometml
    Jun 24
    Did you know Opik integrates with #Openclaw? With the Opik/opik-openclaw plugin, every LLM call, tool invocation, and agent run is automatically traced and visible in your Opik dashboard. Three commands to get started: install, configure, restart. Setup guide →
    Image
    360
  • user avatar
    Comet
    @Cometml
    Jun 22
    We just published a public examples repo for Opik: integrations, use cases, and utility scripts you can clone and run. Community contributions welcome. github.com/comet-ml/opik-…
    Image
    523
  • Comet reposted
    user avatar
    Paul Iusztin
    @pauliusztin_
    Jun 22
    I used to think evals were something you added after building the AI system. But the more AI agents I ship, the more backward that seems... The scariest AI failures are the silent ones. You change something and everything still runs. But did the system get better? Did you
    Image
    505
  • user avatar
    Comet
    @Cometml
    Jun 9
    You're spending ~30% of your coding agent tokens on misconfiguration. Bloated context, unused skills, idle MCPs. We just launched Cost Intelligence in Opik — cuts that waste 20-30% with one click. Native to Claude Code + Codex 🔗globenewswire.com/news-release/2…
    Image
    240
  • Comet reposted
    user avatar
    Rajesh M
    @Rajesh7113
    May 21
    AI agent debugging is a COMPLETE mess right now. You fix one issue… and another workflow randomly breaks. You change a prompt. Tool calls start behaving differently. You improve latency. Accuracy drops somewhere else. Most teams are basically duct taping evals, traces,
    Image
    895
  • user avatar
    Comet
    @Cometml
    May 12
    Our Head of Research Doug Blank headed to Boston for his 3rd annual talk at @MITDeepLearning. He took Asimov's laws of robotics & applied them to agentic AI -- proposing his own three laws of AI and sharing how we're thinking about AI safety at Comet.
    500
  • user avatar
    Comet
    @Cometml
    May 8
    We're hiring across the team 🎉 If you know any rockstars (or are one yourself), we'd love to chat with you! 🔗 comet.com/site/about-us/…
    Image
    269
  • Comet reposted
    user avatar
    Paul Iusztin
    @pauliusztin_
    May 6
    I just interviewed the former CTO at IBM and Chairperson of NodeJS. Here's what I learned: Michael @maximilien spent 12 months shipping production RAG to multiple customers. In our discussion, he told me that nothing on a leaderboard can predict what works until you evaluate
    Image
    700
  • user avatar
    Comet
    @Cometml
    May 2
    "Until you evaluate on your data, nothing else matters."
    user avatar
    Paul Iusztin
    @pauliusztin_
    May 1
    I’ve spent the last week interviewing @maximilien, former CTO at IBM and Chairperson of NodeJS Foundation, who has shipped production RAG to multiple customers over the past year. The lesson he kept circling back to is that until you evaluate on your customer’s data, nothing else
    Image
    690
  • Comet reposted
    user avatar
    Gideon M
    @gidim
    Apr 23
    As your agent matures, something shifts. You stop writing code, and start editing prompts, tweaking params, trying new tools, etc. The tooling for this phase sucks. Today, we’re fixing that. Announcing Agent Configuration + Agent Playground in Opik. 🧵
    Image
    29K

New to X?

Sign up now to get your own personalized timeline!

Create account

By signing up, you agree to the Terms of Service and Privacy Policy, including Cookie Use.

Terms·Privacy·Cookies·Accessibility·Ads Info·© 2026 X Corp.
Don't miss what's happening
People on X are the first to know.
Log inSign up