Log inSign up
Kilo
4,086 posts
Image
user avatar
Kilo
@kilocode
Kilo is the all-in-one agentic engineering platform. 3M+ Kilo Coders. Open source since day one!
VS Code, JetBrains, CLI, Cloud
kilo.ai
Joined March 2025
221
Following
26.4K
Followers
  • Pinned
    user avatar
    Kilo
    @kilocode
    12h
    Article cover image
    Article
    Grok Build 0.1 Beat Every Frontier Model in a Kilo Code Reviews Test
    TL;DR: In a Kilo Code Reviews test, Grok Build 0.1 caught all 10 planted bugs in a React/TypeScript app, beating every frontier model on coverage and cost ($0.29 vs $0.45 for the next-best on...
    25K
  • user avatar
    Kilo
    @kilocode
    1h
    We gave GLM-5.2 and Kimi K2.7 Code the same backend task: plan a feature flag service, then build it. What separated them was decision-making, not code quality. GLM's plan scored 9.0. Kimi's scored 8.1.
    Line chart titled “Agentic Coding Performance by Effort Level.” The chart compares four AI models across different effort settings using average benchmark scores versus average output tokens per task. The x-axis shows average output tokens (10k–90k), and the y-axis shows score percentages (45–85%). GLM-5.2 (blue) improves from about 63% at 35k tokens in non-thinking mode to roughly 72% at 44k tokens in high effort mode and nearly 75% at 86k tokens in max effort mode. GLM-5.1 (green) rises from about 53% at 33k tokens to 58% at 46k tokens. Claude Opus 4.8 (dark gray) starts around 71% at 23k tokens, reaches about 78% at 40k tokens, and remains flat through 88k tokens. Claude Opus 4.7 (light gray) increases from about 61% at 15k tokens to 68% at 30k tokens and 71% at 50k tokens. The chart suggests higher effort levels generally improve performance, with Claude Opus 4.8 achieving the highest scores overall and GLM-5.2 showing strong gains as effort increases.
    975
    user avatar
    Kilo
    @kilocode
    1h
    Replying to @kilocode
    Then we handed GLM's winning plan to both models and had each build it from a blank folder. Same 200 users, flag at 35% rollout. Both services turned it on for the exact same 77 users, down to the individual ID. GLM passed all 15 live checks. Kimi passed 14.
    Kilo Code CLI terminal session showing GLM-5.2 building the feature flag service from the plan, with output from the build run.
    224
    user avatar
    Kilo
    @kilocode
    1h
    The takeaway: once a plan pins down the hard decisions, the model doing the build matters a lot less. The strongest planner writes less and decides more, leaving fewer open questions for whoever builds from it. Full breakdown from @TheBinaryNeuron:
    Image
    GLM-5.2 vs Kimi K2.7 Code: Which Model Is Better at Planning vs Building?
    From blog.kilo.ai
    190
  • user avatar
    Kilo
    @kilocode
    13h
    A black hole simulator from a screenshot drop. @MiniMax_AI M3’s vision did the heavy lifting and the whole build ran $0.53.
    user avatar
    Brian Turcotte
    @coldopn
    13h
    Frontier doesn't only mean Anthropic and OpenAI anymore. I built this black hole simulator by simply dropping an illustration screenshot into Kilo Code, switching to @MiniMax_AI M3, and prompting it to "animate this screenshot into a working black hole simulator". M3's vision
    Image
    00:00
    1.6K
  • user avatar
    Kilo
    @kilocode
    23h
    Did SpaceX buy Cursor because Le Chanton Cat was getting too close to AGI?
    8.4K
    user avatar
    Kilo
    @kilocode
    16h
    For anyone arriving from the cat: our actual read on what the $60B Cursor deal means for model choice, and why we think vertically integrated stacks are the wrong bet.
    Image
    SpaceX Just Bought Cursor for $60 Billion. Why the Deal Matters.
    From blog.kilo.ai
    714
  • user avatar
    Kilo
    @kilocode
    16h
    Three real sites. Three prompts. $0. This is Step 3.7 Flash, free in Kilo right now, and @coldopn is the one putting it through the work. He's been shipping builds like this constantly lately. Follow him if you want the receipts. P.S. We're also giving away $500 to two people
    user avatar
    Brian Turcotte
    @coldopn
    17h
    Why aren't more people talking about @StepFun_ai? Their latest model Step 3.7 Flash just made me 3 beautiful websites in less than 10 minutes: 1. Interactive Resume/CV 2. Landing Page for a Car Wash business 3. Custom Wedding Site Total Price: $0 Total # of Prompts: 3 Step
    Image
    00:00
    1.4K
  • user avatar
    Kilo
    @kilocode
    17h
    25K stars is close! When we hit it, we're sending $500 in Kilo credits to two stargazers as a thank-you.
    Black promotional graphic for a Kilo Code GitHub giveaway celebrating an upcoming milestone. Large white and yellow text reads “25K stars, $1,000.” Supporting text explains that when the Kilo Code GitHub repository reaches 25,000 stars, two stargazers will each receive $500 in Kilo credits. The lower section reiterates the prize details and includes the repository URL, github.com/Kilo-Org/kilocode, in yellow text. A dark callout box on the right highlights the goal: “25K stars.” The Kilo logo appears in the top-right corner.
    2.1K
    user avatar
    Kilo
    @kilocode
    17h
    See where the count's at!
    github.com
    GitHub - Kilo-Org/kilocode: Kilo is the all-in-one agentic engineering platform. Build, ship, and...
    Kilo is the all-in-one agentic engineering platform. Build, ship, and iterate faster with the most popular open source coding agent. - Kilo-Org/kilocode
    595
  • user avatar
    Kilo
    @kilocode
    18h
    Next: SpaceX trying to acquire Mistral's cat?
    Image
    Made with AI
    707
  • user avatar
    Kilo
    @kilocode
    21h
    EU teams: you can now run top open-weight models with data that never leaves Europe. We've partnered with @inceptron for sovereign, EU-hosted inference in the Kilo Gateway. Kimi K2.6, GLM 5.1, MiniMax M2.5. GDPR and ISO compliant. MiniMax M2.5 starts at $0.15/1M input tokens.
    Graphic on a light gray background featuring the Inceptron logo at the top, consisting of a geometric blue, purple, and pink icon next to the word “INCEPTRON” in black uppercase letters. A large black “X” appears in the center, indicating a collaboration. Below it is the Kilo Code logo, showing a black-and-yellow rectangular banner with the Kilo icon on the left and the words “Kilo Code” in a pixel-style yellow font on a black background.
    1.4K
    user avatar
    Kilo
    @kilocode
    21h
    More on what @inceptron brings: > Data residency you can lock to EU-only > ISO-aligned infrastructure > Open-weight models devs are already running daily Full breakdown from @arimesser:
    Image
    Kilo Partners with Inceptron for High-Performance EU Inference
    From blog.kilo.ai
    448
  • user avatar
    Kilo
    @kilocode
    Jun 17
    $15 on Opus. $1.70 on the same task with a cheaper model. Most of what an agent does all day doesn't need a frontier model. You're just paying like it does. 4 levers that cut your AI bill:
    Image
    00:00
    2.2K
    user avatar
    Kilo
    @kilocode
    Jun 17
    The full model-cost math, broken down by @coldopn:
    Image
    4 Levers to Take Control of Your AI Spend
    From blog.kilo.ai
    497

New to X?

Sign up now to get your own personalized timeline!

Create account

By signing up, you agree to the Terms of Service and Privacy Policy, including Cookie Use.

Terms of Service|Privacy Policy|Cookie Policy|Accessibility|Ads info|© 2026 X Corp.
Don't miss what's happening
People on X are the first to know.
Log inSign up