PinnedInVector Database for AIbyMilvus·Dec 5, 2022What is a Vector Database?An introduction to the concepts related to vector database.A response icon2A response icon2
Milvus·1d agoBeyond the TurboQuant-RaBitQ Debate: Why Vector Quantization Matters for AI Infrastructure CostsGoogle’s TurboQuant paper (ICLR 2026) reported 6x KV cache compression with near-zero accuracy loss — results striking enough to wipe $90…
Milvus·1d agoIs MCP Dead? What We Learned Building with MCP, CLI, and Agent SkillsWhen Perplexity’s CTO Denis Yarats said at ASK 2026 that the company was deprioritizing MCP internally, it set off the usual cycle. YC CEO…
Milvus·2d agoHarness Engineering: The Execution Layer AI Agents Actually NeedMitchell Hashimoto built HashiCorp and co-created Terraform. In February 2026, he published a blog post describing a habit he’d developed…
Milvus·2d agoInterview with RaBitQ Authors: The TurboQuant Dispute and Why the Storage Selloff Was a False AlarmGoogle’s TurboQuant paper claimed 6x compression, 8x speedup, and near-zero accuracy loss for vector representations. After it was…
Milvus·3d agoWe Built Graph RAG Without the Graph DatabaseTL;DR: Do you actually need a graph database for Graph RAG? No. Put entities, relations, and passages into Milvus. Use subgraph expansion…
Milvus·3d agoHow to Fix Hermes Agent’s Learning Loop with Milvus 2.6 Hybrid SearchHermes Agent has been everywhere lately. Built by Nous Research, Hermes is a self-hosted personal AI agent that runs on your own hardware…
Milvus·4d agoClaude Context: Reduce Claude Code Token Usage with Milvus-Powered Code RetrievalLarge context windows make AI coding agents feel limitless, right up until they start reading half your repository to answer one question…
Milvus·4d agoHow to Add Long-Term Memory to Anthropic’s Managed Agents with MilvusAnthropic’s Managed Agents make agent infrastructure resilient. A 200-step task now survives a harness crash, a sandbox timeout, or a…
Milvus·Jun 11DeepSeek V4 vs GPT-5.5 vs Qwen3.6: Which Model Should You Use?New model releases are moving faster than production teams can evaluate them. DeepSeek V4, GPT-5.5, and Qwen3.6–35B-A3B all look strong on…