turbovec Crushes Millions of Vectors Into a Fraction of the Memory and Still Beats FAISS

turbovec is a new open source vector index that squeezes a 10 million document corpus from 31 GB of memory down to roughly 4 GB while searching it faster than FAISS, the long standing industry standard from Meta. It is written in Rust with Python bindings, it has already pulled in more than 10,000 stars on GitHub, and it runs entirely on your own machine with no data leaving your hardware. For anyone building AI systems that lean on vector search, those numbers are hard to ignore.

What problem does turbovec solve?

Modern AI applications turn text, images, and audio into long lists of numbers called vectors, then search across millions of them to find the closest matches. That is how semantic search, recommendations, and retrieval for chatbots all work under the hood. The trouble is that storing those vectors at full precision eats enormous amounts of memory, and memory has gotten expensive this year. turbovec attacks the cost directly by compressing each vector down to between two and four bits per dimension, shrinking a typical 1,536 dimension vector by about 16 times, from thousands of bytes to a few hundred.

How does turbovec beat FAISS?

The engine is built on a Google Research algorithm called TurboQuant, presented at ICLR 2026, which is described as data oblivious. In plain terms that means it does not need a training step or a learned codebook before it can compress your data. You add new vectors at any time and they are indexed immediately, with no rebuilds as the collection grows. On top of that, the author hand wrote low level search routines tuned for both ARM and x86 processors, and benchmarks show them beating the equivalent FAISS routine by 12 to 20 percent on ARM while staying competitive on x86. You can also filter at search time, restricting results to a candidate set without taking a recall hit.

Why should non experts care about vector quantization?

This sounds deeply technical, and it is, but the takeaway is simple. The cost of running AI is increasingly about memory, not just raw computing power, and tools like turbovec push that cost down sharply while keeping everything local and private. That combination, cheaper and on your own machine, is exactly what smaller teams and privacy conscious builders have been waiting for.

It also says something about where open source AI infrastructure is heading. A single developer can now take a fresh research paper, implement it in Rust, hand tune the performance, and ship something that outperforms a tool maintained by one of the largest companies in the world. That is the quiet story underneath the star count, and it is worth paying attention to.

Check out what else is trending at GitHub