Hacker News Viewer

The future of code search is not regex – 100x faster than ripgrep

by neogoose on 4/2/2026, 3:45:22 AM

https://fff.dmtrkovalenko.dev/

Comments

by: pjmlp

It has never been ripgrep for decades for those of us on IDEs.

4/2/2026, 6:58:17 AM


by: kristopolous

I ran across this fascinating tool a few days ago researching embedding models on hugging face.<p>Advertised as &quot;ColGREP Semantic code search for your terminal and your coding agents&quot;,<p>I haven&#x27;t put it in any harness yet but I probably should.<p><a href="https:&#x2F;&#x2F;github.com&#x2F;lightonai&#x2F;next-plaid&#x2F;tree&#x2F;main&#x2F;colgrep" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;lightonai&#x2F;next-plaid&#x2F;tree&#x2F;main&#x2F;colgrep</a><p>I&#x27;ve also tried astgrep (also known as sg) but llms really mess up on them. I think you&#x27;d need to fine tune.<p>If anyone has cracked that case I&#x27;d love to hear about it

4/2/2026, 5:04:14 AM


by: genewitch

considering that ripgrep has marginal overhead over just reading the files to &#x2F;dev&#x2F;null, how exactly does this achieve 100x speedup?<p>I have a lot of use for something that can search ~1GB of text &quot;instantly&quot;, but so far nothing beats rg&#x2F;ag after the data has been moved into RAM.

4/2/2026, 6:14:45 AM


by: swiftcoder

Is there a write up of the underlying approach? The summary on the repo mentioned SIMD, but not a whole lot else.

4/2/2026, 6:40:59 AM


by: neogoose

I have open sourced the fastest code search implementation. Comprehensive SDK for both file finder and grep file search that is over 100x faster than ripgrep

4/2/2026, 3:45:22 AM


by: dig1

ctags, GNU Global and even &quot;ugrep -Q&quot; would like to have a few words with you ;)

4/2/2026, 6:37:08 AM


by: globular-toast

Why is it &quot;for neovim&quot;? Surely such a thing would be useful in many applications?

4/2/2026, 6:05:58 AM


by: asdfadsfaf

I don&#x27;t get it how can I search anything but the file name?

4/2/2026, 6:33:31 AM


by: schrodinger

How&#x27;s it work? Embed tokens and use euclidean distance or something?

4/2/2026, 5:24:07 AM