I tried TQ for vector search and my findings is not good, it is not worth it if you cannot use GPU, ...

netdur • yesterday at 10:27 PM • 2 replies • view on HN

I tried TQ for vector search and my findings is not good, it is not worth it if you cannot use GPU, however I got same quality of search as 32f using 8bit quant

I wrote ann ext for sqlite, using tq, I do save a lot on space but 32f is still faster despite everything I have tried

code here https://github.com/netdur/munind/tree/main/src/tq

Replies

ninja3925 • today at 1:05 AM

So i assumed it would get crushed by OPQ (which requires training)

teamchong • today at 12:16 AM

you’re right that 32f is faster on raw query time, quantization adds extra step. main benefit on download size since gzip won’t help much, which matters most in browser contexts

alt Hacker News

Replies