SVE vector optimization for halfvectors dot product calculation #536

pashkinelfe · 2024-04-30T12:41:41Z

I optimized the dot product calculation for half vector using SVE extension that is present on the many machines suitable for vector search on ARM architecture.

Testing HNSW build time (same test as Can I help get tinyint or half branches released? #326 (comment) and Parallel index builds for HNSW #409 (comment)

Results show that on the same machine (Graviton3):

index build time for half vectors using default inner product function is better than for float32 vectors only at high number of cores. Possibly due to less locks/memory accesses/disk IO
index build time for SVE inner product function for halfvectore is better than with default inner product function for halfvectors at any number of cores. Even at serial build.

I tried to add SVE optimization for converting float32<->float16 but found no performance gain in
insert into emb_f16_3 (vector) select vector::halfvec(1536) from emb;

Possibly because using SVE intrinsic makes sense when we deal with many float16 number altogether as it's in dot product calculation. So it's not in the patch.

I suggest that if the performance gain for (1) is good enough we could try to add some architecture checks into the patch. I'm not convinced that the way I did is optimal one. So if anyone more experiences in these checks add some improvements, I appreciate it very much.

And also maybe it's worth adding SVE optimized cosine and l2 distance and testing these cases separately.

…m supported machines

pashkinelfe and others added 2 commits April 30, 2024 12:20

Try SVE vector optimization for halfvectors dot product calculation o…

9c12983

…m supported machines

Merge branch 'master' into sve-optimize2

25e536c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SVE vector optimization for halfvectors dot product calculation #536

SVE vector optimization for halfvectors dot product calculation #536

pashkinelfe commented Apr 30, 2024

SVE vector optimization for halfvectors dot product calculation #536

Are you sure you want to change the base?

SVE vector optimization for halfvectors dot product calculation #536

Conversation

pashkinelfe commented Apr 30, 2024