Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Ideas #27

Open
33 tasks
ankane opened this issue May 25, 2022 · 0 comments
Open
33 tasks

Ideas #27

ankane opened this issue May 25, 2022 · 0 comments

Comments

@ankane
Copy link
Member

ankane commented May 25, 2022

Please create a new issue to discuss any ideas or share your own.

HNSW

  • Add support for inline filtering
    • Scan only - hnsw-filtering branch
    • HQANN - hqann branch (WIP)
  • Use scan->kill_prior_tuple - kill-prior-tuple branch (WIP)
  • Add support for index-only scans - hnsw-index-only-v2 branch
  • Add support for INCLUDE clause with CREATE INDEX (not more performant since still accesses heap)
  • Improve numIndexTuples in cost estimation

IVFFlat

  • Add support for product quantization
  • Add support for another coarse quantizer
  • Add support for parallel k-means
  • Update parallel workers calculation - ivfflat-parallel-workers branch
  • Increase default value of ivfflat.probes
  • Add support for setting default probes on index level
  • Scan additional lists if number of entries is low after scanning probes
  • Use pairing heap for index scan for performance - stages and pairingheap branches
  • Use tuplesort_set_bound for performance - bound branch (not needed w/ pairing heap)
  • Add support for parallel index scans (planner gets cost estimate but doesn't use) - parallel-index-scan3 branch
  • Use Lloyd's or MO-Elkan's k-means when not enough memory for Elkan's
  • Use FB-Elkan's k-means for faster index creation
  • Use mini-batch k-means for index creation for reduced memory - minibatch branch
  • Add support for index-only scans - ivfflat-index-only branch
  • Add support for INCLUDE clause with CREATE INDEX
  • Prevent compression for index tuples - ivfflat-storage branch

Indexes

  • Include vectors with zero norm in indexes for cosine distance - index-zero-norm branch
  • Update cost estimation to not use index if LIMIT + OFFSET > expected tuples - index-limit/index-limit2 branch
  • Update cost estimation to not use index if large % of rows will be filtered by WHERE condition - hnsw-filtering-cost branch

Types

  • Add support for int8 vectors - intvec branch

Functions

Installation

@pgvector pgvector locked and limited conversation to collaborators May 25, 2022
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Development

No branches or pull requests

1 participant