Skip to content

Sudden high search latencies with 50% used QueryNodes #33293

Discussion options

You must be logged in to vote

Hey !
It seems we finally stabilize this situation and understand a lot of stuff. Here is the summary of everything we did to conclude this issue:

  • Add 2 replicas to each collection: Better spread of search on all query nodes
  • Fix failed requests from querynode logs
    • "ef(100) should be larger than k(110)"
    • It was an issue on the client app side.
  • Set queryNode.scheduler.maxReadConcurrentRatio from 1 to 2: better use of the querynode's CPU
  • Increase dataCoord.segment.maxsize from 512 Mo to 2Gb as we have queryNode with 32GB Ram : Divide by 4 the number of segments > Improve the search latency
  • Define queryNode pod AntiAffinity to enforce their placement on different Kubernetes worker nodes.
  • I…

Replies: 8 comments 13 replies

Comment options

You must be logged in to vote
3 replies
@flsworld
Comment options

@yhmo
Comment options

@Archalbc
Comment options

Comment options

You must be logged in to vote
1 reply
@flsworld
Comment options

Comment options

You must be logged in to vote
7 replies
@flsworld
Comment options

@flsworld
Comment options

@xiaofan-luan
Comment options

@xiaofan-luan
Comment options

@flsworld
Comment options

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
1 reply
@xiaofan-luan
Comment options

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
1 reply
@Archalbc
Comment options

Comment options

You must be logged in to vote
0 replies
Answer selected by Archalbc
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
4 participants