A wrong answer from Cache record #388

SimFG · 2023-05-25T11:31:06Z

Discussed in #385

^{Originally posted by terryweijian May 25, 2023}
Hi

My script contains the follow questions, and it will run several loop within one session.
1 'what is TV ?',
2 'can you explain what function of TV is ?',
3 'can you tell me more about TV ?',
4 'what is the function of money ?',

question 1~3 are all about TV, so cache answer are linked to the same answer of the first question in the first loop, but from second loop the second question will linked with the forth question of " what is the function of money". I guess there are both the key prompt word of function. Is there parameters can control the weight of vector calculation for different key words?

The first loop:
Question: what is TV ?
local answer: TV is a television channel that broadcasts live television.
Local Time Spent = 0.2
Cache answer: TV is a television channel that broadcasts live television.
Cache Hit Time Spent = 0.39

Question: can you explain what function of TV is ?
local answer: a television channel
Local Time Spent = 0.09
Cache answer: TV is a television channel that broadcasts live television. # the answer is reasonable
Cache Hit Time Spent = 0.04

Question: can you tell me more about TV ?
local answer: a tv show
Local Time Spent = 0.11
Cache answer: TV is a television channel that broadcasts live television.
Cache Hit Time Spent = 0.03

Question: what is the function of money ?
local answer: money is a currency
Local Time Spent = 0.1
Cache answer: money is a currency
Cache Hit Time Spent = 0.11

Second Loop

Question: what is TV ?
local answer: TV is a television channel that broadcasts live television.
Local Time Spent = 0.17
Cache answer: TV is a television channel that broadcasts live television.
Cache Hit Time Spent = 0.03

Question: can you explain what function of TV is ?
local answer: a television channel
Local Time Spent = 0.1
Cache answer: money is a currency # The cache answer is incorrect as it links to question 4 just because there are both have the key word of "function"
Cache Hit Time Spent = 0.04

Question: can you tell me more about TV ?
local answer: a tv show
Local Time Spent = 0.12
Cache answer: TV is a television channel that broadcasts live television.
Cache Hit Time Spent = 0.03

Question: what is the function of money ?
local answer: money is a currency
Local Time Spent = 0.11
Cache answer: money is a currency
Cache Hit Time Spent = 0.03

SimFG · 2023-05-25T11:32:39Z

answer
there is no idea to control the weight of vector calculation for different key words. You can choose to skip cache searching when you think the cached answer doesn't meet the requirements, but save the llm result to the cache this time. The next time you ask the same question, you will be able to get an accurate answer.

cache_skip param usage

openai.ChatCompletion.create(
    model="gpt-3.5-turbo",
    messages=[{"role": "user", "content": "what's github"}],
    cache_skip=True,
)

ht0rohit · 2023-07-10T08:20:31Z

Hi @SimFG, can I only use the cache_skip parameter inside openai create functions or will it work with Langchain agents to? I would like to do something as below:

pipe = pipeline(
        CONF['MODEL']['pipeline'], model=model, tokenizer=tokenizer, max_length=CONF['MODEL']['max_length'],
        temperature=temperature, top_p=CONF['MODEL']['top_p'], num_beams=CONF['MODEL']['num_beams'],
        early_stopping=le(CONF['MODEL']['early_stopping'])
    )
llm = HuggingFacePipeline(pipeline=pipe)
cached_llm = LangChainLLMs(llm=llm)

llm_cache = Cache()
llm_cache.init(
    pre_embedding_func=get_content_func,
    embedding_func=cache_huggingface.to_embeddings,
    data_manager=data_manager,
    similarity_evaluation=SearchDistanceEvaluation(max_distance=
    CONF['GPTCACHE']['SDE_max_distance'], positive=False)
)

response = cached_llm(question, cache_obj=llm_cache, cache_skip=True)

SimFG · 2023-07-10T11:11:18Z

@ht0rohit yes, it can work. Are you having any problems?

SimFG pinned this issue May 25, 2023

SimFG closed this as completed May 25, 2023

SimFG mentioned this issue May 25, 2023

[Bug]: Moderation api is not working #386

Closed

SimFG mentioned this issue Jul 11, 2023

[Bug]: Chinese similar search recognition problem #478

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

A wrong answer from Cache record #388

A wrong answer from Cache record #388

SimFG commented May 25, 2023

SimFG commented May 25, 2023

ht0rohit commented Jul 10, 2023

SimFG commented Jul 10, 2023

A wrong answer from Cache record #388

A wrong answer from Cache record #388

Comments

SimFG commented May 25, 2023

Discussed in #385

SimFG commented May 25, 2023

ht0rohit commented Jul 10, 2023

SimFG commented Jul 10, 2023