Question about Classifier-free Guidance #343

lucala · 2023-04-15T13:05:17Z

When embedding my text for conditioning, the trick for classifier-free guidance is to drop the embedding sometimes (usually 10% of the time).

My question is, what does drop mean? It seems I have come across two variants: using a random tensor as a substitute or a zero tensor.

GLIDE mentions in section 2.3 "we sometimes replace text captions with an empty sequence" - this would be a third option, using the embedding from the empty string?

I haven't been able to find any explanation on this, does someone know?

fostiropoulos · 2024-01-02T17:29:38Z

@varunponda that reads like a ChatGPT answer lol

@lucala I think it does not matter which method is used for zeroing out as long as it is consistent between sampling and training. Although I do not have experiments on that myself, it makes sense.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question about Classifier-free Guidance #343

Question about Classifier-free Guidance #343

lucala commented Apr 15, 2023 •

edited

fostiropoulos commented Jan 2, 2024 •

edited

Question about Classifier-free Guidance #343

Question about Classifier-free Guidance #343

Comments

lucala commented Apr 15, 2023 • edited

fostiropoulos commented Jan 2, 2024 • edited

lucala commented Apr 15, 2023 •

edited

fostiropoulos commented Jan 2, 2024 •

edited