zlacker

[parent] [thread] 0 comments
1. GaggiX+(OP)[view] [source] 2022-05-24 06:45:09
If the model takes text embeddings/tokens as an input, it can create a connection between the caption and the text on the image (sometimes they are really similar).
[go to top]