zlacker
[parent]
[thread]
0 comments
1. GaggiX+(OP)
[view]
[source]
2022-05-24 06:45:09
If the model takes text embeddings/tokens as an input, it can create a connection between the caption and the text on the image (sometimes they are really similar).
[go to top]