zlacker

I think RAG approach with Vector DB is more likely. Just like when you add a file to your prompt / custom GPTs.

Adding the entire file (or memory in this case) would take up too much of the context. So just query the DB and if there's a match add it to the prompt after the conversation started.

replies(1): >>lxgr+n

>>TruthW+(OP)
These "memories" seem rather short, much shorter than the average document in a knowledge base or FAQ, for example. Maybe they do get compressed to embedding vectors, though.

I could imagine that once there's too many, it would indeed make sense to classify them as a database, though: "Prefers cats over dogs" is probably not salient information in too many queries.