zlacker
[parent]
[thread]
1 comments
1. howrar+(OP)
[view]
[source]
2023-11-18 16:01:38
Every token is already being generated with all previously generated tokens as inputs. There's nothing about the architecture that makes this hard. It just hasn't been trained on this kind of task.
replies(1):
>>peyton+DC1
◧
2. peyton+DC1
[view]
[source]
2023-11-19 01:14:24
>>howrar+(OP)
Really? I don’t know of a positional encoding scheme that’ll handle this.
[go to top]