zlacker
[parent]
[thread]
1 comments
1. throwa+(OP)
[view]
[source]
2025-08-30 10:24:07
The training sets of most LLMs contain a copious amount of content from Libgen (or now: Anna's Archive), where em dashes are frequently used in literary writing.
replies(1):
>>nullc+6d1
◧
2. nullc+6d1
[view]
[source]
2025-08-30 21:24:53
>>throwa+(OP)
Who the hell knows how the initial biases of LLM's broke.
My IRC name (gmaxwell) is a token in the GPT3 tokenizer.
[go to top]