zlacker

[parent] [thread] 2 comments
1. knowit+(OP)[view] [source] 2025-08-28 03:52:37
if very few humans use it, how did AI learn to use it since it was trained on mostly human writing?
replies(2): >>kahirs+a5 >>saitho+r6
2. kahirs+a5[view] [source] 2025-08-28 04:53:08
>>knowit+(OP)
Professional writers and editors use it.
3. saitho+r6[view] [source] 2025-08-28 05:07:08
>>knowit+(OP)
The same way it learned to act like a personal assistant, even though very few humans are personal assistants.

The LLM is first trained as an extreneley large Markov model predicting text scraped from the entire Internet. Ideally, a well trained such Markov model would use em dashes approximately as frequently as they appear in real texts.

But that model is not the LLM you actually interact with. The LLM you interact with is trained by somethig called Reinforcement Learning from Human Feedback, which involves people reading, rating and editing its responses, biasing the outputs and giving the model a "persona".

That persona is the actual LLM you interact with. Since em dash usage was rated highly by the people providing the feedback, the persona learned to use it much more frequently.

[go to top]