zlacker

[parent] [thread] 1 comments
1. bshipp+(OP)[view] [source] 2023-06-14 02:29:19
Pushshift.io used to host a complete repository for Reddit. I think there were archives on the internet archive as well. There are numerous torrents with terabytes of text content for training AIs. Perhaps they might lock it down going forward, but language training horse fled and was eaten by coyotes a decade ago.
replies(1): >>rgavul+Ky1
2. rgavul+Ky1[view] [source] 2023-06-14 14:36:30
>>bshipp+(OP)
More (and more recent) content will be need for further training of the models for them to stay competitive and up to date.
[go to top]