zlacker

[parent] [thread] 3 comments
1. ASalaz+(OP)[view] [source] 2023-06-14 02:14:51
It seems like they realized the value of user content for AI training too late, and are trying to hamfistedly lock down the golden egg goose for their upcoming IPO.
replies(2): >>bshipp+P1 >>adrr+p2
2. bshipp+P1[view] [source] 2023-06-14 02:29:19
>>ASalaz+(OP)
Pushshift.io used to host a complete repository for Reddit. I think there were archives on the internet archive as well. There are numerous torrents with terabytes of text content for training AIs. Perhaps they might lock it down going forward, but language training horse fled and was eaten by coyotes a decade ago.
replies(1): >>rgavul+zA1
3. adrr+p2[view] [source] 2023-06-14 02:34:35
>>ASalaz+(OP)
Adding .json to any url still gives you everything in json. This has to do with 3rd party clients not showing ads and making up the revenue lost.
◧◩
4. rgavul+zA1[view] [source] [discussion] 2023-06-14 14:36:30
>>bshipp+P1
More (and more recent) content will be need for further training of the models for them to stay competitive and up to date.
[go to top]