zlacker

[return to "OpenAI negotiations to reinstate Altman hit snag over board role"]
1. agnost+yS[view] [source] 2023-11-20 01:27:50
>>himara+(OP)
Many are speculating that Sam Altman could just move on and create another OpenAI 2.0 because he could easily attract talent and investors.

What this misses is all the regulatory capture that he’s been campaigning for. All the platforms have now closed their gardens. Authors and artists are much more vigilant about copyright etc. So it’s now a totally different game compared to 3 years ago because the data is not just there up for grabs anymore.

◧◩
2. Shekel+nV[view] [source] 2023-11-20 01:47:30
>>agnost+yS
I don't think getting training data is that hard still, the biggest platforms that locked down their APIs still use them for their mobile apps and can easily be reverse engineered to find keys or undocumented endpoints (or in the case of reddit, an entirely different internal API with less limits and a lot more info leaks...)
◧◩◪
3. monoca+N21[view] [source] 2023-11-20 02:44:46
>>Shekel+nV
Easier than that would just be downloading the torrent of all of Reddit through Sept 2023.

https://academictorrents.com/details/89d24ff9d5fbc1efcdaf9d7...

◧◩◪◨
4. q7xvh9+w31[view] [source] 2023-11-20 02:49:59
>>monoca+N21
That's fascinating that the total size is so tiny — only 2.4 TB‽

I assume this must be only the text portion, and heavily compressed?

◧◩◪◨⬒
5. lxgr+t81[view] [source] 2023-11-20 03:30:37
>>q7xvh9+w31
Text really doesn't take up that much space, and in addition it compresses pretty well.

The entire English language Wikipedia is only around 60GB in a format that can be readily searched and randomly accessed (ZIM), for example: https://kiwix.org/

◧◩◪◨⬒⬓
6. lmm+p91[view] [source] 2023-11-20 03:39:12
>>lxgr+t81
Does Kiwix actually work? I see people hyping it here but I could never get it to actually, y'know, download the file and display the wikipedia on my phone.
◧◩◪◨⬒⬓⬔
7. vatuei+Sa1[view] [source] 2023-11-20 03:58:28
>>lmm+p91
Kiwix worked for me. IIRC there may be difficulties opening an archive that was downloaded outside of the mobile app, but archives downloaded in-app were fine.

For the mobile app I used one of the smaller Wikipedia subsets, since I didn't want to take up too much space on my phone. The full offline Wikipedia download is saved to my laptop.

[go to top]