zlacker

[return to "OpenAI's board has fired Sam Altman"]
1. water-+As[view] [source] 2023-11-17 22:28:12
>>davidb+(OP)
Someone probably already suggested this, but I haven’t seen it yet, so I’ll throw a wild speculation into the mix:

I saw a comment (that I can’t find now) wondering if Sam might have been fired for copyright reasons. Pretty much all the big corpuses that are used in LLM training contain copyrighted material, but that’s not a surprise and I really don’t think they’d kick him out over that. But what if he had a team of people deliberately adding a ton of copyrighted material - books, movies, etc - to the training data for ChatGPT? It feels like it might fit the shape of the situation.

◧◩
2. feralo+2C9[view] [source] 2023-11-20 11:36:38
>>water-+As
Also, it isn't uniquely attributable to Sam. They all do it, use copyrighted material, for training data. By "all", I mean all LLMs (to my knowledge). They don't do it intentionally, but it gets scooped up with everything else.

Hmmm, just thinking... Adam d'Angelo is one of the board members of OpenAI. He has the entire corpus of Quora content to use as training data, i.e. the rights to it are his. But I doubt that only Quora content was used by OpenAI during the past 8 years or so since it was founded! And the content on Quora isn't that great anyway...

[go to top]