zlacker

[return to "GitHub Copilot, with “public code” blocked, emits my copyrighted code"]
1. mjr00+n7[view] [source] 2022-10-16 20:36:20
>>davidg+(OP)
Same issue with Stable Diffusion/NovelAI and certain people's artwork (eg Greg Rutkowski) being obviously used as part of the training set. More noticeable in Copilot since the output needs to be a lot more precise.

Lawmakers need to jump on this stuff ASAP. Some say that it's no different from a person looking at existing code or art and recreating it from memory or using it as inspiration. But the law changes when technology gets involved already, anyway. There's no law against you and I having a conversation, but I may not be able to record it depending on the jurisdiction. Similarly, there's no law against you looking at artwork that I post online, but it's not out of question that a law could exist preventing you from using it as part of an ML training dataset.

◧◩
2. SrslyJ+jl[view] [source] 2022-10-16 22:43:06
>>mjr00+n7
> Some say that it's no different from a person looking at existing code or art and recreating it from memory or using it as inspiration.

Hah, no, the model encodes the code that it was trained on. This is not "recreating from memory", this is "making a copy of the code in a different format." (Modulo some variable renaming, which it's probably programmed to do to in order to obscure the source of the code.)

[go to top]