zlacker

[parent] [thread] 7 comments
1. naikro+(OP)[view] [source] 2022-10-16 20:50:48
who said they haven't.

for something to show up verbatim in the output of a textual AI model it needs to be an input many times.

I wonder if the problem is not copilot, but many people using this person's code without license or credit, and copilot being trained on those pieces of code as well. copilot may just be exposing a problem rather than creating one.

I don't know much about AI, and I don't use copilot.

replies(3): >>belorn+q2 >>make3+y4 >>akudha+Q4
2. belorn+q2[view] [source] 2022-10-16 21:10:32
>>naikro+(OP)
Microsoft have a public statement that they don't use proprietary code, only public code with public licenses. They have a lot of companies as customers who uses github, and they also use a lot third-party code in their own products.
replies(2): >>stefan+X4 >>pabs3+gz
3. make3+y4[view] [source] 2022-10-16 21:32:24
>>naikro+(OP)
there's exactly no way they have
replies(1): >>naikro+2p
4. akudha+Q4[view] [source] 2022-10-16 21:35:49
>>naikro+(OP)
With the amount of resources that Microsoft has, how hard can it be for them to exclude proprietary code that other people have stolen? I’d bet it is easy for them, but they won’t do it. Because they don’t care, because who is gonna take on them?

Will they “accidentally” include proprietary code from say, Oracle? Nope. They’ll make sure of it. But Joe Random? Sure

◧◩
5. stefan+X4[view] [source] [discussion] 2022-10-16 21:37:19
>>belorn+q2
Even BSD et. al. have attribution requirements - that must be a vanishingly small amount of code to be used. Me thinks the people who run GitHub (who have apparently decided to abandon the core business for the latest fun project) aren't being entirely upfront.
◧◩
6. naikro+2p[view] [source] [discussion] 2022-10-17 00:43:04
>>make3+y4
I'm curious how you could possibly know that for sure.
replies(1): >>make3+iG3
◧◩
7. pabs3+gz[view] [source] [discussion] 2022-10-17 02:25:05
>>belorn+q2
I thought they said all public repos without regard to the license they are under, which could be a proprietary EULA.
◧◩◪
8. make3+iG3[view] [source] [discussion] 2022-10-18 00:00:42
>>naikro+2p
because Microsoft is known to be extremely protective of their code. there is just no way they would expose their internal code to being straight up decoded from the model, while they can just train the model on the huge public data of GitHub
[go to top]