zlacker

[parent] [thread] 2 comments
1. threec+(OP)[view] [source] 2025-08-22 01:51:00
The fact that GitHub’s Copilot has an enterprise feature that matches model output against code having certain licenses - in order to prevent you from using it, with a notification - suggests the model outputs are at least potentially infringing.

If MS were compelled to reveal how these completions are generated, there’s at least a possibility that they directly use public repositories to source text chunks that their “model” suggested were relevant (quoted as it could be more than just a model, like vector or search databases or some other orchestration across multiple workloads).

replies(2): >>martin+lK >>ineeda+DU
2. martin+lK[view] [source] 2025-08-22 11:20:35
>>threec+(OP)
> directly use public repositories

I don't see why a company which has been waging a multi decade war against GPL and users' rights would stop at _public_ repositories.

3. ineeda+DU[view] [source] 2025-08-22 12:34:25
>>threec+(OP)
> suggests the model outputs are at least potentially infringing.

The only thing it suggests is that they recognize that a subset of users worry about it. Whether or not GitHub worries about it any further isn’t suggested.

Don’t think about it from an actual “rights” perspective. Think about the entire copyright issue as a “too big to fail” issue.

[go to top]