zlacker

[parent] [thread] 2 comments
1. andy99+(OP)[view] [source] 2023-12-27 22:07:59
> This copy is not a transitory copy in service of a fair use

Training is almost certainly fair use, so it's exactly a transitory copy in service of fair use. Training, other than the brief "transitory copy" you mention is not copying, it's making a minuscule algorithmic adjustment based on fleeting exposure to the data.

replies(2): >>twoodf+Vh >>edwint+ga1
2. twoodf+Vh[view] [source] 2023-12-28 00:20:22
>>andy99+(OP)
Why is training “almost certainly” fair use?

Congress took the circuit holding in MAI Systems seriously enough to carve out a new fair use exception for copying software—entirely within the memory system of a licensed user—in service of debugging it.

If it took an act of Congress to make “unlicensed” debugging a fair use copy…

3. edwint+ga1[view] [source] 2023-12-28 10:04:40
>>andy99+(OP)
If you overtrain the model may include verbatim copies of your training material, and may be able to produce verbatim copies of the original in its output.

If Microsoft truly believes that the trained output doesn't violate copyright then it should be forced to prove that by training it on all its internal source code, including Windows.

[go to top]