zlacker

[parent] [thread] 6 comments
1. matsem+(OP)[view] [source] 2023-05-10 13:37:10
Reading the title I thought you meant the opposite.

Aka, an ai.txt file that disallow ai to train or use your data similar to robots.txt (but for cases when you still want to be crawled, just not extrapolated)

replies(4): >>devd00+I >>revico+P6 >>rglove+TV >>splix+W81
2. devd00+I[view] [source] 2023-05-10 13:40:15
>>matsem+(OP)
I thought the exact same. Creating a new type of robots.txt but making it do the opposite does not make sense.
3. revico+P6[view] [source] 2023-05-10 14:08:09
>>matsem+(OP)
Feels like an enhancement to a sitemap.xml could be a better way to go here.

https://developers.google.com/search/docs/crawling-indexing/...

4. rglove+TV[view] [source] 2023-05-10 17:46:25
>>matsem+(OP)
I've been (slowly) writing a new type of OSS license around this exact concept so it's easier to (legally) stop LLMs hoovering up IP [1] (under "derivative works not permitted").

[1] https://github.com/cheatcode/joystick/blob/development/LICEN...

replies(1): >>remram+WZ1
5. splix+W81[view] [source] 2023-05-10 18:40:54
>>matsem+(OP)
I guess the good part that in ai.txt you can talk to AI. So if you want you can tell it to not crawl or make other agreements with it, just in plain english. What a time to be alive.
◧◩
6. remram+WZ1[view] [source] [discussion] 2023-05-10 23:16:19
>>rglove+TV
They've been ingesting "all rights reserved" content because they think copyright doesn't apply. Licenses won't help.
replies(1): >>rglove+Z84
◧◩◪
7. rglove+Z84[view] [source] [discussion] 2023-05-11 14:56:13
>>remram+WZ1
We'll see. I think courts will end up interpreting it in the same way that they do music sampling other music. In effect that's all it is: a remix of existing information.
[go to top]