zlacker

[return to "Tell HN: We should start to add “ai.txt” as we do for “robots.txt”"]
1. jeroen+mf[view] [source] 2023-05-10 13:42:39
>>Jeanne+(OP)
If AI needs explicit information and context, surely it should focus on improving its context recognition rather than trying to fix that by inserting even more training data.

Regardless, I do agree that something like a robots.txt for AI can be very useful. I'd like my website to be excluded from most AI projects and some kind of standardized way to communicate this preference would be nice, although I realize most AI projects don't exactly care about things like the wishes of authors, copyright, or ethical considerations. It's the idea that matters, really.

If I can use an ai.txt to convince the crawlers that my website contains illegal hardcore terrorist pornography to get it excluded from the datasets, that's another way to accomplish this I suppose.

◧◩
2. LawTal+rW[view] [source] 2023-05-10 16:45:51
>>jeroen+mf
> focus on improving its context recognition rather than trying to fix that by inserting even more training data.

That's how you improve its context recognition. You show it many contexts.

> most AI projects don't exactly care about things like the wishes of authors, copyright, or ethical considerations

Why is it 'ethical' that you get to add a bunch of restrictions to a pre-negotiated situation? You get copyright protections in trade for letting people use your work. There's a way to add restrictions - licensing - and you're looking to get the benefits of licensing, and to take away fair use right from other people, without paying the costs of doing so.

fwiw, I copy most pages I visit and store them. The website has given me the equivalent of a pamphlet and I store it instead of discarding it when I'm finished. This way I can go back and read it again later without having to track down the author and ask for another copy. It's not AI which has me doing this, I've been doing it for decades - it's censorship that has shown me the need.

[go to top]