zlacker

[return to "Be a property owner and not a renter on the internet"]
1. rpcope+9j[view] [source] 2025-01-03 04:07:47
>>dend+(OP)
> Exploiting user-generated content.

You know, if I've noticed anything in the past couple years, it's that even if you self-host your own site, it's still going to get hoovered up and used/exploited by things like AI training bots. I think between everyone's code getting trained on, even if it's AGPLv3 or something similarly restrictive, and generally everything public on the internet getting "trained" and "transformed" to basically launder it via "AI", I can absolutely see why someone rational would want to share a whole lot less, anywhere, in an open fashion, regardless of where it's hosted.

I'd honestly rather see and think more about how to segment communities locally, and go back to the "fragmented" way things once were. It's easier to want to share with other real people than inadvertently working for free to enrich companies.

◧◩
2. baxtr+bx[view] [source] 2025-01-03 06:43:20
>>rpcope+9j
Part of me viscerally agrees because large corporations have monetized UGC.

Another part of me though thinks differently. We are a species that builds knowledge from generation to generation. From one person to another. Over years, over centuries.

Philosophically this part tends to think that your thoughts and ideas belong to humanity and thus need to be shared with all of us.

◧◩◪
3. friend+pD[view] [source] 2025-01-03 07:53:37
>>baxtr+bx
If you recall high school history, rapid, exponential "progress" happened once the knowledge was 1) written down (printing press) 2) archived for the future (libraries) 3) systematized (textbook/encyclopaedia) 4) proactively shared (public education), all on a massive scale.

The fact that some knowledge exists and is even accessible does not really matter if takes a highly trained in a very narrow field scholar to find that piece of information. You need a well established knowledge creation and distribution funnel in operation for humanity as a whole to reap the benefits of knowledge.

There is undoubtedly a lot of useful knowledge on internet platforms, however, most of that knowledge remains unsystematized and largely undiscoverable, meaning that contribution to the totality of human knowledge by these platforms is infinitesimal, which is further drowned by cat and porn videos.

◧◩◪◨
4. TeMPOr+gN[view] [source] 2025-01-03 09:44:01
>>friend+pD
Now we have 5) aggregated and internalized as a whole by computational constructs such as LLMs, which are - 4) - proactively shared (open weights, but also freemium service and dirt-cheap API access to commercial SOTA models), still on a massive scale.

> There is undoubtedly a lot of useful knowledge on internet platforms, however, most of that knowledge remains unsystematized and largely undiscoverable, meaning that contribution to the totality of human knowledge by these platforms is infinitesimal, which is further drowned by cat and porn videos.

Precisely that. Which is why I often argue, that for 99%+ of the content in the training data, its marginal contribution to the training process - itself infinitesimal in isolation - is still by far the most value that content will ever bring to the world.

[go to top]