If the internet archive is already curated for content then yeah there is a 100% chance that there will be more curation of content.
Does anyone have any facts/citations on if this is a myth/coping mechanism I created, or reality?
Not sure what the appeal of the public library is, when you can have your own.
This is not to disparage the tremendous work done and being done by the IA, it's more of me lamenting the trend of our society and societies to mentally babysit people lest their mind gets exposed to something bad, with the implicit assumption that adult humans can't be trusted to see some stupid bs and react with "that was some stupid bs. I am moving it into the stupid bs bucket of things I know about".
The "bot" is wrong. Most of the crawl data used by the Internet Archive, particularly the Alexa crawls, isn't publicly accessible. (This is because some of it includes archived pages which have since been suppressed by the site owner - removing those pages from the archived crawl data isn't practical.)
https://archive.org/details/alexacrawls
Common Crawl data is public, but less comprehensive than IA - https://commoncrawl.org/