zlacker

[return to "Tell HN: We should start to add “ai.txt” as we do for “robots.txt”"]
1. qbasic+Ol[view] [source] 2023-05-10 14:11:34
>>Jeanne+(OP)
Your HTML already has semantic meta elements like author and description you should be populating with info like that: https://developer.mozilla.org/en-US/docs/Learn/HTML/Introduc...
◧◩
2. techaq+7v[view] [source] 2023-05-10 14:50:26
>>qbasic+Ol
and also opengraph meta tags https://ogp.me/
◧◩◪
3. doodle+Hy[view] [source] 2023-05-10 15:06:26
>>techaq+7v
And also schema.org: https://schema.org/
◧◩◪◨
4. westur+4r1[view] [source] 2023-05-10 18:57:32
>>doodle+Hy
Thing > CreativeWork > WebSite https://schema.org/WebSite ... scroll down to "Examples" and click the "JSON-LD" and/or "RDFa" tabs. (And if there isn't an example then go to the schema.org/ URL of a superClassOf (rdfs:subClassOf) of the rdfs:Class or rdfs:Property; there are many markup examples for CreativeWork and subtypes).

httpS://schema.org/license

Also: https://news.ycombinator.com/item?id=35891631

extruct is one way to parse linked data from HTML pages: https://github.com/scrapinghub/extruct

[go to top]