The unfortunate thing about these LLMs is they siphon all public data regardless of license. I agree with data owners one can’t Willy nilly use data that’s accessible but not licensed properly.
Obviously Wikipedia, data from most public institutions, etc., should be available, but not data that does not offer unrestricted use.
We had an entire book (400+ pages) which detailed every single specific stylistic rule we had to follow for our class. Had the same thing in high school newspaper.
I can only assume that NYT has an internal one as well.