Yep, so a few million ripped off articles is plausible.
So the earliest available copyrighted material would be all content published by anybody who died in the year 1953 or earlier.
If the author of an article published in 1950 still has a living author, the work is still copyrighted.