Isn't this exactly how language translator applications work, through ingesting a huge amount of training data?
Automatic machine language translation puts translators out of work and would not be possible without huge amounts of ostensibly unlicensed training data.