It also has some training on problem decomposition. Many smaller models fail before writing the code, they fail when parsing the question.
You can ask them to serialized a problem in prolog, and see exactly when their understanding breaks - this is open hermes 2.5: https://pastebin.com/raw/kr62Hybq