>>lukan+(OP)
Features like function calling are moving in that direction. Microsoft also seems to have plans to deeply integrate LLMs into its OS and if they do a good job it could become a primary way to interact with its features and programs. Considering the progress made on image generation models I could image a special purpose model that is specifically trained on operating APIs and producing good results. The big hurdle would be building the APIs that don't exist for the tools that people like to use. I'm sure there are interesting ways you could think of generating labeled data for actions in various programs.