I often run multiple parallel chats and expose it to slightly different amounts of information. Then average the answers in my head to come up with something more reliable.
For coding tasks, I found it helps to feed the GPT-4 answer into another GPT-4 instance and say "review this code step by step, identify any bugs" etc. It can sometimes find its own errors.