Chasing AGI: Comparing Leading AI Models on Complex Reasoning Tasks