Learning Audio Foundation Models For Reasoning