TL;DR: OpenAI’s new o1 model marks a significant leap in AI reasoning capabilities but introduces critical risks. Its reluctance to acknowledge mistakes, gaps in common-sense reasoning ...
“Growing up, I had always thought about modelling, but I’ve also never been a sample size, so I didn’t think it would be possible. I saw curve models like Paloma and Precious and wanted to do the same ...
Just one month later, OpenAI’s newly-announced o3 model achieved a score of 25.2%, which Epoch’s director, Jaime Sevilla, describes as “far better than our team expected so soon after ...