OpenAI’s New o1 Models: A Step Forward in AI Reasoning, But at a Cost

OpenAI recently unveiled its new o1 models, codenamed “Strawberry,” which have generated significant anticipation among AI enthusiasts. These models are designed to “think” before responding, a feature that sets them apart from previous iterations. However, the reception has been mixed, with some praising its advanced reasoning capabilities and others pointing out its limitations.

The o1 models excel in handling complex questions, making them suitable for tasks that require deep reasoning. This is a notable improvement over the GPT-4o models, which were already impressive in their own right. However, the o1 models come with a significant drawback: they are approximately four times more expensive to use than GPT-4o.

One of the key criticisms of the o1 models is their lack of tools and multimodal capabilities, which were strong points of the GPT-4o models. Additionally, the speed of the o1 models is not on par with their predecessors, making them less efficient for certain tasks. OpenAI itself acknowledges these limitations, stating that GPT-4o remains the best option for most prompts.

Ravid Shwartz Ziv, an NYU professor specializing in AI models, commented on the new release, saying, “It’s impressive, but I think the improvement is not very significant. It’s better at certain problems, but you don’t have this across-the-board improvement”. This sentiment reflects the general consensus that while the o1 models have their strengths, they are not a universal upgrade.

The o1 models are designed to be used for specific types of questions—those that require extensive reasoning and problem-solving. Most users today do not use generative AI for such tasks, as current models are not particularly adept at them. However, the o1 models represent a step towards making AI more capable in this area.

One of the standout features of the o1 models is their ability to break down complex problems into smaller steps, a process known as “multi-step reasoning.” This approach allows the model to identify and correct errors in its reasoning, leading to more accurate answers. While this concept is not entirely new, it has only recently become practical for widespread use.

Kian Katanforoosh, CEO of Workera and a Stanford adjunct lecturer, expressed excitement about the potential of the o1 models. He noted that combining reinforcement learning algorithms with language model techniques could enable step-by-step thinking, allowing the AI to work through complex problems more effectively.

Despite its advanced capabilities, the o1 model is notably expensive. Users are charged not only for input and output tokens but also for “reasoning tokens,” which account for the additional computational steps the model takes to arrive at an answer. This hidden cost makes it crucial for users to be mindful of how they use the o1 models to avoid excessive charges.

The concept of an AI model that can “walk backwards from big ideas” is compelling. In practice, the o1 models have shown promise in this area. For example, one user reported using the o1 model to plan a Thanksgiving dinner, a task that involved complex logistics and benefited from the model’s reasoning capabilities.

However, the high cost and slower speed of the o1 models mean they are not suitable for all tasks. Users need to carefully consider when to use these models to get the best value for their money. OpenAI’s transparency about these limitations is a positive step, helping users set realistic expectations.

In summary, the OpenAI o1 models represent a significant advancement in AI reasoning and problem-solving. While they are not a universal improvement over previous models, they excel in specific areas and offer unique capabilities. The high cost and slower speed are notable drawbacks, but for tasks that require deep reasoning, the o1 models are a valuable tool.

As AI technology continues to evolve, it will be interesting to see how models like o1 are integrated into various applications. The ability to “think” before responding could open up new possibilities for AI, making it more useful for complex problem-solving tasks. However, users will need to balance the benefits with the costs to make the most of these advanced models.

Overall, the release of the o1 models is a step forward for OpenAI, showcasing the potential of AI to handle more complex and nuanced tasks. While there are still areas for improvement, the o1 models offer a glimpse into the future of AI technology and its potential applications.