Advertisement

OpenAI's latest model o1 - thinks before answering, the longer it thinks, the smarter it gets

OpenAI o1 model

OpenAI has recently launched a series of new AI models that spend more time thinking before responding. Two models have been released this time:

1. OpenAI o1-preview

The team developed the OpenAI o1-preview model, which aims to allow AI to spend more time thinking before answering. They can reason through complex tasks and solve more challenging problems than previous models, particularly excelling in science, programming, and mathematics.

2. OpenAI o1-mini

OpenAI o1-mini performs exceptionally well in STEM fields, especially in mathematics and programming —— its performance on benchmarks like AIME and Codeforces is almost equivalent to OpenAI o1. O1-mini will become a faster and more cost-effective model choice for applications requiring reasoning but not extensive world knowledge.

Upload video

Coding examples

: With just a simple prompt, o1 can write a Snake game containing HTML, CSS, and JavaScript. It not only designed the game but also added various logics. After copying the code, you have a complete Snake game. If you want to increase the difficulty, you can use the "AI" letters as obstacles. O1 added some logic, generating a Snake game with a large "AI" obstacle in the center of the screen.

: O1 can accomplish tasks that previous models could not achieve. For example, writing a "Squirrel Hunt" game using Pygame. O1 thought for 21 seconds and then provided the complete code.

Puzzle solving and creation

: In the "writing puzzle," it was asked to write a six-line poem with specific syllable counts and words ending in vowels. O1 was able to output high-quality poetry according to the requirements.

: A complex question was proposed:

"When the princess's age is half the sum of their current ages, the princess's age is twice the prince's age. When the princess's age is twice the prince's former age, the princess's age equals the prince's future age. What are the ages of the prince and princess? Please provide all solutions."

O1 began analyzing this problem, thought for a while, then provided a detailed solution process and formulas, ultimately correctly calculating the answer: The prince and princess are 6k and 8k years old respectively, where k can be any integer.

Reasoning and understanding

: Asking how many "r"s are in the word "strawberry." Previous models might get it wrong, but o1 will think for a few seconds and then give the correct answer.

: Providing a segment of erroneous Korean text and asking o1 to translate it into English. O1 took time to decode these confusing texts, successfully identified the errors, and performed correct understanding and translation. (This indicates that o1 has potential in decoding encrypted texts.)

Applications in different disciplines

: Watching o1 solve quantum physics problems and generate complex physical formulas, even if we don't understand these formulas, we can still feel its professionalism.

: O1 can remember and apply all genetic information, demonstrating strong capabilities in the field of biology.

: Facing a complex question:

"Under what conditions would a tariff improve a country's terms of trade enough to raise national welfare? And in such a case, what second-best arguments can you think of that would suggest that the potential tariff may not be a good idea? These second-best arguments should include public choice considerations, but need not be limited to them."

O1 is able to understand this complex issue and provide an exhaustive conclusion.

OpenAI o1 is much smarter than me

OpenAI o1 engages in deep thought before answering and can produce a long chain of internal thought before responding to users.

  • In competitive programming problems, o1 ranks at the 89th percentile (Codeforces);
  • In the American Mathematics Olympiad Preliminary Contest (AIME), it ranks among the top 500 students nationwide;
  • In benchmark testing of physics, biology, and chemistry questions (GPQA), it surpasses human doctoral-level accuracy.

Team

Researchers behind OpenAI o1: https://openai.com/openai-o1-contributions/