Today, I will use Claude3 to compare the answers of Gemini and GPT-4 for the same question:
Task 1
Explain this picture in Chinese:
Claude3
This picture raises a question: "Is the cat going upstairs or downstairs?" The picture shows the back view of a gray cat walking on a stone staircase. However, from the angle of the picture, it is impossible to tell whether the cat is climbing up the stairs or walking down. This triggers the viewer's thoughts about perspective and perception, and different people may have different understandings and opinions. At the same time, this is also an interesting visual trap that is thought-provoking.
Gemini
This picture shows a cat walking on the stairs. The cat's head and body are facing the top of the stairs, so it is very likely that it is going upstairs. However, due to the shape of the stairs and the posture of the cat's body, some people think the cat is going downstairs. The reason why this picture creates an illusion is that both the shape of the stairs and the posture of the cat's body are symmetrical. If the picture is flipped upside down, the cat's posture still makes sense. In addition, the handrail of the stairs is misleading. If you only look at the handrail, it might make you feel like the cat is going downstairs. To determine whether the cat is going upstairs or downstairs, you can observe the following details: The cat's ears: If the cat's ears are pointing upwards, then it is very likely that it is going upstairs. The cat's tail: If the cat's tail is curled upwards, then it is very likely that it is going upstairs. The cat's legs: If the cat's legs are stretched forward, then it is very likely that it is going upstairs. Based on these details, I think it is very likely that the cat is going upstairs. Below is the Chinese explanation of this picture: Picture description: A picture of a cat walking on the stairs. The cat's head and body are facing the top of the stairs, but due to the shape of the stairs and the posture of the cat's body, some people think the cat is going downstairs. Reason for the illusion: Both the shape of the stairs and the posture of the cat's body are symmetrical. In addition, the handrail of the stairs is misleading. Judgment method: Observe the posture of the cat's ears, tail, and legs. Conclusion: Based on the details in the picture, I think it is very likely that the cat is going upstairs.
GPT-4
This image is a visual illusion of a cat on the stairs. The text above the picture asks, "Is the cat going up or down?" This question arises because the angle at which the picture was taken and the design of the staircase make it difficult to determine the direction in which the cat is moving. Due to the lack of clear visual cues, this photo can be interpreted as the cat walking upwards or downwards, depending on how the viewer interprets the cat's posture and shadow. This type of image is often used to explore the observer's visual perception and how the brain interprets ambiguous or unclear information.
Evaluation: Gemini is the smartest. Personally, I think it's still an issue of search capability, because this image is an old internet meme, so there should be various analyses of this image available online, and Google seems to have seen more of them.
Task 2
Help me create a playable web-based version of the 2048 game that can be controlled with the keyboard's arrow keys (up, down, left, right).
Claude3
Can be played normally, easily controlled via the keyboard.
Gemini
The code is not finished, so there are issues with the demonstration and it cannot be played.
GPT-4
Unable to play
Evaluation: For this coding task, Claude3 is the most reliable.
Task 3
Puzzle Solution
Claude3
The correct choice to replace the question mark should be B. The large 6x6 grid is composed of 2x2 small grids, and each small grid corresponds to one of the candidate patterns. In the 2x2 grid where the question mark is located, the other three cells' patterns match option B, so B is the correct answer.
Gemini
The answer is B) 000. The reasoning process is as follows:
The number of circles in each box equals the sum of the circles in the top row and the bottom row. The two numbers below the box indicate the number of circles that need to be added to the top and bottom rows, respectively. The number of circles in the question mark box is 7, and there are already 2 circles in the top row, so 3 circles need to be added to the top row. Therefore, the answer is B) 000, which means adding 3 circles to the top row and 0 circles to the bottom row. In short, option B) 000 is the only one that can make the total number of circles in the box reach 7.
GPT-4
The correct answer is option D because it has 9 dots, which conforms to the incremental pattern of the puzzle.
Analysis: The answers from the three large language models (LLMs) are all incorrect. The correct answer should be A. In this task, it should be ensured that the way black dots are filled in each row and column of the matrix is unique, meaning there can only be one black dot in each row and column.
Task 4
Verbal IQ question
Question: There are three people, Alex, Ben, and Cody. One is a knight, one is a knave, and one is a spy. The knight always tells the truth, the knave always lies, and the spy can either lie or tell the truth. Alex says, "Cody is the knave." Ben says, "Alex is the knight." Cody says, "I am the spy." Question: Who is the knight, who is the knave, and who is the spy?
Claude3
Alex is the knight, Ben is the spy, and Cody is the knave. Reason:
Alex says Cody is the knave. If Alex is the knight, then Cody must be the knave. Ben says Alex is the knight. If Ben is the knave, then Alex cannot be the knight. Cody claims to be the spy, but Cody cannot be the knight because the knight cannot be the spy. Therefore, the only logical explanation is: Alex is the knight, Ben is the spy, and Cody is the knave.
Gemini
Knight: Alex Rogue: Ben Spy: Cody Analysis:
Alex says Cody is a rogue, but Cody cannot be the rogue, so Alex can only be the knight or the spy. Ben says Alex is the knight, but Ben is the rogue, so he must be lying; Alex cannot be the knight. Therefore, Alex is the knight, Ben is the rogue, and Cody is the spy.
GPT-4
Brief answer: Alex is the knight, Ben is the spy, Cody is the rogue. Simple analysis:
If what Alex said is true (i.e., Cody is a rogue), then Alex is a knight. Since knights always tell the truth, Ben saying that Alex is a knight aligns with the fact that Alex is indeed a knight. Cody claims to be a spy, but as a rogue, his statement must be false, so Cody is indeed a rogue. Thus, the remaining person, Ben, can only be the spy.
Analysis of the answer: GPT-4 and Claude3 answered correctly, while Gemini was incorrect.
Task 5
On https://sdk.vercel.ai/, due to the absence of the Google Gemini API, a speed comparison was conducted between Claude 3, Llama, and GPT-4. Everyone can try it out themselves.
Overall, Llama has the fastest response time, followed by GPT-4, while Claude 3 is relatively slower.