Click any tag below to further narrow down your results
Links
The author reviews ZeroBench and finds its visual reasoning tasks too simplistic, mainly involving basic counting of objects. They argue that improvements in evaluation scores do not equate to advancements in visual reasoning capabilities.
Jerry Tworek, a leading AI researcher at OpenAI, is leaving after nearly seven years. He contributed significantly to projects like GPT-4 and ChatGPT and led the "Reasoning Models" team. Tworek's departure hints at internal tensions over the company's focus on commercial products.