During a recent press appearance, OpenAI CEO Sam Altman said that he’s observed the “IQ” of AI rapidly improve over the past several years. “Very roughly, it feels to me like — this is not scientifically accurate, this is just a vibe or spiritual answer — every year we move one standard deviation of IQ,”…
Category: ai benchmarks
AI, ai benchmarks, coding, Global IT News, Global Security News, programming
People are benchmarking AI by having it make balls bounce in rotating shapes
The list of informal, weird AI benchmarks keeps growing. Over the past few days, some in the AI community on X have become obsessed with a test of how different AI models, particularly so-called reasoning models, handle prompts like this: “Write a Python script for a bouncing yellow ball within a shape. Make the shape…
AI, ai benchmarks, benchmarking, controversy, epoch ai, frontiermath, generative ai, Global IT News, Global Security News, o3, openai
AI benchmarking organization criticized for waiting to disclose funding from OpenAI
An organization developing math benchmarks for AI didn’t disclose that it had received funding from OpenAI until relatively recently, drawing allegations of impropriety from some in the AI community. Epoch AI, a nonprofit primarily funded by Open Philanthropy, a research and grantmaking foundation, revealed on December 20 that OpenAI had supported the creation of FrontierMath.…