Creating the AIQ Test: Mathematical Foundations for AI Evaluations

Source: DARPA
July 2, 2024 | Originally published by DARPA on May 29, 2024

Relying on credible, concrete information is essential in high-stakes decision-making. So, how can society be sure generative artificial intelligence (AI) will be safe and effective for such applications?

Over the past century, one of humanity’s most significant innovations has been the ability to move people and things quickly over large scales. Everything from bridges to jets and rockets uses mathematical foundations to understand the physical world and reliably build these systems and structures.

Yet, as society catapults into an era of exploring and applying AI to quickly deliver information to people, methods for guaranteeing the capabilities (and limitations) of generative AI systems do not exist. Neither do insights into when and why those capabilities manifest.

