Browse latest
Research & Paperscs.AI updates on arXiv.org · May 7, 2026

CreativityBench: Evaluating Agent Creative Reasoning via Affordance-Based Tool Repurposing

CreativityBench is a new benchmark designed to evaluate the creative problem-solving abilities of large language models (LLMs) through affordance-based tool repurposing. It reveals that while LLMs can select plausible objects, they struggle with identifying correct parts, affordances, and the underlying physical mechanisms for creative tool use. This indicates a significant challenge for current LLMs in exhibiting true creative reasoning.

Author: Morein.ai Editorial

CreativityBench is a new benchmark designed to evaluate the creative problem-solving abilities of large language models (LLMs) through affordance-based tool repurposing. It reveals that while LLMs can select plausible objects, they struggle with identifying correct parts, affordances, and the underlying physical mechanisms for creative tool use. This indicates a significant challenge for current LLMs in exhibiting true creative reasoning.

Read original source

Related articles