Task Ability Decomposition and Difficulty Quantification of Visual Tasks for AGI Evaluation
First systematic exploration of task-ability space structure and its link to task difficulty. Proposed TADDL-V framework for quantifying visual task difficulty and released AGI-V70 benchmark for AGI evaluation.
Oct 27, 2025