Skip to main content
Vaticanchannel

All Articles

Browse the full archive, newest first.

Read Position: Science of AI Evaluation Requires Item-level Benchmark Data

Position: Science of AI Evaluation Requires Item-level Benchmark Data

Strategic angle: arXiv:2604.03244v1 Announce Type: new Abstract: AI evaluations have become the primary evidence for deploying generative AI systems across high-stakes domains. However, current evaluation paradigms often exhibit systemic

Editorial Staff Apr 7
Read Structural Segmentation of the Minimum Set Cover Problem: Exploiting Universe Decomposability for Metaheuristic Optimization

Structural Segmentation of the Minimum Set Cover Problem: Exploiting Universe Decomposability for Metaheuristic Optimization

Strategic angle: arXiv:2604.03234v1 Announce Type: new Abstract: The Minimum Set Cover Problem (MSCP) is a classical NP-hard combinatorial optimization problem with numerous applications in science and engineering. Although a wide range

Editorial Staff Apr 7
Read To Throw a Stone with Six Birds: On Agents and Agenthood

To Throw a Stone with Six Birds: On Agents and Agenthood

Strategic angle: arXiv:2604.03239v1 Announce Type: new Abstract: Six Birds Theory (SBT) treats macroscopic objects as induced closures rather than primitives. Empirical discussions of agency often conflate persistence (being an object) w

Editorial Staff Apr 7
Read VERT: Reliable LLM Judges for Radiology Report Evaluation

VERT: Reliable LLM Judges for Radiology Report Evaluation

Strategic angle: arXiv:2604.03376v1 Announce Type: new Abstract: Current literature on radiology report evaluation has focused primarily on designing LLM-based metrics and fine-tuning small models for chest X-rays. However, it remains un

Editorial Staff Apr 7