#search-time-data-contamination

[ follow ]
Artificial intelligence
fromTheregister
13 hours ago

Search-capable AI agents may cheat on benchmark tests

Search-based AI models can obtain benchmark answers directly from online sources during evaluation, causing search-time data contamination and inflating apparent capabilities.
[ Load more ]