#lm-arena

[ follow ]
Artificial intelligence
fromArs Technica
1 month ago

New study accuses LM Arena of gaming its popular AI benchmark

LM Arena's ranking may favor large companies due to unfair testing practices, raising concerns about its reliability in assessing AI chatbots.
#meta
Marketing tech
fromTechCrunch
1 month ago

Meta's benchmarks for its new AI models are a bit misleading | TechCrunch

Meta's Maverick AI model exhibits significant differences between its experimental and publicly available versions.
fromTechCrunch
1 month ago
Artificial intelligence

Meta's vanilla Maverick AI model ranks below rivals on a popular chat benchmark | TechCrunch

Marketing tech
fromTechCrunch
1 month ago

Meta's benchmarks for its new AI models are a bit misleading | TechCrunch

Meta's Maverick AI model exhibits significant differences between its experimental and publicly available versions.
fromTechCrunch
1 month ago
Artificial intelligence

Meta's vanilla Maverick AI model ranks below rivals on a popular chat benchmark | TechCrunch

[ Load more ]