Artificial intelligence
fromInfoQ
2 weeks agoHugging Face Introduces Community Evals for Transparent Model Benchmarking
Community Evals enables benchmark datasets on the Hugging Face Hub to host leaderboards, collect reproducible evaluation results via Git-based .eval_results YAML submissions, and display scores.