benchmark
Here are 4,434 public repositories matching this topic...
MixEval, a ground-truth-based dynamic benchmark derived from off-the-shelf benchmark mixtures, which evaluates LLMs with a highly capable model ranking (i.e., 0.96 correlation with Chatbot Arena) while running locally and quickly (6% the time and cost of running MMLU), with its queries being stably updated every month to avoid contamination.
-
Updated
Jun 1, 2024 - Python
MTEB: Massive Text Embedding Benchmark
-
Updated
Jun 1, 2024 - Python
System Analysis Software
-
Updated
Jun 1, 2024 - C#
ColdRec: A Comprehensive Benchmark for Cold-Start Recommendation.
-
Updated
Jun 1, 2024 - Python
A command-line benchmarking tool
-
Updated
Jun 1, 2024 - Rust
Repository for benchmarking different post-hoc xai explanation methods on image datasets
-
Updated
Jun 1, 2024 - Python
Benchmark results repository service
-
Updated
Jun 1, 2024 - Java
Powerful .NET library for benchmarking
-
Updated
Jun 1, 2024 - C#
JavaScript package managers performance comparison between NPM, Yarn, Yarn PnP, PnPM, and Bun.
-
Updated
Jun 1, 2024 - JavaScript
Ohayou(おはよう), HTTP load generator, inspired by rakyll/hey with tui animation.
-
Updated
Jun 1, 2024 - Rust
A performance-oriented prototyping harness for state of the art Molecular Dynamics algorithms
-
Updated
Jun 1, 2024 - C
[CVPR2024 Highlight] VBench - We Evaluate Video Generation
-
Updated
Jun 1, 2024 - Python
A PyTorch library for all things Reinforcement Learning (RL) for Combinatorial Optimization (CO)
-
Updated
Jun 1, 2024 - Python
Austin Benchmark Suites for Computational Electromagnetics
-
Updated
Jun 1, 2024
Improve this page
Add a description, image, and links to the benchmark topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the benchmark topic, visit your repo's landing page and select "manage topics."