Community-driven benchmarking of 20 NVIDIA NIM models β fully automated, zero infra cost, self-hostable in minutes.
π View Live Dashboard Β· π Docs Β· π€ Contribute Β· π¬ Discussions
NIMStats automatically benchmarks 20 NVIDIA NIM models every hour using GitHub Actions and publishes the results to a beautiful, interactive dashboard. No servers, no subscriptions β just fork, add your API key, and go.
| ποΈ Hourly Benchmarks | π Interactive Charts | π Zero Infrastructure | π Fully Open-Source |
|---|---|---|---|
| Automatic via GitHub Actions | Response time, throughput & trends | Static site + free CI/CD | Fork and self-host in minutes |
Get your own benchmarking dashboard running in under 5 minutes.
git clone https://github.com/MauroDruwel/NIMStats.git
cd NIMStatsVisit build.nvidia.com β Create a free account β Copy your API key.
In your forked repo: Settings β Secrets and variables β Actions β New repository secret
| Name | Value |
|---|---|
NIM_API_KEY |
Your NVIDIA NIM API key |
| Platform | Steps |
|---|---|
| Cloudflare Pages | Connect repo in Cloudflare Pages |
| GitHub Pages | Settings β Pages β Deploy from main |
| Netlify / Vercel | Connect repo for instant auto-deploy |
Actions β Benchmark NVIDIA NIM Models β Run workflow
That's it β your dashboard auto-refreshes every hour. β¨
| Tab | What you get |
|---|---|
| π Overview | 5 animated KPI cards Β· success trend charts Β· top-10 speed & throughput bars Β· model reliability pills |
| π Leaderboard | Composite score rankings Β· sortable columns Β· SVG sparklines Β· trend indicators (βββ) Β· provider chips |
| π¬ Explorer | Per-model deep dive Β· response time history chart Β· error breakdown donut Β· availability heatmap |
| β± Timeline | Filterable run history (All / 24h / 48h / 7d) Β· expandable run cards with full per-model detail |
| βοΈ Compare | Head-to-head overlay chart Β· win-rate stats Β· side-by-side metric comparison |
20 models across 11 providers β click to expand
| Provider | Model | Highlight |
|---|---|---|
| DeepSeek | deepseek-ai/deepseek-v4-flash |
Fast MoE, optimized for speed |
| DeepSeek | deepseek-ai/deepseek-v4-pro |
Professional-grade reasoning |
| DeepSeek | deepseek-ai/deepseek-v3.2 |
Latest with improved reasoning |
| Z-AI | z-ai/glm-5.1 |
Superior code understanding |
| Z-AI | z-ai/glm-4.7 |
Strong mathematical capabilities |
| MiniMax | minimaxai/minimax-m2.7 |
Efficient inference model |
| MiniMax | minimaxai/minimax-m2.5 |
Previous generation MiniMax |
| NVIDIA | nvidia/nemotron-3-super-120b-a12b |
NVIDIA's 120B flagship |
| NVIDIA | nvidia/nemotron-3-nano-omni-30b-a3b-reasoning |
Compact omni reasoning model |
| Moonshot | moonshotai/kimi-k2.6 |
Context-optimized model |
| Moonshot | moonshotai/kimi-k2-instruct |
Instruction-tuned Kimi |
| OpenAI | openai/gpt-oss-120b |
Open-source 120B |
google/gemma-4-31b-it |
Lightweight edge inference | |
| Qwen | qwen/qwen3-coder-480b-a35b-instruct |
Specialized coding (480B MoE) |
| Qwen | qwen/qwen2.5-coder-32b-instruct |
Lightweight Qwen coder |
| Qwen | qwen/qwen3.5-397b-a17b |
Flagship Qwen (397B) |
| Qwen | qwen/qwen3.5-122b-a10b |
Mid-range Qwen 3.5 MoE |
| Mistral | mistralai/devstral-2-123b-instruct-2512 |
Developer-focused (123B) |
| Mistral | mistralai/mistral-large-3-675b-instruct-2512 |
Largest Mistral (675B) |
| Mistral | mistralai/mistral-medium-3.5-128b |
Efficient medium-scale Mistral |
| Meta | meta/llama-3_3-70b-instruct |
Llama 3.3 70B |
| Meta | meta/llama-4-maverick-17b-128e-instruct |
Llama 4 Maverick (128 experts) |
| Meta | meta/llama-3.2-90b-vision-instruct |
Multimodal 90B vision model |
| StepFun | stepfun-ai/step-3.5-flash |
Ultra-fast flash model |
| StepFun | stepfun-ai/step-3.7-flash |
Latest high-performance flash |
βββββββββββββββββββββ GitHub Actions (every hour) βββββββββββββββββββββββ
β β
β βββββββββββββββββββββββ βββββββββββββββββββββββ β
β β Job 1 β Group A β β Job 2 β Group B β (run in parallel) β
β β 10 NIM models β β 10 NIM models β β
β ββββββββββββ¬βββββββββββ ββββββββββββ¬βββββββββββ β
β ββββββββββββββββ¬ββββββββββββββββ β
β ββββββββββΌβββββββββ β
β β Merge + commit β β history.db updated in repo β
β βββββββββββββββββββ β
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ β
β
ββββββββββββΌβββββββββββ
β Static Dashboard β rebuilds on each push
β (Pages / Netlify) β
βββββββββββββββββββββββ
Parallel jobs = ~50% faster benchmarks β‘
Change the benchmark prompt
Edit PROMPT in scripts/test_models.py:
PROMPT = "Your custom prompt here"Add or remove models
Edit ALL_MODELS in scripts/test_models.py:
ALL_MODELS = [
"your/custom-model",
# ...
]Change the schedule
Edit .github/workflows/benchmark.yml:
- cron: '0 */6 * * *' # Every 6 hours instead of every hourRun locally
# Serve the dashboard
python3 -m http.server 8000
# Open http://localhost:8000
# Run benchmarks manually (requires NIM_API_KEY env var)
export NIM_API_KEY=your_key_here
python3 scripts/test_models.pyhistory.db is a SQLite database persisted in the repo β the single source of truth. The browser loads it via sql.js (WebAssembly) and queries it entirely client-side. scripts/results.json is a temporary per-job artifact that is never committed.
Schema:
runs (id, timestamp, prompt, success_count, total_models, fastest_model, fastest_time)
model_results (run_id, model, success, error, response_time, tokens_generated, total_tokens, response)Benchmark parameters: temperature: 0.7 Β· top_p: 0.9 Β· max_tokens: 500 Β· OpenAI-compatible API
Contributions are what make the open-source community amazing. Any contribution you make is greatly appreciated!
- Fork the repository
- Create your feature branch:
git checkout -b feat/amazing-feature - Commit your changes:
git commit -m 'feat: add amazing feature' - Push to the branch:
git push origin feat/amazing-feature - Open a Pull Request
Ideas for contributions:
- π Add new NIM models to the benchmark list
- π New chart types or dashboard widgets
- π Internationalization / translations
- π Bug fixes and performance improvements
- π Improve documentation
Please read through open Issues before starting β someone might already be working on it!
- NVIDIA NIM API Documentation
- NVIDIA Model Catalog
- GitHub Actions Docs
- sql.js β SQLite in the browser
Distributed under the MIT License. See LICENSE for details.
Made with β€οΈ for the ML community Β· β Star this repo if you find it useful!