Name	Name	Last commit message	Last commit date
Latest commit History 12 Commits
config	config
prompts-answers	prompts-answers
prompts	prompts
public	public
results	results
solutions	solutions
src	src
.gitignore	.gitignore
AGENTS.md	AGENTS.md
README.md	README.md
bun.lock	bun.lock
opencode-benchmark-dashboard.png	opencode-benchmark-dashboard.png
package.json	package.json
tsconfig.json	tsconfig.json

Name

Last commit message

Last commit date

Latest commit

History

opencode-benchmark-dashboard.png

package.json

tsconfig.json

opencode-benchmark-dashboard

Benchmark system for testing opencode with various LLM models, measuring speed (latency) and correctness (accuracy).

Why ?

The best tradeoff depends on your use-case and your hardware
accuracy vs speed: reasoning, tok/s, different quantizations matters. Some small LLM can fix themself using tools, a fast LLM can be slow because it wastes too many tokens in the reasoning. Just test them in real world scenarios.

Quick Start

# Install dependencies bun install # Fill prompts/ and prompt-answers/ with your test cases ex. CODING-my-single-test.txt # Check `~/.config/opencode/opencode.json` with your OpenAI-compatible models. # It generates the answers with a specific model. use `opencode models` to see the availables bun run answer -m "opencode/minimax-m2.5-free" # bun run answer -m "opencode/minimax-m2.5-free" -t CODING-my-single-test # It generates the evaluations with a specific model. bun run evaluate -m "opencode/minimax-m2.5-free" # bun run evaluate -m "opencode/minimax-m2.5-free" -t CODING-my-single-test # it opens the dashboard on http://localhost:3000 bun run dashboard

Requirements

Bun runtime
opencode CLI installed and in PATH
Models pre-configured in ~/.config/opencode/opencode.json

About

Benchmark system for testing opencode with various LLM models, measuring speed (latency) and correctness (accuracy).

Topics

Resources

Readme

Stars

Watchers

Forks

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

grigio/opencode-benchmark-dashboard

Folders and files

Latest commit

History

Repository files navigation

opencode-benchmark-dashboard

Why ?

Quick Start

Requirements

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

opencode-benchmark-dashboard

Why ?

Quick Start

Requirements

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages