solc-bench

Benchmark tool for the Solidity compiler. Measures compile-time performance, memory usage, hardware counters (via perf stat), bytecode output size, and gas usage across real-world projects.

Install

pip

pip install -e .

Nix (flake)

nix run github:argotorg/solc-bench -- run --solc /path/to/solc

Requires Python 3.11+. Runtime tools: solc (required), perf (optional, for hardware counters), forge (optional, for input extraction and gas benchmarks).

Pipelines

Each benchmark is compiled under one or more codegen pipelines.

Pipeline	Description	Standard-json settings
`evmasm`	EVM assembly codegen	`"viaIR": false`
`ir`	IR-based codegen	`"viaIR": true`
`ir-ssacfg`	SSA-CFG experimental codegen	`"viaIR": true, "viaSSACFG": true`

solc-bench run picks the pipelines listed in each benchmark's TOML entry (or all pipelines if unspecified). Use --pipeline to restrict to a single one, and --no-optimize to disable the optimizer (it's on by default).

Metrics

Every metric below is collected on every run, except deployment_gas and method_gas, which are opt-in per benchmark (see Gas benchmarks).

Metric	Description	Unit	Source
`instructions`	Hardware instruction count	count	`perf stat`
`cycles`	CPU cycle count	count	`perf stat`
`cpu_time`	CPU time (user + system)	seconds	`os.wait4()` rusage
`wall_time`	Wall clock time	seconds	`time.monotonic()`
`peak_rss`	Peak resident set size	MiB	rusage.ru_maxrss
`creation_size`	Sum of creation bytecode size across all contracts	bytes	solc standard-json output
`runtime_size`	Sum of runtime bytecode size across all contracts	bytes	solc standard-json output
`deployment_gas`	Total deployment gas	gas	`forge test --gas-report`
`method_gas`	Total method-call gas (sum of `mean * calls`)	gas	`forge test --gas-report`

instructions is the primary comparison metric (variance <0.1% vs 3-5% for wall_time). Falls back to cpu_time when perf is unavailable.

When gas benchmarking is enabled, the result JSON also stores the full forge per-function dict (calls, min, mean, median, max) under results.<name>.<pipeline>.functions, keyed by ContractName.signature. method_gas aggregates these to a single number. The per-function detail is what compare --per-function renders.

Benchmarks

Standard suite

The bundled benchmarks.toml is shipped with the package. Run solc-bench list to see which projects are configured.

Each entry maps to a <name>.json standard-json input. These inputs ship in this repository's benchmark_data/ (not in the pip package). solc-bench run reads them from there by default, or from --benchmark-dir DIR.

Custom suite

To run benchmarks outside the bundled suite without modifying the package, place your own benchmarks.toml next to the input JSONs in a directory and point --benchmark-dir at it:

my-benchmarks/
  benchmarks.toml
  my-project.json

When --benchmark-dir contains a benchmarks.toml, it overrides the bundled file. Generate inputs with solc-bench extract (from a Forge project) or solc-bench extract-sourcify (from real-world mainnet contracts).

Adding a benchmark to the bundled suite

Extract the standard-json input from a Forge project:

solc-bench extract --solc ./solc --project /path/to/project --output-dir benchmark_data/

Add an entry to src/solc_bench/benchmarks/benchmarks.toml:

["my-project-1.0.0"]
source = "https://github.com/example/my-project"
version = "v1.0.0"
pipelines = ["evmasm", "ir"]
gas = true   # optional: also collect gas metrics

Open a PR.

Gas benchmarks

Set gas = true in a benchmark's TOML entry to also collect deployment_gas and method_gas. Requires forge in $PATH.

On the first run, solc-bench clones the project at the tag in version (must be a git tag, e.g. v5.6.1) into <benchmark-dir>/<entry-name>/ and invokes forge test --gas-report --json. Subsequent runs reuse that clone.

If you bump version, solc-bench raises RuntimeError and prints the stale clone's path. Delete that directory and re-run to clone the new version.

Sourcify extraction

solc-bench extract-sourcify pulls the top-N most-used mainnet contracts from Sourcify and assembles a ready-to-run benchmark suite (JSONs + a benchmarks.toml index) in a single directory. Each contract's pragma solidity ...; is rewritten to >=<min_version>; during extraction so the suite compiles against any newer solc. Proxy contracts are resolved transparently: the verified implementation's source is used and the proxy address is recorded as proxy_address in the TOML.

CLI

`solc-bench fetch`

Flag	Argument	Default	Description
`--output`	`FILE`	`./solc-{ref}`	Destination path
`--force`	—	off	Overwrite destination if it exists

Positional: ref (required: release tag like v0.8.35 or branch like develop).

solc-bench fetch v0.8.35                              # ./solc-v0.8.35
solc-bench fetch develop                              # ./solc-develop
solc-bench fetch develop --output ./solc --force

The source depends on the ref:

Release tag (e.g. v0.8.35) → downloaded from the matching argotorg/solidity GitHub release.
Branch (e.g. develop) → downloaded from the latest successful b_ubu_static artifact on CircleCI.

Linux x86_64 only. Set CIRCLECI_TOKEN to raise CircleCI rate limits, and GITHUB_TOKEN for GitHub.

`solc-bench run`

Flag	Argument	Default	Description
`--solc`	`PATH`	required	Path to solc binary
`--only`	`NAMES`	(all)	Comma-separated benchmark names
`--tags`	`TAGS`	(none)	Comma-separated tags. AND'd with `--only`
`--iterations`	`N`	`3`	Number of iterations
`--output-dir`	`DIR`	current dir	Where to write results + logs
`--stdout`	—	off	Also print results to stdout
`--benchmark-dir`	`DIR`	bundled	Custom suite directory (JSONs + optional `benchmarks.toml`)
`--pipeline`	`evmasm`/`ir`/`ir-ssacfg`	(all)	Single pipeline to run
`--no-optimize`	—	off	Disable the optimizer

Positional: input_file (optional .sol or .json, bypasses the suite and benchmarks the file directly).

Run the standard suite:

solc-bench run --solc ./solc
solc-bench run --solc ./solc --only openzeppelin-5.6.1
solc-bench run --solc ./solc --tags slow
solc-bench run --solc ./solc --iterations 10
solc-bench run --solc ./solc --pipeline ir
solc-bench run --solc ./solc --no-optimize
solc-bench run --solc ./solc --output-dir /tmp/bench-results
solc-bench run --solc ./solc --stdout

A custom suite via --benchmark-dir:

solc-bench run --solc ./solc --benchmark-dir my-benchmarks

Benchmark a single file (skip the suite):

solc-bench run --solc ./solc contract.sol
solc-bench run --solc ./solc contract.sol --pipeline ir
solc-bench run --solc ./solc input.json
solc-bench run --solc ./solc input.json --pipeline evmasm --no-optimize

For .sol files, the tool wraps the source in a standard-json input. For .json files, the tool overrides the compilation settings with the requested pipeline and optimizer configuration.

Results are written to bench-results.json in --output-dir (current directory by default).

`solc-bench compare`

Flag	Argument	Default	Description
`--pipelines`	`TARGET:REF`	cross-version mode	Compare two pipelines in one file (e.g. `ir:evmasm`)
`--format`	`table`/`json`	`table`	Output format
`--output`	`FILE`	(none)	Write comparison JSON to file
`--per-function`	`min`/`mean`/`median`/`max`	`median`	Per-function gas deltas (cross-version only)
`--plot`	`FILE`	(none)	Write a boxplot (requires `[plot]` extra)
`--plot-metric`	`METRIC[,...]`	`cpu_time`	Metric(s) to plot (comma-separated)

Positional: bench-results.json (required, baseline). target_bench-results.json (optional, cross-version mode only).

Compare two result files (e.g. baseline vs PR branch):

solc-bench compare baseline/bench-results.json target/bench-results.json
solc-bench compare baseline/bench-results.json target/bench-results.json --format json
solc-bench compare baseline/bench-results.json target/bench-results.json --output comparison.json

When both files contain per-function gas data, --per-function adds a per-function delta table sorted by absolute median delta. Pass min, mean, or max to change the sort key:

solc-bench compare baseline/bench-results.json target/bench-results.json --per-function
solc-bench compare baseline/bench-results.json target/bench-results.json --per-function max

Or compare two pipelines within a single result file (e.g. how ir compares to evmasm):

solc-bench compare bench-results.json --pipelines ir:evmasm
solc-bench compare bench-results.json --pipelines ir-ssacfg:evmasm --format json

The output table shows each metric's signed percent delta (target - ref) / ref. Every metric is lower-is-better, so negative is an improvement and positive is a regression. The winner column names the side that wins per metric.

Plot per-iteration samples as boxplots:

solc-bench compare baseline/bench-results.json target/bench-results.json --plot diff.png
solc-bench compare bench-results.json --pipelines ir:evmasm --plot panels.png --plot-metric wall_time,instructions

`solc-bench extract`

Flag	Argument	Default	Description
`--solc`	`PATH`	required	Path to solc binary
`--project`	`DIR`	required	Forge project directory
`--output-dir`	`DIR`	project parent	Where to write the standard-json

solc-bench extract --solc ./solc --project /path/to/forge-project --output-dir benchmark_data/

Produces one .json file per project containing the sources and base settings. Pipeline and optimizer settings are applied at runtime by the run command.

`solc-bench extract-sourcify`

Flag	Argument	Default	Description
`--output-dir`	`DIR`	required	Where to write JSONs and `benchmarks.toml`
`--top-n`	`N`	`100`	Number of mainnet contracts to extract
`--min-version`	`X`	`0.8.0`	solc version floor: filters older + rewrites pragmas to `>=X`
`--force`	—	off	Wipe `--output-dir` contents first

solc-bench extract-sourcify --output-dir sourcify-bench/ --top-n 100
solc-bench extract-sourcify --output-dir sourcify-bench/ --top-n 50 --min-version 0.8.20 --force

See Sourcify extraction for the suite shape and proxy resolution behavior.

`solc-bench list`

Flag	Argument	Default	Description
`--metrics`	—	off	List available metrics instead of benchmarks
`--tags`	—	off	List all tags defined across benchmarks
`--benchmark-dir`	`DIR`	bundled	Override `benchmarks.toml` source directory

solc-bench list                              # list configured benchmarks
solc-bench list --metrics                    # list available metrics
solc-bench list --tags                       # list tags across benchmarks
solc-bench list --benchmark-dir my-suite     # list a custom suite

Name		Name	Last commit message	Last commit date
Latest commit History 57 Commits
benchmark_data		benchmark_data
src/solc_bench		src/solc_bench
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
flake.lock		flake.lock
flake.nix		flake.nix
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

solc-bench

Install

pip

Nix (flake)

Pipelines

Metrics

Benchmarks

Standard suite

Custom suite

Adding a benchmark to the bundled suite

Gas benchmarks

Sourcify extraction

CLI

`solc-bench fetch`

`solc-bench run`

`solc-bench compare`

`solc-bench extract`

`solc-bench extract-sourcify`

`solc-bench list`

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

solc-bench

Install

pip

Nix (flake)

Pipelines

Metrics

Benchmarks

Standard suite

Custom suite

Adding a benchmark to the bundled suite

Gas benchmarks

Sourcify extraction

CLI

solc-bench fetch

solc-bench run

solc-bench compare

solc-bench extract

solc-bench extract-sourcify

solc-bench list

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

`solc-bench fetch`

`solc-bench run`

`solc-bench compare`

`solc-bench extract`

`solc-bench extract-sourcify`

`solc-bench list`

Packages