Skip to content
View greynewell's full-sized avatar

Highlights

  • Pro

Block or report greynewell

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
greynewell/README.md

Grey Newell

I build eval infrastructure. MS CS (ML) at Georgia Tech. Ex-AWS, 12x certified.

MIST stack

MatchSpec · InferMux · SchemaFlux · TokenTrace

Go. Zero external deps. All six repos are pinned below.

Repo Purpose
matchspec Benchmark suites. Runs evals against any backend, produces structured reports.
infermux Inference routing. Abstracts providers, tracks tokens and cost per request.
schemaflux Data compiler. Pass pipeline, pluggable backends, no runtime allocs in hot path.
tokentrace Observability. Collects spans, computes latency percentiles, fires threshold alerts.
mist-go Shared core. Protocol, transport, metrics, circuit breakers, checkpointing.

Methodology: eval-driven development.

Research

ORCID DOI

Pinned Loading

  1. evaldriven.org evaldriven.org Public

    Ship evals before you ship features.

    Python 10 4

  2. mist-go mist-go Public

    Shared core for the MIST stack. Zero external deps.

    Go 1

  3. matchspec matchspec Public

    Eval framework. Define correct, test against it, get results.

    Go 21 8

  4. infermux infermux Public

    Route inference across LLM providers. Track cost per request.

    Go 89 7

  5. schemaflux schemaflux Public

    Structured data compiler. Pass pipeline, pluggable backends.

    Go 11 1

  6. tokentrace tokentrace Public

    Where did your tokens go? Spans, latency percentiles, alerts.

    Go 5