TALOS-V2

TALOS-V2 is an RTL implementation of a Karpathy's microGPT language model for the DE1-SoC / Cyclone V FPGA.

The project includes the FPGA RTL, generated model ROMs, simulation files, Python host utilities, and board-level scripts needed to build, program, and run inference over JTAG.

Features

SystemVerilog inference core
DE1-SoC top level with switch, LED, HEX display, and JTAG/MMIO control
Fixed-point model weights stored as ROM hex files
RTL sampler for hardware token generation
ModelSim testbench for deterministic core simulation
Python tools for JTAG inference, reference runs, and weight export

Repository Layout

rtl/src/           Synthesizable RTL
rtl/src/include/   RTL include fragments
rtl/generated/     Fixed-point model ROM hex files
rtl/microgpt/      Saved model weights and training dataset
rtl/python/        Python host and reference utilities
rtl/tcl/           System Console and Quartus TCL scripts
rtl/sim/           ModelSim testbenches
rtl/ip/            JTAG-to-Avalon bridge IP
rtl/docs/          Additional notes and archived writeups

Root-level batch files provide the main workflow commands.

Requirements

Intel Quartus Prime 18.1 Lite or compatible Cyclone V toolchain
ModelSim Intel FPGA Edition
Python 3
DE1-SoC board for hardware inference

If Quartus is not installed in a standard location, set QUARTUS_ROOTDIR before running the build or programming scripts.

Quick Start

Run commands from the repository root.

Simulate the RTL core:

run_core_sim.bat

Build the FPGA project:

compile_only.bat

Program the DE1-SoC:

program_fpga.bat

Build and program in one step:

run_de1soc.bat

Run inference over JTAG:

run_inference.bat --sampler rtl --steps 15 --temperature 0.5 --seed 2 --stream

Run the Python reference model:

reference_microgpt.bat --count 20 --temperature 0.5

Board Controls

SW0: enable
SW1: reset, high resets and low runs

Status outputs:

LEDR0: ready
LEDR1: busy
LEDR2: generation done
LEDR3: JTAG activity
LEDR4: reset deasserted
LEDR5: enable state
LEDR6: busy blink
LEDR7..9: low bits of the last sampled token
HEX0..1: last sampled token id
HEX2..3: generated token count
HEX4: top-level state
HEX5: switch state

Regenerating Weights

The RTL uses Q4.12 fixed-point ROM files in rtl/generated/. To regenerate them from the saved model weights:

cd rtl
python python\export_weights.py --weights microgpt\weights_only.npy --outdir generated

Validation

The main local checks are:

run_core_sim.bat
cd rtl\python
python -m unittest test_reference_equivalence.py

For hardware validation, rebuild the Quartus project, program the board, and run the JTAG inference command.

Notes

The RTL follows the microGPT transformer structure, but it uses fixed-point hardware arithmetic and an RTL-friendly sampler. Outputs are deterministic for the same seed and configuration.

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
rtl		rtl
.gitignore		.gitignore
README.md		README.md
compile_monitor.log		compile_monitor.log
compile_only.bat		compile_only.bat
info.md		info.md
main.c		main.c
microgpt_bos_start.exe		microgpt_bos_start.exe
program_fpga.bat		program_fpga.bat
reference_microgpt.bat		reference_microgpt.bat
run_core_sim.bat		run_core_sim.bat
run_de1soc.bat		run_de1soc.bat
run_inference.bat		run_inference.bat

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TALOS-V2

Features

Repository Layout

Requirements

Quick Start

Board Controls

Regenerating Weights

Validation

Notes

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

TALOS-V2

Features

Repository Layout

Requirements

Quick Start

Board Controls

Regenerating Weights

Validation

Notes

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages