rmsnorm

Star

Here are 10 public repositories matching this topic...

bzhangGo / rmsnorm

Star

Root Mean Square Layer Normalization

layernorm rmsnorm

Updated Mar 28, 2023
Python

knotgrass / Griffin

Star

Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models

h3 linear attention language-model griffin mamba gelu conv1d rmsnorm rg-lru shift-ssm

Updated Dec 23, 2024
Python

dtunai / Tri-RMSNorm

Star

Efficient kernel for RMS normalization with fused operations, includes both forward and backward passes, compatibility with PyTorch.

machine-learning ai triton rmsnorm

Updated Jun 5, 2024
Python

Simple and easy to understand PyTorch implementation of Large Language Model (LLM) GPT and LLAMA from scratch with detailed steps. Implemented: Byte-Pair Tokenizer, Rotational Positional Embedding (RoPe), SwishGLU, RMSNorm, Mixture of Experts (MOE). Tested on Taylor Swift song lyrics dataset.

moe mixture-of-experts kv-cache llm rmsnorm swiglu pytorch-llm byte-pair-tokenizer rotational-positional-embedding

Updated Nov 18, 2024
Python

sushantkumar23 / nano-gpt

Star

Simple character level Transformer

transformers pytorch attention attention-mechanism rope self-attention multi-head-attention shakespeare-dataset transformer-architecture llm rmsnorm

Updated May 27, 2024
Jupyter Notebook

MadrasLe / MGRrmsnorm

Star

Optimized Fused RMSNorm implementation with CUDA. Features vectorized memory access (float4), warp-level reductions, and efficient backward pass for LLM training

kernel deep-learning optimization high-performance cuda transformer gpu-computing custom-kernel llm rmsnorm

Updated Dec 24, 2025
Python

rmgogogo / nano-aigc

Star

Generative models nano version for fun. No STOA here, nano first.

Updated Jul 27, 2025
Jupyter Notebook

Halva773 / pytorch-transformer-lm

Star

Build an LLM in PyTorch: BPE tokenizer, GPT-1/2 + LLaMA, end-to-end train/infer

nlp nlp-machine-learning rope kv-cache tranformers llm rmsnorm swiglu

Updated Feb 8, 2026
Python

ralolooafanxyaiml / frad

Star

A from-scratch PyTorch LLM implementing Sparse Mixture-of-Experts (MoE) with Top-2 gating. Integrates modern Llama-3 components (RMSNorm, SwiGLU, RoPE, GQA) and a custom-coded Byte-Level BPE tokenizer. Pre-trained on a curated corpus of existential & dark philosophical literature.

python pytorch transformer moe from-scratch mixture-of-experts bpe gqa llm rmsnorm swiglu existential-ai llama-3-architecture rope-embeddings custom-tokenizer

Updated Jan 7, 2026
Python

luciITby / OpenLabLM

Star

🚀 Build your own LLM easily with OpenLabLM, a lightweight, hackable codebase tailored for hobbyists using a single consumer GPU.

nlp machine-learning deep-neural-networks ai deep-learning pytorch muon natural-language-generation language-model natural-language-understanding pytorch-implementation large-language-models llm generative-ai rmsnorm multi-head-latent-attention

Updated Feb 15, 2026
Python

Improve this page

Add a description, image, and links to the rmsnorm topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the rmsnorm topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

rmsnorm

Here are 10 public repositories matching this topic...

bzhangGo / rmsnorm

knotgrass / Griffin

dtunai / Tri-RMSNorm

s-chh / PyTorch-Scratch-LLM

sushantkumar23 / nano-gpt

MadrasLe / MGRrmsnorm

rmgogogo / nano-aigc

Halva773 / pytorch-transformer-lm

ralolooafanxyaiml / frad

luciITby / OpenLabLM

Improve this page

Add this topic to your repo