supervised-fine-tuning

Here are 22 public repositories matching this topic...

microsoft / ignite25-PREL13-observe-manage-and-scale-agentic-ai-apps-with-microsoft-foundry

Learn How To Observe, Manage, and Scale, Agentic AI Apps Using Azure AI Foundry - with this hands-on workshop

observability quality-evaluation aiops distillation-model azure-openai azure-ai-search safety-evaluation azure-ai-foundry supervised-fine-tuning agent-evaluation azure-ai-foundry-models

Updated Mar 26, 2026
Jupyter Notebook

Hong-Lab-UMN-ECE / RoSTE

Star

[ICML 2025] Official code for the paper "RoSTE: An Efficient Quantization-Aware Supervised Fine-Tuning Approach for Large Language Models"

quantization large-language-models supervised-fine-tuning

Updated May 29, 2025
Python

homzer / Q-RM

Star

Code for SFT and RL

reinforcement-learning model-parallel reward-modeling supervised-fine-tuning

Updated Jun 22, 2025
Python

OptimAI-Lab / RoSTE

Star

[ICML 2025] Official code for the paper "RoSTE: An Efficient Quantization-Aware Supervised Fine-Tuning Approach for Large Language Models"

quantization large-language-models supervised-fine-tuning

Updated May 29, 2025
Python

viktor-shcherb / llm-tool-call-sft

Star

LoRA fine-tuning pipeline for tool-calling chat LLMs with config-driven datasets, deterministic prompts, and built-in tool-call evaluation.

transformers pytorch lora evaluation-metrics fine-tuning llm chat-model tool-calling supervised-fine-tuning causal-language-model

Updated Nov 7, 2025
Python

pxaris / FM-music-tagging

Star

Automatic music tagging using foundation models

pytorch music-information-retrieval probing music-classification few-shot-learning foundation-models supervised-fine-tuning

Updated Jun 23, 2025
Python

HemantBK / LLaMA-Sum-Fine-Tuning

Star

Fine-tuned Meta's LLaMA 3.2 1B for text summarization using QLoRA (4-bit quantization + LoRA), achieving 40%+ improvement in ROUGE-2 over the base model on CNN/DailyMail dataset.

machine-learning pytorch llama lora fine-tuning huggingface ollama llama3 supervised-fine-tuning

Updated Mar 6, 2026
Python

klay-liu / Financial-Intent-Understanding-with-LLMs

Star

🎯 Fine-tuning LLMs using LlamaFactory for financial intent understanding | Evaluating open-source models on OpenFinData benchmark | Full implementation with multiple models (Qwen2.5/ChatGLM3/Baichuan2/Llama3)

lora fine-tuning financial-nlp llms chinese-llm baichuan2-7b chatglm3-6b llama-factory llama3-8b financial-domain qwen2-5 intent-understanding openfindata supervised-fine-tuning

Updated Jan 16, 2025
Jupyter Notebook

hubertik1 / emotion-prediction-finetuning

Star

Fine-tune Qwen2.5-VL-7B with LoRA to predict human-rated emotion intensity (1–7) from images, with a ResNet18 regression baseline, full preprocessing/SFT pipeline, and evaluation (MAE/RMSE + bias analysis).

machine-learning computer-vision deep-learning transformers regression pytorch lora emotion-recognition fine-tuning vision-language-model qwen supervised-fine-tuning

Updated Jan 26, 2026
Python

Lakshmiec / llama3-faq-classifier

Star

Fine-tuning Llama-3 8B using Unsloth & QLoRA to automate SME customer service logic with 99% accuracy.

qlora unsloth llama3 supervised-fine-tuning

Updated Feb 14, 2026
Jupyter Notebook

ArchitJ6 / Llama2-FineTuning

Star

🦙 Llama2-FineTuning: Fine-tune LLAMA 2 with Custom Datasets Using LoRA and QLoRA Techniques

nlp transformers text-generation pytorch lora quantization fine-tuning peft google-colab huggingface transformer-reinforcement-learning large-language-models low-rank-adaptation qlora bitsandbytes fine-tuning-llm llama2 fine-tuning-llama2 supervised-fine-tuning

Updated Apr 15, 2025
Jupyter Notebook

behnazeslami / Supervised_Fine_Tuning_with_QLoRA

Star

Supervised Fine Tuning with QLoRA

torch transformer accelerate llama sft supervised-finetuning qlora llama3 supervised-fine-tuning

Updated Apr 7, 2025
Python

mbeps / llama3.1_fine-tuning_mult-it

Star

Fine-tuning various Llama 3.1 family of models on the Mult-It dataset

python nlp machine-learning deep-learning artificial-intelligence transformer lora peft large-language-models llm parameter-efficient-fine-tuning llama3-1 supervised-fine-tuning

Updated Nov 12, 2025
Jupyter Notebook

JoshuaProvoste / Supervised-Fine-Tuning-of-TinyLlama-LLM-for-Trademark-CL

Star

End-to-end Supervised Fine-Tuning (SFT) pipeline for TinyLlama-1.1B-Chat, specialized in trademark similarity risk assessment using heuristic-labeled SFT data, CPU-only LoRA training, adapter validation, full-weight merge, GGUF export, quantization (Q4_K_M), and local inference deployment via llama.cpp.

python artificial-intelligence lora fine-tuning llm tinyllama tinyllamachat supervised-fine-tuning tinyllama-1-1b-chat

Updated Feb 18, 2026
Python

rakheOmar / eci-slm

Star

Compact TensorFlow language model for Election Commission of India (ECI) domain pretraining and assistant-masked SFT.

tensorflow transformer language-model election-commision-of-india sentencepiece llm supervised-fine-tuning

Updated Mar 29, 2026
Jupyter Notebook

sandeeppanem / qwen3-resume-extraction

Star

Fine-tune Qwen3-0.6B for resume parsing using LoRA

nlp machine-learning transformers information-extraction lora resume-parser fine-tuning peft huggingface instruction-following instruction-tuning supervised-fine-tuning qwen3 json-extraction resume-extraction

Updated Jan 18, 2026
Python

lionajuanabel / Fine-Dllm

Star

LoRA fine-tuning pipeline for tool-calling chat LLMs with config-driven datasets, deterministic prompts, and built-in tool-call evaluation.

transformers pytorch lora evaluation-metrics fine-tuning llm chat-model supervised-fine-tuning

Updated Feb 3, 2026
Python

Mithil-hub / STaR-Self-Taught-Reasoner-Reasoning-Enhancement-for-LLMs-on-GSM8K

Star

STaR Self-Taught Reasoner implementation on GSM8K — Zero-Shot CoT vs Vanilla SFT vs STaR with Llama 3.2-3B

star mathematical-reasoning llm gsm8k llm-training llm-inference chain-of-thought-reasoning llm-evaluation llm-finetuning llm-reasoning llama3 supervised-fine-tuning star-bootstrapping self-taught-reasoner

Updated Feb 28, 2026
Python

zohaibterminator / MediSense

Star

A Multimodal AI medical assistant

web-development deep-learning nextjs transformers multimodal fastapi sqlalchemy-orm huggingface vector-database supabase large-language-models langchain-python retrieval-augmented-generation vision-language-models supervised-fine-tuning

Updated Jun 22, 2025
Jupyter Notebook

AdityaShinde716 / Fine-Tuning-Llama

Star

Fine-tuned LLaMA 3 (8B) using Unsloth with 4-bit quantization and LoRA-based PEFT to enable memory-efficient, accelerated training. Conducted supervised fine-tuning on the Alpaca Cleaned dataset using FP16 precision, gradient checkpointing, and 8-bit AdamW optimization, achieving effective instruction tuning on limited GPU resources.

cuda python3 pytorch lora alpaca fine-tuning llm unsloth llama3 supervised-fine-tuning fast-language-model

Updated Feb 21, 2026

Improve this page

Add a description, image, and links to the supervised-fine-tuning topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the supervised-fine-tuning topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

supervised-fine-tuning

Here are 22 public repositories matching this topic...

microsoft / ignite25-PREL13-observe-manage-and-scale-agentic-ai-apps-with-microsoft-foundry

Hong-Lab-UMN-ECE / RoSTE

homzer / Q-RM

OptimAI-Lab / RoSTE

viktor-shcherb / llm-tool-call-sft

pxaris / FM-music-tagging

HemantBK / LLaMA-Sum-Fine-Tuning

klay-liu / Financial-Intent-Understanding-with-LLMs

hubertik1 / emotion-prediction-finetuning

Lakshmiec / llama3-faq-classifier

ArchitJ6 / Llama2-FineTuning

behnazeslami / Supervised_Fine_Tuning_with_QLoRA

mbeps / llama3.1_fine-tuning_mult-it

JoshuaProvoste / Supervised-Fine-Tuning-of-TinyLlama-LLM-for-Trademark-CL

rakheOmar / eci-slm

sandeeppanem / qwen3-resume-extraction

lionajuanabel / Fine-Dllm

Mithil-hub / STaR-Self-Taught-Reasoner-Reasoning-Enhancement-for-LLMs-on-GSM8K

zohaibterminator / MediSense

AdityaShinde716 / Fine-Tuning-Llama

Improve this page

Add this topic to your repo