qwen3

Here are 32 public repositories matching this topic...

Mintplex-Labs / anything-llm

The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, MCP compatibility, and more.

mcp no-code ai-agents multimodal rag vector-database llm localai local-llm ollama llm-webui lmstudio agent-framework-javascript deepseek llama3 custom-ai-agents mcp-servers deepseek-r1 qwen3

Updated May 16, 2025
JavaScript

unslothai / unsloth

Sponsor

Star

Finetune Qwen3, Llama 4, TTS, DeepSeek-R1 & Gemma 3 LLMs 2x faster with 70% less memory! 🦥

Updated May 18, 2025
Python

1Panel-dev / MaxKB

Star

💬 MaxKB is an open-source AI assistant for enterprise. It seamlessly integrates RAG pipelines, supports robust workflows, and provides MCP tool-use capabilities.

chatbot knowledgebase rag llm langchain pgvector ollama maxkb llama3 mcp-server deepseek-r1 qwen3

Updated May 19, 2025
Python

sgl-project / sglang

Star

SGLang is a fast serving framework for large language models and vision language models.

cuda inference pytorch transformer moe llama vlm llm llm-serving llava deepseek-llm deepseek llama3 llama3-1 deepseek-v3 deepseek-r1 deepseek-r1-zero qwen3 llama4

Updated May 19, 2025
Python

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen2.5-VL, Qwen2.5-Omni, Qwen2-Audio, Ovis2, InternVL3, Llava, GLM4v, Phi4, ...) (AAAI 2025).

Updated May 19, 2025
Python

zilliztech / deep-searcher

Star

Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.

agent openai grok claude rag milvus vector-database llm zilliz deepseek agentic-rag grok3 reasoning-models deepseek-r1 deep-research qwen3 llama4

Updated May 16, 2025
Python

xlite-dev / Awesome-LLM-Inference

Star

📚A curated list of Awesome LLM/VLM Inference Papers with codes: WINT8/4, FlashAttention, PagedAttention, Parallelism, MLA, etc.

mla vllm llm-inference awesome-llm flash-attention tensorrt-llm paged-attention deepseek flash-attention-3 deepseek-v3 minimax-01 deepseek-r1 flash-mla qwen3

Updated May 18, 2025
Python

johnbean393 / Sidekick

Star

A native macOS app that allows users to chat with a local LLM that can respond with information from files, folders and websites on your Mac without installing any other software. Powered by llama.cpp.

macos swift ai chatbot llama ai-agents swiftui rag qwq aichat llm qwen deepseek agentic-ai deepseek-r1 qwq-32b gemma3 qwen3 llama4

Updated May 19, 2025
Swift

papersgpt / papersgpt-for-zotero

Star

Zotero chat PDF with AI, DeepSeek, GPT 4.1, ChatGPT, Claude, Gemini, Qwen3

chat pdf ai gemini zotero llama gemma mistral claude chatgpt local-llm deepseek deepseek-r1 deepseek-r1-distill-llama phi-4-mini qwen3 o4-mini gpt-4-1

Updated May 18, 2025
JavaScript

coderonion / awesome-llm-and-aigc

Star

🚀🚀🚀A collection of some awesome public projects about Large Language Model(LLM), Vision Language Model(VLM), Vision Language Action(VLA), AI Generated Content(AIGC), the related Datasets and Applications.

Updated May 3, 2025

JohanLi233 / Viby

Star

Viby vibes everything.

linux shell agent productivity terminal tools ai mcp python3 gpt uv rag llm generative-ai shell-gpt terminalgpt qwen3 qwen3-moe

Updated May 18, 2025
Python

NetEase-Media / grps_trtllm

Star

Higher performance OpenAI LLM service than vLLM serve: A pure C++ high-performance OpenAI LLM service implemented with GPRS+TensorRT-LLM+Tokenizers.cpp, supporting chat and function call, AI agents, distributed multi-GPU inference, multimodal capabilities, and a Gradio chat interface.

Updated May 14, 2025
Python

lucasjinreal / Crane

Star

A Pure Rust based LLM (Any LLM based MLLM such as Spark-TTS) Inference Engine, powering by Candle framework.

rust mllm llama-cpp qwen2-vl spark-tts qwen3

Updated Mar 26, 2025
Rust

AaronFeng753 / Better-Qwen3

Star

Auto Thinking Mode switch for Qwen3 in Open webui

qwen open-webui qwen3

Updated May 8, 2025
Python

aws-samples / easy-model-deployer

Star

A user-friendly Command-line/SDK tool that makes it quickly and easier to deploy open-source LLMs on AWS

ec2 ecs sagemaker huggingface qwq langchain large-language-model vllm ollama deepseek comfyui-workflow inferentia-2 internlm2 openai-compatible-api qwen2-5 deepseek-r1 qwq-32b gemma3 qwen3

Updated May 19, 2025
Python

bold84 / cot_proxy

Star

Smart proxy for LLM APIs that enables model-specific parameter control, automatic mode switching (like Qwen3's /think and /no_think), and <think> tag filtering. Perfect for using advanced models with apps that lack parameter customization.

llm qwen3