-
Notifications
You must be signed in to change notification settings - Fork 11.8k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
vulkan: small fixes
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#13626
opened May 19, 2025 by
netrunnereve
Loading…
cuda: fix CMAKE_CUDA_COMPILER not found error (#13528)
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#13625
opened May 19, 2025 by
lizhenneng
Loading…
scripts: update pyproject.toml - deprecated poetry config + support uv
#13615
opened May 18, 2025 by
borgoat
Loading…
SYCL: Add non contiguous support in RMS_NORM and NORM kernels
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#13611
opened May 18, 2025 by
qnixsynapse
•
Draft
ggml: aarch64: Implement SVE F32 kernels for Mamba Model
ggml
changes relating to the ggml tensor library for machine learning
#13602
opened May 17, 2025 by
vineelabhinav
Loading…
ggml : add memset_tensor for rpc
ggml
changes relating to the ggml tensor library for machine learning
#13601
opened May 17, 2025 by
gkpln3
Loading…
ggml : fix race-condition in ggml-rpc
ggml
changes relating to the ggml tensor library for machine learning
#13600
opened May 17, 2025 by
gkpln3
Loading…
SYCL: Avoid using SYCL-Graph for unsupported nodes
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#13587
opened May 16, 2025 by
EwanC
Loading…
CUDA: skip fully masked-out KV in FA vec kernel
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#13584
opened May 16, 2025 by
JohannesGaessler
Loading…
server : separate the notion of position and KV tokens, remove prompt truncation
breaking change
Changes that break ABIs, APIs, file formats, or other forms of backwards compatibility.
examples
python
python script changes
server
#13576
opened May 15, 2025 by
ngxson
Loading…
gguf-py : add support for sub_type (in arrays) in GGUFWriter add_key_value method
python
python script changes
#13561
opened May 15, 2025 by
CISC
Loading…
Granite Four
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
python
python script changes
testing
Everything test related
#13550
opened May 14, 2025 by
gabe-l-hart
•
Draft
2 tasks
sycl : reviewing the backend documentation
documentation
Improvements or additions to documentation
examples
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#13544
opened May 14, 2025 by
Alcpz
Loading…
sycl: disable reorder for sycl mulmat
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#13536
opened May 14, 2025 by
sgeor255
Loading…
ci : upgraded oneAPI version in SYCL workflows and dockerfile
devops
improvements to build systems and github actions
#13532
opened May 14, 2025 by
Alcpz
Loading…
cuda: set cuda compiler path (#13527)
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#13528
opened May 14, 2025 by
lizhenneng
Loading…
webui: Add editing assistant messages (#11849)
examples
server
#13522
opened May 14, 2025 by
lr1729
Loading…
convert: Swap GLM4 EOS / EOT token
python
python script changes
#13505
opened May 13, 2025 by
henk717
Loading…
feat(server): Add tool call support to WebUI (LLama Server)
examples
server
#13501
opened May 13, 2025 by
samolego
Loading…
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.