Pull requests: ggerganov/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
remove convert-lora-to-ggml.py
refactoring
Refactoring
review complexity : low
Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
#7204
opened May 10, 2024 by
slaren
Loading…
Fix(server): stopped_word always true
bugfix
fixes an issue or bug
review complexity : low
Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
server/webui
#7198
opened May 10, 2024 by
kurikomoe
Loading…
Add support for properly optimized Windows ARM64 builds with LLVM and MSVC
devops
improvements to build systems and github actions
review complexity : high
Generally require indepth knowledge of LLMs
#7191
opened May 10, 2024 by
max-krasnyansky
Loading…
CUDA: add FP32 FlashAttention vector kernel
Nvidia GPU
Issues specific to Nvidia GPUs
performance
Speed related topics
review complexity : high
Generally require indepth knowledge of LLMs
#7188
opened May 9, 2024 by
JohannesGaessler
Loading…
main : add token healing
enhancement
New feature or request
review complexity : high
Generally require indepth knowledge of LLMs
#7187
opened May 9, 2024 by
mare5x
Loading…
convert-hf : support bfloat16 conversion
performance
Speed related topics
review complexity : medium
Generally require more time to grok but manageable by beginner to medium expertise level
#7158
opened May 9, 2024 by
compilade
Loading…
3 of 4 tasks
ggml : rewrite silu and softmax for cpu
refactoring
Refactoring
review complexity : high
Generally require indepth knowledge of LLMs
#7154
opened May 9, 2024 by
jart
Loading…
Server: Fix system_prompt handling
bugfix
fixes an issue or bug
help wanted
Extra attention is needed
review complexity : low
Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
#7153
opened May 8, 2024 by
ngxson
Loading…
perplexity: add BF16 vs. FP16 results
documentation
Improvements or additions to documentation
need feedback
Testing and feedback with results are needed
review complexity : low
Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
#7150
opened May 8, 2024 by
JohannesGaessler
Loading…
chore: Add model vocab support
enhancement
New feature or request
review complexity : medium
Generally require more time to grok but manageable by beginner to medium expertise level
#7117
opened May 7, 2024 by
teleprint-me
•
Draft
tokenization: no double BOS tokens
refactoring
Refactoring
review complexity : medium
Generally require more time to grok but manageable by beginner to medium expertise level
#7107
opened May 6, 2024 by
JohannesGaessler
Loading…
Scripting & documenting debugging one test without anything else in the loop.
documentation
Improvements or additions to documentation
review complexity : low
Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
#7096
opened May 6, 2024 by
josh-ramer
Loading…
Add left recursion check: quit early instead of going into an infinite loop
bugfix
fixes an issue or bug
review complexity : medium
Generally require more time to grok but manageable by beginner to medium expertise level
#7083
opened May 5, 2024 by
nuchi
Loading…
Script to convert Grok-1 weights from raw JAX pickle files.
enhancement
New feature or request
python
python script changes
review complexity : medium
Generally require more time to grok but manageable by beginner to medium expertise level
docs: Fix typo and update description for --embeddings flag
documentation
Improvements or additions to documentation
review complexity : low
Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
#7026
opened May 1, 2024 by
louixs
Loading…
Added support for the ArcticForCausalLM.
enhancement
New feature or request
review complexity : medium
Generally require more time to grok but manageable by beginner to medium expertise level
#7020
opened May 1, 2024 by
fairydreaming
Loading…
Fix flash attention for ROCm
enhancement
New feature or request
review complexity : high
Generally require indepth knowledge of LLMs
#7011
opened Apr 30, 2024 by
jdecourval
•
Draft
add chatglm3-6b model support [help wanted]
enhancement
New feature or request
help wanted
Extra attention is needed
review complexity : medium
Generally require more time to grok but manageable by beginner to medium expertise level
new tokenizer-verifier tool to check gguf tokenizer parameters
enhancement
New feature or request
review complexity : low
Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
#6988
opened Apr 29, 2024 by
anisse
Loading…
server: avoid breaking KV cache when prompt >= n_ctx
enhancement
New feature or request
review complexity : medium
Generally require more time to grok but manageable by beginner to medium expertise level
move ndk code to a new library
android
Issues specific to Android
refactoring
Refactoring
review complexity : low
Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
#6951
opened Apr 27, 2024 by
eltonkola
Loading…
Option to split during conversion
enhancement
New feature or request
help wanted
Extra attention is needed
python
python script changes
review complexity : medium
Generally require more time to grok but manageable by beginner to medium expertise level
#6942
opened Apr 27, 2024 by
christianazinn
•
Draft
Updated server_queue to delete tasks from queue when server is shutdown. Feature Request #6421
demo
Demonstrate some concept or idea, not intended to be merged
review complexity : low
Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
#6941
opened Apr 27, 2024 by
rahsuri
Loading…
Implemented basic interface for llamacheck and link to weights, adapt…
demo
Demonstrate some concept or idea, not intended to be merged
review complexity : medium
Generally require more time to grok but manageable by beginner to medium expertise level
#6940
opened Apr 27, 2024 by
Ferruolo
Loading…
Fix clip build on windows + clang
bugfix
fixes an issue or bug
review complexity : low
Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
Previous Next
ProTip!
Adding no:label will show everything without a label.