Skip to content

Pull requests: ggerganov/llama.cpp

Author
Filter by author
Label
Filter by label
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Milestones
Filter by milestone
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

remove convert-lora-to-ggml.py refactoring Refactoring review complexity : low Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
#7204 opened May 10, 2024 by slaren Loading…
Fix(server): stopped_word always true bugfix fixes an issue or bug review complexity : low Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix server/webui
#7198 opened May 10, 2024 by kurikomoe Loading…
Add support for properly optimized Windows ARM64 builds with LLVM and MSVC devops improvements to build systems and github actions review complexity : high Generally require indepth knowledge of LLMs
#7191 opened May 10, 2024 by max-krasnyansky Loading…
CUDA: add FP32 FlashAttention vector kernel Nvidia GPU Issues specific to Nvidia GPUs performance Speed related topics review complexity : high Generally require indepth knowledge of LLMs
#7188 opened May 9, 2024 by JohannesGaessler Loading…
main : add token healing enhancement New feature or request review complexity : high Generally require indepth knowledge of LLMs
#7187 opened May 9, 2024 by mare5x Loading…
convert-hf : support bfloat16 conversion performance Speed related topics review complexity : medium Generally require more time to grok but manageable by beginner to medium expertise level
#7158 opened May 9, 2024 by compilade Loading…
3 of 4 tasks
ggml : rewrite silu and softmax for cpu refactoring Refactoring review complexity : high Generally require indepth knowledge of LLMs
#7154 opened May 9, 2024 by jart Loading…
Server: Fix system_prompt handling bugfix fixes an issue or bug help wanted Extra attention is needed review complexity : low Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
#7153 opened May 8, 2024 by ngxson Loading…
perplexity: add BF16 vs. FP16 results documentation Improvements or additions to documentation need feedback Testing and feedback with results are needed review complexity : low Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
#7150 opened May 8, 2024 by JohannesGaessler Loading…
chore: Add model vocab support enhancement New feature or request review complexity : medium Generally require more time to grok but manageable by beginner to medium expertise level
#7117 opened May 7, 2024 by teleprint-me Draft
tokenization: no double BOS tokens refactoring Refactoring review complexity : medium Generally require more time to grok but manageable by beginner to medium expertise level
#7107 opened May 6, 2024 by JohannesGaessler Loading…
Scripting & documenting debugging one test without anything else in the loop. documentation Improvements or additions to documentation review complexity : low Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
#7096 opened May 6, 2024 by josh-ramer Loading…
Add left recursion check: quit early instead of going into an infinite loop bugfix fixes an issue or bug review complexity : medium Generally require more time to grok but manageable by beginner to medium expertise level
#7083 opened May 5, 2024 by nuchi Loading…
Script to convert Grok-1 weights from raw JAX pickle files. enhancement New feature or request python python script changes review complexity : medium Generally require more time to grok but manageable by beginner to medium expertise level
#7058 opened May 3, 2024 by heiner Draft
docs: Fix typo and update description for --embeddings flag documentation Improvements or additions to documentation review complexity : low Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
#7026 opened May 1, 2024 by louixs Loading…
Added support for the ArcticForCausalLM. enhancement New feature or request review complexity : medium Generally require more time to grok but manageable by beginner to medium expertise level
#7020 opened May 1, 2024 by fairydreaming Loading…
Fix flash attention for ROCm enhancement New feature or request review complexity : high Generally require indepth knowledge of LLMs
#7011 opened Apr 30, 2024 by jdecourval Draft
add chatglm3-6b model support [help wanted] enhancement New feature or request help wanted Extra attention is needed review complexity : medium Generally require more time to grok but manageable by beginner to medium expertise level
#6999 opened Apr 30, 2024 by mnlife Draft
new tokenizer-verifier tool to check gguf tokenizer parameters enhancement New feature or request review complexity : low Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
#6988 opened Apr 29, 2024 by anisse Loading…
server: avoid breaking KV cache when prompt >= n_ctx enhancement New feature or request review complexity : medium Generally require more time to grok but manageable by beginner to medium expertise level
#6958 opened Apr 28, 2024 by prfd Draft
move ndk code to a new library android Issues specific to Android refactoring Refactoring review complexity : low Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
#6951 opened Apr 27, 2024 by eltonkola Loading…
Option to split during conversion enhancement New feature or request help wanted Extra attention is needed python python script changes review complexity : medium Generally require more time to grok but manageable by beginner to medium expertise level
#6942 opened Apr 27, 2024 by christianazinn Draft
Updated server_queue to delete tasks from queue when server is shutdown. Feature Request #6421 demo Demonstrate some concept or idea, not intended to be merged review complexity : low Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
#6941 opened Apr 27, 2024 by rahsuri Loading…
Implemented basic interface for llamacheck and link to weights, adapt… demo Demonstrate some concept or idea, not intended to be merged review complexity : medium Generally require more time to grok but manageable by beginner to medium expertise level
#6940 opened Apr 27, 2024 by Ferruolo Loading…
Fix clip build on windows + clang bugfix fixes an issue or bug review complexity : low Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
#6934 opened Apr 26, 2024 by dhiltgen Draft
ProTip! Adding no:label will show everything without a label.