Skip to content

Actions: predibase/lorax

docs

Actions

Loading...

Show workflow options

Create status badge

223 workflow runs
223 workflow runs
Event

Filter by event

Status

Filter by status

Branch
Actor

Filter by actor

try out an integration test workflow (#516)
docs #223: Commit cfc1e19 pushed by noyoshi
June 14, 2024 17:21 36s main
June 14, 2024 17:21 36s
Fix issue with GQA initialization for Qwen2 (#514)
docs #222: Commit 9bed4da pushed by arnavgarg1
June 13, 2024 19:37 35s main
June 13, 2024 19:37 35s
fix batching bug (#513)
docs #221: Commit 835d19c pushed by tgaddair
June 12, 2024 21:38 33s main
June 12, 2024 21:38 33s
Fixed case where loaded lora adapter has no segments (#510)
docs #220: Commit 432be6e pushed by tgaddair
June 12, 2024 04:03 31s main
June 12, 2024 04:03 31s
feat: return usage in ChatCompletionStreamResponse (#506)
docs #219: Commit 4187cab pushed by tgaddair
June 11, 2024 16:33 30s main
June 11, 2024 16:33 30s
Add distilbert (#508)
docs #218: Commit 84fb56d pushed by magdyksaleh
June 10, 2024 22:09 27s main
June 10, 2024 22:09 27s
Bert to gpu (#507)
docs #217: Commit f5e71bd pushed by magdyksaleh
June 10, 2024 21:31 33s main
June 10, 2024 21:31 33s
Add support for batching to embedder models (#503)
docs #216: Commit e8f3d33 pushed by tgaddair
June 8, 2024 05:34 31s main
June 8, 2024 05:34 31s
hqq upgrades (#491)
docs #215: Commit 1b528e0 pushed by tgaddair
June 6, 2024 16:25 36s main
June 6, 2024 16:25 36s
Fixed phi-3 with Su Rotary Embedding (#499)
docs #214: Commit c71861a pushed by tgaddair
June 5, 2024 16:21 32s main
June 5, 2024 16:21 32s
Revert AWQ to stable commit (#498)
docs #213: Commit b2ea56e pushed by tgaddair
June 4, 2024 21:57 32s main
June 4, 2024 21:57 32s
Update Makefile-awq (#493)
docs #212: Commit b1db967 pushed by tgaddair
June 4, 2024 05:23 29s main
June 4, 2024 05:23 29s
Bump client to v0.6.1 (#496)
docs #211: Commit 319183e pushed by tgaddair
June 3, 2024 22:43 31s main
June 3, 2024 22:43 31s
June 3, 2024 20:37 29s
Fix quant cache OOM (#494)
docs #209: Commit 26e0982 pushed by tgaddair
May 30, 2024 05:36 29s main
May 30, 2024 05:36 29s
add missed dtypes for 8bit kv cache (#490)
docs #208: Commit 7d6b1d4 pushed by tgaddair
May 28, 2024 21:13 34s main
May 28, 2024 21:13 34s
Fix issue with Medusa batch load signature (#492)
docs #207: Commit f2193f0 pushed by tgaddair
May 28, 2024 16:19 33s main
May 28, 2024 16:19 33s
Embedder Service v0 with FlashBert (#385)
docs #206: Commit e37549e pushed by tgaddair
May 25, 2024 21:15 34s main
May 25, 2024 21:15 34s
fix: load tokenizer/config with trust_remote_code (#476)
docs #205: Commit feb69c4 pushed by tgaddair
May 25, 2024 20:14 28s main
May 25, 2024 20:14 28s
start porting latest tgi (#480)
docs #204: Commit a2ca687 pushed by tgaddair
May 24, 2024 22:52 30s main
May 24, 2024 22:52 30s
Fix for the LM_HEAD issue (#475)
docs #203: Commit da90421 pushed by tgaddair
May 23, 2024 20:28 28s main
May 23, 2024 20:28 28s
chore: update infer.rs (#487)
docs #202: Commit 2481e70 pushed by tgaddair
May 23, 2024 19:34 28s main
May 23, 2024 19:34 28s
Bump lorax client v0.6.0 (#488)
docs #201: Commit bd7db80 pushed by tgaddair
May 23, 2024 16:44 26s main
May 23, 2024 16:44 26s
Bump Lorax Client to 3.9 (#486)
docs #200: Commit 0a33ea8 pushed by tgaddair
May 22, 2024 17:49 33s main
May 22, 2024 17:49 33s
Support jointly trained Medusa + LoRA adapters (#482)
docs #199: Commit a1ff52d pushed by tgaddair
May 22, 2024 17:48 30s main
May 22, 2024 17:48 30s