Skip to content

WIP: Use DirectStorage with CUDA interop to more efficient load tensors #2218

WIP: Use DirectStorage with CUDA interop to more efficient load tensors

WIP: Use DirectStorage with CUDA interop to more efficient load tensors #2218

Triggered via pull request June 10, 2024 16:39
@mtavenrathmtavenrath
reopened #7796
Status Failure
Total duration 1d 12h 56m 20s
Artifacts

bench.yml

on: pull_request_target
Matrix: bench-server-baseline
Fit to window
Zoom out
Zoom in

Annotations

3 errors
bench-server-baseline (phi-2, q8_0)
This request was automatically failed because there were no enabled runners online to process the request for more than 1 days.
bench-server-baseline (phi-2, q4_0)
This request was automatically failed because there were no enabled runners online to process the request for more than 1 days.
bench-server-baseline (phi-2, f16)
This request was automatically failed because there were no enabled runners online to process the request for more than 1 days.