Issues: ray-project/kuberay
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
[Feature] Support GCS fault tolerance without external dependencies like Redis
enhancement
New feature or request
triage
#2162
opened May 22, 2024 by
shaikhismail
2 tasks done
[Bug] Fail the job, if the head node crashes
bug
Something isn't working
triage
#2161
opened May 21, 2024 by
peterghaddad
2 tasks done
[Bug] Readiness probe failed: timeout on minikube
bug
Something isn't working
raycluster
#2158
opened May 20, 2024 by
anovv
1 of 2 tasks
[Feature] Should we also set PublishNotReadyAddresses if the service is not headless?
enhancement
New feature or request
raycluster
#2157
opened May 19, 2024 by
rueian
2 tasks done
[Feature] Checkpoint API to recover from checkpoint from previous runs
1.2.0
enhancement
New feature or request
long-running job
rayjob
#2155
opened May 17, 2024 by
sathyanarays
1 of 2 tasks
[Bug] RayJob falsely marked as "Running" when driver fails
1.2.0
bug
Something isn't working
long-running job
rayjob
#2154
opened May 17, 2024 by
sathyanarays
1 of 2 tasks
FT GCS should handle draining of node where head pod is scheduled
1.2.0
enhancement
New feature or request
gcs ft
long-running job
rayjob
#2153
opened May 17, 2024 by
abatilo
1 of 2 tasks
[Bug] RayJob does not work when Something isn't working
raycluster
app.kubernetes.io/name
is set
1.2.0
bug
#2147
opened May 14, 2024 by
kwohlfahrt
2 tasks done
[Feature] RayService CRD to have ImagePullSecret Reference
enhancement
New feature or request
triage
#2137
opened May 10, 2024 by
roverkinz
1 of 2 tasks
[Bug] [raycluster-controller] Kuberay cannot recreate new raycluster header pod when it has been evicted by kubelet as disk pressure
bug
Something isn't working
triage
#2125
opened May 8, 2024 by
xjhust
2 tasks done
[Feature] [API Server] [RFC] Add persistence for job history using a SQL database
apiserver
enhancement
New feature or request
#2114
opened May 2, 2024 by
han-steve
2 tasks done
[Bug] [API Server] Can't specify cluster rayVersion in Ray Job
apiserver
bug
Something isn't working
#2109
opened Apr 30, 2024 by
han-steve
1 task done
[Bug] Ray Head access to extra GPU resources
bug
Something isn't working
gpu
#2098
opened Apr 23, 2024 by
shaowei-su
2 tasks done
[Bug] Misleading error message in RayService when upgrading to KubeRay v1.1.0
bug
Something isn't working
rayservice
#2088
opened Apr 18, 2024 by
kevin85421
1 of 2 tasks
[Bug] Priority Class Name from worker group spec not forwarded to final templated yaml files
bug
Something isn't working
triage
#2086
opened Apr 17, 2024 by
sam-h-bean
2 tasks done
[Bug] What's the relationship between watching Something isn't working
rayservice
Endpoints
and RayService e2e tests?
bug
#2085
opened Apr 17, 2024 by
kevin85421
1 of 2 tasks
[Feature] Ray cluster launcher support for GKE
enhancement
New feature or request
triage
#2083
opened Apr 16, 2024 by
richardsliu
2 tasks done
[Feature] Publish New feature or request
triage
python-client
to PyPI
enhancement
#2078
opened Apr 12, 2024 by
danielgafni
2 tasks done
[Feature] KubeRay Scalability Benchmarking
enhancement
New feature or request
#2069
opened Apr 5, 2024 by
andrewsykim
2 tasks done
[Feature] Support dynamic refresh of watched namespaces
enhancement
New feature or request
operation
#2061
opened Apr 4, 2024 by
marton-bod
2 tasks done
[Bug] Fail to reconcileServe
bug
Something isn't working
triage
#2057
opened Apr 1, 2024 by
LronDC
2 tasks done
[Feature] Support extended kube-scheduler as batch scheduler
enhancement
New feature or request
triage
#2052
opened Mar 29, 2024 by
KunWuLuan
2 tasks done
[Feature] [API Server] Allow autoscaling in python api server client
apiserver
enhancement
New feature or request
#2029
opened Mar 20, 2024 by
smit-kiri
1 of 2 tasks
[Bug] As Rayservice CR - serveConfigV2 has the list of the Applications. If the number of applications increases the applications and deployment configuration part increases which might increase the CR size. In this case as the size of cr object is limited by etcd how should we handle the case where number of applications increases
bug
Something isn't working
triage
#2027
opened Mar 19, 2024 by
rajendra-avesha
1 of 2 tasks
Previous Next
ProTip!
Follow long discussions with comments:>50.