Skip to content

Leaderworkerset v0.2.0

Latest
Compare
Choose a tag to compare
@liurupeng liurupeng released this 19 Apr 18:47
78268be

Features:

  • Support RollingUpdate with MaxUnavailable
  • Allow Prometheus to gather metrics gathered by controller-runtime
  • Fix TPU env var assignment when leader pod doesn't request TPU
  • User guide to deploy multi-host inference with Saxml
  • Increase qps limit for pod scheduling
  • Setup E2E test and improve test coverage

Acknowledgments

Thanks to our contributors in this release, in alphabetic order:
@ahg-g @Bslabe123 @Edwinhr716 @googs1025 @kannon92 @kerthcet @liurupeng @nayihz @Zeel-Patel