Skip to content
View HandH1998's full-sized avatar
  • Beijing
  • 11:00 (UTC +08:00)
Block or Report

Block or report HandH1998

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. vllm-project/vllm vllm-project/vllm Public

    A high-throughput and memory-efficient inference and serving engine for LLMs

    Python 21.8k 3.1k

  2. bytedance/lightseq bytedance/lightseq Public

    LightSeq: A High Performance Library for Sequence Processing and Generation

    C++ 3.1k 325

  3. microsoft/Megatron-DeepSpeed microsoft/Megatron-DeepSpeed Public

    Forked from NVIDIA/Megatron-LM

    Ongoing research training transformer language models at scale, including: BERT & GPT-2

    Python 1.7k 328

  4. AniZpZ/AutoSmoothQuant AniZpZ/AutoSmoothQuant Public

    An easy-to-use package for implementing SmoothQuant for LLMs

    Python 66 4

  5. zhengjian2322/net2net zhengjian2322/net2net Public

    Python 4 1

  6. QQQ QQQ Public

    QQQ is an innovative and hardware-optimized W4A8 quantization solution.

    Python 22