Skip to content
@mit-han-lab

MIT HAN Lab

Efficient AI Computing. PI: Song Han

Pinned

  1. streaming-llm streaming-llm Public

    [ICLR 2024] Efficient Streaming Language Models with Attention Sinks

    Python 6.2k 352

  2. smoothquant smoothquant Public

    [ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

    Python 1k 116

  3. llm-awq llm-awq Public

    [MLSys 2024] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

    Python 1.9k 134

  4. bevfusion bevfusion Public

    [ICRA'23] BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation

    Python 2k 368

  5. once-for-all once-for-all Public

    [ICLR 2020] Once for All: Train One Network and Specialize it for Efficient Deployment

    Python 1.8k 332

  6. temporal-shift-module temporal-shift-module Public

    [ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding

    Python 2k 417

Repositories

Showing 10 of 50 repositories
  • TinyChatEngine Public

    TinyChatEngine: On-Device LLM Inference Library

    C++ 557 MIT 52 23 2 Updated May 13, 2024
  • lmquant Public
    Python 45 Apache-2.0 0 0 0 Updated May 12, 2024
  • qserve Public

    QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving

    Python 179 Apache-2.0 3 4 0 Updated May 11, 2024
  • efficientvit Public

    EfficientViT is a new family of vision models for efficient high-resolution vision.

    Python 1,437 Apache-2.0 126 67 0 Updated May 10, 2024
  • llm-awq Public

    [MLSys 2024] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

    Python 1,867 MIT 134 97 6 Updated May 8, 2024
  • distrifuser Public

    [CVPR 2024 Highlight] DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models

    Python 444 MIT 11 1 0 Updated May 5, 2024
  • spatten-llm Public

    [HPCA'21] SpAtten: Efficient Sparse Attention Architecture with Cascade Token and Head Pruning

    Scala 48 MIT 3 1 0 Updated May 4, 2024
  • torchquantum Public

    A PyTorch-based framework for Quantum Classical Simulation, Quantum Machine Learning, Quantum Neural Networks, Parameterized Quantum Circuits with support for easy deployments on real quantum computers.

    Jupyter Notebook 1,194 MIT 169 58 (5 issues need help) 4 Updated Apr 30, 2024
  • smoothquant Public

    [ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

    Python 1,037 MIT 116 53 1 Updated Apr 29, 2024
  • sparsevit Public

    [CVPR'23] SparseViT: Revisiting Activation Sparsity for Efficient High-Resolution Vision Transformer

    Python 53 Apache-2.0 2 1 0 Updated Apr 24, 2024

People

This organization has no public members. You must be a member to see who’s a part of this organization.