Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

perf: bf16 kernels #1664

Closed
wants to merge 17 commits into from
Closed

perf: bf16 kernels #1664

wants to merge 17 commits into from

Commits on Dec 20, 2023

  1. Add bf16 kernels

    rok committed Dec 20, 2023
    Configuration menu
    Copy the full SHA
    15be40f View commit details
    Browse the repository at this point in the history
  2. lint

    rok committed Dec 20, 2023
    Configuration menu
    Copy the full SHA
    23cab08 View commit details
    Browse the repository at this point in the history
  3. changing build script

    rok committed Dec 20, 2023
    Configuration menu
    Copy the full SHA
    0322f75 View commit details
    Browse the repository at this point in the history
  4. simd flags for avx512bf16

    rok committed Dec 20, 2023
    Configuration menu
    Copy the full SHA
    2f71829 View commit details
    Browse the repository at this point in the history
  5. change build flags

    rok committed Dec 20, 2023
    Configuration menu
    Copy the full SHA
    53da635 View commit details
    Browse the repository at this point in the history
  6. linx bf16 flag

    rok committed Dec 20, 2023
    Configuration menu
    Copy the full SHA
    355db1a View commit details
    Browse the repository at this point in the history
  7. Change instructions string

    rok committed Dec 20, 2023
    Configuration menu
    Copy the full SHA
    a74d351 View commit details
    Browse the repository at this point in the history
  8. changes

    rok committed Dec 20, 2023
    Configuration menu
    Copy the full SHA
    e8867e0 View commit details
    Browse the repository at this point in the history
  9. Add avx512bf16 feature flag

    rok committed Dec 20, 2023
    Configuration menu
    Copy the full SHA
    f74600d View commit details
    Browse the repository at this point in the history
  10. Work

    rok committed Dec 20, 2023
    Configuration menu
    Copy the full SHA
    f02949c View commit details
    Browse the repository at this point in the history
  11. minor change

    rok committed Dec 20, 2023
    Configuration menu
    Copy the full SHA
    7e4194f View commit details
    Browse the repository at this point in the history
  12. add norm_l2

    rok committed Dec 20, 2023
    Configuration menu
    Copy the full SHA
    458e5fa View commit details
    Browse the repository at this point in the history
  13. minor

    rok committed Dec 20, 2023
    Configuration menu
    Copy the full SHA
    d26a66b View commit details
    Browse the repository at this point in the history
  14. Review feedback

    rok committed Dec 20, 2023
    Configuration menu
    Copy the full SHA
    ad9b2e8 View commit details
    Browse the repository at this point in the history
  15. lint

    rok committed Dec 20, 2023
    Configuration menu
    Copy the full SHA
    fca0038 View commit details
    Browse the repository at this point in the history
  16. remove mac target, limit to linux

    rok committed Dec 20, 2023
    Configuration menu
    Copy the full SHA
    cf64297 View commit details
    Browse the repository at this point in the history

Commits on Dec 21, 2023

  1. map cosine and dot kernels

    rok committed Dec 21, 2023
    Configuration menu
    Copy the full SHA
    c1c3a16 View commit details
    Browse the repository at this point in the history