Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

add built-in matrix multiplication with sizes between 2x2 and 8192x8192 #27

Open
tugrul512bit opened this issue Apr 13, 2017 · 0 comments
Assignees

Comments

@tugrul512bit
Copy link
Owner

batched 2x2 4x4 16x16 32x32
single 8k x 8k with sub-matrix partitioning to increase load balancing

N-levels of partitioning (4,16,64,256 sub matrices)
or M-levels of batching

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
Cekirdekler.dll
Awaiting triage
Development

No branches or pull requests

1 participant