[NDTensors] Fix contracting dense with diag on GPU #1453

kmp5VT · 2024-05-20T15:49:05Z

Description

In reference to this bug report

This bug is two parted. First I am working to make it possible to call tr on GPU based Tensors. The issue here is a scalar indexing problem where the code tries to getdiagindex of the tensor. For now I have tried to use @allowscalar and expose to solve the issue. Timing wise I have found CUDA based tr with @allowscalar is roughly 2x faster than CPU based trace.

The second bug is that delta functions are not being properly constructed on GPU. This is, impart, due to delta being a UniformDiag which does not carry information about where the diag will be allocated and assumes CPU. I am not sure yet how to fix this problem. Potentially we could replace the datatype of UniformDiag from <:Number to <:UnallocatedFill{<:Number}

Checklist:

Trace works on CPU and GPU.
Performance of trace on GPU is equal to or better than CPU
Delta functions can be constructed on GPU
All current unittests pass
Unittests are made for the failing cases which prompted this PR.
Ensure that all solutions also work with quantum numbers and block sparsity

mtfishman · 2024-05-20T15:57:29Z

I am not sure yet how to fix this problem. Potentially we could replace the datatype of UniformDiag from <:Number to <:UnallocatedFill{<:Number}

I would prefer not going that route since that is a much more involved change that would likely require rewriting a lot of the UniformDiag code (which is best left for the new DiagonalArrays design). It would be better to try to solve it in a more narrow way.

NDTensors/src/diag/tensoralgebra/contract.jl

NDTensors/ext/NDTensorsCUDAExt/indexing.jl

Co-authored-by: Matt Fishman <[email protected]>

kmp5VT · 2024-05-22T18:43:51Z

@mtfishman So I am pushing an idea on how to fix the issue with \delta it involves reworking the dense function to look like this

dense(T::Tensor) = dense(unwrap_array_type(T), T)
dense(datat::Type{<:AbstractArray}, T::Tensor) = setstorage(T, adapt(datat, dense(storage(T))))

Since the dense for Diag already looks like this

function dense(T::DiagTensor)
  return dense(unwrap_array_type(T), T)
end

Update* I just checked and the code I pushed fixes both of the errors in the bug report on metal. Testing the other backends now.

mtfishman · 2024-05-22T19:07:50Z

What about using expose for that dispatch?

kmp5VT · 2024-05-22T19:36:22Z

@mtfishman For the dispatch of the dense function?

mtfishman · 2024-05-22T20:21:58Z

@mtfishman For the dispatch of the dense function?

Yes.

codecov-commenter · 2024-05-28T16:03:08Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 44.14%. Comparing base (e08e131) to head (0dec0e5).
Report is 7 commits behind head on main.

❗ Current head 0dec0e5 differs from pull request most recent head 422159f

Please upload reports for the commit 422159f to get more accurate results.

❗ Your organization needs to install the Codecov GitHub app to enable full functionality.

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #1453      +/-   ##
==========================================
+ Coverage   43.65%   44.14%   +0.49%     
==========================================
  Files         136      144       +8     
  Lines        8806     9374     +568     
==========================================
+ Hits         3844     4138     +294     
- Misses       4962     5236     +274

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

…_1450

NDTensors/ext/NDTensorsGPUArraysCoreExt/contract.jl

Co-authored-by: Matt Fishman <[email protected]>

NDTensors/test/test_diag.jl

mtfishman · 2024-05-29T12:51:54Z

@kmp5VT this looks good to me.

The only remaining thing I can see to do is fix tensor contractions involving block sparse tensors with diagonal blocks. What's the status of that? I think we can fix that in a follow-up PR, I guess that can be fixed in a similar way (say by calling denseblocks and then moving the tensor to GPU).

kmp5VT · 2024-05-29T13:56:07Z

@kmp5VT this looks good to me.

The only remaining thing I can see to do is fix tensor contractions involving block sparse tensors with diagonal blocks. What's the status of that? I think we can fix that in a follow-up PR, I guess that can be fixed in a similar way (say by calling denseblocks and then moving the tensor to GPU).

@mtfishman Thats a good point. I hadn't checked the blocksparse implementation. I ran this code

using ITensorMPS: MPO, siteinds
using LinearAlgebra: tr
using Metal: mtl
s = siteinds("S=1/2", 2; conserve_qns=true)
o = MPO(s, "Id")
tr(mtl(o))

and it currently fails because of scalar indexing (because its fed into the cpu dense * diag code). I found that in diagblocksparse I can expose the block calls to contract which fixes the behavior

mtfishman · 2024-05-29T15:41:51Z

Great, glad to see it was a simple fix, simpler than I was picturing.

Seems like the only thing left is to add a test for the block sparse case, after that is this good to go?

NDTensors/test/test_blocksparse.jl

NDTensors/ext/NDTensorsGPUArraysCoreExt/contract.jl

mtfishman · 2024-05-31T18:38:21Z

Besides the final comment, this looks good, thanks.

kmp5VT added 2 commits May 20, 2024 11:42

Make an expose version of getdiagindex

5ae6ae6

format

ae40536

kmp5VT marked this pull request as draft May 20, 2024 15:49

mtfishman reviewed May 20, 2024

View reviewed changes

NDTensors/src/diag/tensoralgebra/contract.jl Outdated Show resolved Hide resolved

mtfishman changed the title ~~[NDTensors][bug] Tr on GPU~~ [NDTensors] Fix contracting dense with diag on GPU May 20, 2024

kmp5VT added 6 commits May 20, 2024 12:39

revert initial commit

534f2eb

Update to work through contract

a1a80a1

format

3a9244c

Merge branch 'main' into kmp5/debug/issue_1450

55354d8

Add a test for UniformDiag on GPU. Working on more

50fe36d

forgot @test

29a506d

mtfishman reviewed May 21, 2024

View reviewed changes

NDTensors/ext/NDTensorsCUDAExt/indexing.jl Outdated Show resolved Hide resolved

kmp5VT and others added 2 commits May 22, 2024 10:34

Update NDTensors/ext/NDTensorsCUDAExt/indexing.jl

b129479

Co-authored-by: Matt Fishman <[email protected]>

Merge branch 'main' into kmp5/debug/issue_1450

4bb306d

mtfishman added NDTensors Requires changes to the NDTensors.jl library. GPU labels May 27, 2024

kmp5VT and others added 3 commits May 28, 2024 11:25

Remove GPUArraysCore from Library to make Extension

2059288

Merge branch 'ITensor:main' into kmp5/redo/just_refactor_contract

9a3cac0

Force arraytype to be a vector for now

16d432b

Updates to test

08b41c6

kmp5VT force-pushed the kmp5/debug/issue_1450 branch from 628e330 to 2059288 Compare May 28, 2024 16:09

kmp5VT added 2 commits May 28, 2024 12:44

Merge branch 'kmp5/redo/just_refactor_contract' into kmp5/debug/issue…

915e17b

…_1450

Add GPUArraysCore to extras

9a48ca0

kmp5VT force-pushed the kmp5/debug/issue_1450 branch from 1cdde58 to 9a48ca0 Compare May 28, 2024 16:45

kmp5VT added 4 commits May 28, 2024 17:37

Merge branch 'main' into kmp5/debug/issue_1450

5c6bd99

Restrict to unifiedDiag

ac0b84d

format

7628db7

Merge branch 'main' into kmp5/debug/issue_1450

195132a

mtfishman reviewed May 28, 2024

View reviewed changes

NDTensors/ext/NDTensorsGPUArraysCoreExt/contract.jl Outdated Show resolved Hide resolved

Update NDTensors/ext/NDTensorsGPUArraysCoreExt/contract.jl

fe7e547

Co-authored-by: Matt Fishman <[email protected]>

mtfishman reviewed May 29, 2024

View reviewed changes

NDTensors/test/test_diag.jl Outdated Show resolved Hide resolved

Use approx over == and add rtol

422159f

Fix blocksparse behavior

e62442a

mtfishman marked this pull request as ready for review May 29, 2024 18:36

kmp5VT added 3 commits May 29, 2024 15:45

Add todo message

2e5dd99

We can test blocksparse contract using SVD.

dd2d6db

format

de4bcc4

kmp5VT commented May 29, 2024

View reviewed changes

NDTensors/test/test_blocksparse.jl Outdated Show resolved Hide resolved

kmp5VT added 7 commits May 29, 2024 15:49

add contract

e1829fd

Merge branch 'main' into kmp5/debug/issue_1450

5bab226

Revert SVD tests

0ff79ea

Remove contract

d839bc7

Add contract tests for BlockSparseDiag

e19fc95

format

3cf5bbe

Merge branch 'main' into kmp5/debug/issue_1450

e459c54

mtfishman reviewed May 31, 2024

View reviewed changes

NDTensors/ext/NDTensorsGPUArraysCoreExt/contract.jl Show resolved Hide resolved

alphabetize

6f45c0a

mtfishman merged commit 99baf1d into ITensor:main May 31, 2024
15 checks passed

This was referenced Jun 20, 2024

[NDTensors] [BUG] tr on GPU #1450

Closed

Using 'expand' with GPU backend ITensor/ITensorMPS.jl#24

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[NDTensors] Fix contracting dense with diag on GPU #1453

[NDTensors] Fix contracting dense with diag on GPU #1453

kmp5VT commented May 20, 2024 •

edited

mtfishman commented May 20, 2024 •

edited

kmp5VT commented May 22, 2024 •

edited

mtfishman commented May 22, 2024

kmp5VT commented May 22, 2024

mtfishman commented May 22, 2024

codecov-commenter commented May 28, 2024 •

edited

mtfishman commented May 29, 2024

kmp5VT commented May 29, 2024

mtfishman commented May 29, 2024

mtfishman commented May 31, 2024

[NDTensors] Fix contracting dense with diag on GPU #1453

[NDTensors] Fix contracting dense with diag on GPU #1453

Conversation

kmp5VT commented May 20, 2024 • edited

Description

Checklist:

mtfishman commented May 20, 2024 • edited

kmp5VT commented May 22, 2024 • edited

mtfishman commented May 22, 2024

kmp5VT commented May 22, 2024

mtfishman commented May 22, 2024

codecov-commenter commented May 28, 2024 • edited

Codecov Report

mtfishman commented May 29, 2024

kmp5VT commented May 29, 2024

mtfishman commented May 29, 2024

mtfishman commented May 31, 2024

kmp5VT commented May 20, 2024 •

edited

mtfishman commented May 20, 2024 •

edited

kmp5VT commented May 22, 2024 •

edited

codecov-commenter commented May 28, 2024 •

edited