Features/436 svd #1009

Sai-Suraj-27 · 2022-08-19T10:19:01Z

Description

Bulge Chasing algorithm to transform the given matrix into a bidiagonal matrix (Upper bidiagonal in case of m>=n and lower bidiagonal in case of m<n).

Changes proposed:

Updated the bulge chasing algorithm for the bi_diagonalize function.
Updated the 3 functions in the utils.py file which are for applying left and right house holder reflectors on the matrix.
Implementation is currently done, using 2 processes.

Type of change

Implemented useful functions for the svd.

Changes Remaining

Further Optimizing the bulge chasing algorithm.

skip ci

… to torch based.

…m matrices it is working fine.

…re more efficient and better now.

…los.

…put matrix into 3 parts/tensors using prep_halo

…ss does bi_diagonalize for its own part and sends the changes to neighbouring ranks. Currently, halfway through it.

…pdate halo git s properly to get the correct bidiagonal matrix finally.

… Implemented as given in 2-3 research papers.

…re. Slicing properly for each matrix is becomming little difficult.

…to not properly applying one of the transformations on Ej.

…e final resulting matrix is also comming perfectly bidiagonal but Some checking should be done for larger matrices with mostly zeros.

…comments.

…ay to the bi_diagonalize() function. Now, It is working perfectly fine in multiple processes.

…rrors occurred as expected and resolved almost all of them. Now the algorithm is even better and as of now there are no errors and resulting matrix is comming exactly bi_diagonal for m>=n and diagonal for m<n.

…. Should check the command with which code should be run...

…s and now should keep comm.send at correct location.

… one process until next one gives it something. This is should be managed accordingly.

…unning, The program is not completing.

… inputs. Implementation is with using of 2 processess.

for more information, see https://pre-commit.ci

ghost · 2022-08-19T10:20:34Z

👇 Click on the image for a new way to code review

Make big changes easier — review code in small groups of related files
Know where to start — see the whole change at a glance
Take a code tour — explore the change with an interactive tour
Make comments and review — all fully sync’ed with github

Try it now!

Legend

…into features/436-SVD

coquelin77

looks great so far!

i left a few comments on how to make the code a bit cleaner. the next step is to break the algorithm down into smaller parts and say what is going on there in comments. This will greatly help the next person which is reading the code. for example, where does tag 3 come from?

I would rather see comments which explain too much, than comments when explain too little. we can always shorten them later

coquelin77 · 2022-08-23T07:10:51Z

heat/core/linalg/bcg.py

+ if j == 2:
+ if rank == 1:


these can be combined into a single statement

Done sir, 👍

coquelin77 · 2022-08-23T07:12:15Z

heat/core/linalg/bcg.py

+ if Ej.size(0) > 0 and Ej.size(1) > 0:
+
+ Uj = comm.recv(source=0, tag=2)
+ Ej = torch.matmul(Uj.float(), Ej.float())


do you need to call .float() here? they should already be the same dtype that they were when they were sent

"RuntimeError: expected scalar type Float but found Double" is coming if I remove it at some places. So, I kept them where ever required.

coquelin77 · 2022-08-23T07:13:10Z

heat/core/linalg/bcg.py

+
+ # print("b: ", b)
+
+ U1, vt1 = ht.eye(m, dtype=ht.float64), ht.eye(n, dtype=ht.float64)


since we dont know what that the input will be a float64 matrix, you should use the dtype of the input DNDarray (arr.dtype)

Changed it to arr.dtype

coquelin77 · 2022-08-23T07:17:10Z

heat/core/linalg/bcg.py

+ # print("This one:", arr)
+ # req4 = comm.irecv(source=0,tag=1)
+ # arr = req4.wait()
+ Ej = arr[p_left:p_right, i + (j - 2) * b + 1 : i + 1 + (j - 1) * b]


since you use this multiple times, it would probably help with readability if you saved i + (j - 2) * b + 1 as a variable. the same goes for other slice start/stop indices

…research paper mentioned here will be the best way to get a good idea.

ClaudiaComito · 2023-01-27T12:01:46Z

@Hmm-its-me have you merged the latest version of your base branch? That will include the latest version of the CI. You should have a PR from @bhagemeier waiting to be merged. Thanks!

…-SVD

Sai-Suraj-27 · 2023-01-27T12:31:16Z

@Hmm-its-me have you merged the latest version of your base branch? That will include the latest version of the CI. You should have a PR from @bhagemeier waiting to be merged. Thanks!

Thanks, @ClaudiaComito mam, I have merged it, and now all the checks are passed 👍

ClaudiaComito · 2023-02-09T11:17:26Z

@Sai-Suraj-27 can you resolve conflicts? I merged main into your base branch, when you update you should have the latest version of the CI running.

Sai-Suraj-27 · 2023-02-09T13:36:01Z

@Sai-Suraj-27 can you resolve conflicts? I merged main into your base branch, when you update you should have the latest version of the CI running.

@ClaudiaComito Done...👍

ClaudiaComito · 2023-02-09T14:01:35Z

@Sai-Suraj-27 can you resolve conflicts? I merged main into your base branch, when you update you should have the latest version of the CI running.

@ClaudiaComito Done...👍

@Sai-Suraj-27 you haven't pulled in the latest changes. Check out the .github directory in this branch with that in your base branch

…-SVD

Sai-Suraj-27 · 2023-02-10T06:01:06Z

@Sai-Suraj-27 can you resolve conflicts? I merged main into your base branch, when you update you should have the latest version of the CI running.

@ClaudiaComito Done...👍

@Sai-Suraj-27 you haven't pulled in the latest changes. Check out the .github directory in this branch with that in your base branch

@ClaudiaComito Sorry, my bad, I have pulled the latest changes, 👍

mrfh92 · 2023-02-13T12:58:22Z

I will try to complete review as soon as possible

mrfh92

Review of "Features/436-SVD #1009"
Since I have not been involved in mentoring this GSoC'22 project, the following comments are rather general, possibly show my missing knowledge, and sometimes do not specifically address parts of the code.
The code is well-structured and formatted, thus readable, and appropriate documentation is available for almost all functions, in particular for all functions that are exposed to the user. However, I have the following comments:

As far as I understand from the title of the PR, the original purpose of this branch was to add an SVD. The two main contributions I can find, namely bi_diagonalize and block_diagonalize, are important steps towards this and --from a mathematical point of view-- certainly also the main steps. Nevertheless, I believe that it would make sense to shift block_diagonalize from the file svd.py to utils.py since this routine is an intermediate step that will usually not be exposed to the user. This is roughly the same for the unit tests, where bi_diagonalize is tested under the name svd which is a bit misleading. (Such refactoring, however, could also be done at a later stage of the project…)
The function block_diagonalize still prints some output for debugging. This additional output can be removed.
The function block_diagonalize does not seem to scale well (see the plot attached). I tested this on up to 24 MPI-processes (2 threads each) on up to 2 nodes of the HDFML-cluster. I would have expected a descrease in computing time when increasing the number of processes. Nevertheless, it may be the case that my test-matrices (size 7500x7500) were just too small to see scaling effects…
The function bi_diagonalize is not found, neither by heat.linalg.bi_diagonalize nor heat.linalg.svd.bi_diagonalize. Is this intended or did I do something wrong?
Anyway, after having a look onto the code of bi_diagonalize (without testing it practically), I have the impression that the code looks very “serial” (loops over the matrix sizes). For large matrices this may result in an infeasibly large computing time.
As far as I see, the unit tests for bi_diagonalize are still missing.

mrfh92 · 2023-04-17T15:19:47Z

Since this PR does not contain high-level routines ready to be used by non-experts and it does not seem to be active, I close this PR (as discussed in the second-last PR talk) in order to clean up our PR-list.

Re-opening is possible, of course.

ClaudiaComito · 2023-04-26T09:26:59Z

Reopening this PR after internal discussion. More work is needed so setting it to Draft.

github-actions · 2024-05-20T02:10:22Z

This pull request is stale because it has been open for 60 days with no activity.

Sai-Suraj-27 and others added 26 commits July 20, 2022 11:11

Just removed last line

55a3891

updated imports, no more import relative errors.🥳

52d4398

added pre-commit hooks

f53e435

1. Made bi_diagonalize function better, changed functions in utils.py…

4d0ae31

… to torch based.

Updated explanation in bcg.py, Now it is more clear.

eaa58f3

Added some notes related to the implementation using 3 kernels.

25803b8

updated apply_house_left & right functions and tested with many rando…

84d7783

…m matrices it is working fine.

Further updated apply_house_left & right functions and i guess they a…

b2f203b

…re more efficient and better now.

Started working with halos, instead of a.larray sending array with ha…

a6e9bb5

…los.

Prepared basis for running with multiple processes by dividing the in…

9a4b138

…put matrix into 3 parts/tensors using prep_halo

made small changes to make the code running properly using 3 processors.

720e854

Divided the input matrix into 3 parts for 3 processors and each proce…

4066e00

…ss does bi_diagonalize for its own part and sends the changes to neighbouring ranks. Currently, halfway through it.

resolved some errors and now send andrecv is workin fne buI have to u…

d244647

…pdate halo git s properly to get the correct bidiagonal matrix finally.

Changed the algorithm to make it easy to parallelize in comming days,…

f46594a

… Implemented as given in 2-3 research papers.

Properly updated the j==2 case now the other cases should be taken ca…

9a4b697

…re. Slicing properly for each matrix is becomming little difficult.

Running without any errors but rows are not eliminated. Might be due …

b968bd8

…to not properly applying one of the transformations on Ej.

Made the algorithm working fine for both the cases of m>=n and m<n Th…

f6e016a

…e final resulting matrix is also comming perfectly bidiagonal but Some checking should be done for larger matrices with mostly zeros.

Added some notes in def string of function, and removed un necessary …

9440d89

…comments.

loaded the matrix with halos using cat_halo instead of passing a.larr…

4c98207

…ay to the bi_diagonalize() function. Now, It is working perfectly fine in multiple processes.

Tested the function by applying block_diagonalize(a) before several e…

367dee4

…rrors occurred as expected and resolved almost all of them. Now the algorithm is even better and as of now there are no errors and resulting matrix is comming exactly bi_diagonal for m>=n and diagonal for m<n.

Tried implementing with 2,3 processess but correct ans is not comming…

be1f615

…. Should check the command with which code should be run...

implemented further using for 2 procesess. Solved several small issue…

8377231

…s and now should keep comm.send at correct location.

ALmost done, implementing with 2 processes but dont' know how to stop…

d6fc833

… one process until next one gives it something. This is should be managed accordingly.

olved the issue now it is working fine with 2 processess. But after r…

7df6b23

…unning, The program is not completing.

Resolved all errors now, function is giving bidiagonal matrix for all…

4c244b7

… inputs. Implementation is with using of 2 processess.

[pre-commit.ci] auto fixes from pre-commit.com hooks

6a9abae

for more information, see https://pre-commit.ci

Sai-Suraj-27 added 2 commits August 19, 2022 16:09

Removed one extra file.

411d35c

Merge branch 'features/436-SVD' of https://github.com/SaiSuraj27/heat …

0fa7986

…into features/436-SVD

coquelin77 reviewed Aug 23, 2022

View reviewed changes

Sai-Suraj-27 added 2 commits August 26, 2022 21:48

Added a few comments for clear explanation.

3b65364

Added comments for almost all parts of the algorithm but Reading the …

1224a20

…research paper mentioned here will be the best way to get a good idea.

ClaudiaComito added the GSoC label Sep 26, 2022

ClaudiaComito added enhancement New feature or request linalg labels Oct 5, 2022

Merge branch 'helmholtz-analytics:features/436-SVD' into features/436…

cbc3f37

…-SVD

ClaudiaComito requested a review from mrfh92 January 30, 2023 04:21

Replaced heat.core with ..

013a85d

ClaudiaComito marked this pull request as ready for review February 9, 2023 13:55

Merge branch 'helmholtz-analytics:features/436-SVD' into features/436…

59cedd7

…-SVD

ClaudiaComito added the PR talk label Feb 10, 2023

mrfh92 self-assigned this Feb 13, 2023

mrfh92 reviewed Feb 14, 2023

View reviewed changes

mrfh92 closed this Apr 17, 2023

ClaudiaComito reopened this Apr 26, 2023

ClaudiaComito marked this pull request as draft April 26, 2023 09:27

mrfh92 removed their assignment Jul 3, 2023

ClaudiaComito added this to the Repo Clean-Up milestone Jul 31, 2023

mrfh92 removed the PR talk label Jul 31, 2023

github-actions bot added the stale label May 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Features/436 svd #1009

Features/436 svd #1009

Sai-Suraj-27 commented Aug 19, 2022

ghost commented Aug 19, 2022 •

edited by ghost

Loading

coquelin77 left a comment

coquelin77 Aug 23, 2022

Sai-Suraj-27 Aug 26, 2022

coquelin77 Aug 23, 2022

Sai-Suraj-27 Aug 26, 2022 •

edited

Loading

coquelin77 Aug 23, 2022

Sai-Suraj-27 Aug 26, 2022

coquelin77 Aug 23, 2022

ClaudiaComito commented Jan 27, 2023

Sai-Suraj-27 commented Jan 27, 2023 •

edited

Loading

ClaudiaComito commented Feb 9, 2023

Sai-Suraj-27 commented Feb 9, 2023

ClaudiaComito commented Feb 9, 2023

Sai-Suraj-27 commented Feb 10, 2023 •

edited

Loading

mrfh92 commented Feb 13, 2023

mrfh92 left a comment •

edited

Loading

mrfh92 commented Apr 17, 2023 •

edited

Loading

ClaudiaComito commented Apr 26, 2023

github-actions bot commented May 20, 2024


		# print("b: ", b)

		U1, vt1 = ht.eye(m, dtype=ht.float64), ht.eye(n, dtype=ht.float64)

Features/436 svd #1009

Are you sure you want to change the base?

Features/436 svd #1009

Conversation

Sai-Suraj-27 commented Aug 19, 2022

Description

Changes proposed:

Type of change

Changes Remaining

ghost commented Aug 19, 2022 • edited by ghost Loading

Legend

coquelin77 left a comment

Choose a reason for hiding this comment

coquelin77 Aug 23, 2022

Choose a reason for hiding this comment

Sai-Suraj-27 Aug 26, 2022

Choose a reason for hiding this comment

coquelin77 Aug 23, 2022

Choose a reason for hiding this comment

Sai-Suraj-27 Aug 26, 2022 • edited Loading

Choose a reason for hiding this comment

coquelin77 Aug 23, 2022

Choose a reason for hiding this comment

Sai-Suraj-27 Aug 26, 2022

Choose a reason for hiding this comment

coquelin77 Aug 23, 2022

Choose a reason for hiding this comment

ClaudiaComito commented Jan 27, 2023

Sai-Suraj-27 commented Jan 27, 2023 • edited Loading

ClaudiaComito commented Feb 9, 2023

Sai-Suraj-27 commented Feb 9, 2023

ClaudiaComito commented Feb 9, 2023

Sai-Suraj-27 commented Feb 10, 2023 • edited Loading

mrfh92 commented Feb 13, 2023

mrfh92 left a comment • edited Loading

Choose a reason for hiding this comment

mrfh92 commented Apr 17, 2023 • edited Loading

ClaudiaComito commented Apr 26, 2023

github-actions bot commented May 20, 2024

ghost commented Aug 19, 2022 •

edited by ghost

Loading

Sai-Suraj-27 Aug 26, 2022 •

edited

Loading

Sai-Suraj-27 commented Jan 27, 2023 •

edited

Loading

Sai-Suraj-27 commented Feb 10, 2023 •

edited

Loading

mrfh92 left a comment •

edited

Loading

mrfh92 commented Apr 17, 2023 •

edited

Loading