test: switch from curl to urllib for HTTP requests #29970

iw4p · 2024-04-26T12:24:44Z

Switched from using subprocess to make HTTP requests (curl) to using the Python requests library, improving cleanliness and maintainability.

DrahtBot · 2024-04-26T12:24:47Z

The following sections might be updated with supplementary metadata relevant to reviewers and maintainers.

Code Coverage

For detailed information about the code coverage, see the test coverage report.

Reviews

See the guideline for information on the review process.

Type	Reviewers
ACK	AngusP

If your review is incorrectly listed, please react with 👎 to this comment and the bot will ignore it on the next update.

maflcko · 2024-04-26T12:30:33Z

test/get_previous_releases.py

+ response = requests.get(tarballUrl)
+ with open(tarball, 'wb') as file:
+ file.write(response.content)


This is not a refactor. It is a behavior change when an error occurs, for example a network error, or a file error

If you're going to read the entire data into memory, you might as well do the hashing at the same time? Currently, it reads back the same tarball it wrote and passes it to the hasher, which seems kind of a waste.

You're right about behavior change. I am still not sure about using script or util tag.

It's important to be careful about behavior changes but i do think this is an improvement, using requests.head is better practice than calling curl then looking for a string in the output.

laanwj · 2024-04-26T13:43:44Z

This is a script used for the tests, so adding test tag.

iw4p · 2024-04-26T14:00:56Z

The title's tag also changed from Refactor to Test

iw4p · 2024-04-26T17:46:22Z

@laanwj I investigated and found that you modified import requests here here. Is there any shell script that I can provide pip install requests to pass the previous release, depends DEBUG?

edit: It seems bitcoin/ci/test/00_setup_env_native_previous_releases.sh is responsible to provides everything that the environment's needed. Shall I add pip install requests here?

laanwj · 2024-04-26T22:25:44Z

Yes, that's usually how it'd be done.

But gah, requests is an external dependency? That's unfortunate, i don't think we should be adding dependencies unless absolutely necessary. Probably better to do this with python's built in urllib or http client functionality, or keep it like this.

iw4p · 2024-04-27T05:35:10Z

I replaced the requests with urllib, tested the script for downloading .tar.gz file and works fine for me.

maflcko · 2024-04-27T07:20:17Z

test/get_previous_releases.py

- ['curl', '--remote-name', tarballUrl]
- ]
+ try:
+ response = urlopen(Request(tarballUrl, method='HEAD'))


Is it required to check for 404 in a separate, additional request?

That's how the current code works too - it first does a HEAD request to check that the file is there, then goes to downloading. i don't know the original reasoning behind this, though.

No, I could write more pythonically and with one request, but the reason I wrote it this way was to commit to the previous code. If you agree with one request, I can change it.
Should this change be in a new commit or should I force it on the previous commit?

Should this change be in a new commit or should I force it on the previous commit?

Squashing seems fine?

To rewrite the git history for having the last change on the last commit and ignore old ones, I forced it. I used squashing only for merging before that.

AngusP

crACK 7fe94f7

AngusP · 2024-05-10T10:20:27Z

test/get_previous_releases.py

- if ret:
- return ret
+ try:
+ response = urlopen(tarballUrl)


AFAIK there's a minor behaviour change here, where urlopen will follow redirects whereas curl won't usually

$ curl -I https://httpbin.org/absolute-redirect/3 HTTP/2 302 # ...

>>> from urllib.request import urlopen >>> response = urlopen("https://httpbin.org/absolute-redirect/3") >>> response.code 200 # Not 302 because redirects were followed

This should be fine, but worth a mention.

Great point, Thank you.

fanquake · 2024-05-13T07:33:11Z

test/get_previous_releases.py

@@ -5,7 +5,7 @@
 # file COPYING or http://www.opensource.org/licenses/mit-license.php.
 #
 # Download or build previous releases.
-# Needs curl and tar to download a release, or the build dependencies when
+# Needs urllib built-in python library and tar to download a release, or the build dependencies when


If this is part of the standard library, I don't think you need to list it as a requirement.

DrahtBot added the Refactoring label Apr 26, 2024

maflcko reviewed Apr 26, 2024

View reviewed changes

iw4p mentioned this pull request Apr 26, 2024

refactor: refactored platform assignment into get_platform function #29971

Open

laanwj added Tests and removed Refactoring labels Apr 26, 2024

iw4p changed the title ~~refactor: switch from curl to requests for HTTP requests~~ test: switch from curl to requests for HTTP requests Apr 26, 2024

DrahtBot added the CI failed label Apr 26, 2024

iw4p force-pushed the feature/requests-library-usage branch from 38176a7 to ddd27cb Compare April 26, 2024 17:42

iw4p force-pushed the feature/requests-library-usage branch from ddd27cb to 29e9377 Compare April 27, 2024 05:30

iw4p changed the title ~~test: switch from curl to requests for HTTP requests~~ test: switch from curl to urllib for HTTP requests Apr 27, 2024

test: switch from curl to urllib for HTTP requests

7fe94f7

iw4p force-pushed the feature/requests-library-usage branch from 29e9377 to 7fe94f7 Compare April 27, 2024 05:34

DrahtBot removed the CI failed label Apr 27, 2024

maflcko reviewed Apr 27, 2024

View reviewed changes

AngusP approved these changes May 10, 2024

View reviewed changes

fanquake reviewed May 13, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

test: switch from curl to urllib for HTTP requests #29970

test: switch from curl to urllib for HTTP requests #29970

iw4p commented Apr 26, 2024 •

edited

DrahtBot commented Apr 26, 2024 •

edited

maflcko Apr 26, 2024

laanwj Apr 26, 2024

iw4p Apr 26, 2024

laanwj Apr 26, 2024

laanwj commented Apr 26, 2024

iw4p commented Apr 26, 2024

iw4p commented Apr 26, 2024 •

edited

laanwj commented Apr 26, 2024

iw4p commented Apr 27, 2024

maflcko Apr 27, 2024

laanwj Apr 27, 2024

iw4p Apr 27, 2024

maflcko May 13, 2024

iw4p May 13, 2024

AngusP left a comment

AngusP May 10, 2024

iw4p May 12, 2024

fanquake May 13, 2024

test: switch from curl to urllib for HTTP requests #29970

Are you sure you want to change the base?

test: switch from curl to urllib for HTTP requests #29970

Conversation

iw4p commented Apr 26, 2024 • edited

DrahtBot commented Apr 26, 2024 • edited

Code Coverage

Reviews

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

laanwj commented Apr 26, 2024

iw4p commented Apr 26, 2024

iw4p commented Apr 26, 2024 • edited

laanwj commented Apr 26, 2024

iw4p commented Apr 27, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

AngusP left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

iw4p commented Apr 26, 2024 •

edited

DrahtBot commented Apr 26, 2024 •

edited

iw4p commented Apr 26, 2024 •

edited