Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

issue with fetch_lawschool_gpa #499

Open
jr2021 opened this issue Oct 11, 2023 · 1 comment
Open

issue with fetch_lawschool_gpa #499

jr2021 opened this issue Oct 11, 2023 · 1 comment

Comments

@jr2021
Copy link

jr2021 commented Oct 11, 2023

Hello,

I'm having a problem with downloading the lawschool GPA dataset using the fetch_lawschool_gpa function due to an error thrown in tempeh's seaphe_datasets.py file.

  File "/home/robertsj/miniconda3/envs/hpobench_fairmohpo/lib/python3.9/site-packages/tempeh/datasets/seaphe_datasets.py", line 39, in load_lawschool_data
with zipfile.ZipFile(temp_file_name, 'r') as zip_ref:
  File "/home/robertsj/miniconda3/envs/hpobench_fairmohpo/lib/python3.9/zipfile.py", line 1266, in __init__
    self._RealGetContents()
  File "/home/robertsj/miniconda3/envs/hpobench_fairmohpo/lib/python3.9/zipfile.py", line 1333, in _RealGetContents
    raise BadZipFile("File is not a zip file")
zipfile.BadZipFile: File is not a zip file

I believe the issue is that response.content contains the HTML code for the URL http://www.seaphe.org/databases/LSAC/LSAC_SAS.zip, not the zip contents. Do you know if tempeh has made any major changes or dropped support for the Lawschool dataset?

I wonder if you are able to reproduce this error, or provide me with other direction?

Best,

Jake Robertson
University of Freiburg

@kvarsh
Copy link

kvarsh commented Oct 11, 2023

Also see #492

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants