-
Notifications
You must be signed in to change notification settings - Fork 1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Suggestion: Add Excel sheet for the list of companies #200
Comments
Google Sheets link Alternatively, code to generate .csv: import requests
import bs4 as bs
import pandas as pd
url = 'https://github.com/j-delaney/easy-application'
html = requests.get(url)
# source: https://stackoverflow.com/a/42285792
soup = bs.BeautifulSoup(html.text, 'lxml')
parsed_table = soup.find_all('table')[0]
data = [[td.text + '_' + td.a['href'] if td.find('a') else
''.join(td.stripped_strings)
for td in row.find_all('td')]
for row in parsed_table.find_all('tr')]
df = pd.DataFrame(data[1:], columns=['Url', 'Location'])
sp = df['Url'].str.split('_', n=1, expand=True)
df['Company'] = sp[0]
df['Url'] = sp[1]
df = df[['Company', 'Url', 'Location']]
print(df)
df.to_csv('~/Downloads/j-delaney_easy-application.csv') |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
How about adding an Excel version of the list of companies so you can track the applications better? This Excel can then be downloaded to your laptop for your record, similar to a fork.
I have written a Python script and can share it if you want.
The text was updated successfully, but these errors were encountered: