Skip to content

Kiminjo/GitHub-crawler

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

16 Commits
 
 
 
 
 
 

Repository files navigation

GitHub crawler

Crawler for crawling github repository information for master's degree research

Crawling GitHub repositories

In this project, I crawled github repositories related autonomous vehicle for my master's degree research. I used PyGitHub for crawling, if you want to know more about this library, please check here.

If you would like to know more about my degree research conducted using the data collected through this crawler, please check here.

Crawled features

data data type
repository name str
repository ID int
owner ID int
owner type str
repository full name str
topcis list
contributors list
contributor counts int
stargazer counts int
forker counts int
created date date
last updated datae date
readme str



How to Run

Preliminaries

  • Specify the keywords you want to collect in crawling_material.py

  • Specify the date range you want to collect in main of Github_crawling.py

Run the code

Run python Github_crawling.py in terminal



Software Requirements

  • python >= 3.5
  • PyGithub
  • pandas
  • numpy

About

GitHub crawler for Graduate research

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages