This repository contains the exploratory data analysis (EDA) conducted on the Olympic dataset using SQL. The dataset consists of information about individual athletes participating in Olympic events, including details such as athlete's name, age, height, weight, team, NOC, games, sports, events, and medals.
- How many Olympic games have been held?
- List down all Olympics games held so far.
- Mention the total number of nations that participated in each Olympic game.
- Which year saw the highest and lowest number of countries participating in the Olympics?
- Which nation has participated in all of the Olympic games?
- Identify the sport played in all summer Olympics.
- Which sports were played only once in the Olympics?
- Fetch the total number of sports played in each Olympic game.
- Fetch details of the oldest athletes to win a gold medal.
- Find the ratio of male and female athletes who participated in all Olympic games.
- Fetch the top 5 athletes who have won the most gold medals.
- Fetch the top 5 athletes who have won the most medals (gold/silver/bronze).
- Fetch the top 5 most successful countries in the Olympics (defined by the number of medals won).
- List down total gold, silver, and bronze medals won by each country.
- List down total gold, silver, and bronze medals won by each country corresponding to each Olympic game.
- Identify which country won the most gold, most silver, and most bronze medals in each Olympic game.
- Identify which country won the most gold, most silver, most bronze medals, and the most medals in each Olympic game.
- Which countries have never won a gold medal but have won silver/bronze medals?
- In which sport/event, India has won the highest number of medals?
- Break down all Olympic games where India won a medal for Hockey and the number of medals in each Olympic game.
- Dataset: The dataset used for this analysis is available in the
athlete_events.csv
file. - Queries: SQL queries used for analysis are provided in the SQL script file.
- Notebooks: Jupyter notebooks or SQL notebooks used for analysis can be found in the
notebooks
directory.
Clone the repository and explore the provided SQL script to understand the analysis steps and findings.
This project is licensed under the MIT License.
- Dataset Source: Kaggle - 120 years of Olympic history: athletes and results
Feel free to contribute, suggest improvements, or report issues.