Celiac disease diagnoses are steadily increasing in Italy, and the goal of this project is to analyze the current situation, compare it with the past, and investigate the phenomenon in detail.
The data were collected by downloading the PDF "Relazione annuale al Parlamento sulla celiachia - Anno 2021", extracting the tables inside it with Excel in a CSV format (read more here). After some data cleaning and pre-processing, like handling whitespaces, null values, incorrect commas and punctuation, data types and more, I started performing my analysis by performing queries with SQL.
- https://www.epicentro.iss.it/celiachia/epidemiologia-italia
- https://www.epicentro.iss.it/celiachia/aggiornamenti
βββ README
βββ data
β βββ raw
βββ notebooks
βββ figures