Skip to content

COVIDcast (COVID-19) Epidemiological Data | Delphi Research Group (CMU)

License

Notifications You must be signed in to change notification settings

rearc-data/delphi-covidcast-covid-19

Repository files navigation

Rearc Logo

COVIDcast (COVID-19) Epidemiological Data | Delphi Research Group (CMU)

You can subscribe to the AWS Data Exchange product utilizing the automation featured in this repository by visiting https://aws.amazon.com/marketplace/pp/prodview-uusmw5j7egnck.

Main Overview

This resource contains an archived collection of datasets from Carnegie Mellon University Delphi Research Group's COVID-19 Surveillance Streams Data (COVIDcast) - itself an endpoint in Delphi's Epidata open API for Epidemiological Data.

Delphi's COVIDcast datasets are based on a variety of data sources including a CMU-run Facebook health survey, a Google-run health survey, lab test results provided by Quidel Inc, search data released by Google Health Trends, and outpatient doctor visits provided by a national health system.

Data Source

The datasets in this product are delivered with descriptive S3 prefixes set to mirror the same parameters used when interacting directly with the COVIDcast API. A file_format parameter has been added as a base prefix to be able to easily navigate between the JSON Lines and CSV versions of this data.

/<file_format>/<data_source>/< signal >/<time_type>/<geo_type>.<file_extension>

For example, to access a JSON Lines file covering Facebook survey data for COVID-Like Illnesses (CLI) broken into daily entries by two-letter state codes, you would visit the file at the following path:

/jsonl/fb-survey/raw_cli/day/state.jsonl

Individual data entries include the following fields:

geo_value | time_value | direction | value | stderr | sample_size

To learn about the valid parameters and fields used in the Delphi's COVIDcast data, please visit the COVIDcast API documentation and the Delphi Epidata GitHub repository where this project is actively maintained.

More Information

Contact Details

  • If you find any issues or have enhancements with this product, open up a GitHub issue and we will gladly take a look at it. Better yet, submit a pull request. Any contributions you make are greatly appreciated ❤️.
  • If you are interested in any other open datasets, please create a request on our project board here.
  • If you have questions about this source data, please contact a memeber of the Delphi Research Group by opening an issue on this project's GitHub page.
  • If you have any other questions or feedback, send us an email at [email protected].

About Rearc

Rearc is a cloud, software and services company. We believe that empowering engineers drives innovation. Cloud-native architectures, modern software and data practices, and the ability to safely experiment can enable engineers to realize their full potential. We have partnered with several enterprises and startups to help them achieve agility. Our approach is simple — empower engineers with the best tools possible to make an impact within their industry.

About

COVIDcast (COVID-19) Epidemiological Data | Delphi Research Group (CMU)

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published