Skip to content

Commit

Permalink
Updated datasets 2023-05-03 UTC
Browse files Browse the repository at this point in the history
  • Loading branch information
actions-user committed May 3, 2023
1 parent 6779b83 commit 68b8751
Show file tree
Hide file tree
Showing 6 changed files with 221 additions and 202 deletions.
2 changes: 1 addition & 1 deletion aws_open_datasets.json
Original file line number Diff line number Diff line change
Expand Up @@ -2270,7 +2270,7 @@
"ManagedBy": "[Common Crawl](https://commoncrawl.org/)",
"UpdateFrequency": "Monthly",
"License": "This data is available for anyone to use under the [Common Crawl Terms of Use](h",
"Tags": "aws-pds, encyclopedic, natural language processing, internet",
"Tags": "aws-pds, encyclopedic, natural language processing, internet, web archive",
"RequesterPays": null,
"ControlledAccess": null,
"Host": null,
Expand Down
2 changes: 1 addition & 1 deletion aws_open_datasets.tsv
Original file line number Diff line number Diff line change
Expand Up @@ -121,7 +121,7 @@ Cloud to Street - Microsoft Flood and Clouds Dataset Flood and Cloud Training Da
Co-Produced Climate Data to Support California's Resilience Investments Data catalog arn:aws:s3:::cadcat us-west-2 S3 Bucket ['[Browse Bucket](https://cadcat.s3.amazonaws.com/index.html)', '[Data Catalog](https://cadcat.s3.amazonaws.com/cae.yaml)'] https://analytics.cal-adapt.org/data/ [email protected] Cal-Adapt Analytics Engine https://analytics.cal-adapt.org/ Infrequent, Irregular Varies, see dataset specific metadata atmosphere, aws-pds, climate, climate model, earth observation, geoscience, geospatial, meteorological, simulations, weather, zarr
CoMMpass from the Multiple Myeloma Research Foundation RNA-Seq Gene Expression Quantification arn:aws:s3:::gdc-mmrf-commpass-phs000748-2-open us-east-1 S3 Bucket https://gdc.cancer.gov/about-gdc/contributed-genomic-data-cancer-research/founda [email protected] [Center for Translational Data Science at The University of Chicago](https://ctd Genomic Data Commons (GDC) is source of truth for this dataset; GDC offers month NIH Genomic Data Sharing Policy: https://gdc.cancer.gov/access-data/data-access- aws-pds, cancer, genomic, genetic, whole genome sequencing, STRIDES
ComStock ComStock model output arn:aws:s3:::nrel-pds-building-stock/comstock/ us-west-2 S3 Bucket ['[Browse Dataset](https://data.openei.org/s3_viewer?bucket=nrel-pds-building-stock)'] https://comstock.nrel.gov/ [email protected] [National Renewable Energy Laboratory](https://www.nrel.gov/) Annually [ComStock License](https://github.com/NREL/ComStock/blob/main/LICENSE.txt) aws-pds, energy
Common Crawl Crawl data (WARC and ARC format) arn:aws:s3:::commoncrawl us-east-1 S3 Bucket https://commoncrawl.org/the-data/get-started/ https://commoncrawl.org/connect/contact-us/ [Common Crawl](https://commoncrawl.org/) Monthly This data is available for anyone to use under the [Common Crawl Terms of Use](h aws-pds, encyclopedic, natural language processing, internet True
Common Crawl Crawl data (WARC and ARC format) arn:aws:s3:::commoncrawl us-east-1 S3 Bucket https://commoncrawl.org/the-data/get-started/ https://commoncrawl.org/connect/contact-us/ [Common Crawl](https://commoncrawl.org/) Monthly This data is available for anyone to use under the [Common Crawl Terms of Use](h aws-pds, encyclopedic, natural language processing, internet, web archive True
Common Screens - Cloudfront CDN distribution for hotlinking screenshots Cloudfront CDN distribution for hotlinking screenshots us-west-2 CloudFront Distribution https://commonscreens.com/?page_id=1492 [email protected] [Common Screens](https://commonscreens.com/) Monthly [Attribution 4.0 International (CC BY 4.0)](https://creativecommons.org/licenses aws-pds, encyclopedic, natural language processing, internet dqh5x5k6xg3n1.cloudfront.net
Common Screens - Common Screens (jpeg and csv format) Common Screens (jpeg and csv format) arn:aws:s3:::common-screens us-west-2 S3 Bucket https://commonscreens.com/?page_id=1492 [email protected] [Common Screens](https://commonscreens.com/) Monthly [Attribution 4.0 International (CC BY 4.0)](https://creativecommons.org/licenses aws-pds, encyclopedic, natural language processing, internet
Community Earth System Model Large Ensemble (CESM LENS) Project data files arn:aws:s3:::ncar-cesm-lens us-west-2 S3 Bucket https://doi.org/10.26024/wt24-5j82 [email protected] [National Center for Atmospheric Research](https://ncar.ucar.edu/) Rare. The LENS experiment is complete, but we may occasionally copy additional f https://www.ucar.edu/terms-of-use/data climate, model, climate model, atmosphere, oceans, land, ice, geospatial, aws-pds, sustainability, zarr
Expand Down
Loading

0 comments on commit 68b8751

Please sign in to comment.