THIS PROJECT IS BEING REPLACED BY
This project is being replaced by Li, the next-generation serverless crawler for COVID-19 data.
The reasons for the switchover are documented in Issue 782.
We are not actively accepting PRs for this repository, but are still using this repo to track issues.
Scraper code written for this project is not compatible with Li, see that project for examples. We have some helpers to assist in migrating code, see this document.
Thanks very much!
A crawler that scrapes COVID-19 Coronavirus data from government and curated data sources.
This project exists to scrape, de-duplicate, and cross-check county-level data on the COVID-19 coronavirus pandemic.
Every piece of data includes GeoJSON and population data, cites the source from which the data was obtained, and includes a rating of the source's technical quality (completeness, machine readability, best practices -- not accuracy).
Where's the data?
How often is it updated?
We upload fresh data every day at around 9PM PST.
How do I use this data?
Read the Data Fields documentation for details on exactly what each field in the dataset means.
How can I run the crawler locally?
Check out our Getting Started guide to help get our project running on your local machine.
NOTE: This project is being replaced by Li, the next-generation serverless crawler for COVID-19 data.
You can contribute to this project in two big ways:
Contribute to the project core
Check the Issues for any task we need to get done. If you are new to open-source, look for the label
Good first issue
Contribute a source
Contributions for any place in the world are welcome. See the community-curated list of verified data sources to find a new datasource to add, and be sure to update the "Scraped?" column when you do.
To help you contribute a new source, please read the Sources and Scrapers guide before you start!
Send a pull request with your scraper, and be sure to run the scraper first with the instructions specified in the guide to make sure the data is valid.
This project is licensed under the permissive BSD 2-clause license.
The data produced by this project is public domain.
This project uses data from ISO-3166 Country and Dependent Territories Lists with UN Regional Codes under the Creative Commons Attribution-ShareAlike 4.0 International License.
Please cite this project if you use it in your visualization or reporting.
Data obtained from Corona Data Scraper