Serratus is a collaborative Open Science project for ultra-rapid discovery of known and unknown coronaviruses in response to the COVID-19 pandemic.

Preprint: Petabase-scale sequence alignment catalyses viral discovery

Serratus Mountain in Squamish, BC. Canada


While there is an intense research effort to sequence and analyze SARS-CoV-2 isolates for tracking real-time virus evolution, our understanding of the virus's evolutionary origins is limited by incomplete genomic characterization of other members of the Coronaviridae (CoV) and related families.

We are re-analyzing all RNA-seq, meta-genomics, meta-transcriptomics and environmental sequencing data in the NCBI Short Read Archive to discover new sequence biodiversity of coronaviruses. That is >3.4 million biological samples or >5 petabases of sequencing data.

Contribute to Serratus

The Serratus team is actively looking to collaborate with all scientists and developers.



Learn more about the Serratus architecture

Serratus Usage

Learn more on the Serratus Wiki

Serratus Usage

Building Containers

Access Data Releases

Tantalus: R interface for Serratus Data

Data Release Policy

Our primary goal is to generate the coronavirus data to accelerate the global research efforts in fighting SARS-CoV-2. To achieve this:

  • All software development is open-source and freely available (GPLv3)
  • All sequencing data generated, raw and processed, will be freely and immediatly available in the public domain in accordance with the Bermuda Principles set out by the Human Genome Project.

Contributing Team


Ultra-deep search for novel viruses

Serratus Info

⭐ Stars 72
πŸ”— Homepage
πŸ”— Source Code
πŸ•’ Last Update 9 months ago
πŸ•’ Created a year ago
🐞 Open Issues 50
βž— Star-Issue Ratio 1
😎 Author ababaian