Skip to main content



In this repo we are linking to various material that have to do with our investigations of the origin and evolution of SARS-CoV-1, the virus that caused the 2002-2003 SARS epidemic.


SARS-CoV-1 sequence data from humans, civets, raccoon dogs, and ferret badgers were downloaded from NCBI. Metadata (metadata.csv) was scraped from primary publications and cross-checked when possible. Certain sequences (exclude.csv) were removed due to poor quality, duplicates, and because they had unknown sources (e.g., extensive tissue culture passage, lab constructs, etc). Final sequences were quality checked and indels correct as necessary (fix.csv).

Alignments were created using MAFFT, found free of recombination using GARD, and trees were built using RAxML (no bootstraps)

Andersen Lab
The Scripps Research Institute
La Jolla, CA, USA
[email protected]


GitHub Commits