We are sequencing West Nile virus from California, with an emphasis on San Diego, Kern, and Sacramento/Yolo counties, to understand how 1) the virus spreads between regions, 2) is maintained locally between seasons, and 3) the factors that promote local outbreaks. Our goal is to generate thousands of new West Nile virus genomes from infected birds and mosquitoes. This research is part of the WestNile 4K Project.
Collaborations and data sources
The samples from San Diego county were provided by Nikos Garfield and Saran Grewal from the San Diego County Vector Control Program. The samples from all the other counties in California, including Sacramento-Yolo and Kern were provided by Ying Fang and Chris Barker from the Barker Lab, University of California, Davis and Sarah Wheeler from Sacramento-Yolo Mosquito and Vector Control Program.
The BAM files are available on Google Cloud.
The sequencing is being performed using an amplicon-based sequencing scheme using PrimalSeq. Our full protocol is available online here. Sequencing data is aligned using bwa and processed using iVar (Grubaguh et al. Genome Biology 2019).
Below is a table showing the count of sequenced genomes by county.
Average depth and percent coverage of genome are available in a tsv file.
The following sequences with <50% coverage of the coding region weren't included in downstream analysis. They are highlighted in red in the figure above.
Multiple sequence alignment
Alignment were performed using Mafft. The PHI test was used to test for recombination and RDP4 was used to narrow down sequences with potential contamination. These sequences are in consensus_sequences/contaminated_sequences.
Disclaimer. Please note that this data is still based on work in progress and should be considered preliminary. If you intend to include any of these data in publications, please let us know – otherwise please feel free to download and use without restrictions. We have shared this data with the hope that people will download and use it, as well as scrutinize it so we can improve our methods and analyses. Please contact us if you have any questions or comments – we’ll buy beers for #ResearchParasites that spot flaws and faults in the data and come up with improvements!
The Scripps Research Institute
La Jolla, CA, USA
Yale School of Public Health
Nikos Gurfield, Saran Grewal
San Diego Vector Control
Chris Barker, Sarah Wheeler, Ying Fang, Lark Coffey, William Reisen
University of California, Davis
Josh Quick, Nick Loman
University of Birmingham