Nextstrain


Nextstrain offers real-time dashboards for tracking viral diseases such as influenza, Zika, and Ebola. The platform supports both community-developed and internally-maintained dashboards for a range of pathogens. Utilizing an open-source, reproducible bioinformatics pipeline, Nextstrain integrates data from repositories like the National Center for Biotechnology Information (NCBI) and the Global Initiative on Sharing All Influenza Data (GISAID) to assist researchers and policymakers in making informed epidemiological decisions 1. Its suite of tools, including Augur and Auspice, enables robust, reproducible phylogenetic analysis and visualization, facilitating the rapid and comprehensive dissemination of results 2,3.

Updated: January 19th, 2026

Overview

Researchers and policy makers can use the genetic evolution of viruses to make critical epidemiological decisions, aiding in understanding emerging disease trends and informing policy responses to outbreaks 3. Often, however, these analyses are conducted in isolation, with results and data rarely made public until after publication, and they do not always capture how viral evolution unfolds over time 1. In response to this bottleneck, collaborators from the Fred Hutchinson Cancer Research Center and the University of Basel launched Nextstrain in 2015 4.

This platform provides near real-time dashboards for viral diseases of high public health interest—from endemic infections (seasonal influenza, dengue) to emerging outbreaks (avian influenza, Zika, Ebola)—plus access to the underlying bioinformatics tools 3. Its framework enables ongoing monitoring of pathogen evolution and can expand to additional diseases developed and maintained by the research community. This approach allows robust, reproducible, rapid analysis and dissemination of results that keep pace with sequencing data generation 1,2.

Data for Nextstrain maintained dashboards are sourced from public repositories such as the National Center for Biotechnology Information (NCBI), Global Initiative on Sharing All Influenza Data (GISAID), and Virus Pathogen Resource (ViPR), as well as GitHub repositories and other genomic data sources. The visualization integrates sequence data with other data types such as geographic information, serology, or host species in a consistent and concise format accessible to health professionals, epidemiologists, virologists and the public alike 1,2.

Nextstrain displays dashboards developed by both internal maintainers (Nextstrain-maintained pathogen analyses) and community contributors (Scalable Sharing with Nextstrain Groups), all using the same reproducible open-source bioinformatics pipeline 57. Nextstrain currently maintains dashboards for the following endemic infections and emerging outbreaks:

  • RNA Viruses - Respiratory: Avian influenza, HMPV (Human metapneumovirus), Measles, Mumps, RSV (Respiratory syncytial virus), Rubella, SARS-CoV-2, Seasonal CoV (Seasonal coronaviruses), Seasonal influenza

  • RNA Viruses - Vector-borne (Arboviruses): Dengue, Oropouche, West Nile Virus, Yellow Fever Virus, Zika

  • RNA Viruses - Hemorrhagic Fever: Ebola, Lassa, Nipah

  • RNA Viruses - Gastrointestinal: Norovirus

  • RNA Viruses - Neurotropic: Enterovirus, Rabies

  • DNA Viruses: Mpox (Monkeypox)

  • Bacteria: Tuberculosis

Nextstrain offers an open-source toolkit—a collection of core functions and packages—that powers both the insights on their main webpage and enables researchers to analyze their own data. These tools can be accessed and installed for local use at: Installing Nextstrain 8,9.

  • Augur: A suite of bioinformatics tools that together provide complete phylogenetic analysis using configurable “recipes” (called builds) specific to different pathogens. Includes composable functions for filtering and aligning sequences, inferring trees, and integrating metadata 10.
Zoomable Image
Pipeline schematic showing some of the core computational modules and possible workflow entry points 8,10,11.
  • Auspice (Package and Webtool): Dynamic visualization tool for data generated by the Augur pipeline. It powers all pathogen dashboards on Nextstrain, including community-created dashboards 8,1113.

  • Nextstrain CLI: The Nextstrain CLI integrates Augur, Auspice, and deployment functionality, providing a unified interface to run pathogen workflows across computing environments (Docker, Conda, AWS Batch) and publish datasets to nextstrain.org or self-hosted pages 14.

Zoomable Image
Schematic of CLI-coordinated workflow integration 11,14.
  • Nextclade (CLI and Webtool): Analyzes virus genome sequences by aligning data to a reference genome, calling mutations, and placing sequences on a phylogeny. Reports clade assignments based on phylogenetic position (rather than mutational signatures) and sequence quality metrics 1518.

Gaining Access

Getting Data

Nextstrain is committed to open science while respecting data privacy and ownership. Researchers can access publicly available sequencing data through dashboards that draw from public repositories, such as GenBank and GISAID where applicable 1. The platform provides derived data, including phylogenetic trees, metadata, and dashboard screenshots that users can download and integrate with their own metadata files 19. Additionally, researchers can view previous data releases or compare different releases. See Viewing previous analyses for detailed guidance 20.

To access publicly available datasets, navigate to any Nextstrain dashboard and scroll to the bottom of the page to locate the download button, or press the “d” key to open the data download popup window. Available download options include:

  • Phylogenetic Trees: Download newick trees with branch lengths (in terms of genetic divergence or time). Trees include all data and do not reflect current filtering.

  • Metadata: Per-strain metadata for all samples or current visualization. Includes traits and authorship. An “authors” metadata table is also available.

  • Diversity (entropy) data: Download data behind the entropy (diversity) panel. The data mirrors what is displayed and includes nucleotide positions and base change counts.

  • SVG Screenshot: Download a vector image for editing and display. Screenshots are licensed under CC-BY.

Sharing Data

Nextstrain enables users to run analyses and share them through multiple methods. These include storing data on nextstrain.org, utilizing GitHub repositories, creating groups on Nextstrain, and using public URLs https://docs.nextstrain.org/projects/auspice/en/stable/server/overview.html?. See Contributing to Nextstrain and Share Analyses through Nextstrain for detailed guidance 7,21,22.

Publications

This section presents a selection of PubMed articles that utilize the dataset and are authored by individuals affiliated with the Yale University. These articles are provided to inspire researchers and students to use the data in their own work.

  • Genomic epidemiology of West Nile virus in Europe.
    R Tobias Koch, Diana Erazo, Arran J Folly, Nicholas Johnson, Simon Dellicour, Nathan D Grubaugh, Chantal B F Vogels
    One health (Amsterdam, Netherlands) 2023 Dec 16 doi: 10.1016/j.onehlt.2023.100664
    PMID: 38193029
Back to top

References

1.
Hadfield, J. et al. Nextstrain: Real-time tracking of pathogen evolution. Bioinformatics 34, 4121–4123 (2018).
2.
Trevor Bedfo, PhD et al. What is nextstrain? https://docs.nextstrain.org/en/latest/learn/about.html.
3.
Trevor Bedfo, PhD et al. Nextstrain. https://nextstrain.org.
4.
Nextstrain. Wikipedia (2025).
5.
Trevor Bedfo, PhD et al. Scalable sharing with nextstrain groups. Nextclade. https://nextstrain.org/groups.
6.
Trevor Bedfo, PhD et al. Nextstrain-maintained pathogen analyses. Nextclade. https://nextstrain.org/pathogens.
7.
Trevor Bedfo, PhD et al. Contributing to nextstrain. https://docs.nextstrain.org/en/latest/guides/contribute/index.html.
8.
Disease Control {and} Prevention (CDC), C. for. Module 3.1 – getting started with nextstrain. Advanced molecular detection (AMD). https://www.cdc.gov/advanced-molecular-detection/php/training/module-3-1.html (2024).
9.
Trevor Bedfo, PhD et al. Installing nextstrain. https://docs.nextstrain.org/en/latest/install.html.
10.
11.
Trevor Bedfo, PhD et al. Parts of a whole. https://docs.nextstrain.org/en/latest/learn/parts.html.
12.
Trevor Bedfo, PhD et al. Auspice: An open-source interactive tool for visualising phylogenomic data. https://docs.nextstrain.org/projects/auspice/en/stable/index.html.
13.
Trevor Bedfo, PhD et al. Auspice.us. Auspice. https://auspice.us/.
14.
Trevor Bedfo, PhD et al. Nextstrain CLI. https://docs.nextstrain.org/projects/cli/en/stable/.
15.
Trevor Bedfo, PhD et al. Nextclade. Nextclade. https://clades.nextstrain.org/.
16.
17.
Trevor Bedfo, PhD et al. 4. Clade assignment. Nextclade. https://docs.nextstrain.org/projects/nextclade/en/stable/user/faq.html#.
18.
Trevor Bedfo, PhD et al. Frequently asked questions. Nextclade. https://docs.nextstrain.org/projects/nextclade/en/stable/user/faq.html#.
19.
Trevor Bedfo, PhD et al. Download the data displayed through nextstrain.org. https://docs.nextstrain.org/en/latest/guides/share/download-data.html.
20.
21.
Thomas Sibley. Welcome to discussion.nextstrain.org! Nextstrain discussion. https://discussion.nextstrain.org/t/welcome-to-discussion-nextstrain-org/30 (2020).
22.
Trevor Bedfo, PhD et al. Share analyses through nextstrain. Nextclade. https://docs.nextstrain.org/en/latest/guides/share/index.html.
23.
24.
Trevor Bedfo, PhD et al. Welcome to nextstrain’s documentation! https://docs.nextstrain.org/en/latest/index.html#.
25.
Trevor Bedfo, PhD et al. Nextstrain. https://github.com/nextstrain.
26.
Nextstrain discussion. Nextstrain discussion. https://discussion.nextstrain.org/.
27.
Grubaugh Lab. Nextstrain. Grubaugh lab. https://grubaughlab.com/nextstrain/.