What’s coming in Ensembl 102 / Ensembl Genomes 49

Ensembl 102 (and Ensembl Genomes 49) are due to be released in October 2020. As with all releases, we cannot guarantee that anything listed here will make it into the final release.

Major Data Updates

New Genomes

Plants:

  • Saltwater cress (Eutrema salsugineum)
  • Lavender scallops (Kalanchoe fedtschenkoi)
  • Valley oak (Quercus lobata)

New Assemblies and/or Annotation

Mammals:

  • Tasmanian Devil (Sarcophilus harrisii)

Plants:

  • Cotton (Gossypium raimondii)
  • White yam (Dioscorea rotundata

Metazoa:

  • Purple sea urchin (Strongylocentrotus purpuratus)
  • Red fire ant (Solenopsis invicta)
  • European honey bee (Apis mellifera)
  • Jewel wasp (Nasonia vitripenis)
  • Honey bee mite (Varroa destructor)

Bacteria:

  • New batch update of bacterial and archaeal genomes and annotation from ENA:
    • 22,088 new genomes
    • 34,804 genomes have been removed, due to redundancy
  • Updated annotation of pathogen-host interaction data from PHI-base
  • Alignments to Rfam covariance models (Rfam 12.2) visible in new track called ‘Rfam models’
  • Updated protein features for all species using InterProScan 77.0

Other Updates and Highlights

  • Update to translate all non-ATG start codons as Methionine
  • Plant reactome mappings for plant species from Gramene
  • Updated repeated element annotation using a custom plant library (nrTEplants)
  • Retirement of Ensembl 81 archive site (jul2015.archive.ensembl.org)