Ensembl 90 has been released!

Ensembl 90 is now live and it’s absolutely massive! Read on to find out why:

New species and annotation

Ensembl 90 is our biggest release ever in terms of species.

Images of the new rodents

New/updated rodents in Ensembl 90: Kangaroo rat (updated genes), Guinea pig (updated genes), Golden hamster (new), Degu (new), Damara mole rat (new), Brazilian Guinea pig (new), Algerian mouse (new), Lesser Egyptian jerboa (new), Ryukyu mouse (new; imported annotation), Prairie vole (new), Naked mole rat (new; two assemblies available), Chinese hamster ovary (new; imported annotation), Northern American deer mouse (new), Long-tailed chinchilla (new), Shrew mouse (new; imported assembly), Upper Galilee mountains blind mole rat (new), Squirrel (updated genes) and Chinese hamster (new).

We’ve got 19 new or updated rodent genomes. Of these, sixteen were annotated withĀ our new clade-based system, which makes use of the similarity between species’ genomes to automatically annotate genes onto the homologous regions:

We have also imported three new rodent genomes and their annotation:

This is the first ever Ensembl release where we’ve imported annotation from external resources, but our rigorous quality control makes us confident that these species’ annotation will meet the high standard expected ofĀ Ensembl genes. It’s also the first time we’ve supported more than oneĀ genome assembly perĀ species (naked mole rat and Chinese hamster) in one Ensembl database, which will allow you to continue to work with your preferred assembly, within the Ensembl framework.Ā We planĀ to continue to import high quality gene sets, where available, and to use our quicker clade-based annotation, so expect lots more new genomes appearing in futureĀ Ensembl releases.

Aside from rodents, we’ve got a new pig genome assembly,Ā Sscrofa11.1 from the Swine Genome Sequencing Consortium. The assembly is created from a single Duroc sow, named TJ Tabasco. The genome was annotated using species specific RNA-Seq data, PacBio long reads and cDNAs, as well asĀ proteins from related vertebrates.

We also have updates to our human, mouse and zebrafish gene sets. This brings us to human GENCODE 27, updating to the human genome patch version GRCh38.p10 with the latest updates from the Ensembl automatic and Havana manual annotation,Ā andĀ mouse GENCODE M15, with the latest Ensembl and Havana genes. Zebrafish annotation incorporates new gene models from RNA-seq and adds pri-miRNAs to the other features database.

Variation data

We have updates to ourĀ variation data coming in for human:Ā COSMIC 81 somatic variants, HGMD 2016.4, dbSNP 150 and DGVa structural variants. For both our main database and our GRCh37 database, now have alleleĀ frequencies fromĀ TOPMed (Trans-Omics for Precision Medicine)Ā andĀ UK10K, and,Ā for pre-existing dbSNP variants,Ā gnomAD. We also have DGVa structural variant updates forĀ Cow, Dog and Mouse.

We have updated phenotype data in Human (NHGRI-EBI GWAS Catalog, OMIM and MIM morbid, ClinVar, Cosmic Gene Census, DDG2P and Orphanet), Mouse (IMPC, MGI),Ā Cat, Chicken, Chimpanzee, Cow, Dog, Horse, Macaque, Pig, RatĀ (RGD), Sheep, Turkey and ZebrafishĀ (ZFIN).

Microarray probe mapping

Ensembl provide probe mapping for a number of popular commercially-available microarrays, mapping probesets to genomic loci and Ensembl genes. You can get mapping to genes through the transcriptĀ pages in the browser and BioMart. We have updated our probe mappings for:

  • Ciona intestinalis
  • Caenorhabditis elegans
  • Chicken
  • Chimpanzee
  • Cow
  • Dog
  • Fruitfly
  • Human
  • Macaque
  • Mouse
  • Mouse strains: 129S1/SvImJ, A/J, AKR/J,Ā  BALB/cJ,Ā C3H/HeJ, C57BL/6NJ, CAST/EiJ, CBA/J, DBA/2J, FVB/NJ, LP/J, NOD/ShiLtJ, NZO/HlLtJ, PWK/PhJ, SPRET/EiJ and WSB/EiJ
  • Pig
  • Platypus
  • Rabbit
  • Rat
  • Saccharomyces cerevisiae
  • Xenopus
  • Zebrafish

Interface updates

You’ll now be able to adjust the y-axis scale of custom wiggle tracks in the browser. We’ll be releasing a blog post about this soon.

File format updates

We’ve updated our sequence ontology terms in our GFF3 files to improve consistency and remove bugs. Read more in this blog post.

Find out more

We’ll be holding a release webinar on Wednesday 6th SeptemberĀ at 4pm BST. Register here to learn more about the exciting updates to Ensembl, and ask your questions to the team.