Ever come across a transcript that seems to span multiple genes? These are called ‘readthrough transcripts’, or sometimes ‘conjoined genes’, and they’re more common than you might think. Read on to find out about what they are and what they do, and how we annotate these at Ensembl.
Author: Erin (Outreach)
In this blog we catch up with Ensembl’s 2018 Google Summer of Code (GSoC) students and hear about their now completed projects, and their reflections on the experience. You may have already seen our previous blog post which we published as they were just beginning their projects. Read on to find out how they went, what they learnt and what valuable advice they can pass on to aspiring GSoC students.
It’s probably reasonable to assume that the coding sequence (CDS) of a protein-coding transcript model is the feature that is of primary interest to most people who use Ensembl. However, both the 5’ and 3’ untranslated regions (UTRs) are important biological entities in their own right, and it is vital that we in Ensembl do the best we can to represent them accurately. However, the annotation of these UTRs is complicated, so we’re going to focus on exploring the annotation process for 3’ UTRs in this article (Figure 1).
The path to sequencing the wheat genome has been no easy ride, due to its large and highly repetitive genome. This new assembly from the IWGSC bridges many gaps from the initial genome sequencing effort. Read on to find out more about this exciting new genome assembly!
We’ve been tending the Ensembl Plants garden with great care, have cultivated several new species and updated a number of genome assemblies, including a new chromosome-level wheat genome from IWGSC. The protist team have also been busy and are proud to announce an updated assembly for Leishmania major.
Read on to find out more about these exciting new updates and more in this new release of Ensembl Genomes!
Are you feline excited for our new pawsome release?!
Ensembl 93 has been released, bringing with it two new big cat genomes for tiger and leopard, and an update to the domestic cat assembly. If cats aren’t your thing, we also have a huge new dbSNP import for human and a brand new regulatory build and GENCODE update for mouse.
We also have a new hagfish genome, important changes to our VEP REST endpoints, and many more exciting developments so read on to find out more!
Both Ensembl release 93 and Ensembl Genomes release 40 are scheduled for late June and early July 2018, respectively.
Included are a number of new genomes and genebuilds for vertebrates and plants (including leopard, Amur tiger, hagfish, pigeon pea, carrot and adzuki bean) and significant updates to the mouse GENCODE annotation and regulatory build. This release will also bring a new import of variants from dbSNP for human, and allele frequencies for dog variation data!