Ensembl BlogEnsembl Blog

News about the Ensembl Project and its genome browser

  • About Us
    • Documentation projects
    • Future Plans
    • Student Projects
  • Workshops
  • Known Bugs
    • Ensembl 100
    • Ensembl 101
    • Ensembl 102
    • Ensembl 99 and earlier
  • Contact Us

Categories

  • Release announcements
  • COVID-19
  • Ensembl VEP
  • New data and web features
  • Other news
  • Training
  • Community
  • Jobs @ Ensembl
  • Service status
Tweets by @ensembl

Archives

Meta

  • Log in
  • Entries feed
  • Comments feed
  • WordPress.org
© 2018 EMBL-EBI. All rights reserved. Background image by Spencer Phillips
This website requires cookies, and the limited processing of your personal data in order to function. By using the site you are agreeing to this as outlined in our Privacy Policy

New Ensembl motif features

15th October 2018 by Emily (Outreach)·4 Comments

In its latest release, Ensembl has completely reviewed its reporting of potential Transcription Factor (TF) binding sites. TF proteins are key players of gene expression regulation that bind to specific DNA regions characterised by approximate sequence patterns, or transcription factor binding motifs (TFBM). These motifs are generally represented as a Position Specific Frequency Matrix, or Binding Matrix. Ensembl scans genomes for occurrences of these motifs, reporting Motif Features at each possible location.

New Motifs

We have extended our characterisation of TF binding by importing 632 human and 85 mouse TFBMs imputed through SELEX. This new collection greatly expands our repertoire of known motifs and covers a significant fraction of all known transcription factors.

The new motifs have been mapped onto putative regulatory elements of the Ensembl Regulatory Build using MOODS. In a given epigenome, for each Transcription Factor, if a ChIP-seq data set is also available, we annotate each Motif Feature as either experimentally verified or unverified, depending on whether they fully overlap a ChIP-Seq peak. If there is no ChIP-seq data available, all Motif Features are considered unverified. Since ChIP-seq experiments are epigenome specific, the Motif Feature annotation varies across the different epigenomes. In mouse, our resources do not contain ChIP-seq datasets for any of the new Transcription Factor Binding Matrix, therefore, all Motif Features are considered unverified.

 

New Binding Matrix visualisation


Our new sequence logo visualisation

We developed a new visualisation for the sequence logos of regulatory motifs, which is simpler and more accurate. Rather than the commonly-used stretched base visualisation with coloured letters, this uses solid blocks of colour to represent the information content at each base. We chose this new display because it scales well, both horizontally and vertically, without losing legibility. Data can be downloaded from the image and the image itself can be exported in SVG format to enable reuse and integration into publications and presentations.

Where can I find this data?

On the “Location View”, click on “Configure this page” and then select the “Configure Region Image” tab. Under “Regulation”, select “Other regulatory regions” and enable the “Motif Features” track. A new track containing all Motif Features in the region will be displayed, highlighting verified (black) and unverified (grey) motifs.

Motifs can be shown in the region view by configuring the page. Motifs that are verified are shown in black.

In human, experimentally verified Motif Features are also displayed in the Epigenome activity tracks.

Where the motifs are bouns in that cell type, the motifs are displayed in the epigenome regulatory feature activity tracks.

Stable IDs

Binding Matrices  and Motif Features have been given stable identifiers. In human, they look like ENSPFMXXXX (Position Frequency Matrix) and ENSMXXXXXXXXXXX (Motif) respectively. Mouse follows a similar pattern (ENSMUSPFMXXXX and ENSMUSMXXXXXXXXXXX).

jaspar motif pwm selex sequence logo tfbm transcription factor

Post navigation

Previous Previous post: Our new joint transcript initiative : The Matched Annotation from the NCBI and EBI (MANE) project
Next Next post: Get involved in community gene annotation for Zymoseptoria tritici
Proudly powered by WordPress. Theme: Flat 1.0.0 by Themeisle.