For the third year in a row, we’re lucky to have student developers working with us as part of Google Summer of Code. We’ve got three GSOC-ers this year, working on some really exciting projects: Zeyu Tony Yang, working on primary genome analysis, Nabil Ibtehaz, working on transcript-level orthology and Somesh Chaturvedi, working on retrieving reference sequences with APIs.
GSOC is a project set up by Google that places students in open source projects to take on a short independent coding project, and pays them for it. We have to pass rigorous selection criteria to be allowed to offer projects on GSOC, and the students have to be selected by both Google and us to take part. It means the GSOC-ers are the Top Gun of student developers. We think this is a really great opportunity, both for open source projects like us, who get a fresh pair of eyes to take a look at something that we’ve maybe put on the back-burner, and for the students, who get experience working on a real-world coding project during their university summer break.
As of Ensembl release 93, which is due at the end of the month, the Gene Variant Image view will be retired for human. We have elected to retire this page because we feel that the density of known genetic variation is too great for this view to be informative in its current form.
Map of low-middle income countries, according to the OECD
We’re excited to announce that we have received a grant from the Wellcome Trust to deliver Ensembl training in low-middle income countries at no charge.Continue reading
Some Variant Effect Predictor (VEP) jobs are small, just ten or fewer variants, and that’s easy. Some VEP jobs are big, if you do variant calling on one whole human genome, that’s five million variants! The more variants you have, the more computing power the VEP needs to process them, which can make it slow. But there are ways to speed it up.
The VEP (Variant Effect Predictor) is our most popular tool and is incredibly useful for annotating genetic variants with the genes they hit and what effect they have on them. But did you know you can filter your results? Both in the web interface and using the script?
Ensembl and NCBI have been working to align the GENCODE and RefSeq reference transcripts. As part of that effort, we are also developing plans to define a primary transcript for every gene as well as a minimal set of clinically relevant transcripts. To guide that effort, we have developed a small survey to get input on how to define the primary transcript and whether this would be important to your work.
The survey should only take 10 minutes or less and you will have the opportunity to sign up for follow-up info about this project if you are interested.
We think conferences are great opportunities to use talks and posters to tell people about all the cool stuff we’re developing, provide training with workshops and learn more about what’s going on in our fields of interest. Ensembl team members attend many conferences a year and 2018 is no exception, we’re planning to attend twenty (so far)!
This month we’re meeting Carla from our comparative genomics team (which we call compara).