WS265 release

loci with two different protein products

There are a small number of loci which code for two very different protein products. These include dicistronic mRNA operons and loci which have a few small exons in common then have alternate splicing leading to many different exons being used.

The details of some of these can be found in the WormBook chapter “Operon and non-operon gene clusters in the C. elegans genome”.

Previously, these have been curated as isoforms of a single gene. This was causing problems because the description of gene
function would be based on one isoform, leaving the other isoform undescribed or described incorrectly, based on the first isoform.
There are 42 C. elegans and 47 C. briggsae known loci that have now been split to have different Gene IDs.

WS264 release

C. elegans sORFs

sORFs.org is a public repository of small open reading frames (sORFs) identified by ribosome profiling (RIBO-seq).

It contains predicted sORF regions for several species, including C. elegans.

We have annotated 118 predicted sORF regions as coding (CDS) isoforms of the existing genes. It is likely that in the next release, where these isoforms do not overlap with existing isoforms, these sORF regions will be changed to be individual genes and not isoforms.

52 of these annotated sORF regions do not start with the canonical Methionine AUG initiation codon. It is possible that they use a non-canonical initiation codon. Some of these non-canonical initiation codons are not the expected non-canonical initiation codon Isoleucine, but code for residues like Valine.

Trichuris muris

This release we see the integrated of the Edinburgh strain of Trichuris muris version TMUE3.1. This species has been fully integrated as a core species meaning there are stable IDs and tracking with inclusion in all additional pipelines and analysis.
The genome assembly and gene annotation has been taken directly from the Pathogen Genomics group at the WTSI. Additional mapping of gene mergers, splits and transfer of IDs from the TMUE2.2 has been done to allow users to identify their genes of interest.

Caenorhabditis nigoni

This release includes the Caenorhabditis nigoni genome assembly and gene set described in “Rapid genome shrinkage in a self-fertile nematode reveals sperm competition proteins” by Da Yin, et. al (Science 359,55-61 2018) as non-core species set.
This species should be of special interest, due to its phylogenetic closeness to C.briggsae and its differences in sexual reproduction.

The data is available as files on the WormBase FTP site, as well as the JBrowse genome browser.

WormBase Release WS263

We would like to announce the availability of the WormBase WS263 release on the WormBase website and FTP.

Some of the highlights of this release are:

New Pristionchus Assembly and Gene Set

The new data made available through WormBase is described in “Single-Molecule Sequencing Reveals the Chromosome-Scale Genomic Architecture of the Nematode Model Organism Pristionchus pacificus by Christian Roedelsperger, et al. in Cell.

ID mapping was carried out to try and preserve the existing IDs. The published IDs were also incorporated into the database so that they are searchable on wormbase.org.

New PantherDB data

the PantherDB orthology set included in WormBase has been updated to the latest version (13.0).

DNAseI hypersensitive Sites

We have created 42,728 Features with a Method=DNAseI_hypersensitive_site to mark DNase hypersensitivity sites found in C. elegans ’embryo’ and ‘L1 arrest’ life-stages in non-coding regions from the paper WBPaper00053259. These sites, together with Transcription Factor footprints, will be added to the tracks available in the genome browsers.

Genome Res. 2017 Dec;27(12):2108-2119. “Genome-wide discovery of active regulatory elements and transcription factor footprints in Caenorhabditis elegans using DNase-seq.”  Ho MCW, Quintero-Cadena P, Sternberg PW.

WormBase Version WS262

If you noticed, version WS262 has been out for several days.  The version number present at the top of the WormBase home page is now a link to the Release Notes which is a concise summary listing the various data types and their numbers.  Check it out if you want to get a quick view of the breadth and depth of the data in WormBase and to see what has changed since the last release.  Also find the list of data files available for this release, on our FTP site.