miRcode - transcriptome-wide microRNA target prediction including lncRNAs

Transcriptome-wide microRNA target prediction including lncRNAs

miRcode 11 (June 2012) based on: UCSC GRCh37/hg19, GENCODE 11 transcripts, Multiz 46 species, TargetScan6 families

Data info

Browse

Download

FAQ

Larsson lab

Source data information

The current miRcode release is based on the hg19 genome assembly and relies on data sources listed below.

GENCODE transcripts/genes

Transcripts in the GENCODE 11 annotation were analyzed in all regions, and results aggregated on a per-gene basis. For easy-of-use, we classify genes into a few broad categories, but all of GENCODE is included and searchable.

Total transcripts¹		179,905
Total genes¹		53,520
LncRNA genes²	All	10,419
	Intergenic³	5,680
	Overlapping	4,739
Coding genes⁴		19,999
Pseudogenes⁵		12,549
Other⁶		10,553

¹Ambigously mapped transcripts are excluded, leading to subtle differences compared to official counts.
²Having no coding spliceforms, and mature transcripts >200 nt.
³Not overlapping with any transcript of a coding gene.
⁴Genes producing at least one coding, non-NMD, isoform, although several non-coding transcripts may also be produced.
⁵GENCODE pseudogenes are also included in this miRcode release.
⁶Remaining genes (e.g. tRNAs, snoRNAs, all-NMD coding genes).

Multiz alignments

The Multiz 46-way vertebrate alignment was used for evaluating site conservation.

Primates	9
Placental mammals	23
Non-mammal vertebrates	13

TargetScan microRNA family definitions

Analyses are based on microRNA seed families as defined by TargetScan 6, as these are widely adopted.

Inquiries can be addressed to erik.larsson@gu.se