NonCode aReNA database

NonCode aReNA database is an integrated collection of manually curated and automatically annotated non-coding transcripts.
At now, it comprenhends sources:
- VEGA (release 67, Nov 2016)
- ENSEMBL (release 87, Dec 2016)
- NCBI REFSEQ (release 79, Nov 2016)
Moreover NonCode aReNA includes specialized curated resources:
- miRBase database of microRNAs (v. 21, last release, Jun 2014)
- GtRNAdb database of tRNAs (last release)
- piRNABank database of piRNAs (last release)
- TAIR Arabidopsis Information Resource (v. 10 - last stable version)

These resource share many entries, so it was necessary to reduce redundancy by through cross-referencing. Redundancy is considered giving priority to manually curated data source. Priority of public data sources is: RefSeq -> Vega -> Ensembl
Moreover, similarity redundancy is considered.
Currently, NonCode aReNa database comprehends model organisms: homo sapiens, mus musculus and arabidopsis thaliana.

DB Statistics

Species num. transcripts
arabidopsis thaliana 1448
homo sapiens 175040
mus musculus 118544


Source Species num transcripts
ENSEMBLhomo sapiens5399
ENSEMBLmus musculus5392
GTRNADBhomo sapiens53
GTRNADBmus musculus22
MIRBASEhomo sapiens4468
MIRBASEmus musculus3094
PIRNABANKhomo sapiens23439
PIRNABANKmus musculus39986
REFSEQhomo sapiens61568
REFSEQmus musculus28049
TAIRarabidopsis thaliana1448
VEGAhomo sapiens80113
VEGAmus musculus42001


biotype num transcripts
lncRNA65704
piRNA63425
retained_intron44945
processed_transcript41861
misc_RNA25957
lincRNA19160
antisense14640
miRNA8752
snRNA3323
snoRNA2610
sense_intronic1258
miRNA_primary_transcript1221
rRNA915
tRNA764
sense_overlapping358
bidirectional_promoter_lncRNA129
scaRNA80
Mt_tRNA44
3prime_overlapping_ncrna37
guide_RNA31
ribozyme28
antisense_RNA14