Bioinformatics summary - current status

Species 454 ID Small RNA sequence count cDNA targets used
total data mapped
zero mis-matches
sequence
count
sequence
source
reads distinct reads distinct
Maize:
Run # 1 157,944 65,272 108,399 42,887 36,563 ZmGI
release 16
TIGR
Run # 2 119,098 81,719 67,896 43988

Rice: Run # 1 53,750 25,009 35,486 12,868 62,827 OSA1
release 4
TIGR
Run # 2 164,623 122,523 121,167 88,481

RoApx 319,964 185,467 274,648 149,401
ShApx 303,622 219,754 262,040 183,861
Infl 429,729 283,688 363,637 229,151
Leaf 355,902 147,153 261,191 91,105
N.B. In keeping with the predominant notation in the small RNA community, the column formally labeled as 'unique' in the above table is now labelled as 'distinct', and refers to the sequence and not to a map position. That is, the total nucleotide sequences can be reduced to a set of distinct sequences, some of which may be mapped to multiple places within the genome sequence.


Plant small RNAs, which include miRNAs, siRNAs and trans-acting siRNAs (ta-siRNAs), represent a class of regulatory molecule that is increasingly being seen as a significant component of epigenetic processes as well as an important component of gene networks involved in development and in homeostasis. Here we present a bioinformatic resource for cereal crops consisting of large-scale datasets of maize and rice small RNA sequences generated by 454 Life Science sequencing. The small RNA sequences have been mapped to the rice genome and available maize genome sequence and are presented in two genome browser datasets using the Generic Genome Browser (Lincoln Stein). Potential target sequences representing mature mRNA sequences have been predicted using the FASTH software from the Zuker lab. and access to the resulting small RNA target pair (SRTP) dataset has been made available through a mysql based relational database. Within the genome browser the small RNAs have links to the SRTP database that will return a list of potential targets. The SRTP database may also be searched independently using both small RNA and target transcript queries. Data linking and integration is the main focus of this interface and to this aim links are present in the SRTP results pages back to the browser and the SRTP database as well as external sites. The resource will be updated as more sequences become available.