Log on / register
BioMed Central home | Journals A-Z | Feedback | Support | My details
Open AccessHighly AccessResearch

HuMiTar: A sequence-based method for prediction of human microRNA targets

Jishou Ruan1 email, Hanzhe Chen1 email, Lukasz Kurgan2 email, Ke Chen2 email, Chunsheng Kang3 email and Peiyu Pu3 email

Chern Institute for Mathematics, College of Mathematics and LPMC, Nankai University, Tianjin, PR China

Department of Electrical and Computer Engineering, University of Alberta, Canada

Neuro-oncology laboratory, General Hospital of the Tianjin Medical University, Tianjin, PR China

author email corresponding author email

Algorithms for Molecular Biology 2008, 3:16doi:10.1186/1748-7188-3-16

Published: 22 December 2008

Abstract

Background

MicroRNAs (miRs) are small noncoding RNAs that bind to complementary/partially complementary sites in the 3' untranslated regions of target genes to regulate protein production of the target transcript and to induce mRNA degradation or mRNA cleavage. The ability to perform accurate, high-throughput identification of physiologically active miR targets would enable functional characterization of individual miRs. Current target prediction methods include traditional approaches that are based on specific base-pairing rules in the miR's seed region and implementation of cross-species conservation of the target site, and machine learning (ML) methods that explore patterns that contrast true and false miR-mRNA duplexes. However, in the case of the traditional methods research shows that some seed region matches that are conserved are false positives and that some of the experimentally validated target sites are not conserved.

Results

We present HuMiTar, a computational method for identifying common targets of miRs, which is based on a scoring function that considers base-pairing for both seed and non-seed positions for human miR-mRNA duplexes. Our design shows that certain non-seed miR nucleotides, such as 14, 18, 13, 11, and 17, are characterized by a strong bias towards formation of Watson-Crick pairing. We contrasted HuMiTar with several representative competing methods on two sets of human miR targets and a set of ten glioblastoma oncogenes. Comparison with the two best performing traditional methods, PicTar and TargetScanS, and a representative ML method that considers the non-seed positions, NBmiRTar, shows that HuMiTar predictions include majority of the predictions of the other three methods. At the same time, the proposed method is also capable of finding more true positive targets as a trade-off for an increased number of predictions. Genome-wide predictions show that the proposed method is characterized by 1.99 signal-to-noise ratio and linear, with respect to the length of the mRNA sequence, computational complexity. The ROC analysis shows that HuMiTar obtains results comparable with PicTar, which are characterized by high true positive rates that are coupled with moderate values of false positive rates.

Conclusion

The proposed HuMiTar method constitutes a step towards providing an efficient model for studying translational gene regulation by miRs.


© 1999-2010 BioMed Central Ltd unless otherwise stated. Part of Springer Science+Business Media.