Log on / register
BioMed Central home | Journals A-Z | Feedback | Support | My details
Open AccessHighly AccessResearch

Analysis of computational approaches for motif discovery

Nan Li email and Martin Tompa email

Department of Computer Science and Engineering, Box 352350, University of Washington, Seattle, WA 98195-2350, USA

author email corresponding author email

Algorithms for Molecular Biology 2006, 1:8doi:10.1186/1748-7188-1-8

Published: 19 May 2006

Abstract

Recently, we performed an assessment of 13 popular computational tools for discovery of transcription factor binding sites (M. Tompa, N. Li, et al., "Assessing Computational Tools for the Discovery of Transcription Factor Binding Sites", Nature Biotechnology, Jan. 2005). This paper contains follow-up analysis of the assessment results, and raises and discusses some important issues concerning the state of the art in motif discovery methods: 1. We categorize the objective functions used by existing tools, and design experiments to evaluate whether any of these objective functions is the right one to optimize. 2. We examine various features of the data sets that were used in the assessment, such as sequence length and motif degeneracy, and identify which features make data sets hard for current motif discovery tools. 3. We identify an important feature that has not yet been used by existing tools and propose a new objective function that incorporates this feature.


© 1999-2008 BioMed Central Ltd unless otherwise stated < info@biomedcentral.com >   Terms and conditions