<?xml version = '1.0' encoding = 'UTF-8'?>
<?xml-stylesheet href="/rss/styledrssBMC.css" type="text/css"?>
<rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns="http://purl.org/rss/1.0/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:extra="http://www.biomedcentral.com/xml/schemas/extra/" xmlns:prism="http://prismstandard.org/namespaces/1.2/basic/" xmlns:cc="http://web.resource.org/cc/">
	<channel rdf:about="http://www.biomedcentral.com/rss">
		<extra:info rdf:parseType="Literal">
			<html:div xmlns:html="http://www.w3.org/1999/xhtml" style="font:14px Verdana, Geneva, Arial, Helvetica, sans-serif">
				<html:span style="font-weight:bold">This is an RSS newsfeed from BioMed Central</html:span>
				<html:br/>
				<html:span style="font-size: 12px;">It is intended to be used with an RSS reader. For more information about RSS newsfeeds from BioMed Central, visit <html:br/><html:a href="http://www.biomedcentral.com/info/about/rss/" style="color:#3333CC; font-size:12px;">http://www.biomedcentral.com/info/about/rss/</html:a><html:br/>
				</html:span>
			</html:div>
		</extra:info>
		<title>Algorithms for Molecular Biology - Most viewed articles</title>
		<link>http://www.almob.orgmostviewed/</link>
		<description>Most viewed articles in last 30 days from Algorithms for Molecular Biology (ISSN 1748-7188) published by 
				
				BioMed Central
		</description>
        <cc:license rdf:resource="http://creativecommons.org/licenses/by/2.0/"/>
         <items>
            <rdf:Seq>
            
				    <rdf:li rdf:resource="http://www.almob.org/content/3/1/8"/>			    
            
				    <rdf:li rdf:resource="http://www.almob.org/content/3/1/9"/>			    
            
				    <rdf:li rdf:resource="http://www.almob.org/content/3/1/7"/>			    
            
				    <rdf:li rdf:resource="http://www.almob.org/content/3/1/6"/>			    
            
				    <rdf:li rdf:resource="http://www.almob.org/content/3/1/10"/>			    
            
				    <rdf:li rdf:resource="http://www.almob.org/content/3/1/5"/>			    
            
				    <rdf:li rdf:resource="http://www.almob.org/content/1/1/6"/>			    
            
				    <rdf:li rdf:resource="http://www.almob.org/content/3/1/3"/>			    
            
				    <rdf:li rdf:resource="http://www.almob.org/content/3/1/4"/>			    
            
				    <rdf:li rdf:resource="http://www.almob.org/content/3/1/2"/>			    
            
            </rdf:Seq>
        </items>
    </channel>
    
		<item rdf:about="http://www.almob.org/content/3/1/8">
            
            <title>A weighted average difference method for detecting differentially expressed genes from microarray data</title>
			<description>Background:
Identification of differentially expressed genes (DEGs) under different experimental conditions is an important task in many microarray studies. However, choosing which method to use for a particular application is problematic because its performance depends on the evaluation metric, the dataset, and so on. In addition, when using the Affymetrix GeneChip&#174; system, researchers must select a preprocessing algorithm from a number of competing algorithms such as MAS, RMA, and DFW, for obtaining expression-level measurements. To achieve optimal performance for detecting DEGs, a suitable combination of gene selection method and preprocessing algorithm needs to be selected for a given probe-level dataset.
Results:
We introduce a new fold-change (FC)-based method, the weighted average difference method (WAD), for ranking DEGs. It uses the average difference and relative average signal intensity so that highly expressed genes are highly ranked on the average for the different conditions. The idea is based on our observation that known or potential marker genes (or proteins) tend to have high expression levels. We compared WAD with seven other methods; average difference (AD), FC, rank products (RP), moderated t statistic (modT), significance analysis of microarrays (samT), shrinkage t statistic (shrinkT), and intensity-based moderated t statistic (ibmT). The evaluation was performed using a total of 38 different binary (two-class) probe-level datasets: two artificial "spike-in" datasets and 36 real experimental datasets. The results indicate that WAD outperforms the other methods when sensitivity and specificity are considered simultaneously: the area under the receiver operating characteristic curve for WAD was the highest on average for the 38 datasets. The gene ranking for WAD was also the most consistent when subsets of top-ranked genes produced from three different preprocessed data (MAS, RMA, and DFW) were compared. Overall, WAD performed the best for MAS-preprocessed data and the FC-based methods (AD, WAD, FC, or RP) performed well for RMA and DFW-preprocessed data.
Conclusion:
WAD is a promising alternative to existing methods for ranking DEGs with two classes. Its high performance should increase researchers' confidence in microarray analyses.</description>
			<link>http://www.almob.org/content/3/1/8</link>		
			<dc:creator>Koji Kadota, Yuji Nakai and Kentaro Shimizu</dc:creator>
			<dc:source>Algorithms for Molecular Biology 2008, 3:8</dc:source>
			<dc:subject>Number of accesses: 764</dc:subject>
			<dc:date>2008-06-26</dc:date>
			<dc:identifier>doi:10.1186/1748-7188-3-8</dc:identifier>
			
			
							
					<prism:publicationName>Algorithms for Molecular Biology</prism:publicationName>
					
			
							
					<prism:issn>1748-7188</prism:issn>
					
			
							
					<prism:volume>3</prism:volume>
					
			
							
					<prism:startingPage>8</prism:startingPage>
					
			
							
					<prism:publicationDate>2008-06-26</prism:publicationDate>
					

            <cc:license rdf:resource="http://creativecommons.org/licenses/by/2.0/"/>
        </item>
	
		<item rdf:about="http://www.almob.org/content/3/1/9">
            
            <title>Metabolite-based clustering and visualization of mass spectrometry data using one-dimensional self-organizing maps</title>
			<description>Background:
One of the goals of global metabolomic analysis is to identify metabolic markers that are hidden within a large background of data originating from high-throughput analytical measurements. Metabolite-based clustering is an unsupervised approach for marker identification based on grouping similar concentration profiles of putative metabolites. A major problem of this approach is that in general there is no prior information about an adequate number of clusters.
Results:
We present an approach for data mining on metabolite intensity profiles as obtained from mass spectrometry measurements. We propose one-dimensional self-organizing maps for metabolite-based clustering and visualization of marker candidates. In a case study on the wound response of Arabidopsis thaliana, based on metabolite profile intensities from eight different experimental conditions, we show how the clustering and visualization capabilities can be used to identify relevant groups of markers.
Conclusion:
Our specialized realization of self-organizing maps is well-suitable to gain insight into complex pattern variation in a large set of metabolite profiles. In comparison to other methods our visualization approach facilitates the identification of interesting groups of metabolites by means of a convenient overview on relevant intensity patterns. In particular, the visualization effectively supports researchers in analyzing many putative clusters when the true number of biologically meaningful groups is unknown.</description>
			<link>http://www.almob.org/content/3/1/9</link>		
			<dc:creator>Peter Meinicke, Thomas Lingner, Alexander Kaever, Kirstin Feussner, Cornelia G&#246;bel, Ivo Feussner, Petr Karlovsky and Burkhard Morgenstern</dc:creator>
			<dc:source>Algorithms for Molecular Biology 2008, 3:9</dc:source>
			<dc:subject>Number of accesses: 655</dc:subject>
			<dc:date>2008-06-26</dc:date>
			<dc:identifier>doi:10.1186/1748-7188-3-9</dc:identifier>
			
			
							
					<prism:publicationName>Algorithms for Molecular Biology</prism:publicationName>
					
			
							
					<prism:issn>1748-7188</prism:issn>
					
			
							
					<prism:volume>3</prism:volume>
					
			
							
					<prism:startingPage>9</prism:startingPage>
					
			
							
					<prism:publicationDate>2008-06-26</prism:publicationDate>
					

            <cc:license rdf:resource="http://creativecommons.org/licenses/by/2.0/"/>
        </item>
	
		<item rdf:about="http://www.almob.org/content/3/1/7">
            
            <title>Noisy: Identification of problematic columns in multiple sequence alignments</title>
			<description>MotivationSequence-based methods for phylogenetic reconstruction from (nucleic acid) sequence data are notoriously plagued by two effects: homoplasies and alignment errors. Large evolutionary distances imply a large number of homoplastic sites. As most protein-coding genes show dramatic variations in substitution rates that are not uncorrelated across the sequence, this often leads to a patchwork pattern of (i) phylogenetically informative and (ii) effectively randomized regions. In highly variable regions, furthermore, alignment errors accumulate resulting in sometimes misleading signals in phylogenetic reconstruction.
Results:
We present here a method that, based on assessing the distribution of character states along a cyclic ordering of the taxa, allows the identification of phylogenetically uninformative homoplastic sites in a multiple sequence alignment. Removal of these sites appears to improve the performance of phylogenetic reconstruction algorithms as measured by various indices of "tree quality". In particular, we obtain more stable trees due to the exclusion of phylogenetically incompatible sites that most likely represent strongly randomized characters.SoftwareThe computer program noisy implements this approach. It can be employed to improving phylogenetic reconstruction capability with quite a considerable success rate whenever (1) the average bootstrap support obtained from the original alignment is low, and (2) there are sufficiently many taxa in the data set &#8211; at least, say, 12 to 15 taxa. The software can be obtained under the GNU Public License from http://www.bioinf.uni-leipzig.de/Software/noisy/.</description>
			<link>http://www.almob.org/content/3/1/7</link>		
			<dc:creator>Andreas WM Dress, Christoph Flamm, Guido Fritzsch, Stefan Gr&#252;newald, Matthias Kruspe, Sonja J Prohaska and Peter F Stadler</dc:creator>
			<dc:source>Algorithms for Molecular Biology 2008, 3:7</dc:source>
			<dc:subject>Number of accesses: 463</dc:subject>
			<dc:date>2008-06-24</dc:date>
			<dc:identifier>doi:10.1186/1748-7188-3-7</dc:identifier>
			
			
							
					<prism:publicationName>Algorithms for Molecular Biology</prism:publicationName>
					
			
							
					<prism:issn>1748-7188</prism:issn>
					
			
							
					<prism:volume>3</prism:volume>
					
			
							
					<prism:startingPage>7</prism:startingPage>
					
			
							
					<prism:publicationDate>2008-06-24</prism:publicationDate>
					

            <cc:license rdf:resource="http://creativecommons.org/licenses/by/2.0/"/>
        </item>
	
		<item rdf:about="http://www.almob.org/content/3/1/6">
            
            <title>DIALIGN-TX: greedy and progressive approaches for segment-based multiple sequence alignment</title>
			<description>Background:
DIALIGN-T is a reimplementation of the multiple-alignment program DIALIGN. Due to several algorithmic improvements, it produces significantly better alignments on locally and globally related sequence sets than previous versions of DIALIGN. However, like the original implementation of the program, DIALIGN-T uses a a straight-forward greedy approach to assemble multiple alignments from local pairwise sequence similarities. Such greedy approaches may be vulnerable to spurious random similarities and can therefore lead to suboptimal results. In this paper, we present DIALIGN-TX, a substantial improvement of DIALIGN-T that combines our previous greedy algorithm with a progressive alignment approach.
Results:
Our new heuristic produces significantly better alignments, especially on globally related sequences, without increasing the CPU time and memory consumption exceedingly. The new method is based on a guide tree; to detect possible spurious sequence similarities, it employs a vertex-cover approximation on a conflict graph. We performed benchmarking tests on a large set of nucleic acid and protein sequences For protein benchmarks we used the benchmark database BALIBASE 3 and an updated release of the database IRMBASE 2 for assessing the quality on globally and locally related sequences, respectively. For alignment of nucleic acid sequences, we used BRAliBase II for global alignment and a newly developed database of locally related sequences called DIRM-BASE 1. IRMBASE 2 and DIRMBASE 1 are constructed by implanting highly conserved motives at random positions in long unalignable sequences.
Conclusion:
On BALIBASE3, our new program performs significantly better than the previous program DIALIGN-T and outperforms the popular global aligner CLUSTAL W, though it is still outperformed by programs that focus on global alignment like MAFFT, MUSCLE and T-COFFEE. On the locally related test sets in IRMBASE 2 and DIRM-BASE 1, our method outperforms all other programs while MAFFT E-INSi is the only method that comes close to the performance of DIALIGN-TX.</description>
			<link>http://www.almob.org/content/3/1/6</link>		
			<dc:creator>Amarendran R Subramanian, Michael Kaufmann and Burkhard Morgenstern</dc:creator>
			<dc:source>Algorithms for Molecular Biology 2008, 3:6</dc:source>
			<dc:subject>Number of accesses: 372</dc:subject>
			<dc:date>2008-05-27</dc:date>
			<dc:identifier>doi:10.1186/1748-7188-3-6</dc:identifier>
			
			
							
					<prism:publicationName>Algorithms for Molecular Biology</prism:publicationName>
					
			
							
					<prism:issn>1748-7188</prism:issn>
					
			
							
					<prism:volume>3</prism:volume>
					
			
							
					<prism:startingPage>6</prism:startingPage>
					
			
							
					<prism:publicationDate>2008-05-27</prism:publicationDate>
					

            <cc:license rdf:resource="http://creativecommons.org/licenses/by/2.0/"/>
        </item>
	
		<item rdf:about="http://www.almob.org/content/3/1/10">
            
            <title>A stitch in time: Efficient computation of genomic DNA melting bubbles</title>
			<description>Background:
It is of biological interest to make genome-wide predictions of the locations of DNA melting bubbles using statistical mechanics models. Computationally, this poses the challenge that a generic search through all combinations of bubble starts and ends is quadratic.
Results:
An efficient algorithm is described, which shows that the time complexity of the task is O(NlogN) rather than quadratic. The algorithm exploits that bubble lengths may be limited, but without a prior assumption of a maximal bubble length. No approximations, such as windowing, have been introduced to reduce the time complexity. More than just finding the bubbles, the algorithm produces a stitch profile, which is a probabilistic graphical model of bubbles and helical regions. The algorithm applies a probability peak finding method based on a hierarchical analysis of the energy barriers in the Poland-Scheraga model.
Conclusions:
Exact and fast computation of genomic stitch profiles is thus feasible. Sequences of several megabases have been computed, only limited by computer memory. Possible applications are the genome-wide comparisons of bubbles with promotors, TSS, viral integration sites, and other melting-related regions.</description>
			<link>http://www.almob.org/content/3/1/10</link>		
			<dc:creator>Eivind Tostesen</dc:creator>
			<dc:source>Algorithms for Molecular Biology 2008, 3:10</dc:source>
			<dc:subject>Number of accesses: 257</dc:subject>
			<dc:date>2008-07-17</dc:date>
			<dc:identifier>doi:10.1186/1748-7188-3-10</dc:identifier>
			
			
							
					<prism:publicationName>Algorithms for Molecular Biology</prism:publicationName>
					
			
							
					<prism:issn>1748-7188</prism:issn>
					
			
							
					<prism:volume>3</prism:volume>
					
			
							
					<prism:startingPage>10</prism:startingPage>
					
			
							
					<prism:publicationDate>2008-07-17</prism:publicationDate>
					

            <cc:license rdf:resource="http://creativecommons.org/licenses/by/2.0/"/>
        </item>
	
		<item rdf:about="http://www.almob.org/content/3/1/5">
            
            <title>On the optimality of the neighbor-joining algorithm</title>
			<description>The popular neighbor-joining (NJ) algorithm used in phylogenetics is a greedy algorithm for finding the balanced minimum evolution (BME) tree associated to a dissimilarity map. From this point of view, NJ is "optimal" when the algorithm outputs the tree which minimizes the balanced minimum evolution criterion. We use the fact that the NJ tree topology and the BME tree topology are determined by polyhedral subdivisions of the spaces of dissimilarity maps R+(n2)
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaWenfgDOvwBHrxAJfwnHbqeg0uy0HwzTfgDPnwy1aaceaGae83gHi1aa0baaSqaaiabgUcaRaqaamaabmaabaqbaeqabiqaaaqaaiabd6gaUbqaaiabikdaYaaaaiaawIcacaGLPaaaaaaaaa@3BA1@ to study the optimality of the neighbor-joining algorithm. In particular, we investigate and compare the polyhedral subdivisions for n &#8804; 8. This requires the measurement of volumes of spherical polytopes in high dimension, which we obtain using a combination of Monte Carlo methods and polyhedral algorithms. Our results include a demonstration that highly unrelated trees can be co-optimal in BME reconstruction, and that NJ regions are not convex. We obtain the l2 radius for neighbor-joining for n = 5 and we conjecture that the ability of the neighbor-joining algorithm to recover the BME tree depends on the diameter of the BME tree.</description>
			<link>http://www.almob.org/content/3/1/5</link>		
			<dc:creator>Kord Eickmeyer, Peter Huggins, Lior Pachter and Ruriko Yoshida</dc:creator>
			<dc:source>Algorithms for Molecular Biology 2008, 3:5</dc:source>
			<dc:subject>Number of accesses: 257</dc:subject>
			<dc:date>2008-04-30</dc:date>
			<dc:identifier>doi:10.1186/1748-7188-3-5</dc:identifier>
			
			
							
					<prism:publicationName>Algorithms for Molecular Biology</prism:publicationName>
					
			
							
					<prism:issn>1748-7188</prism:issn>
					
			
							
					<prism:volume>3</prism:volume>
					
			
							
					<prism:startingPage>5</prism:startingPage>
					
			
							
					<prism:publicationDate>2008-04-30</prism:publicationDate>
					

            <cc:license rdf:resource="http://creativecommons.org/licenses/by/2.0/"/>
        </item>
	
		<item rdf:about="http://www.almob.org/content/1/1/6">
            
            <title>Multiple sequence alignment with user-defined anchor points</title>
			<description>Background:
Automated software tools for multiple alignment often fail to produce biologically meaningful results. In such situations, expert knowledge can help to improve the quality of alignments.
Results:
Herein, we describe a semi-automatic version of the alignment program DIALIGN that can take pre-defined constraints into account. It is possible for the user to specify parts of the sequences that are assumed to be homologous and should therefore be aligned to each other. Our software program can use these sites as anchor points by creating a multiple alignment respecting these constraints. This way, our alignment method can produce alignments that are biologically more meaningful than alignments produced by fully automated procedures. As a demonstration of how our method works, we apply our approach to genomic sequences around the Hox gene cluster and to a set of DNA-binding proteins. As a by-product, we obtain insights about the performance of the greedy algorithm that our program uses for multiple alignment and about the underlying objective function. This information will be useful for the further development of DIALIGN. The described alignment approach has been integrated into the TRACKER software system.</description>
			<link>http://www.almob.org/content/1/1/6</link>		
			<dc:creator>Burkhard Morgenstern, Sonja J Prohaska, Dirk P&#246;hler and Peter F Stadler</dc:creator>
			<dc:source>Algorithms for Molecular Biology 2006, 1:6</dc:source>
			<dc:subject>Number of accesses: 187</dc:subject>
			<dc:date>2006-04-19</dc:date>
			<dc:identifier>doi:10.1186/1748-7188-1-6</dc:identifier>
			
			
							
					<prism:publicationName>Algorithms for Molecular Biology</prism:publicationName>
					
			
							
					<prism:issn>1748-7188</prism:issn>
					
			
							
					<prism:volume>1</prism:volume>
					
			
							
					<prism:startingPage>6</prism:startingPage>
					
			
							
					<prism:publicationDate>2006-04-19</prism:publicationDate>
					

            <cc:license rdf:resource="http://creativecommons.org/licenses/by/2.0/"/>
        </item>
	
		<item rdf:about="http://www.almob.org/content/3/1/3">
            
            <title>A scoring matrix approach to detecting miRNA target sites</title>
			<description>Background:
Experimental identification of microRNA (miRNA) targets is a difficult and time consuming process. As a consequence several computational prediction methods have been devised in order to predict targets for follow up experimental validation. Current computational target prediction methods use only the miRNA sequence as input. With an increasing number of experimentally validated targets becoming available, utilising this additional information in the search for further targets may help to improve the specificity of computational methods for target site prediction.
Results:
We introduce a generic target prediction method, the Stacking Binding Matrix (SBM) that uses both information about the miRNA as well as experimentally validated target sequences in the search for candidate target sequences. We demonstrate the utility of our method by applying it to both animal and plant data sets and compare it with miRanda, a commonly used target prediction method.
Conclusion:
We show that SBM can be applied to target prediction in both plants and animals and performs well in terms of sensitivity and specificity. Open source code implementing the SBM method, together with documentation and examples are freely available for download from the address in the Availability and Requirements section.</description>
			<link>http://www.almob.org/content/3/1/3</link>		
			<dc:creator>Simon Moxon, Vincent Moulton and Jan T Kim</dc:creator>
			<dc:source>Algorithms for Molecular Biology 2008, 3:3</dc:source>
			<dc:subject>Number of accesses: 176</dc:subject>
			<dc:date>2008-03-31</dc:date>
			<dc:identifier>doi:10.1186/1748-7188-3-3</dc:identifier>
			
			
							
					<prism:publicationName>Algorithms for Molecular Biology</prism:publicationName>
					
			
							
					<prism:issn>1748-7188</prism:issn>
					
			
							
					<prism:volume>3</prism:volume>
					
			
							
					<prism:startingPage>3</prism:startingPage>
					
			
							
					<prism:publicationDate>2008-03-31</prism:publicationDate>
					

            <cc:license rdf:resource="http://creativecommons.org/licenses/by/2.0/"/>
        </item>
	
		<item rdf:about="http://www.almob.org/content/3/1/4">
            
            <title>Protein sequence and structure alignments within one framework</title>
			<description>Background:
Protein structure alignments are usually based on very different techniques to sequence alignments. We propose a method which treats sequence, structure and even combined sequence + structure in a single framework. Using a probabilistic approach, we calculate a similarity measure which can be applied to fragments containing only protein sequence, structure or both simultaneously.
Results:
Proof-of-concept results are given for the different problems. For sequence alignments, the methodology is no better than conventional methods. For structure alignments, the techniques are very fast, reliable and tolerant of a range of alignment parameters. Combined sequence and structure alignments may provide a more reliable alignment for pairs of proteins where pure structural alignments can be misled by repetitive elements or apparent symmetries.
Conclusion:
The probabilistic framework has an elegance in principle, merging sequence and structure descriptors into a single framework. It has a practical use in fast structural alignments and a potential use in finding those examples where sequence and structural similarities apparently disagree.</description>
			<link>http://www.almob.org/content/3/1/4</link>		
			<dc:creator>Gundolf Schenk, Thomas Margraf and Andrew E Torda</dc:creator>
			<dc:source>Algorithms for Molecular Biology 2008, 3:4</dc:source>
			<dc:subject>Number of accesses: 162</dc:subject>
			<dc:date>2008-04-01</dc:date>
			<dc:identifier>doi:10.1186/1748-7188-3-4</dc:identifier>
			
			
							
					<prism:publicationName>Algorithms for Molecular Biology</prism:publicationName>
					
			
							
					<prism:issn>1748-7188</prism:issn>
					
			
							
					<prism:volume>3</prism:volume>
					
			
							
					<prism:startingPage>4</prism:startingPage>
					
			
							
					<prism:publicationDate>2008-04-01</prism:publicationDate>
					

            <cc:license rdf:resource="http://creativecommons.org/licenses/by/2.0/"/>
        </item>
	
		<item rdf:about="http://www.almob.org/content/3/1/2">
            
            <title>Learning from positive examples when the negative class is undetermined- microRNA gene identification</title>
			<description>Background:
The application of machine learning to classification problems that depend only on positive examples is gaining attention in the computational biology community. We and others have described the use of two-class machine learning to identify novel miRNAs. These methods require the generation of an artificial negative class. However, designation of the negative class can be problematic and if it is not properly done can affect the performance of the classifier dramatically and/or yield a biased estimate of performance. We present a study using one-class machine learning for microRNA (miRNA) discovery and compare one-class to two-class approaches using na&#239;ve Bayes and Support Vector Machines. These results are compared to published two-class miRNA prediction approaches. We also examine the ability of the one-class and two-class techniques to identify miRNAs in newly sequenced species.
Results:
Of all methods tested, we found that 2-class naive Bayes and Support Vector Machines gave the best accuracy using our selected features and optimally chosen negative examples. One class methods showed average accuracies of 70&#8211;80% versus 90% for the two 2-class methods on the same feature sets. However, some one-class methods outperform some recently published two-class approaches with different selected features. Using the EBV genome as and external validation of the method we found one-class machine learning to work as well as or better than a two-class approach in identifying true miRNAs as well as predicting new miRNAs.
Conclusion:
One and two class methods can both give useful classification accuracies when the negative class is well characterized. The advantage of one class methods is that it eliminates guessing at the optimal features for the negative class when they are not well defined. In these cases one-class methods can be superior to two-class methods when the features which are chosen as representative of that positive class are well defined.AvailabilityThe OneClassmiRNA program is available at: 1</description>
			<link>http://www.almob.org/content/3/1/2</link>		
			<dc:creator>Malik Yousef, Segun Jung, Louise C Showe and Michael K Showe</dc:creator>
			<dc:source>Algorithms for Molecular Biology 2008, 3:2</dc:source>
			<dc:subject>Number of accesses: 141</dc:subject>
			<dc:date>2008-01-28</dc:date>
			<dc:identifier>doi:10.1186/1748-7188-3-2</dc:identifier>
			
			
							
					<prism:publicationName>Algorithms for Molecular Biology</prism:publicationName>
					
			
							
					<prism:issn>1748-7188</prism:issn>
					
			
							
					<prism:volume>3</prism:volume>
					
			
							
					<prism:startingPage>2</prism:startingPage>
					
			
							
					<prism:publicationDate>2008-01-28</prism:publicationDate>
					

            <cc:license rdf:resource="http://creativecommons.org/licenses/by/2.0/"/>
        </item>
		
	<cc:License rdf:about="http://creativecommons.org/licenses/by/2.0/">
         <cc:permits rdf:resource="http://creativecommons.org/ns#Reproduction"/>
         <cc:permits rdf:resource="http://creativecommons.org/ns#Distribution"/>
         <cc:permits rdf:resource="http://creativecommons.org/ns#DerivativeWorks"/>
	</cc:License>
</rdf:RDF>
