<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
	<ui>1748-7188-2-16</ui>
	<ji>1748-7188</ji>
	<fm>
		<dochead>Research</dochead>
		<bibl>
			<title>
				<p>Evaluating deterministic motif significance measures in protein databases</p>
			</title>
			<aug>
				<au id="A1" ca="yes">
					<snm>Ferreira</snm>
					<mnm>Gabriel</mnm>
					<fnm>Pedro</fnm>
					<insr iid="I1"/>
					<email>pedrogabriel@di.uminho.pt</email>
				</au>
				<au id="A2">
					<snm>Azevedo</snm>
					<mi>J</mi>
					<fnm>Paulo</fnm>
					<insr iid="I1"/>
					<email>pja@di.uminho.pt</email>
				</au>
			</aug>
			<insg>
				<ins id="I1">
					<p>Department of Informatics, University of Minho, Campus de Gualtar, 4710-057 Braga, Portugal</p>
				</ins>
			</insg>
			<source>Algorithms for Molecular Biology</source>
			<issn>1748-7188</issn>
			<pubdate>2007</pubdate>
			<volume>2</volume>
			<issue>1</issue>
			<fpage>16</fpage>
			<url>http://www.almob.org/content/2/1/16</url>
			<xrefbib>
				<pubidlist>
					<pubid idtype="pmpid">18157916</pubid>
					<pubid idtype="doi">10.1186/1748-7188-2-16</pubid>
				</pubidlist>
			</xrefbib>
		</bibl>
		<history>
			<rec>
				<date>
					<day>15</day>
					<month>5</month>
					<year>2007</year>
				</date>
			</rec>
			<acc>
				<date>
					<day>24</day>
					<month>12</month>
					<year>2007</year>
				</date>
			</acc>
			<pub>
				<date>
					<day>24</day>
					<month>12</month>
					<year>2007</year>
				</date>
			</pub>
		</history>
		<cpyrt>
			<year>2007</year>
			<collab>Ferreira and Azevedo; licensee BioMed Central Ltd.</collab>
			<note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
		</cpyrt>
		<abs>
			<sec>
				<st>
					<p>Abstract</p>
				</st>
				<sec>
					<st>
						<p>Background</p>
					</st>
					<p>Assessing the outcome of motif mining algorithms is an essential task, as the number of reported motifs can be very large. Significance measures play a central role in automatically ranking those motifs, and therefore alleviating the analysis work. Spotting the most interesting and relevant motifs is then dependent on the choice of the right measures. The combined use of several measures may provide more robust results. However caution has to be taken in order to avoid spurious evaluations.</p>
				</sec>
				<sec>
					<st>
						<p>Results</p>
					</st>
					<p>From the set of conducted experiments, it was verified that several of the selected significance measures show a very similar behavior in a wide range of situations therefore providing redundant information. Some measures have proved to be more appropriate to rank highly conserved motifs, while others are more appropriate for weakly conserved ones. Support appears as a very important feature to be considered for correct motif ranking. We observed that not all the measures are suitable for situations with poorly balanced class information, like for instance, when positive data is significantly less than negative data. Finally, a visualization scheme was proposed that, when several measures are applied, enables an easy identification of high scoring motifs.</p>
				</sec>
				<sec>
					<st>
						<p>Conclusion</p>
					</st>
					<p>In this work we have surveyed and categorized 14 significance measures for pattern evaluation. Their ability to rank three types of deterministic motifs was evaluated. Measures were applied in different testing conditions, where relations were identified. This study provides some pertinent insights on the choice of the right set of significance measures for the evaluation of deterministic motifs extracted from protein databases.</p>
				</sec>
			</sec>
		</abs>
	</fm>
	<bdy>
		<sec>
			<st>
				<p>Introduction</p>
			</st>
			<p>The mining of <it>sequence patterns</it>, also called <it>motifs</it>, is one of the most important tasks in protein sequence analysis and continues to be an active topic of research. The large number of proposals found in the literature sustain this claim. Sequence mining is the task of analyzing a set of possible related sequences and detecting subtrings that occur significantly among those sequences. Motif over-representation can be explained by the existence of segments that have been preserved through the natural evolution of the proteins and suggests that the regions described by those substrings play a structural and functional role in the protein's mechanisms <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr></abbrgrp>. Different types of motifs representation have been proposed and two main classes can be distinguished: <it>probabilistic </it>and <it>deterministic</it>. A probabilistic motif consists of a model that simulates the sequences or part of the sequences under consideration. When an input sequence is provided, a probability of being matched by the motif is yielded. Position Weight Matrices (PWM) and Hidden Markov Models (HMMs) are examples of probabilistic motifs. Deterministic motifs are commonly expressed by an enhanced regular expression syntax, either matching or not the input sequences. This paper is devoted to the evaluation of significance measures for deterministic motif discovery in protein databases. A critical aspect of the motif analysis process is that due to the completeness nature of deterministic mining algorithms the number of extracted motifs is often very large. Not all these motifs are particularly interesting and most of them certainly arise by chance. Therefore, it is crucial to propose scoring methods to discriminate the relevant and significant motifs.</p>
			<p>By itself, the definition of a significant motif is an interesting problem. One possible solution to assess this significance is to delegate this decision to a biologist. An expert would analyze the target proteins and decide which motifs have biological interest. As this approach is only feasible for small and medium scale experiments, an alternative is to automatically evaluate motifs according to their statistical or informative importance. As pointed by Hart et al. in <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>, statistical significance is often correlated with biological significance and provides a meaningful criterion for the analysis of relevant motifs.</p>
			<p>In addition to support a better understanding of the protein's structure and function, motifs have also a wide-range of other applications. They can be used to perform clustering <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>, family classification <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr></abbrgrp>, discovery of sub-families in large protein families <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>, gene expression analysis <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr></abbrgrp> and the study and discovery of homology relations <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>. The selection of the appropriate measures for a specific problem depends on how well they adjust to the problem. In the literature, many measures of interest and significance have been proposed. How to choose the most appropriate significance measure is still an open question.</p>
			<p>Similar to this problem is the discovery of significant association rules. In the work of Tan, Kumar and Srivastava <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>, a survey and general evaluation of itemset interest measures is presented. Such measures were used to describe the statistical relationship between the items in a itemset <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. This problem is different from the motif evaluation problem, since an item occurs only once per itemset, which is not the case of motifs, where an item (called symbol) may occur repeatedly. Transcription Factor Binding Sites (TFBS) can be described by motifs with very specific characteristics. Typically, they consist of small length contiguous motifs, highly degenerated, i.e., with many ambiguous positions. In Tompa <it>et al. </it><abbrgrp><abbr bid="B16">16</abbr></abbrgrp>, an assessment of 13 popular algorithms for the discovery of TFBS was performed. Later, Li and Tompa <abbrgrp><abbr bid="B17">17</abbr></abbrgrp> have categorized and examined the adequacy of three popular significance functions used by the algorithms described in <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>.</p>
			<p>Although, these studies were designed for problems other than protein motif analysis, they may bring important improvements to the field. For instance, the results of the unsupervised mining of massive protein datasets, such as the SwissProt <abbrgrp><abbr bid="B18">18</abbr></abbrgrp> comprehensive protein sequence database, are almost impossible to be properly analyzed. This can be mainly due to the inexistence of measures that objectively and automatically evaluate the biological significance of newly discovered motifs and allow the identification of the truly significant motifs among the irrelevant ones.</p>
			<p>Different measures evaluate different properties. Thus, the best solution for a particular problem may include the simultaneous use of several measures. Given that some of these measures will show consistent or even very similar results, it is important to identify such relations in order to avoid biased evaluations. We are also interested in studying the impact of different problem characteristics and how certain operations inherent to the mining process affect these measures.</p>
			<p>The contributions of this paper can be summarized as follows:</p>
			<p>&#8226; It surveys and categorizes significance measures presented in the bioinformatics, data mining, statistics and machine learning literature.</p>
			<p>&#8226; It provides a comprehensive evaluation of the selected measures, in the presence of different motif and dataset characteristics.</p>
			<p>&#8226; It proposes a methodology that combines the information provided by several measures in order to highlight the most interesting motifs.</p>
			<p>The remainder of the paper is organized in two parts. In the first part we describe the characteristics of the evaluated motifs and the sources where the evaluated data is obtained. Significance measures are then introduced according to the considered categorization. The second part is dedicated to the experimental evaluation. We start by describing how motifs are extracted and then go on to the analysis of ranking, consistency and variability of the measures in a wide range of situations. In section "Motif Ranking Visualizer", we propose a methodology for identifying high scoring motifs and demonstrate its application. Finally, we conclude with the main lessons learned.</p>
			<sec>
				<st>
					<p>Evaluating Deterministic Motifs</p>
				</st>
				<p>Deterministic motifs are described in a regular expression based language, which tends to be easily understandable by humans. These motifs can be divided in two types: <it>fixed-length </it>and <it>extensible-length</it>. Fixed-length motifs (a.k.a (<it>l, d</it>)-motifs <abbrgrp><abbr bid="B19">19</abbr><abbr bid="B20">20</abbr></abbrgrp>) consist of a string with a fixed size of <it>l </it>symbols where <it>d </it>possible symbols may have a mismatch with the matched sequences in the database. Extensible-length motifs have an arbitrary length with an arbitrary number of symbols and gaps. Consider the following abstract pattern:</p>
				<p>
					<display-formula><it>A</it><sub>1 </sub>- <it>x</it>(<it>p</it><sub>1</sub>, <it>q</it><sub>1</sub>) - <it>A</it><sub>2 </sub>- <it>x</it>(<it>p</it><sub>2</sub>, <it>q</it><sub>2</sub>) - ... - <it>A</it><sub><it>n</it></sub></display-formula>
				</p>
				<p><it>A</it><sub><it>i </it></sub>is a sequence of consecutive amino acids, called <it>component </it>and -<it>x</it>(<it>p</it><sub><it>i</it></sub>, <it>q</it><sub><it>i</it></sub>)- represents a gap greater or equal than <it>p</it><sub><it>i </it></sub>and smaller or equal than <it>q</it><sub><it>i</it></sub>. A symbol is considered to be <it>concrete </it>if it represents one of the twenty amino acid symbols. Three types of extensible-length motifs can be distinguished:</p>
				<p>&#8226; <b>Contiguous Motifs </b>contain no gaps, i.e., <it>p</it><sub><it>i </it></sub>= <it>q</it><sub><it>i </it></sub>= 0, &#8704;<it>i</it>, e.g. IPCCPV.</p>
				<p>&#8226; <b>Rigid Gap Motifs </b>only contain gaps with a fixed length, i.e., <it>p</it><sub><it>i </it></sub>= <it>q</it><sub><it>i</it></sub>, &#8704;<it>i</it>. The symbol '.' is a wild-card symbol used to denote a gap of size one and it matches any symbol of the alphabet, e.g. <it>MN..A.CA</it></p>
				<p>&#8226; <b>Flexible Gap Motifs </b>allow a variable number of gaps between events of the sequence, i.e., <it>p</it><sub><it>i </it></sub>&#8804; <it>q</it><sub><it>i</it></sub>, &#8704;<it>i</it>, e.g. AN-x(1,3)-C-x(4,6)-D.</p>
				<p>Deterministic motifs are typically mined through combinatorial algorithms that perform an exhaustive traversal of the search space and perform filtering using the support metric. The <it>support </it>of a motif is the number of different sequences where it occurs. For a motif to pass the filter, its support has to be equal or greater than a user pre-defined threshold (see <abbrgrp><abbr bid="B21">21</abbr><abbr bid="B22">22</abbr><abbr bid="B23">23</abbr><abbr bid="B24">24</abbr></abbrgrp> for a comprehensive overview). Support is an <it>apriori </it>measure of statistical significance. Generally, further assessment of motif significance is done as a post-processing step.</p>
				<p>In this scenario, two important facts justify the critical need for the evaluation of significance measures. First it provides means for an early pruning of irrelevant motifs. The combinatorial nature of the deterministic mining process may deliver an exponentially increasing number of motifs. Thus, efficient pruning of irrelevant motifs results in performance improvement of the algorithms. Second, motifs over-representation does not necessarily imply significance.</p>
				<p>In this work, three types of extensible-length motifs will be used to perform the evaluation of fourteen significance measures.</p>
			</sec>
			<sec>
				<st>
					<p>The Prosite Database</p>
				</st>
				<p>There is a significant number of motif repositories freely available at the Internet. Examples of well established and reliable databases are: Prosite <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>, Prints <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>, Blocks <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>, InterPro <abbrgrp><abbr bid="B28">28</abbr></abbrgrp> or eMotif <abbrgrp><abbr bid="B29">29</abbr></abbrgrp> (see <abbrgrp><abbr bid="B30">30</abbr></abbrgrp> for an overview). From the listed databases, Prosite deserves a special attention in the context of our work. Prosite <abbrgrp><abbr bid="B25">25</abbr></abbrgrp> is the oldest and best known sequence motif database. It is semi-manually annotated and its motifs are characterized for having a high biological significance. They provide a strong indication of a region in the protein with an important role. A family of protein sequences is then described by one or more motifs. Since this database is considered a standard, new algorithms and methods tend to use it as a benchmark test-bed.</p>
			</sec>
			<sec>
				<st>
					<p>The Dilimot Database</p>
				</st>
				<p>One of the characteristics of the Prosite motifs is that they are strongly conserved in the respective families, covering the majority or the totality of their sequences. In order to perform an evaluation on less conserved motifs, we have used the Dilimot database <abbrgrp><abbr bid="B31">31</abbr></abbrgrp>. It provides a service for finding over-represented, short (3 to 8 amino acids), rigid gap motifs in a set of protein sequences. Additionally, it makes available high-confidence pre-computed motif sets from different species. In this work, several motifs from human related proteins will be used.</p>
			</sec>
			<sec>
				<st>
					<p>Significance Measures</p>
				</st>
				<p>As introduced by Brazma <it>et al. </it><abbrgrp><abbr bid="B22">22</abbr></abbrgrp>, a significance measure can be defined as a function of the form: <it>f</it>(<it>M, C</it>) &#8594; &#8477;, where <it>M </it>represents the motif being evaluated and <it>C </it>is a set of related proteins sequences usually called <it>target family </it>or positive data. This function returns a real value score that expresses how relevant or significant is <it>M </it>with respect to <it>C</it>. These scores may provide hints to biologically or statistically relevant motifs. If additional sequence information is available, for example where motifs are less expected to occur, both positive and negative information can then be considered in the evaluation. The function can be extended to include the negative dataset <inline-formula><m:math name="1748-7188-2-16-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>C</m:mi><m:mo>&#175;</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGafm4qamKbaebaaaa@2CFA@</m:annotation></m:semantics></m:math></inline-formula>: <it>f</it>(<it>M, C</it>, <inline-formula><m:math name="1748-7188-2-16-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>C</m:mi><m:mo>&#175;</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGafm4qamKbaebaaaa@2CFA@</m:annotation></m:semantics></m:math></inline-formula>) &#8594; &#8477;. The universe of all sequences <it>U </it>corresponds to <it>U </it>= <it>C </it>+ <inline-formula><m:math name="1748-7188-2-16-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>C</m:mi><m:mo>&#175;</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGafm4qamKbaebaaaa@2CFA@</m:annotation></m:semantics></m:math></inline-formula> and the size of each set of sequences is denoted as |<it>C</it>| and |<inline-formula><m:math name="1748-7188-2-16-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>C</m:mi><m:mo>&#175;</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGafm4qamKbaebaaaa@2CFA@</m:annotation></m:semantics></m:math></inline-formula>|, respectively. We now distinguish four possible cases of a motif <it>M </it>matching a sequence of <it>C</it>:</p>
				<p>&#8226; <it>True Positive </it>(<it>T</it><sub><it>P</it></sub>): a sequence that belongs to the target family and matches the motif.</p>
				<p>&#8226; <it>True Negative </it>(<it>T</it><sub><it>N</it></sub>): a sequence that does not belong to the target family and does not match the motif.</p>
				<p>&#8226; <it>False Negative </it>(<it>F</it><sub><it>N</it></sub>): a sequence that belongs to the target family and does not match the motif.</p>
				<p>&#8226; <it>False Positive </it>(<it>F</it><sub><it>P</it></sub>): a sequence that does not belong to the target family and matches the motif.</p>
				<p>Sagot <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>, suggests that motifs can be evaluated according to the following approaches: probability of matching a random sequence, sensitivity/specificity, information content and minimum description length (MDL). Since this categorization does not include all possible measures, nor distinguishes the type of information provided, a different categorization will be considered. Three categories are proposed:</p>
				<p>1. <it>Class-based </it>measures, which are calculated based on the information of the motif in relation to positive and negative data.</p>
				<p>2. <it>Information-Theoretic </it>measures, which are based solely on Information-theoretic models like probabilistic or entropy models. In this case the calculation is self-contained, i.e., the necessary information is found in the motif itself.</p>
				<p>3. <it>Hybrid </it>measures use both Information-theoretic and class information.</p>
			</sec>
			<sec>
				<st>
					<p>Class-based Measures</p>
				</st>
				<p>The ideal motif is one that matches all the sequences of the target family and no other sequence outside this family. It is also known as <it>signature </it>motif. In this context, the measures most widely used to express the quality of the motifs are: <it>sensitivity</it>, <it>specificity </it>and <it>positive predicted value </it>(see Table <tblr tid="T1">1</tblr>). Sensitivity (Sn), also called recall, measures the proportion of sequences of the target family correctly matched by the motif. Specificity (Sp) measures the proportion of sequences outside the target family that are not matched by the motif. Positive Predicted value (PPV), also called precision, measures the proportion of sequences that are covered by the motif and that belong to the target family. An ideal motif is one with 100% of Sn and PPV. These three measures yield a positive rank of motifs, i.e., their score is proportional to the rank. For comparison purposes, a negative rank measure <it>false positive rate </it>(Fpr) is also considered. This measure returns the proportion of negative instances that were incorrectly reported as being positive. In this case, the greater the score the worst the quality of the motif. Motifs can be ranked according to one or all of these measures. When a unique value is required to score a motif, a combination of these measures can be used. The <it>F-Measure </it>(F) <abbrgrp><abbr bid="B33">33</abbr></abbrgrp> and the <it>Pearson Correlation </it>(Corr) <abbrgrp><abbr bid="B22">22</abbr><abbr bid="B34">34</abbr></abbrgrp> (also known as Matthews Correlation Coefficient, for its application in secondary structure prediction <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>) are examples of such composed measures. As a last example of a class-based measure we refer to the <it>Discrimination power </it>(Dp) <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>. This measure is particularly useful as a filter, since Dp is proportionally associated to selectiveness. A characteristic of class-based measures is that they do not rely on the motif structure to be calculated. Hence, they can be applied to any type of deterministic motif. Although a myriad of class-based measures can be found, covering different aspects of a pattern quality, we only review those widely used in a biological context. Please refer to Table <tblr tid="T1">1</tblr> and <tblr tid="T2">2</tblr> for details on these measures.</p>
				<tbl id="T1">
					<title>
						<p>Table 1</p>
					</title>
					<caption>
						<p>List of the motif significance measures.</p>
					</caption>
					<tblbdy cols="5">
						<r>
							<c ca="center">
								<p>Symbol</p>
							</c>
							<c ca="center">
								<p>Measure</p>
							</c>
							<c ca="center">
								<p>Formula</p>
							</c>
							<c ca="center">
								<p>Range</p>
							</c>
							<c ca="center">
								<p>Type</p>
							</c>
						</r>
						<r>
							<c cspan="5">
								<hr/>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>Sn</p>
							</c>
							<c ca="center">
								<p>Sensitivity</p>
							</c>
							<c ca="center">
								<p>
									<inline-formula>
										<m:math name="1748-7188-2-16-i2" xmlns:m="http://www.w3.org/1998/Math/MathML">
											<m:semantics>
												<m:mrow>
													<m:mi>S</m:mi>
													<m:mi>n</m:mi>
													<m:mo stretchy="false">(</m:mo>
													<m:mi>M</m:mi>
													<m:mo stretchy="false">)</m:mo>
													<m:mo>=</m:mo>
													<m:mfrac>
														<m:mrow>
															<m:msub>
																<m:mi>T</m:mi>
																<m:mi>P</m:mi>
															</m:msub>
														</m:mrow>
														<m:mrow>
															<m:msub>
																<m:mi>T</m:mi>
																<m:mi>P</m:mi>
															</m:msub>
															<m:mo>+</m:mo>
															<m:msub>
																<m:mi>F</m:mi>
																<m:mi>N</m:mi>
															</m:msub>
														</m:mrow>
													</m:mfrac>
												</m:mrow>
												<m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaem4uamLaemOBa4MaeiikaGIaemyta0KaeiykaKIaeyypa0tcfa4aaSaaaeaacqWGubavdaWgaaqaaiabdcfaqbqabaaabaGaemivaq1aaSbaaeaacqWGqbauaeqaaiabgUcaRiabdAeagnaaBaaabaGaemOta4eabeaaaaaaaa@3B13@</m:annotation>
											</m:semantics>
										</m:math>
									</inline-formula>
								</p>
							</c>
							<c ca="center">
								<p>[0,1]</p>
							</c>
							<c ca="center">
								<p>C</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>Sp</p>
							</c>
							<c ca="center">
								<p>Specificity</p>
							</c>
							<c ca="center">
								<p>
									<inline-formula>
										<m:math name="1748-7188-2-16-i3" xmlns:m="http://www.w3.org/1998/Math/MathML">
											<m:semantics>
												<m:mrow>
													<m:mi>S</m:mi>
													<m:mi>p</m:mi>
													<m:mo stretchy="false">(</m:mo>
													<m:mi>M</m:mi>
													<m:mo stretchy="false">)</m:mo>
													<m:mo>=</m:mo>
													<m:mfrac>
														<m:mrow>
															<m:msub>
																<m:mi>T</m:mi>
																<m:mi>N</m:mi>
															</m:msub>
														</m:mrow>
														<m:mrow>
															<m:msub>
																<m:mi>T</m:mi>
																<m:mi>N</m:mi>
															</m:msub>
															<m:mo>+</m:mo>
															<m:msub>
																<m:mi>F</m:mi>
																<m:mi>P</m:mi>
															</m:msub>
														</m:mrow>
													</m:mfrac>
												</m:mrow>
												<m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaem4uamLaemiCaaNaeiikaGIaemyta0KaeiykaKIaeyypa0tcfa4aaSaaaeaacqWGubavdaWgaaqaaiabd6eaobqabaaabaGaemivaq1aaSbaaeaacqWGobGtaeqaaiabgUcaRiabdAeagnaaBaaabaGaemiuaafabeaaaaaaaa@3B13@</m:annotation>
											</m:semantics>
										</m:math>
									</inline-formula>
								</p>
							</c>
							<c ca="center">
								<p>[0,1]</p>
							</c>
							<c ca="center">
								<p>C</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>PPV</p>
							</c>
							<c ca="center">
								<p>Positive Predicted Value</p>
							</c>
							<c ca="center">
								<p>
									<inline-formula>
										<m:math name="1748-7188-2-16-i4" xmlns:m="http://www.w3.org/1998/Math/MathML">
											<m:semantics>
												<m:mrow>
													<m:mi>P</m:mi>
													<m:mi>P</m:mi>
													<m:mi>V</m:mi>
													<m:mo stretchy="false">(</m:mo>
													<m:mi>M</m:mi>
													<m:mo stretchy="false">)</m:mo>
													<m:mo>=</m:mo>
													<m:mfrac>
														<m:mrow>
															<m:msub>
																<m:mi>T</m:mi>
																<m:mi>P</m:mi>
															</m:msub>
														</m:mrow>
														<m:mrow>
															<m:msub>
																<m:mi>T</m:mi>
																<m:mi>P</m:mi>
															</m:msub>
															<m:mo>+</m:mo>
															<m:msub>
																<m:mi>F</m:mi>
																<m:mi>P</m:mi>
															</m:msub>
														</m:mrow>
													</m:mfrac>
												</m:mrow>
												<m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaemiuaaLaemiuaaLaemOvayLaeiikaGIaemyta0KaeiykaKIaeyypa0tcfa4aaSaaaeaacqWGubavdaWgaaqaaiabdcfaqbqabaaabaGaemivaq1aaSbaaeaacqWGqbauaeqaaiabgUcaRiabdAeagnaaBaaabaGaemiuaafabeaaaaaaaa@3C0A@</m:annotation>
											</m:semantics>
										</m:math>
									</inline-formula>
								</p>
							</c>
							<c ca="center">
								<p>[0,1]</p>
							</c>
							<c ca="center">
								<p>C</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>Fpr</p>
							</c>
							<c ca="center">
								<p>False Positive Rate</p>
							</c>
							<c ca="center">
								<p>
									<inline-formula>
										<m:math name="1748-7188-2-16-i5" xmlns:m="http://www.w3.org/1998/Math/MathML">
											<m:semantics>
												<m:mrow>
													<m:mi>F</m:mi>
													<m:mi>p</m:mi>
													<m:mi>r</m:mi>
													<m:mo stretchy="false">(</m:mo>
													<m:mi>M</m:mi>
													<m:mo stretchy="false">)</m:mo>
													<m:mo>=</m:mo>
													<m:mfrac>
														<m:mrow>
															<m:msub>
																<m:mi>F</m:mi>
																<m:mi>P</m:mi>
															</m:msub>
														</m:mrow>
														<m:mrow>
															<m:msub>
																<m:mi>F</m:mi>
																<m:mi>P</m:mi>
															</m:msub>
															<m:mo>+</m:mo>
															<m:msub>
																<m:mi>T</m:mi>
																<m:mi>N</m:mi>
															</m:msub>
														</m:mrow>
													</m:mfrac>
												</m:mrow>
												<m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaemOrayKaemiCaaNaemOCaiNaeiikaGIaemyta0KaeiykaKIaeyypa0tcfa4aaSaaaeaacqWGgbGrdaWgaaqaaiabdcfaqbqabaaabaGaemOray0aaSbaaeaacqWGqbauaeqaaiabgUcaRiabdsfaunaaBaaabaGaemOta4eabeaaaaaaaa@3C4E@</m:annotation>
											</m:semantics>
										</m:math>
									</inline-formula>
								</p>
							</c>
							<c ca="center">
								<p>[-1,1]</p>
							</c>
							<c ca="center">
								<p>C</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>F</p>
							</c>
							<c ca="center">
								<p>F-Measure</p>
							</c>
							<c ca="center">
								<p>
									<inline-formula>
										<m:math name="1748-7188-2-16-i6" xmlns:m="http://www.w3.org/1998/Math/MathML">
											<m:semantics>
												<m:mrow>
													<m:mi>F</m:mi>
													<m:mo stretchy="false">(</m:mo>
													<m:mi>M</m:mi>
													<m:mo stretchy="false">)</m:mo>
													<m:mo>=</m:mo>
													<m:mfrac>
														<m:mrow>
															<m:mn>2</m:mn>
															<m:mo>&#215;</m:mo>
															<m:mi>S</m:mi>
															<m:mi>e</m:mi>
															<m:mi>n</m:mi>
															<m:mi>s</m:mi>
															<m:mi>i</m:mi>
															<m:mi>t</m:mi>
															<m:mi>i</m:mi>
															<m:mi>v</m:mi>
															<m:mi>i</m:mi>
															<m:mi>t</m:mi>
															<m:mi>y</m:mi>
															<m:mo>&#215;</m:mo>
															<m:mi>P</m:mi>
															<m:mi>P</m:mi>
															<m:mi>V</m:mi>
														</m:mrow>
														<m:mrow>
															<m:mi>S</m:mi>
															<m:mi>e</m:mi>
															<m:mi>n</m:mi>
															<m:mi>s</m:mi>
															<m:mi>i</m:mi>
															<m:mi>t</m:mi>
															<m:mi>i</m:mi>
															<m:mi>v</m:mi>
															<m:mi>i</m:mi>
															<m:mi>t</m:mi>
															<m:mi>y</m:mi>
															<m:mo>+</m:mo>
															<m:mi>P</m:mi>
															<m:mi>P</m:mi>
															<m:mi>V</m:mi>
														</m:mrow>
													</m:mfrac>
													<m:mo>=</m:mo>
													<m:mfrac>
														<m:mrow>
															<m:mn>2</m:mn>
															<m:mo>&#215;</m:mo>
															<m:msub>
																<m:mi>T</m:mi>
																<m:mi>P</m:mi>
															</m:msub>
														</m:mrow>
														<m:mrow>
															<m:mn>2</m:mn>
															<m:mo>&#215;</m:mo>
															<m:msub>
																<m:mi>T</m:mi>
																<m:mi>P</m:mi>
															</m:msub>
															<m:mo>+</m:mo>
															<m:msub>
																<m:mi>F</m:mi>
																<m:mi>N</m:mi>
															</m:msub>
															<m:mo>+</m:mo>
															<m:msub>
																<m:mi>F</m:mi>
																<m:mi>P</m:mi>
															</m:msub>
														</m:mrow>
													</m:mfrac>
												</m:mrow>
												<m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaemOrayKaeiikaGIaemyta0KaeiykaKIaeyypa0tcfa4aaSaaaeaacqaIYaGmcqGHxdaTcqWGtbWucqWGLbqzcqWGUbGBcqWGZbWCcqWGPbqAcqWG0baDcqWGPbqAcqWG2bGDcqWGPbqAcqWG0baDcqWG5bqEcqGHxdaTcqWGqbaucqWGqbaucqWGwbGvaeaacqWGtbWucqWGLbqzcqWGUbGBcqWGZbWCcqWGPbqAcqWG0baDcqWGPbqAcqWG2bGDcqWGPbqAcqWG0baDcqWG5bqEcqGHRaWkcqWGqbaucqWGqbaucqWGwbGvaaGccqGH9aqpjuaGdaWcaaqaaiabikdaYiabgEna0kabdsfaunaaBaaabaGaemiuaafabeaaaeaacqaIYaGmcqGHxdaTcqWGubavdaWgaaqaaiabdcfaqbqabaGaey4kaSIaemOray0aaSbaaeaacqWGobGtaeqaaiabgUcaRiabdAeagnaaBaaabaGaemiuaafabeaaaaaaaa@7017@</m:annotation>
											</m:semantics>
										</m:math>
									</inline-formula>
								</p>
							</c>
							<c ca="center">
								<p>[0,1]</p>
							</c>
							<c ca="center">
								<p>1</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>Corr</p>
							</c>
							<c ca="center">
								<p>Correlation</p>
							</c>
							<c ca="center">
								<p>
									<inline-formula>
										<m:math name="1748-7188-2-16-i7" xmlns:m="http://www.w3.org/1998/Math/MathML">
											<m:semantics>
												<m:mrow>
													<m:mi>C</m:mi>
													<m:mo stretchy="false">(</m:mo>
													<m:mi>M</m:mi>
													<m:mo stretchy="false">)</m:mo>
													<m:mo>=</m:mo>
													<m:mfrac>
														<m:mrow>
															<m:msub>
																<m:mi>T</m:mi>
																<m:mi>P</m:mi>
															</m:msub>
															<m:mo>&#215;</m:mo>
															<m:msub>
																<m:mi>T</m:mi>
																<m:mi>N</m:mi>
															</m:msub>
															<m:mo>&#8722;</m:mo>
															<m:msub>
																<m:mi>F</m:mi>
																<m:mi>P</m:mi>
															</m:msub>
															<m:mo>&#215;</m:mo>
															<m:msub>
																<m:mi>F</m:mi>
																<m:mi>N</m:mi>
															</m:msub>
														</m:mrow>
														<m:mrow>
															<m:msqrt>
																<m:mrow>
																	<m:mo stretchy="false">(</m:mo>
																	<m:msub>
																		<m:mi>T</m:mi>
																		<m:mi>P</m:mi>
																	</m:msub>
																	<m:mo>+</m:mo>
																	<m:msub>
																		<m:mi>F</m:mi>
																		<m:mi>N</m:mi>
																	</m:msub>
																	<m:mo stretchy="false">)</m:mo>
																	<m:mo stretchy="false">(</m:mo>
																	<m:msub>
																		<m:mi>T</m:mi>
																		<m:mi>P</m:mi>
																	</m:msub>
																	<m:mo>+</m:mo>
																	<m:msub>
																		<m:mi>F</m:mi>
																		<m:mi>P</m:mi>
																	</m:msub>
																	<m:mo stretchy="false">)</m:mo>
																	<m:mo stretchy="false">(</m:mo>
																	<m:msub>
																		<m:mi>T</m:mi>
																		<m:mi>N</m:mi>
																	</m:msub>
																	<m:mo>+</m:mo>
																	<m:msub>
																		<m:mi>F</m:mi>
																		<m:mi>P</m:mi>
																	</m:msub>
																	<m:mo stretchy="false">)</m:mo>
																	<m:mo stretchy="false">(</m:mo>
																	<m:msub>
																		<m:mi>T</m:mi>
																		<m:mi>N</m:mi>
																	</m:msub>
																	<m:mo>+</m:mo>
																	<m:msub>
																		<m:mi>F</m:mi>
																		<m:mi>N</m:mi>
																	</m:msub>
																	<m:mo stretchy="false">)</m:mo>
																</m:mrow>
															</m:msqrt>
														</m:mrow>
													</m:mfrac>
												</m:mrow>
												<m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaem4qamKaeiikaGIaemyta0KaeiykaKIaeyypa0tcfa4aaSaaaeaacqWGubavdaWgaaqaaiabdcfaqbqabaGaey41aqRaemivaq1aaSbaaeaacqWGobGtaeqaaiabgkHiTiabdAeagnaaBaaabaGaemiuaafabeaacqGHxdaTcqWGgbGrdaWgaaqaaiabd6eaobqabaaabaWaaOaaaeaacqGGOaakcqWGubavdaWgaaqaaiabdcfaqbqabaGaey4kaSIaemOray0aaSbaaeaacqWGobGtaeqaaiabcMcaPiabcIcaOiabdsfaunaaBaaabaGaemiuaafabeaacqGHRaWkcqWGgbGrdaWgaaqaaiabdcfaqbqabaGaeiykaKIaeiikaGIaemivaq1aaSbaaeaacqWGobGtaeqaaiabgUcaRiabdAeagnaaBaaabaGaemiuaafabeaacqGGPaqkcqGGOaakcqWGubavdaWgaaqaaiabd6eaobqabaGaey4kaSIaemOray0aaSbaaeaacqWGobGtaeqaaiabcMcaPaqabaaaaaaa@5DDA@</m:annotation>
											</m:semantics>
										</m:math>
									</inline-formula>
								</p>
							</c>
							<c ca="center">
								<p>[-1,1]</p>
							</c>
							<c ca="center">
								<p>C</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>Dp</p>
							</c>
							<c ca="center">
								<p>Discrimination Power</p>
							</c>
							<c ca="center">
								<p>
									<inline-formula>
										<m:math name="1748-7188-2-16-i8" xmlns:m="http://www.w3.org/1998/Math/MathML">
											<m:semantics>
												<m:mrow>
													<m:mi>D</m:mi>
													<m:mi>p</m:mi>
													<m:mo stretchy="false">(</m:mo>
													<m:mi>M</m:mi>
													<m:mo stretchy="false">)</m:mo>
													<m:mo>=</m:mo>
													<m:mfrac>
														<m:mrow>
															<m:msub>
																<m:mi>T</m:mi>
																<m:mi>P</m:mi>
															</m:msub>
														</m:mrow>
														<m:mrow>
															<m:mrow>
																<m:mo>|</m:mo>
																<m:mi>C</m:mi>
																<m:mo>|</m:mo>
															</m:mrow>
														</m:mrow>
													</m:mfrac>
													<m:mo>&#8722;</m:mo>
													<m:mfrac>
														<m:mrow>
															<m:msub>
																<m:mi>F</m:mi>
																<m:mi>P</m:mi>
															</m:msub>
														</m:mrow>
														<m:mrow>
															<m:mrow>
																<m:mo>|</m:mo>
																<m:mover accent="true">
																	<m:mi>C</m:mi>
																	<m:mo>&#175;</m:mo>
																</m:mover>
																<m:mo>|</m:mo>
															</m:mrow>
														</m:mrow>
													</m:mfrac>
												</m:mrow>
												<m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaemiraqKaemiCaaNaeiikaGIaemyta0KaeiykaKIaeyypa0tcfa4aaSaaaeaacqWGubavdaWgaaqaaiabdcfaqbqabaaabaWaaqWaaeaacqWGdbWqaiaawEa7caGLiWoaaaGccqGHsisljuaGdaWcaaqaaiabdAeagnaaBaaabaGaemiuaafabeaaaeaadaabdaqaaiqbdoeadzaaraaacaGLhWUaayjcSdaaaaaa@41AF@</m:annotation>
											</m:semantics>
										</m:math>
									</inline-formula>
								</p>
							</c>
							<c ca="center">
								<p>[-1,1]</p>
							</c>
							<c ca="center">
								<p>C</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>IG</p>
							</c>
							<c ca="center">
								<p>Information Gain</p>
							</c>
							<c ca="center">
								<p>
									<inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1748-7188-2-16-i26">
											<m:semantics>
												<m:mrow>
													<m:mtable columnalign="left">
														<m:mtr columnalign="left">
															<m:mtd columnalign="left">
																<m:mrow>
																	<m:mi>I</m:mi><m:mi>G</m:mi><m:mrow><m:mo>(</m:mo>
																		<m:mi>M</m:mi>
																		<m:mo>)</m:mo></m:mrow><m:mo>=</m:mo><m:mi>I</m:mi><m:mi>n</m:mi><m:mi>f</m:mi><m:mi>o</m:mi><m:mrow><m:mo>(</m:mo>
																		<m:mi>M</m:mi>
																		<m:mo>)</m:mo></m:mrow><m:mo>&#215;</m:mo><m:mrow><m:mo>[</m:mo>
																		<m:mrow>
																			<m:mi>S</m:mi><m:mi>u</m:mi><m:mi>p</m:mi><m:mi>p</m:mi><m:mi>o</m:mi><m:mi>r</m:mi><m:mi>t</m:mi><m:mrow><m:mo>(</m:mo>
																				<m:mi>M</m:mi>
																				<m:mo>)</m:mo></m:mrow><m:mo>&#8722;</m:mo><m:mn>1</m:mn>
																		</m:mrow>
																		<m:mo>]</m:mo></m:mrow>
																</m:mrow>
															</m:mtd>
														</m:mtr>
														<m:mtr columnalign="left">
															<m:mtd columnalign="left">
																<m:mrow>
																	<m:mtext>	</m:mtext><m:mtext>where&#160;</m:mtext><m:mi>I</m:mi><m:mi>n</m:mi><m:mi>f</m:mi><m:mi>o</m:mi><m:mrow><m:mo>(</m:mo>
																		<m:mi>M</m:mi>
																		<m:mo>)</m:mo></m:mrow><m:mo>=</m:mo><m:mo>&#8722;</m:mo><m:mi>l</m:mi><m:mi>o</m:mi><m:msub>
																		<m:mi>g</m:mi>
																		<m:mrow>
																			<m:mrow><m:mo>|</m:mo>
																				<m:mi>&#931;</m:mi>
																				<m:mo>|</m:mo></m:mrow>
																		</m:mrow>
																	</m:msub>
																	<m:mi>p</m:mi><m:mrow><m:mo>(</m:mo>
																		<m:mi>M</m:mi>
																		<m:mo>)</m:mo></m:mrow>
																</m:mrow>
															</m:mtd>
														</m:mtr>
													</m:mtable>
												</m:mrow>
												<m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbiqaaaBafaqaaeGabaaabaGaemysaKKaem4raC0aaeWaaeaacqWGnbqtaiaawIcacaGLPaaacqGH9aqpcqWGjbqscqWGUbGBcqWGMbGzcqWGVbWBdaqadaqaaiabd2eanbGaayjkaiaawMcaaiabgEna0oaadmaabaGaem4uamLaemyDauNaemiCaaNaemiCaaNaem4Ba8MaemOCaiNaemiDaq3aaeWaaeaacqWGnbqtaiaawIcacaGLPaaacqGHsislcqaIXaqmaiaawUfacaGLDbaaaeGabaaPbiaaxMaacqqG3bWDcqqGObaAcqqGLbqzcqqGYbGCcqqGLbqzcqqGGaaicqWGjbqscqWGUbGBcqWGMbGzcqWGVbWBdaqadaqaaiabd2eanbGaayjkaiaawMcaaiabg2da9iabgkHiTiabdYgaSjabd+gaVjabdEgaNnaaBaaaleaadaabdaqaaiabfo6atbGaay5bSlaawIa7aaqabaGccqWGWbaCdaqadaqaaiabd2eanbGaayjkaiaawMcaaaaaaaa@6C7F@</m:annotation>
											</m:semantics>
										</m:math>
									</inline-formula>
								</p>
							</c>
							<c ca="center">
								<p>[0, + &#8734;[</p>
							</c>
							<c ca="center">
								<p>IT</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>Pratt</p>
							</c>
							<c ca="center">
								<p>Pratt Measure</p>
							</c>
							<c ca="center">
								<p>
									<inline-formula>
										<m:math name="1748-7188-2-16-i9" xmlns:m="http://www.w3.org/1998/Math/MathML">
											<m:semantics>
												<m:mrow>
													<m:mtable>
														<m:mtr>
															<m:mtd>
																<m:mrow>
																	<m:mi>P</m:mi>
																	<m:mi>r</m:mi>
																	<m:mi>a</m:mi>
																	<m:mi>t</m:mi>
																	<m:mi>t</m:mi>
																	<m:mo stretchy="false">(</m:mo>
																	<m:mi>M</m:mi>
																	<m:mo stretchy="false">)</m:mo>
																	<m:mo>=</m:mo>
																	<m:mstyle displaystyle="true">
																		<m:msubsup>
																			<m:mo>&#8721;</m:mo>
																			<m:mi>i</m:mi>
																			<m:mi>n</m:mi>
																		</m:msubsup>
																		<m:mrow>
																			<m:msup>
																				<m:mi>I</m:mi>
																				<m:mo>&#8242;</m:mo>
																			</m:msup>
																			<m:mo stretchy="false">(</m:mo>
																			<m:msub>
																				<m:mi>A</m:mi>
																				<m:mi>i</m:mi>
																			</m:msub>
																			<m:mo stretchy="false">)</m:mo>
																			<m:mo>&#8722;</m:mo>
																			<m:mi>c</m:mi>
																		</m:mrow>
																	</m:mstyle>
																	<m:mo>&#8901;</m:mo>
																	<m:mstyle displaystyle="true">
																		<m:msubsup>
																			<m:mo>&#8721;</m:mo>
																			<m:mrow>
																				<m:mi>k</m:mi>
																				<m:mo>=</m:mo>
																				<m:mn>1</m:mn>
																			</m:mrow>
																			<m:mrow>
																				<m:mi>n</m:mi>
																				<m:mo>&#8722;</m:mo>
																				<m:mn>1</m:mn>
																			</m:mrow>
																		</m:msubsup>
																		<m:mrow>
																			<m:mo stretchy="false">(</m:mo>
																			<m:msub>
																				<m:mi>q</m:mi>
																				<m:mi>k</m:mi>
																			</m:msub>
																			<m:mo>&#8722;</m:mo>
																			<m:msub>
																				<m:mi>p</m:mi>
																				<m:mi>k</m:mi>
																			</m:msub>
																			<m:mo stretchy="false">)</m:mo>
																		</m:mrow>
																	</m:mstyle>
																</m:mrow>
															</m:mtd>
														</m:mtr>
														<m:mtr>
															<m:mtd>
																<m:mrow>
																	<m:mtext>where&#160;</m:mtext>
																	<m:msup>
																		<m:mi>I</m:mi>
																		<m:mo>&#8242;</m:mo>
																	</m:msup>
																	<m:mo stretchy="false">(</m:mo>
																	<m:msub>
																		<m:mi>A</m:mi>
																		<m:mi>i</m:mi>
																	</m:msub>
																	<m:mo stretchy="false">)</m:mo>
																	<m:mo>=</m:mo>
																	<m:mo>&#8722;</m:mo>
																	<m:mstyle displaystyle="true">
																		<m:msub>
																			<m:mo>&#8721;</m:mo>
																			<m:mrow>
																				<m:msub>
																					<m:mi>a</m:mi>
																					<m:mi>i</m:mi>
																				</m:msub>
																				<m:mo>&#8712;</m:mo>
																				<m:msub>
																					<m:mi>A</m:mi>
																					<m:mi>i</m:mi>
																				</m:msub>
																			</m:mrow>
																		</m:msub>
																		<m:mrow>
																			<m:mo stretchy="false">(</m:mo>
																			<m:mi>P</m:mi>
																			<m:mo stretchy="false">(</m:mo>
																			<m:msub>
																				<m:mi>a</m:mi>
																				<m:mi>i</m:mi>
																			</m:msub>
																			<m:mo stretchy="false">)</m:mo>
																			<m:mo>&#215;</m:mo>
																			<m:mi>l</m:mi>
																			<m:mi>o</m:mi>
																			<m:mi>g</m:mi>
																			<m:mo stretchy="false">(</m:mo>
																			<m:mi>P</m:mi>
																			<m:mo stretchy="false">(</m:mo>
																			<m:msub>
																				<m:mi>a</m:mi>
																				<m:mi>i</m:mi>
																			</m:msub>
																			<m:mo stretchy="false">)</m:mo>
																			<m:mo stretchy="false">)</m:mo>
																			<m:mo stretchy="false">)</m:mo>
																			<m:mo>+</m:mo>
																			<m:mstyle displaystyle="true">
																				<m:msub>
																					<m:mo>&#8721;</m:mo>
																					<m:mrow>
																						<m:msub>
																							<m:mi>a</m:mi>
																							<m:mi>i</m:mi>
																						</m:msub>
																						<m:mo>&#8712;</m:mo>
																						<m:msub>
																							<m:mi>A</m:mi>
																							<m:mi>i</m:mi>
																						</m:msub>
																					</m:mrow>
																				</m:msub>
																				<m:mrow>
																					<m:mo stretchy="false">(</m:mo>
																					<m:mfrac>
																						<m:mrow>
																							<m:mi>P</m:mi>
																							<m:mo stretchy="false">(</m:mo>
																							<m:msub>
																								<m:mi>a</m:mi>
																								<m:mi>i</m:mi>
																							</m:msub>
																							<m:mo stretchy="false">)</m:mo>
																						</m:mrow>
																						<m:mrow>
																							<m:mi>P</m:mi>
																							<m:mo stretchy="false">(</m:mo>
																							<m:msub>
																								<m:mi>A</m:mi>
																								<m:mi>i</m:mi>
																							</m:msub>
																							<m:mo stretchy="false">)</m:mo>
																						</m:mrow>
																					</m:mfrac>
																					<m:mo>&#215;</m:mo>
																					<m:mi>l</m:mi>
																					<m:mi>o</m:mi>
																					<m:mi>g</m:mi>
																					<m:mo stretchy="false">(</m:mo>
																					<m:mfrac>
																						<m:mrow>
																							<m:mi>P</m:mi>
																							<m:mo stretchy="false">(</m:mo>
																							<m:msub>
																								<m:mi>a</m:mi>
																								<m:mi>i</m:mi>
																							</m:msub>
																							<m:mo stretchy="false">)</m:mo>
																						</m:mrow>
																						<m:mrow>
																							<m:mi>P</m:mi>
																							<m:mo stretchy="false">(</m:mo>
																							<m:msub>
																								<m:mi>A</m:mi>
																								<m:mi>i</m:mi>
																							</m:msub>
																							<m:mo stretchy="false">)</m:mo>
																						</m:mrow>
																					</m:mfrac>
																					<m:mo stretchy="false">)</m:mo>
																					<m:mo stretchy="false">)</m:mo>
																				</m:mrow>
																			</m:mstyle>
																		</m:mrow>
																	</m:mstyle>
																</m:mrow>
															</m:mtd>
														</m:mtr>
														<m:mtr>
															<m:mtd>
																<m:mrow>
																	<m:mtext>and&#160;</m:mtext>
																	<m:mi>P</m:mi>
																	<m:mo stretchy="false">(</m:mo>
																	<m:msub>
																		<m:mi>A</m:mi>
																		<m:mi>i</m:mi>
																	</m:msub>
																	<m:mo stretchy="false">)</m:mo>
																	<m:mo>=</m:mo>
																	<m:mstyle displaystyle="true">
																		<m:msub>
																			<m:mo>&#8721;</m:mo>
																			<m:mrow>
																				<m:msub>
																					<m:mi>a</m:mi>
																					<m:mi>i</m:mi>
																				</m:msub>
																				<m:mo>&#8712;</m:mo>
																				<m:msub>
																					<m:mi>A</m:mi>
																					<m:mi>i</m:mi>
																				</m:msub>
																			</m:mrow>
																		</m:msub>
																		<m:mrow>
																			<m:mi>p</m:mi>
																			<m:mo stretchy="false">(</m:mo>
																			<m:msub>
																				<m:mi>a</m:mi>
																				<m:mi>i</m:mi>
																			</m:msub>
																			<m:mo stretchy="false">)</m:mo>
																		</m:mrow>
																	</m:mstyle>
																</m:mrow>
															</m:mtd>
														</m:mtr>
													</m:mtable>
												</m:mrow>
												<m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaqbaeqabmqaaaqaaGqaciab=bfaqjab=jhaYjabdggaHjabdsha0jabdsha0jabcIcaOiabd2eanjabcMcaPiabg2da9maaqadabaGafmysaKKbauaacqGGOaakcqWGbbqqdaWgaaWcbaGaemyAaKgabeaakiabcMcaPiabgkHiTiabdogaJbWcbaGaemyAaKgabaGaemOBa4ganiabggHiLdGccqGHflY1daaeWaqaaiabcIcaOiabdghaXnaaBaaaleaacqWGRbWAaeqaaOGaeyOeI0IaemiCaa3aaSbaaSqaaiabdUgaRbqabaGccqGGPaqkaSqaaiabdUgaRjabg2da9iabigdaXaqaaiabd6gaUjabgkHiTiabigdaXaqdcqGHris5aaGcbaGaee4DaCNaeeiAaGMaeeyzauMaeeOCaiNaeeyzauMaeeiiaaIafmysaKKbauaacqGGOaakcqWGbbqqdaWgaaWcbaGaemyAaKgabeaakiabcMcaPiabg2da9iabgkHiTmaaqababaGaeiikaGIaemiuaaLaeiikaGIaemyyae2aaSbaaSqaaiabdMgaPbqabaGccqGGPaqkcqGHxdaTcqWFSbaBcqWFVbWBcqWFNbWzcqGGOaakcqWGqbaucqGGOaakcqWGHbqydaWgaaWcbaGaemyAaKgabeaakiabcMcaPiabcMcaPiabcMcaPiabgUcaRmaaqababaGaeiikaGYaaSaaaeaacqWGqbaucqGGOaakcqWGHbqydaWgaaWcbaGaemyAaKgabeaakiabcMcaPaqaaiabdcfaqjabcIcaOiabdgeabnaaBaaaleaacqWGPbqAaeqaaOGaeiykaKcaaiabgEna0kab=XgaSjab=9gaVjab=DgaNjabcIcaOmaalaaabaGaemiuaaLaeiikaGIaemyyae2aaSbaaSqaaiabdMgaPbqabaGccqGGPaqkaeaacqWGqbaucqGGOaakcqWGbbqqdaWgaaWcbaGaemyAaKgabeaakiabcMcaPaaacqGGPaqkcqGGPaqkaSqaaiabdggaHnaaBaaameaacqWGPbqAaeqaaSGaeyicI4Saemyqae0aaSbaaWqaaiabdMgaPbqabaaaleqaniabggHiLdaaleaacqWGHbqydaWgaaadbaGaemyAaKgabeaaliabgIGiolabdgeabnaaBaaameaacqWGPbqAaeqaaaWcbeqdcqGHris5aaGcbaGaeeyyaeMaeeOBa4MaeeizaqMaeeiiaaIaemiuaaLaeiikaGIaemyqae0aaSbaaSqaaiabdMgaPbqabaGccqGGPaqkcqGH9aqpdaaeqaqaaiabdchaWjabcIcaOiabdggaHnaaBaaaleaacqWGPbqAaeqaaOGaeiykaKcaleaacqWGHbqydaWgaaadbaGaemyAaKgabeaaliabgIGiolabdgeabnaaBaaameaacqWGPbqAaeqaaaWcbeqdcqGHris5aaaaaaa@C763@</m:annotation>
											</m:semantics>
										</m:math>
									</inline-formula>
								</p>
							</c>
							<c ca="center">
								<p>]- &#8734;, + &#8734;[</p>
							</c>
							<c ca="center">
								<p>IT</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>LogOdd</p>
							</c>
							<c ca="center">
								<p>LogOdd</p>
							</c>
							<c ca="center">
								<p>
									<inline-formula>
										<m:math name="1748-7188-2-16-i10" xmlns:m="http://www.w3.org/1998/Math/MathML">
											<m:semantics>
												<m:mrow>
													<m:mtext>Logodd</m:mtext>
													<m:mo stretchy="false">(</m:mo>
													<m:mtext>M</m:mtext>
													<m:mo stretchy="false">)</m:mo>
													<m:mo>=</m:mo>
													<m:mi>l</m:mi>
													<m:mi>o</m:mi>
													<m:mi>g</m:mi>
													<m:mo stretchy="false">(</m:mo>
													<m:mfrac>
														<m:mrow>
															<m:mfrac>
																<m:mrow>
																	<m:mi>S</m:mi>
																	<m:mi>u</m:mi>
																	<m:mi>p</m:mi>
																	<m:mi>p</m:mi>
																	<m:mi>o</m:mi>
																	<m:mi>r</m:mi>
																	<m:mi>t</m:mi>
																	<m:mo stretchy="false">(</m:mo>
																	<m:mi>M</m:mi>
																	<m:mo stretchy="false">)</m:mo>
																</m:mrow>
																<m:mrow>
																	<m:mi>N</m:mi>
																	<m:mi>u</m:mi>
																	<m:mi>m</m:mi>
																	<m:mi>S</m:mi>
																	<m:mi>e</m:mi>
																	<m:mi>q</m:mi>
																	<m:mi>s</m:mi>
																</m:mrow>
															</m:mfrac>
														</m:mrow>
														<m:mrow>
															<m:mi>P</m:mi>
															<m:mo stretchy="false">(</m:mo>
															<m:mi>M</m:mi>
															<m:mo stretchy="false">)</m:mo>
														</m:mrow>
													</m:mfrac>
													<m:mo stretchy="false">)</m:mo>
												</m:mrow>
												<m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaeeitaWKaee4Ba8Maee4zaCMaee4Ba8MaeeizaqMaeeizaqMaeiikaGIaeeyta0KaeiykaKIaeyypa0JaemiBaWMaem4Ba8Maem4zaCMaeiikaGscfa4aaSaaaeaadaWcaaqaaiabdofatjabdwha1jabdchaWjabdchaWjabd+gaVjabdkhaYjabdsha0jabcIcaOiabd2eanjabcMcaPaqaaiabd6eaojabdwha1jabd2gaTjabdofatjabdwgaLjabdghaXjabdohaZbaaaeaacqWGqbaucqGGOaakcqWGnbqtcqGGPaqkaaGccqGGPaqkaaa@57F4@</m:annotation>
											</m:semantics>
										</m:math>
									</inline-formula>
								</p>
							</c>
							<c ca="center">
								<p>]- &#8734;, + &#8734;[</p>
							</c>
							<c ca="center">
								<p>IT</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>ZScore</p>
							</c>
							<c ca="center">
								<p>Z-Score</p>
							</c>
							<c ca="center">
								<p>
									<inline-formula>
										<m:math name="1748-7188-2-16-i11" xmlns:m="http://www.w3.org/1998/Math/MathML">
											<m:semantics>
												<m:mrow>
													<m:mtable>
														<m:mtr>
															<m:mtd>
																<m:mrow>
																	<m:mi>Z</m:mi>
																	<m:mi>s</m:mi>
																	<m:mi>c</m:mi>
																	<m:mi>o</m:mi>
																	<m:mi>r</m:mi>
																	<m:mi>e</m:mi>
																	<m:mo stretchy="false">(</m:mo>
																	<m:mi>M</m:mi>
																	<m:mo stretchy="false">)</m:mo>
																	<m:mo>=</m:mo>
																	<m:mfrac>
																		<m:mrow>
																			<m:mi>S</m:mi>
																			<m:mi>u</m:mi>
																			<m:mi>p</m:mi>
																			<m:mi>p</m:mi>
																			<m:mi>o</m:mi>
																			<m:mi>r</m:mi>
																			<m:mi>t</m:mi>
																			<m:mo stretchy="false">(</m:mo>
																			<m:mi>M</m:mi>
																			<m:mo stretchy="false">)</m:mo>
																			<m:mo>&#8722;</m:mo>
																			<m:mi>E</m:mi>
																			<m:mo stretchy="false">(</m:mo>
																			<m:mi>M</m:mi>
																			<m:mo stretchy="false">)</m:mo>
																		</m:mrow>
																		<m:mrow>
																			<m:mi>N</m:mi>
																			<m:mo stretchy="false">(</m:mo>
																			<m:mi>M</m:mi>
																			<m:mo stretchy="false">)</m:mo>
																		</m:mrow>
																	</m:mfrac>
																</m:mrow>
															</m:mtd>
														</m:mtr>
														<m:mtr>
															<m:mtd>
																<m:mrow>
																	<m:mtext>where&#160;</m:mtext>
																	<m:mi>E</m:mi>
																	<m:mo stretchy="false">(</m:mo>
																	<m:mi>M</m:mi>
																	<m:mo stretchy="false">)</m:mo>
																	<m:mo>=</m:mo>
																	<m:msub>
																		<m:mi>N</m:mi>
																		<m:mrow>
																			<m:mi>r</m:mi>
																			<m:mi>e</m:mi>
																			<m:mi>s</m:mi>
																			<m:mi>i</m:mi>
																			<m:mi>d</m:mi>
																		</m:mrow>
																	</m:msub>
																	<m:mo>&#215;</m:mo>
																	<m:mi>P</m:mi>
																	<m:mo stretchy="false">(</m:mo>
																	<m:mi>M</m:mi>
																	<m:mo stretchy="false">)</m:mo>
																	<m:mtext>&#160;and&#160;</m:mtext>
																	<m:mi>N</m:mi>
																	<m:mo stretchy="false">(</m:mo>
																	<m:mi>M</m:mi>
																	<m:mo stretchy="false">)</m:mo>
																	<m:mo>=</m:mo>
																	<m:msqrt>
																		<m:mrow>
																			<m:msub>
																				<m:mi>N</m:mi>
																				<m:mrow>
																					<m:mi>r</m:mi>
																					<m:mi>e</m:mi>
																					<m:mi>s</m:mi>
																					<m:mi>i</m:mi>
																					<m:mi>d</m:mi>
																				</m:mrow>
																			</m:msub>
																			<m:mo>&#215;</m:mo>
																			<m:mi>P</m:mi>
																			<m:mo stretchy="false">(</m:mo>
																			<m:mi>M</m:mi>
																			<m:mo stretchy="false">)</m:mo>
																			<m:mo>&#215;</m:mo>
																			<m:mo stretchy="false">(</m:mo>
																			<m:mn>1</m:mn>
																			<m:mo>&#8722;</m:mo>
																			<m:mi>P</m:mi>
																			<m:mo stretchy="false">(</m:mo>
																			<m:mi>M</m:mi>
																			<m:mo stretchy="false">)</m:mo>
																			<m:mo stretchy="false">)</m:mo>
																		</m:mrow>
																	</m:msqrt>
																</m:mrow>
															</m:mtd>
														</m:mtr>
													</m:mtable>
												</m:mrow>
												<m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaqbaeqabiqaaaqaaiabdQfaAjabdohaZjabdogaJjabd+gaVjabdkhaYjabdwgaLjabcIcaOiabd2eanjabcMcaPiabg2da9KqbaoaalaaabaGaem4uamLaemyDauNaemiCaaNaemiCaaNaem4Ba8MaemOCaiNaemiDaqNaeiikaGIaemyta0KaeiykaKIaeyOeI0IaemyrauKaeiikaGIaemyta0KaeiykaKcabaGaemOta4KaeiikaGIaemyta0KaeiykaKcaaaGcbaGaee4DaCNaeeiAaGMaeeyzauMaeeOCaiNaeeyzauMaeeiiaaIaemyrauKaeiikaGIaemyta0KaeiykaKIaeyypa0JaemOta40aaSbaaSqaaiabdkhaYjabdwgaLjabdohaZjabdMgaPjabdsgaKbqabaGccqGHxdaTcqWGqbaucqGGOaakcqWGnbqtcqGGPaqkcqqGGaaicqqGHbqycqqGUbGBcqqGKbazcqqGGaaicqWGobGtcqGGOaakcqWGnbqtcqGGPaqkcqGH9aqpdaGcaaqaaiabd6eaonaaBaaaleaacqWGYbGCcqWGLbqzcqWGZbWCcqWGPbqAcqWGKbazaeqaaOGaey41aqRaemiuaaLaeiikaGIaemyta0KaeiykaKIaey41aqRaeiikaGIaeGymaeJaeyOeI0IaemiuaaLaeiikaGIaemyta0KaeiykaKIaeiykaKcaleqaaaaaaaa@8B4E@</m:annotation>
											</m:semantics>
										</m:math>
									</inline-formula>
								</p>
							</c>
							<c ca="center">
								<p>]- &#8734;, + &#8734;[</p>
							</c>
							<c ca="center">
								<p>IT</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>J</p>
							</c>
							<c ca="center">
								<p>J-Measure</p>
							</c>
							<c ca="center">
								<p>
									<inline-formula>
										<m:math name="1748-7188-2-16-i12" xmlns:m="http://www.w3.org/1998/Math/MathML">
											<m:semantics>
												<m:mrow>
													<m:mtable>
														<m:mtr>
															<m:mtd>
																<m:mrow>
																	<m:mi>J</m:mi>
																	<m:mo stretchy="false">(</m:mo>
																	<m:mi>C</m:mi>
																	<m:mo>;</m:mo>
																	<m:mi>M</m:mi>
																	<m:mo stretchy="false">)</m:mo>
																	<m:mo>=</m:mo>
																	<m:mi>P</m:mi>
																	<m:mo stretchy="false">(</m:mo>
																	<m:mi>M</m:mi>
																	<m:mo stretchy="false">)</m:mo>
																	<m:mo>&#215;</m:mo>
																	<m:mi>j</m:mi>
																	<m:mo stretchy="false">(</m:mo>
																	<m:mi>C</m:mi>
																	<m:mo>;</m:mo>
																	<m:mi>M</m:mi>
																	<m:mo stretchy="false">)</m:mo>
																</m:mrow>
															</m:mtd>
														</m:mtr>
														<m:mtr>
															<m:mtd>
																<m:mrow>
																	<m:mtext>where&#160;</m:mtext>
																	<m:mi>j</m:mi>
																	<m:mo stretchy="false">(</m:mo>
																	<m:mi>C</m:mi>
																	<m:mo>;</m:mo>
																	<m:mi>M</m:mi>
																	<m:mo stretchy="false">)</m:mo>
																	<m:mo>=</m:mo>
																	<m:mi>P</m:mi>
																	<m:mo stretchy="false">(</m:mo>
																	<m:mi>C</m:mi>
																	<m:mo>|</m:mo>
																	<m:mi>M</m:mi>
																	<m:mo stretchy="false">)</m:mo>
																	<m:mo>&#215;</m:mo>
																	<m:mi>l</m:mi>
																	<m:mi>o</m:mi>
																	<m:msub>
																		<m:mi>g</m:mi>
																		<m:mn>2</m:mn>
																	</m:msub>
																	<m:mfrac>
																		<m:mrow>
																			<m:mi>P</m:mi>
																			<m:mo stretchy="false">(</m:mo>
																			<m:mi>C</m:mi>
																			<m:mo>|</m:mo>
																			<m:mi>M</m:mi>
																			<m:mo stretchy="false">)</m:mo>
																		</m:mrow>
																		<m:mrow>
																			<m:mi>P</m:mi>
																			<m:mo stretchy="false">(</m:mo>
																			<m:mi>C</m:mi>
																			<m:mo stretchy="false">)</m:mo>
																		</m:mrow>
																	</m:mfrac>
																	<m:mo>+</m:mo>
																	<m:mo stretchy="false">(</m:mo>
																	<m:mn>1</m:mn>
																	<m:mo>&#8722;</m:mo>
																	<m:mi>P</m:mi>
																	<m:mo stretchy="false">(</m:mo>
																	<m:mi>C</m:mi>
																	<m:mo>|</m:mo>
																	<m:mi>M</m:mi>
																	<m:mo stretchy="false">)</m:mo>
																	<m:mo stretchy="false">)</m:mo>
																	<m:mo>&#215;</m:mo>
																	<m:mi>l</m:mi>
																	<m:mi>o</m:mi>
																	<m:msub>
																		<m:mi>g</m:mi>
																		<m:mn>2</m:mn>
																	</m:msub>
																	<m:mfrac>
																		<m:mrow>
																			<m:mo stretchy="false">(</m:mo>
																			<m:mn>1</m:mn>
																			<m:mo>&#8722;</m:mo>
																			<m:mi>P</m:mi>
																			<m:mo stretchy="false">(</m:mo>
																			<m:mi>C</m:mi>
																			<m:mo>|</m:mo>
																			<m:mi>M</m:mi>
																			<m:mo stretchy="false">)</m:mo>
																			<m:mo stretchy="false">)</m:mo>
																		</m:mrow>
																		<m:mrow>
																			<m:mo stretchy="false">(</m:mo>
																			<m:mn>1</m:mn>
																			<m:mo>&#8722;</m:mo>
																			<m:mi>P</m:mi>
																			<m:mo stretchy="false">(</m:mo>
																			<m:mi>C</m:mi>
																			<m:mo stretchy="false">)</m:mo>
																			<m:mo stretchy="false">)</m:mo>
																		</m:mrow>
																	</m:mfrac>
																</m:mrow>
															</m:mtd>
														</m:mtr>
													</m:mtable>
												</m:mrow>
												<m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaqbaeqabiqaaaqaaiabdQeakjabcIcaOiabdoeadjabcUda7iabd2eanjabcMcaPiabg2da9iabdcfaqjabcIcaOiabd2eanjabcMcaPiabgEna0kabdQgaQjabcIcaOiabdoeadjabcUda7iabd2eanjabcMcaPaqaaiabbEha3jabbIgaOjabbwgaLjabbkhaYjabbwgaLjabbccaGiabdQgaQjabcIcaOiabdoeadjabcUda7iabd2eanjabcMcaPiabg2da9iabdcfaqjabcIcaOiabdoeadjabcYha8jabd2eanjabcMcaPiabgEna0kabdYgaSjabd+gaVjabdEgaNnaaBaaaleaacqaIYaGmaeqaaKqbaoaalaaabaGaemiuaaLaeiikaGIaem4qamKaeiiFaWNaemyta0KaeiykaKcabaGaemiuaaLaeiikaGIaem4qamKaeiykaKcaaOGaey4kaSIaeiikaGIaeGymaeJaeyOeI0IaemiuaaLaeiikaGIaem4qamKaeiiFaWNaemyta0KaeiykaKIaeiykaKIaey41aqRaemiBaWMaem4Ba8Maem4zaC2aaSbaaSqaaiabikdaYaqabaqcfa4aaSaaaeaacqGGOaakcqaIXaqmcqGHsislcqWGqbaucqGGOaakcqWGdbWqcqGG8baFcqWGnbqtcqGGPaqkcqGGPaqkaeaacqGGOaakcqaIXaqmcqGHsislcqWGqbaucqGGOaakcqWGdbWqcqGGPaqkcqGGPaqkaaaaaaaa@8BA3@</m:annotation>
											</m:semantics>
										</m:math>
									</inline-formula>
								</p>
							</c>
							<c ca="center">
								<p>[0, + &#8734;[</p>
							</c>
							<c ca="center">
								<p>H</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>I</p>
							</c>
							<c ca="center">
								<p>Mutual Information</p>
							</c>
							<c ca="center">
								<p>
									<inline-formula>
										<m:math name="1748-7188-2-16-i13" xmlns:m="http://www.w3.org/1998/Math/MathML">
											<m:semantics>
												<m:mrow>
													<m:mtable>
														<m:mtr>
															<m:mtd>
																<m:mrow>
																	<m:mi>I</m:mi>
																	<m:mo stretchy="false">(</m:mo>
																	<m:mi>Q</m:mi>
																	<m:mo>;</m:mo>
																	<m:mi>M</m:mi>
																	<m:mo stretchy="false">)</m:mo>
																	<m:mo>=</m:mo>
																	<m:mi>H</m:mi>
																	<m:mo stretchy="false">(</m:mo>
																	<m:mi>Q</m:mi>
																	<m:mo stretchy="false">)</m:mo>
																	<m:mo>&#8722;</m:mo>
																	<m:mi>H</m:mi>
																	<m:mo stretchy="false">(</m:mo>
																	<m:mi>Q</m:mi>
																	<m:mo>|</m:mo>
																	<m:mi>M</m:mi>
																	<m:mo stretchy="false">)</m:mo>
																	<m:mtext>&#160;where&#160;</m:mtext>
																	<m:mi>H</m:mi>
																	<m:mo stretchy="false">(</m:mo>
																	<m:mi>Q</m:mi>
																	<m:mo stretchy="false">)</m:mo>
																	<m:mo>=</m:mo>
																	<m:mo>&#8722;</m:mo>
																	<m:mstyle displaystyle="true">
																		<m:msub>
																			<m:mo>&#8721;</m:mo>
																			<m:mrow>
																				<m:mi>q</m:mi>
																				<m:mo>&#8712;</m:mo>
																				<m:mo>{</m:mo>
																				<m:mi>C</m:mi>
																				<m:mo>,</m:mo>
																				<m:mover accent="true">
																					<m:mi>C</m:mi>
																					<m:mo>&#175;</m:mo>
																				</m:mover>
																				<m:mo>}</m:mo>
																			</m:mrow>
																		</m:msub>
																		<m:mrow>
																			<m:mi>P</m:mi>
																			<m:mo stretchy="false">(</m:mo>
																			<m:mi>q</m:mi>
																			<m:mo stretchy="false">)</m:mo>
																		</m:mrow>
																	</m:mstyle>
																	<m:mo>&#215;</m:mo>
																	<m:mi>l</m:mi>
																	<m:mi>o</m:mi>
																	<m:msub>
																		<m:mi>g</m:mi>
																		<m:mn>2</m:mn>
																	</m:msub>
																	<m:mi>P</m:mi>
																	<m:mo stretchy="false">(</m:mo>
																	<m:mi>q</m:mi>
																	<m:mo stretchy="false">)</m:mo>
																</m:mrow>
															</m:mtd>
														</m:mtr>
														<m:mtr>
															<m:mtd>
																<m:mrow>
																	<m:mtext>and&#160;</m:mtext>
																	<m:mi>H</m:mi>
																	<m:mo stretchy="false">(</m:mo>
																	<m:mi>Q</m:mi>
																	<m:mo>|</m:mo>
																	<m:mi>M</m:mi>
																	<m:mo stretchy="false">)</m:mo>
																	<m:mo>=</m:mo>
																	<m:mo>&#8722;</m:mo>
																	<m:mi>P</m:mi>
																	<m:mo stretchy="false">(</m:mo>
																	<m:mi>M</m:mi>
																	<m:mo stretchy="false">)</m:mo>
																	<m:mo>&#215;</m:mo>
																	<m:mstyle displaystyle="true">
																		<m:msub>
																			<m:mo>&#8721;</m:mo>
																			<m:mrow>
																				<m:mi>q</m:mi>
																				<m:mo>&#8712;</m:mo>
																				<m:mo>{</m:mo>
																				<m:mi>C</m:mi>
																				<m:mo>,</m:mo>
																				<m:mover accent="true">
																					<m:mi>C</m:mi>
																					<m:mo>&#175;</m:mo>
																				</m:mover>
																				<m:mo>}</m:mo>
																			</m:mrow>
																		</m:msub>
																		<m:mrow>
																			<m:mi>P</m:mi>
																			<m:mo stretchy="false">(</m:mo>
																			<m:mi>q</m:mi>
																			<m:mo>|</m:mo>
																			<m:mi>M</m:mi>
																			<m:mo stretchy="false">)</m:mo>
																		</m:mrow>
																	</m:mstyle>
																	<m:mo>&#215;</m:mo>
																	<m:mi>l</m:mi>
																	<m:mi>o</m:mi>
																	<m:msub>
																		<m:mi>g</m:mi>
																		<m:mn>2</m:mn>
																	</m:msub>
																	<m:mi>P</m:mi>
																	<m:mo stretchy="false">(</m:mo>
																	<m:mi>q</m:mi>
																	<m:mo>|</m:mo>
																	<m:mi>M</m:mi>
																	<m:mo stretchy="false">)</m:mo>
																</m:mrow>
															</m:mtd>
														</m:mtr>
													</m:mtable>
												</m:mrow>
												<m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaqbaeqabiqaaaqaaiabdMeajjabcIcaOiabdgfarjabcUda7iabd2eanjabcMcaPiabg2da9iabdIeaijabcIcaOiabdgfarjabcMcaPiabgkHiTiabdIeaijabcIcaOiabdgfarjabcYha8jabd2eanjabcMcaPiabbccaGiabbEha3jabbIgaOjabbwgaLjabbkhaYjabbwgaLjabbccaGiabdIeaijabcIcaOiabdgfarjabcMcaPiabg2da9iabgkHiTmaaqababaGaemiuaaLaeiikaGIaemyCaeNaeiykaKcaleaacqWGXbqCcqGHiiIZcqGG7bWEcqWGdbWqcqGGSaalcuWGdbWqgaqeaiabc2ha9bqab0GaeyyeIuoakiabgEna0kabdYgaSjabd+gaVjabdEgaNnaaBaaaleaacqaIYaGmaeqaaOGaemiuaaLaeiikaGIaemyCaeNaeiykaKcabaGaeeyyaeMaeeOBa4MaeeizaqMaeeiiaaIaemisaGKaeiikaGIaemyuaeLaeiiFaWNaemyta0KaeiykaKIaeyypa0JaeyOeI0IaemiuaaLaeiikaGIaemyta0KaeiykaKIaey41aq7aaabeaeaacqWGqbaucqGGOaakcqWGXbqCcqGG8baFcqWGnbqtcqGGPaqkaSqaaiabdghaXjabgIGiolabcUha7jabdoeadjabcYcaSiqbdoeadzaaraGaeiyFa0habeqdcqGHris5aOGaey41aqRaemiBaWMaem4Ba8Maem4zaC2aaSbaaSqaaiabikdaYaqabaGccqWGqbaucqGGOaakcqWGXbqCcqGG8baFcqWGnbqtcqGGPaqkaaaaaa@9B2C@</m:annotation>
											</m:semantics>
										</m:math>
									</inline-formula>
								</p>
							</c>
							<c ca="center">
								<p>[0, 1]</p>
							</c>
							<c ca="center">
								<p>H</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>S</p>
							</c>
							<c ca="center">
								<p>Surprise Measure</p>
							</c>
							<c ca="center">
								<p>
									<inline-formula>
										<m:math name="1748-7188-2-16-i14" xmlns:m="http://www.w3.org/1998/Math/MathML">
											<m:semantics>
												<m:mrow>
													<m:mi>S</m:mi>
													<m:mo stretchy="false">(</m:mo>
													<m:mi>M</m:mi>
													<m:mo stretchy="false">)</m:mo>
													<m:mo>=</m:mo>
													<m:mi>I</m:mi>
													<m:mi>n</m:mi>
													<m:mi>f</m:mi>
													<m:mi>o</m:mi>
													<m:mo stretchy="false">(</m:mo>
													<m:mi>M</m:mi>
													<m:mo stretchy="false">)</m:mo>
													<m:mo>&#215;</m:mo>
													<m:mi>P</m:mi>
													<m:mo stretchy="false">(</m:mo>
													<m:mi>C</m:mi>
													<m:mo>|</m:mo>
													<m:mi>M</m:mi>
													<m:mo stretchy="false">)</m:mo>
													<m:mo>=</m:mo>
													<m:mi>I</m:mi>
													<m:mi>n</m:mi>
													<m:mi>f</m:mi>
													<m:mi>o</m:mi>
													<m:mo stretchy="false">(</m:mo>
													<m:mi>M</m:mi>
													<m:mo stretchy="false">)</m:mo>
													<m:mo>&#215;</m:mo>
													<m:mfrac>
														<m:mrow>
															<m:mi>S</m:mi>
															<m:mi>u</m:mi>
															<m:mi>p</m:mi>
															<m:mi>p</m:mi>
															<m:mi>o</m:mi>
															<m:mi>r</m:mi>
															<m:mi>t</m:mi>
															<m:mo stretchy="false">(</m:mo>
															<m:mi>M</m:mi>
															<m:mo>&#8712;</m:mo>
															<m:mi>C</m:mi>
															<m:mo stretchy="false">)</m:mo>
														</m:mrow>
														<m:mrow>
															<m:mi>S</m:mi>
															<m:mi>u</m:mi>
															<m:mi>p</m:mi>
															<m:mi>p</m:mi>
															<m:mi>o</m:mi>
															<m:mi>r</m:mi>
															<m:mi>t</m:mi>
															<m:mo stretchy="false">(</m:mo>
															<m:mi>M</m:mi>
															<m:mo stretchy="false">)</m:mo>
														</m:mrow>
													</m:mfrac>
												</m:mrow>
												<m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaem4uamLaeiikaGIaemyta0KaeiykaKIaeyypa0JaemysaKKaemOBa4MaemOzayMaem4Ba8MaeiikaGIaemyta0KaeiykaKIaey41aqRaemiuaaLaeiikaGIaem4qamKaeiiFaWNaemyta0KaeiykaKIaeyypa0JaemysaKKaemOBa4MaemOzayMaem4Ba8MaeiikaGIaemyta0KaeiykaKIaey41aqBcfa4aaSaaaeaacqWGtbWucqWG1bqDcqWGWbaCcqWGWbaCcqWGVbWBcqWGYbGCcqWG0baDcqGGOaakcqWGnbqtcqGHiiIZcqWGdbWqcqGGPaqkaeaacqWGtbWucqWG1bqDcqWGWbaCcqWGWbaCcqWGVbWBcqWGYbGCcqWG0baDcqGGOaakcqWGnbqtcqGGPaqkaaaaaa@690D@</m:annotation>
											</m:semantics>
										</m:math>
									</inline-formula>
								</p>
							</c>
							<c ca="center">
								<p>[0, + &#8734;[</p>
							</c>
							<c ca="center">
								<p>H</p>
							</c>
						</r>
					</tblbdy>
					<tblfn>
						<p>Description of the fourteen significance measures according to the respective type (C = Class based; IT = Information-Theoretic based; H = Hybrid). For each measure the abbreviation symbol used throughout the paper, the formula and the respective range.</p>
					</tblfn>
				</tbl>
				<tbl id="T2">
					<title>
						<p>Table 2</p>
					</title>
					<caption>
						<p>Auxiliary formulas.</p>
					</caption>
					<tblbdy cols="2">
						<r>
							<c ca="center">
								<p>Formula</p>
							</c>
							<c ca="center">
								<p>Range</p>
							</c>
						</r>
						<r>
							<c cspan="2">
								<hr/>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>
									<inline-formula>
										<m:math name="1748-7188-2-16-i15" xmlns:m="http://www.w3.org/1998/Math/MathML">
											<m:semantics>
												<m:mrow>
													<m:mi>P</m:mi>
													<m:mo stretchy="false">(</m:mo>
													<m:mi>C</m:mi>
													<m:mo stretchy="false">)</m:mo>
													<m:mo>=</m:mo>
													<m:mfrac>
														<m:mrow>
															<m:msub>
																<m:mi>T</m:mi>
																<m:mi>P</m:mi>
															</m:msub>
															<m:mo>+</m:mo>
															<m:msub>
																<m:mi>F</m:mi>
																<m:mi>N</m:mi>
															</m:msub>
														</m:mrow>
														<m:mrow>
															<m:msub>
																<m:mi>T</m:mi>
																<m:mi>P</m:mi>
															</m:msub>
															<m:mo>+</m:mo>
															<m:msub>
																<m:mi>F</m:mi>
																<m:mi>N</m:mi>
															</m:msub>
															<m:mo>+</m:mo>
															<m:msub>
																<m:mi>F</m:mi>
																<m:mi>P</m:mi>
															</m:msub>
															<m:mo>+</m:mo>
															<m:msub>
																<m:mi>T</m:mi>
																<m:mi>N</m:mi>
															</m:msub>
														</m:mrow>
													</m:mfrac>
												</m:mrow>
												<m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaemiuaaLaeiikaGIaem4qamKaeiykaKIaeyypa0tcfa4aaSaaaeaacqWGubavdaWgaaqaaiabdcfaqbqabaGaey4kaSIaemOray0aaSbaaeaacqWGobGtaeqaaaqaaiabdsfaunaaBaaabaGaemiuaafabeaacqGHRaWkcqWGgbGrdaWgaaqaaiabd6eaobqabaGaey4kaSIaemOray0aaSbaaeaacqWGqbauaeqaaiabgUcaRiabdsfaunaaBaaabaGaemOta4eabeaaaaaaaa@436B@</m:annotation>
											</m:semantics>
										</m:math>
									</inline-formula>
								</p>
							</c>
							<c ca="center">
								<p>[0,1]</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>
									<inline-formula>
										<m:math name="1748-7188-2-16-i16" xmlns:m="http://www.w3.org/1998/Math/MathML">
											<m:semantics>
												<m:mrow>
													<m:mi>P</m:mi>
													<m:mo stretchy="false">(</m:mo>
													<m:mi>C</m:mi>
													<m:mo>|</m:mo>
													<m:mi>M</m:mi>
													<m:mo stretchy="false">)</m:mo>
													<m:mo>=</m:mo>
													<m:mfrac>
														<m:mrow>
															<m:msub>
																<m:mi>T</m:mi>
																<m:mi>P</m:mi>
															</m:msub>
														</m:mrow>
														<m:mrow>
															<m:msub>
																<m:mi>T</m:mi>
																<m:mi>P</m:mi>
															</m:msub>
															<m:mo>+</m:mo>
															<m:msub>
																<m:mi>F</m:mi>
																<m:mi>P</m:mi>
															</m:msub>
														</m:mrow>
													</m:mfrac>
												</m:mrow>
												<m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaemiuaaLaeiikaGIaem4qamKaeiiFaWNaemyta0KaeiykaKIaeyypa0tcfa4aaSaaaeaacqWGubavdaWgaaqaaiabdcfaqbqabaaabaGaemivaq1aaSbaaeaacqWGqbauaeqaaiabgUcaRiabdAeagnaaBaaabaGaemiuaafabeaaaaaaaa@3C3B@</m:annotation>
											</m:semantics>
										</m:math>
									</inline-formula>
								</p>
							</c>
							<c ca="center">
								<p>[0,1]</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>
									<inline-formula>
										<m:math name="1748-7188-2-16-i17" xmlns:m="http://www.w3.org/1998/Math/MathML">
											<m:semantics>
												<m:mrow>
													<m:mfrac>
														<m:mrow>
															<m:mi>P</m:mi>
															<m:mo stretchy="false">(</m:mo>
															<m:mi>C</m:mi>
															<m:mo>|</m:mo>
															<m:mi>M</m:mi>
															<m:mo stretchy="false">)</m:mo>
														</m:mrow>
														<m:mrow>
															<m:mi>P</m:mi>
															<m:mo stretchy="false">(</m:mo>
															<m:mi>C</m:mi>
															<m:mo stretchy="false">)</m:mo>
														</m:mrow>
													</m:mfrac>
													<m:mo>=</m:mo>
													<m:mfrac>
														<m:mrow>
															<m:msub>
																<m:mi>T</m:mi>
																<m:mi>P</m:mi>
															</m:msub>
															<m:mo>&#215;</m:mo>
															<m:mo stretchy="false">(</m:mo>
															<m:msub>
																<m:mi>T</m:mi>
																<m:mi>P</m:mi>
															</m:msub>
															<m:mo>+</m:mo>
															<m:msub>
																<m:mi>F</m:mi>
																<m:mi>N</m:mi>
															</m:msub>
															<m:mo>+</m:mo>
															<m:msub>
																<m:mi>F</m:mi>
																<m:mi>P</m:mi>
															</m:msub>
															<m:mo>+</m:mo>
															<m:msub>
																<m:mi>T</m:mi>
																<m:mi>N</m:mi>
															</m:msub>
															<m:mo stretchy="false">)</m:mo>
														</m:mrow>
														<m:mrow>
															<m:mo stretchy="false">(</m:mo>
															<m:msub>
																<m:mi>T</m:mi>
																<m:mi>P</m:mi>
															</m:msub>
															<m:mo>+</m:mo>
															<m:msub>
																<m:mi>F</m:mi>
																<m:mi>P</m:mi>
															</m:msub>
															<m:mo stretchy="false">)</m:mo>
															<m:mo>&#215;</m:mo>
															<m:mo stretchy="false">(</m:mo>
															<m:msub>
																<m:mi>T</m:mi>
																<m:mi>P</m:mi>
															</m:msub>
															<m:mo>+</m:mo>
															<m:msub>
																<m:mi>F</m:mi>
																<m:mi>N</m:mi>
															</m:msub>
															<m:mo stretchy="false">)</m:mo>
														</m:mrow>
													</m:mfrac>
												</m:mrow>
												<m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaqcfa4aaSaaaeaacqWGqbaucqGGOaakcqWGdbWqcqGG8baFcqWGnbqtcqGGPaqkaeaacqWGqbaucqGGOaakcqWGdbWqcqGGPaqkaaGccqGH9aqpjuaGdaWcaaqaaiabdsfaunaaBaaabaGaemiuaafabeaacqGHxdaTcqGGOaakcqWGubavdaWgaaqaaiabdcfaqbqabaGaey4kaSIaemOray0aaSbaaeaacqWGobGtaeqaaiabgUcaRiabdAeagnaaBaaabaGaemiuaafabeaacqGHRaWkcqWGubavdaWgaaqaaiabd6eaobqabaGaeiykaKcabaGaeiikaGIaemivaq1aaSbaaeaacqWGqbauaeqaaiabgUcaRiabdAeagnaaBaaabaGaemiuaafabeaacqGGPaqkcqGHxdaTcqGGOaakcqWGubavdaWgaaqaaiabdcfaqbqabaGaey4kaSIaemOray0aaSbaaeaacqWGobGtaeqaaiabcMcaPaaaaaa@5C1B@</m:annotation>
											</m:semantics>
										</m:math>
									</inline-formula>
								</p>
							</c>
							<c ca="center">
								<p>[0,1]</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>
									<inline-formula>
										<m:math name="1748-7188-2-16-i18" xmlns:m="http://www.w3.org/1998/Math/MathML">
											<m:semantics>
												<m:mrow>
													<m:mfrac>
														<m:mrow>
															<m:mn>1</m:mn>
															<m:mo>&#8722;</m:mo>
															<m:mi>P</m:mi>
															<m:mo stretchy="false">(</m:mo>
															<m:mi>C</m:mi>
															<m:mo>|</m:mo>
															<m:mi>M</m:mi>
															<m:mo stretchy="false">)</m:mo>
														</m:mrow>
														<m:mrow>
															<m:mn>1</m:mn>
															<m:mo>&#8722;</m:mo>
															<m:mi>P</m:mi>
															<m:mo stretchy="false">(</m:mo>
															<m:mi>C</m:mi>
															<m:mo stretchy="false">)</m:mo>
														</m:mrow>
													</m:mfrac>
													<m:mo>=</m:mo>
													<m:mfrac>
														<m:mrow>
															<m:msub>
																<m:mi>F</m:mi>
																<m:mi>P</m:mi>
															</m:msub>
															<m:mo>&#215;</m:mo>
															<m:mo stretchy="false">(</m:mo>
															<m:msub>
																<m:mi>T</m:mi>
																<m:mi>P</m:mi>
															</m:msub>
															<m:mo>+</m:mo>
															<m:msub>
																<m:mi>F</m:mi>
																<m:mi>N</m:mi>
															</m:msub>
															<m:mo>+</m:mo>
															<m:msub>
																<m:mi>F</m:mi>
																<m:mi>P</m:mi>
															</m:msub>
															<m:mo>+</m:mo>
															<m:msub>
																<m:mi>T</m:mi>
																<m:mi>N</m:mi>
															</m:msub>
															<m:mo stretchy="false">)</m:mo>
														</m:mrow>
														<m:mrow>
															<m:mo stretchy="false">(</m:mo>
															<m:msub>
																<m:mi>T</m:mi>
																<m:mi>P</m:mi>
															</m:msub>
															<m:mo>+</m:mo>
															<m:msub>
																<m:mi>F</m:mi>
																<m:mi>P</m:mi>
															</m:msub>
															<m:mo stretchy="false">)</m:mo>
															<m:mo>&#215;</m:mo>
															<m:mo stretchy="false">(</m:mo>
															<m:msub>
																<m:mi>T</m:mi>
																<m:mi>N</m:mi>
															</m:msub>
															<m:mo>+</m:mo>
															<m:msub>
																<m:mi>F</m:mi>
																<m:mi>P</m:mi>
															</m:msub>
															<m:mo stretchy="false">)</m:mo>
														</m:mrow>
													</m:mfrac>
												</m:mrow>
												<m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaqcfa4aaSaaaeaacqaIXaqmcqGHsislcqWGqbaucqGGOaakcqWGdbWqcqGG8baFcqWGnbqtcqGGPaqkaeaacqaIXaqmcqGHsislcqWGqbaucqGGOaakcqWGdbWqcqGGPaqkaaGccqGH9aqpjuaGdaWcaaqaaiabdAeagnaaBaaabaGaemiuaafabeaacqGHxdaTcqGGOaakcqWGubavdaWgaaqaaiabdcfaqbqabaGaey4kaSIaemOray0aaSbaaeaacqWGobGtaeqaaiabgUcaRiabdAeagnaaBaaabaGaemiuaafabeaacqGHRaWkcqWGubavdaWgaaqaaiabd6eaobqabaGaeiykaKcabaGaeiikaGIaemivaq1aaSbaaeaacqWGqbauaeqaaiabgUcaRiabdAeagnaaBaaabaGaemiuaafabeaacqGGPaqkcqGHxdaTcqGGOaakcqWGubavdaWgaaqaaiabd6eaobqabaGaey4kaSIaemOray0aaSbaaeaacqWGqbauaeqaaiabcMcaPaaaaaa@5FB9@</m:annotation>
											</m:semantics>
										</m:math>
									</inline-formula>
								</p>
							</c>
							<c ca="center">
								<p>[0,1]</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>
									<inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1748-7188-2-16-i27">
											<m:semantics>
												<m:mrow>
													<m:mi>I</m:mi><m:mi>n</m:mi><m:mi>f</m:mi><m:mi>o</m:mi><m:mrow><m:mo>(</m:mo>
														<m:mi>M</m:mi>
														<m:mo>)</m:mo></m:mrow><m:mo>=</m:mo><m:mo>&#8722;</m:mo><m:mi>l</m:mi><m:mi>o</m:mi><m:msub>
														<m:mi>g</m:mi>
														<m:mrow>
															<m:mrow><m:mo>|</m:mo>
																<m:mi>&#931;</m:mi>
																<m:mo>|</m:mo></m:mrow>
														</m:mrow>
													</m:msub>
													<m:mi>P</m:mi><m:mrow><m:mo>(</m:mo>
														<m:mi>M</m:mi>
														<m:mo>)</m:mo></m:mrow>
												</m:mrow>
												<m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaemysaKKaemOBa4MaemOzayMaem4Ba82aaeWaaeaacqWGnbqtaiaawIcacaGLPaaacqGH9aqpcqGHsislcqWGSbaBcqWGVbWBcqWGNbWzdaWgaaWcbaWaaqWaaeaacqqHJoWuaiaawEa7caGLiWoaaeqaaOGaemiuaa1aaeWaaeaacqWGnbqtaiaawIcacaGLPaaaaaa@427E@</m:annotation>
											</m:semantics>
										</m:math>
									</inline-formula>
								</p>
							</c>
							<c ca="center">
								<p>[0, + &#8734;[</p>
							</c>
						</r>
					</tblbdy>
					<tblfn>
						<p>List of auxiliary formulas used for the calculation of measures from Table 1. The respective range is also provided.</p>
					</tblfn>
				</tbl>
			</sec>
			<sec>
				<st>
					<p>Information-Theoretic Measures</p>
				</st>
				<p>When analyzing the probabilistic aspects of genetic sequences, one of two models can be adopted: a Markov or a Bernoulli model. In Markov models, the probability distribution of a given symbol depends on the <it>n </it>previous symbols, where <it>n </it>determines the order of the Markov chain <abbrgrp><abbr bid="B8">8</abbr><abbr bid="B36">36</abbr></abbrgrp>.</p>
				<p>In Bernoulli models, sequences are generated according to an independent identically distributed (i.i.d.) process. Therefore, the occurrence of a motif <it>M </it>in a given sequence is assumed to be an i.i.d. process <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>. This means that both the input sequences and the occurrence of the amino acids are independent. Protein sequences where motifs are sought to be found are often biologically related.</p>
				<p>Although the independence of the positions along a sequence and in the motifs is not always verified, it can be considered reasonable to work under the assumption of an i.i.d. model <abbrgrp><abbr bid="B38">38</abbr></abbrgrp>. The probability <it>P </it>of a motif <it>M</it>, in the form <it>A</it><sub>1 </sub>- <it>x</it>(<it>p</it><sub>1</sub>, <it>q</it><sub>1</sub>) - <it>A</it><sub>2 </sub>- <it>x</it>(<it>p</it><sub>2</sub>, <it>q</it><sub>2</sub>) - ... - <it>A</it><sub><it>n</it></sub>, can be calculated according to formula 1.</p>
				<p>
					<display-formula id="M1"><it>P</it>(<it>M</it>) = <it>P</it>(<it>A</it><sub>1</sub>) &#215; <it>P</it>(-<it>x</it>(<it>p</it><sub>1</sub>, <it>q</it><sub>1</sub>)-) &#215; <it>P</it>(<it>A</it><sub>2</sub>) &#215; <it>P</it>(-<it>x</it>(<it>p</it><sub>2</sub>, <it>q</it><sub>2</sub>)-) &#215; ... &#215; <it>P</it>(<it>A</it><sub><it>n</it></sub>)</display-formula>
				</p>
				<p>Since the probability of matching any symbol from the alphabet (denoted by character '.') is one (<it>P</it>('.') = 1), then <it>P</it>(-<it>x</it>(<it>p</it>, <it>q</it>)-) = 1 and <inline-formula><m:math name="1748-7188-2-16-i19" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mi>P</m:mi><m:mo stretchy="false">(</m:mo><m:msub><m:mi>A</m:mi><m:mi>i</m:mi></m:msub><m:mo stretchy="false">)</m:mo><m:mo>=</m:mo><m:mstyle displaystyle="true"><m:msub><m:mo>&#8719;</m:mo><m:mrow><m:msub><m:mi>a</m:mi><m:mi>j</m:mi></m:msub><m:mo>&#8712;</m:mo><m:msub><m:mi>A</m:mi><m:mi>i</m:mi></m:msub></m:mrow></m:msub><m:mrow><m:mi>P</m:mi><m:mo stretchy="false">(</m:mo><m:msub><m:mi>a</m:mi><m:mi>j</m:mi></m:msub><m:mo stretchy="false">)</m:mo></m:mrow></m:mstyle></m:mrow><m:annotation encoding="MathType-MTEF">
MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaemiuaaLaeiikaGIaemyqae0aaSbaaSqaaiabdMgaPbqabaGccqGGPaqkcqGH9aqpdaqeqaqaaiabdcfaqjabcIcaOiabdggaHnaaBaaaleaacqWGQbGAaeqaaOGaeiykaKcaleaacqWGHbqydaWgaaadbaGaemOAaOgabeaaliabgIGiolabdgeabnaaBaaameaacqWGPbqAaeqaaaWcbeqdcqGHpis1aaaa@40DD@</m:annotation></m:semantics></m:math></inline-formula>. We consider that the probability of an amino acid <it>a</it><sub><it>j</it></sub>, <it>P</it>(<it>a</it><sub><it>j</it></sub>), is given by its frequency in the Swiss-Prot database <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>. If ambiguous positions occur in substring <it>A</it><sub><it>i</it></sub>, then its probability is given by formula 2.</p>
				<p>
					<display-formula id="M2">
						<m:math name="1748-7188-2-16-i20" xmlns:m="http://www.w3.org/1998/Math/MathML">
							<m:semantics>
								<m:mrow>
									<m:mi>P</m:mi>
									<m:mo stretchy="false">(</m:mo>
									<m:msub>
										<m:mi>A</m:mi>
										<m:mi>i</m:mi>
									</m:msub>
									<m:mo stretchy="false">)</m:mo>
									<m:mo>=</m:mo>
									<m:mstyle displaystyle="true">
										<m:munder>
											<m:mo>&#8719;</m:mo>
											<m:mrow>
												<m:msub>
													<m:mi>a</m:mi>
													<m:mi>j</m:mi>
												</m:msub>
												<m:mo>&#8712;</m:mo>
												<m:msub>
													<m:mi>A</m:mi>
													<m:mi>i</m:mi>
												</m:msub>
											</m:mrow>
										</m:munder>
										<m:mrow>
											<m:mo stretchy="false">(</m:mo>
											<m:mstyle displaystyle="true">
												<m:munderover>
													<m:mo>&#8721;</m:mo>
													<m:mrow>
														<m:mi>k</m:mi>
														<m:mo>=</m:mo>
														<m:mn>1</m:mn>
													</m:mrow>
													<m:mrow>
														<m:mrow>
															<m:mo>|</m:mo>
															<m:mrow>
																<m:msub>
																	<m:mi>A</m:mi>
																	<m:mi>i</m:mi>
																</m:msub>
															</m:mrow>
															<m:mo>|</m:mo>
														</m:mrow>
													</m:mrow>
												</m:munderover>
												<m:mrow>
													<m:mi>P</m:mi>
													<m:mo stretchy="false">(</m:mo>
													<m:msub>
														<m:mi>a</m:mi>
														<m:mi>j</m:mi>
													</m:msub>
													<m:mi>k</m:mi>
													<m:mo stretchy="false">)</m:mo>
													<m:mo stretchy="false">)</m:mo>
												</m:mrow>
											</m:mstyle>
										</m:mrow>
									</m:mstyle>
								</m:mrow>
								<m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaemiuaaLaeiikaGIaemyqae0aaSbaaSqaaiabdMgaPbqabaGccqGGPaqkcqGH9aqpdaqeqbqaaiabcIcaOmaaqahabaGaemiuaaLaeiikaGIaemyyae2aaSbaaSqaaiabdQgaQbqabaGccqWGRbWAcqGGPaqkcqGGPaqkaSqaaiabdUgaRjabg2da9iabigdaXaqaamaaemaabaGaemyqae0aaSbaaWqaaiabdMgaPbqabaaaliaawEa7caGLiWoaa0GaeyyeIuoaaSqaaiabdggaHnaaBaaameaacqWGQbGAaeqaaSGaeyicI4Saemyqae0aaSbaaWqaaiabdMgaPbqabaaaleqaniabg+Givdaaaa@4FD3@</m:annotation>
							</m:semantics>
						</m:math>
					</display-formula>
				</p>
				<p>where <it>a</it><sub><it>jk </it></sub>stands for the <it>k-th </it>amino acid in position <it>j </it>of the substring <it>A</it><sub><it>i</it></sub>. For instance, the probability of the substring <it>A </it>- [<it>GC</it>] - &#183; &#183; - <it>V </it>is given by 0.0783 &#215; (0.0693 + 0.0152) &#215; 1 &#215; 1 &#215; 0.0671 = 4.44 &#215; 10<sup>-4</sup>. <it>Support</it>(<it>M</it>) is the number of times that a motif M occurs in different sequences of the database. <it>Support</it>(<it>M </it>&#8712; <it>C</it>) corresponds to the number of sequences in family C where M occurs.</p>
				<p>Information-Theoretic measures quantify the degree of information encoded in a motif. We provide examples of five of these measures.</p>
				<p><it>Information Gain </it>(IG) <abbrgrp><abbr bid="B39">39</abbr><abbr bid="B40">40</abbr></abbrgrp> is used to measure the amount of accumulated information by a motif in relation to an amino acid sequence. In this measure (see Table <tblr tid="T1">1</tblr>), the self-information content <it>Info(M) </it>(see Table <tblr tid="T2">2</tblr>) quantifies the information content associated with the motif, i.e., how likely is <it>M </it>to occur. (<it>Support</it>(<it>M</it>) - 1) gives the occurrence of motif M in the positive dataset. The minus one value of this component allows to easily reject motifs that trivially occur once.</p>
				<p>The Minimum Description Length (MDL) principle applied in <abbrgrp><abbr bid="B11">11</abbr><abbr bid="B38">38</abbr></abbrgrp>, is also an Information-theoretic measure and can be made equivalent to the IG measure. MDL is used to score the motifs and to measure the fitness of these motifs with respect to the input sequences. Assuming the hypothetical transmission of sequences, the idea is to measure how much can be saved in this transmission, if one knows about the presence of the motif. Neville-Manning <it>et al. </it><abbrgrp><abbr bid="B38">38</abbr></abbrgrp> demonstrated that <it>K </it>&#215; <it>log</it><sub>2 </sub><it>P</it>(<it>M</it>) is the saving obtained from a motif <it>M </it>over <it>K </it>covered sequences, which is equivalent to the IG formula.</p>
				<p>The <it>LogOdd </it>(LogOdd) measure provides the degree of surprise of a pattern. It compares the actual probability of occurrence (relative support value) with the expected probability of occurrence according to the background distribution. The formula presented in Table <tblr tid="T1">1</tblr> is a variant of the LogOdd formula introduced in <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>, which was first proposed to measure the significance of probabilistic patterns. This measure is particularly useful when comparing motifs with different lengths <abbrgrp><abbr bid="B17">17</abbr><abbr bid="B41">41</abbr></abbrgrp>. Both IG and LogOdd measures can be applied to all types of deterministic patterns.</p>
				<p>The <it>Pratt </it>(Pratt) measure was introduced by Jonassen <it>et al. </it><abbrgrp><abbr bid="B42">42</abbr></abbrgrp> to rank extensible gap motifs obtained from the Pratt algorithm. Its value is calculated in two steps. In the first step, the information encoded by the motif is calculated. The second step corresponds to a penalty that is considered when gaps occur. The last measure used was the Z-Score measure. Although it is essentially a statistical measure, it was included in this group as it can be calculated based on the support, the motif information and the number of amino acids in the database (constant value). This measure can be used to filter out irrelevant motifs by selecting only those whose actual number of occurrences considerably exceeds its expected number. This criteria is based on the following biological motivation: if a motif occurs more than it is expected to occur by chance, then it should have a biological interest <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B37">37</abbr></abbrgrp>. Z-Score is one of the most widely used measures for motif evaluation, see for example <abbrgrp><abbr bid="B37">37</abbr><abbr bid="B43">43</abbr></abbrgrp>.</p>
				<p>In the Z-Score formula (see Table <tblr tid="T1">1</tblr>), <it>Support</it>(<it>M</it>) denotes the actual number of occurrences, <it>E</it>(<it>M</it>) the expected number of occurrences of <it>M</it>, and <it>N</it>(<it>M</it>) the square root of the expected variance.</p>
				<p>It was generally verified that statistically relevant motifs, discriminated through the Z-Score function, match functionally important regions of the proteins <abbrgrp><abbr bid="B37">37</abbr><abbr bid="B43">43</abbr></abbrgrp>. Another important conclusion obtained from <abbrgrp><abbr bid="B37">37</abbr></abbrgrp> is that for over-represented motifs, the non-maximal motifs (which are contained in other motifs) have a lower degree of surprise than the maximal ones. This result is a good example that significance measures can be used as a clever mechanism to prune motifs not only after, but also before, their significance is computed. The minimum support criterion provides a way to detect those motifs that occur frequently. Significance measures, like Z-Score or IG, allow to detect motifs that although not frequent occur more than expected or that represent a high degree of information. Both criteria are complementary in the task of automatically retrieving significant motifs from a database. Please refer to Table <tblr tid="T1">1</tblr> and <tblr tid="T2">2</tblr> for details on these measures.</p>
			</sec>
			<sec>
				<st>
					<p>Hybrid Measures</p>
				</st>
				<p>Considering measures that use both Information-theoretic and class-based features to determine the significance of a pattern, we selected two measures that are popular in the machine learning and data mining communities: the <it>J-Measure </it>(J) <abbrgrp><abbr bid="B44">44</abbr></abbrgrp> and the <it>Mutual Information </it>(I), which is derived from the Shannon's entropy theory <abbrgrp><abbr bid="B34">34</abbr><abbr bid="B45">45</abbr><abbr bid="B46">46</abbr></abbrgrp>.</p>
				<p>For a class space <it>Q </it>= {<it>C</it>, <inline-formula><m:math name="1748-7188-2-16-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>C</m:mi><m:mo>&#175;</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGafm4qamKbaebaaaa@2CFA@</m:annotation></m:semantics></m:math></inline-formula>}, the component <it>H</it>(<it>Q</it>) of the <it>I </it>measure (see Table <tblr tid="T1">1</tblr>) provides the degree of information encoded by <it>Q</it>. Given a motif <it>M</it>, component <it>H</it>(<it>Q</it>|<it>M</it>) measures the amount of uncertainty remaining about <it>Q </it>after <it>M </it>is known. The difference <it>H</it>(<it>Q</it>) - <it>H</it>(<it>Q</it>|<it>M</it>) provides the expected information gain about <it>Q </it>upon knowing <it>M</it>.</p>
				<p>The J measure is the product of two factors. The first factor, <it>P</it>(<it>M</it>), provides the prior probability of motif occurrence. The second factor, <it>j</it>(<it>C</it>; <it>M</it>), considers a target class <it>C </it>and its complement <inline-formula><m:math name="1748-7188-2-16-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>C</m:mi><m:mo>&#175;</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGafm4qamKbaebaaaa@2CFA@</m:annotation></m:semantics></m:math></inline-formula> and measures the goodness-of-fit of <it>M </it>with relation to class <it>C</it>. It is also called <it>cross-entropy </it><abbrgrp><abbr bid="B47">47</abbr></abbrgrp>.</p>
				<p>In addition, we redefine the <it>IG </it>measure to account for the distribution of motifs among the protein families, leading to the definition of a measure called <it>Surprise-Measure </it>(<it>S</it>). The <it>S </it>measure combines the information content (<it>Info</it>) of the motif <it>M </it>with the conditional probability of <it>M </it>matching a sequence (<it>s</it>) from the target class <it>C</it>. This probability is given by the relative occurrence of <it>M </it>in <it>C</it>, <inline-formula><m:math name="1748-7188-2-16-i21" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mfrac><m:mrow><m:mi>S</m:mi><m:mi>u</m:mi><m:mi>p</m:mi><m:mi>p</m:mi><m:mi>o</m:mi><m:mi>r</m:mi><m:mi>t</m:mi><m:mo stretchy="false">(</m:mo><m:mi>M</m:mi><m:mo>&#8712;</m:mo><m:mi>C</m:mi><m:mo stretchy="false">)</m:mo></m:mrow><m:mrow><m:mi>S</m:mi><m:mi>u</m:mi><m:mi>p</m:mi><m:mi>p</m:mi><m:mi>o</m:mi><m:mi>r</m:mi><m:mi>t</m:mi><m:mo stretchy="false">(</m:mo><m:mi>M</m:mi><m:mo stretchy="false">)</m:mo></m:mrow></m:mfrac></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaqcfa4aaSaaaeaacqWGtbWucqWG1bqDcqWGWbaCcqWGWbaCcqWGVbWBcqWGYbGCcqWG0baDcqGGOaakcqWGnbqtcqGHiiIZcqWGdbWqcqGGPaqkaeaacqWGtbWucqWG1bqDcqWGWbaCcqWGWbaCcqWGVbWBcqWGYbGCcqWG0baDcqGGOaakcqWGnbqtcqGGPaqkaaaaaa@4820@</m:annotation></m:semantics></m:math></inline-formula>, which corresponds to the positive predicted value of <it>M</it>. It expresses the amount of information provided by the motif and its quality as a class descriptor.</p>
				<p>These three measures can be easily calculated for all types of deterministic motifs. In general, one can interpret such measures as a way to quantify the uncertainty reduction of a sequence <it>s </it>belonging to the class <it>C</it>, given that <it>s </it>contains the motif <it>M</it>.</p>
				<p>In conclusion, the presented measures can be calculated based on two components of motif information: the class match information (<it>T</it><sub><it>P</it></sub>, <it>T</it><sub><it>N</it></sub>, <it>F</it><sub><it>P</it></sub>, <it>F</it><sub><it>N</it></sub>) component and the motif probability and gap information component. Class-based measures are calculated according to the first component, Information-theoretic measures based on the second and hybrid measures based on both. Table <tblr tid="T2">2</tblr> contains formulas to support a better understanding of Table <tblr tid="T1">1</tblr>.</p>
			</sec>
			<sec>
				<st>
					<p>Evaluation</p>
				</st>
				<p>We start by describing the algorithms applied to mine the three different types of motifs used in the experiments.</p>
				<p>To mine contiguous motifs we developed a simple algorithm based on the n-gram methodology. A <it>n-gram </it>is a word of <it>n </it>contiguous symbols. The algorithm takes as input a set of sequences and the <it>target motif</it>, which represents the motif to be primarily spotted. It extracts words with the length of the target motif (n = motif length) through window sliding. Each word is hashed into a table and the respective support count incremented. Finally, the score values for the different measures of all the scanned words are calculated. Due to their popularity within the bioinformatics community, Teiresias <abbrgrp><abbr bid="B48">48</abbr></abbrgrp> and Pratt <abbrgrp><abbr bid="B49">49</abbr></abbrgrp> were used to extract rigid and flexible gap motifs, respectively. Besides the input dataset, Teiresias algorithm accepts as input three parameters: minimum support, L and W, where L defines the minimum number of concrete symbols that a word of length W must contain. Pratt allows specifying the characteristics of the extracted motifs by setting a large number of parameters. It automatically scores the motifs according to the Pratt measure. With the exception of the minimum support value and the number of reported motifs all the remaining Pratt parameters were used assuming the default values recommended by the authors (program available at <abbrgrp><abbr bid="B50">50</abbr></abbrgrp>). Additional details for the use of these programs are provided whenever necessary.</p>
				<p>The consistency between two measures can be defined as follows:</p>
				<p><b>Definition 1. (Measure Consistency) </b><it>Given two measures M</it><sub>1</sub><it> and M</it><sub>2</sub><it> and the respective score value vectors V</it><sub><it>M</it>1</sub><it> and V</it><sub><it>M</it>2</sub>,<it> the respective consistency is determined by the Pearson's Correlation between its vectors, corr</it>(<it>V</it><sub><it>M</it>1</sub>, <it>V</it><sub><it>M</it>2</sub>).</p>
				<p>Informally, a motif is considered to be strongly conserved if it occurs in the majority of the input sequences, i.e., its relative support value is approximately 100%. Alternatively, it is considered weakly conserved if its relative support is considerably below 50%.</p>
			</sec>
			<sec>
				<st>
					<p>Ranking Analysis</p>
				</st>
				<p>In this first experiment, the ability of the introduced measures in ranking the three different types of motifs is evaluated. The general evaluation procedure was as follows: select a target motif from Prosite, Dilimot or synthetically generated motif. Gather the set of related protein sequences where false negatives may occur. The parameters of the algorithm are refined until the target motif is included in the reported solution. For motif ranking evaluation only positive information is considered. Since not all the elements of class match information are available, only Information-theoretic measures are used in the ranking evaluation. In order to assess the quality of the measures in ranking the target motifs, a metric called <it>R</it><sub><it>m </it></sub>(Formula 3) was used, where <it>N</it><sub><it>motifs </it></sub>is the total number of evaluated motifs and <it>Rank</it><sub><it>motifs </it></sub>the sum of the respective rank values. Measures with <it>R</it><sub><it>m </it></sub>closer to 1 are the best.</p>
				<p>
					<display-formula id="M3">
						<m:math name="1748-7188-2-16-i22" xmlns:m="http://www.w3.org/1998/Math/MathML">
							<m:semantics>
								<m:mrow>
									<m:msub>
										<m:mi>R</m:mi>
										<m:mi>m</m:mi>
									</m:msub>
									<m:mo>=</m:mo>
									<m:mfrac>
										<m:mrow>
											<m:msub>
												<m:mi>N</m:mi>
												<m:mrow>
													<m:mi>m</m:mi>
													<m:mi>o</m:mi>
													<m:mi>t</m:mi>
													<m:mi>i</m:mi>
													<m:mi>f</m:mi>
													<m:mi>s</m:mi>
												</m:mrow>
											</m:msub>
										</m:mrow>
										<m:mrow>
											<m:mi>R</m:mi>
											<m:mi>a</m:mi>
											<m:mi>n</m:mi>
											<m:msub>
												<m:mi>k</m:mi>
												<m:mrow>
													<m:mi>m</m:mi>
													<m:mi>o</m:mi>
													<m:mi>t</m:mi>
													<m:mi>i</m:mi>
													<m:mi>f</m:mi>
													<m:mi>s</m:mi>
												</m:mrow>
											</m:msub>
										</m:mrow>
									</m:mfrac>
								</m:mrow>
								<m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaemOuai1aaSbaaSqaaiabd2gaTbqabaGccqGH9aqpjuaGdaWcaaqaaiabd6eaonaaBaaabaGaemyBa0Maem4Ba8MaemiDaqNaemyAaKMaemOzayMaem4CamhabeaaaeaacqWGsbGucqWGHbqycqWGUbGBcqWGRbWAdaWgaaqaaiabd2gaTjabd+gaVjabdsha0jabdMgaPjabdAgaMjabdohaZbqabaaaaaaa@47E2@</m:annotation>
							</m:semantics>
						</m:math>
					</display-formula>
				</p>
				<sec>
					<st>
						<p>Contiguous Motifs</p>
					</st>
					<p>Real protein sequence data was obtained from Prosite. Entries that contain contiguous motifs were selected and the respective sets of sequences retrieved. Additionally, synthetic protein data was generated. Each synthetic dataset consists of 50 sequences of length 300. For each dataset, a motif of a given length was randomly generated and planted once in all its sequences. The generation of sequences and motifs was done according to the Swiss-Prot amino acid frequency. Motifs were then extracted according to the described n-gram methodology.</p>
					<p>Table <tblr tid="T3">3</tblr> shows the ranking of 11 Prosite motifs and Table <tblr tid="T4">4</tblr> the results for a group of 8 synthetic protein datasets. In both cases, the target motifs are highly conserved with a support of around or equal to 100%.</p>
					<tbl id="T3">
						<title>
							<p>Table 3</p>
						</title>
						<caption>
							<p>Evaluation of contiguous motifs on Prosite data.</p>
						</caption>
						<tblbdy cols="11">
							<r>
								<c ca="center">
									<p>PS entry</p>
								</c>
								<c ca="center">
									<p>Motif</p>
								</c>
								<c ca="center">
									<p>NumSeqs</p>
								</c>
								<c ca="center">
									<p>DiffNGrams</p>
								</c>
								<c ca="center">
									<p>Rel. Supp(%)</p>
								</c>
								<c ca="center">
									<p>Supp Rank</p>
								</c>
								<c ca="center">
									<p>ZScore</p>
								</c>
								<c ca="center">
									<p>LogOdd</p>
								</c>
								<c ca="center">
									<p>Pratt</p>
								</c>
								<c ca="center">
									<p>IG</p>
								</c>
								<c ca="center">
									<p>Info</p>
								</c>
							</r>
							<r>
								<c cspan="11">
									<hr/>
								</c>
							</r>
							<r>
								<c ca="center">
									<p>PS00341</p>
								</c>
								<c ca="center">
									<p>IPCCPV</p>
								</c>
								<c ca="center">
									<p>9</p>
								</c>
								<c ca="center">
									<p>702</p>
								</c>
								<c ca="center">
									<p>77.8</p>
								</c>
								<c ca="center">
									<p>9</p>
								</c>
								<c ca="center">
									<p>21</p>
								</c>
								<c ca="center">
									<p>65</p>
								</c>
								<c ca="center">
									<p>166</p>
								</c>
								<c ca="center">
									<p>13</p>
								</c>
								<c ca="center">
									<p>217</p>
								</c>
							</r>
							<r>
								<c ca="center">
									<p>PS00415</p>
								</c>
								<c ca="center">
									<p>LRRRLSDS</p>
								</c>
								<c ca="center">
									<p>12</p>
								</c>
								<c ca="center">
									<p>3582</p>
								</c>
								<c ca="center">
									<p>91.6</p>
								</c>
								<c ca="center">
									<p>9</p>
								</c>
								<c ca="center">
									<p>503</p>
								</c>
								<c ca="center">
									<p>1058</p>
								</c>
								<c ca="center">
									<p>2103</p>
								</c>
								<c ca="center">
									<p>11</p>
								</c>
								<c ca="center">
									<p>1784</p>
								</c>
							</r>
							<r>
								<c ca="center">
									<p>PS00047</p>
								</c>
								<c ca="center">
									<p>GAKRH</p>
								</c>
								<c ca="center">
									<p>105</p>
								</c>
								<c ca="center">
									<p>653</p>
								</c>
								<c ca="center">
									<p>93.3</p>
								</c>
								<c ca="center">
									<p>21</p>
								</c>
								<c ca="center">
									<p>61</p>
								</c>
								<c ca="center">
									<p>109</p>
								</c>
								<c ca="center">
									<p>216</p>
								</c>
								<c ca="center">
									<p>27</p>
								</c>
								<c ca="center">
									<p>460</p>
								</c>
							</r>
							<r>
								<c ca="center">
									<p>PS00984</p>
								</c>
								<c ca="center">
									<p>CFWKYC</p>
								</c>
								<c ca="center">
									<p>19</p>
								</c>
								<c ca="center">
									<p>1256</p>
								</c>
								<c ca="center">
									<p>100</p>
								</c>
								<c ca="center">
									<p>1</p>
								</c>
								<c ca="center">
									<p>1</p>
								</c>
								<c ca="center">
									<p>1</p>
								</c>
								<c ca="center">
									<p>785</p>
								</c>
								<c ca="center">
									<p>1</p>
								</c>
								<c ca="center">
									<p>5</p>
								</c>
							</r>
							<r>
								<c ca="center">
									<p>PS00541</p>
								</c>
								<c ca="center">
									<p>SKRKYRK</p>
								</c>
								<c ca="center">
									<p>6</p>
								</c>
								<c ca="center">
									<p>144</p>
								</c>
								<c ca="center">
									<p>100</p>
								</c>
								<c ca="center">
									<p>1</p>
								</c>
								<c ca="center">
									<p>85</p>
								</c>
								<c ca="center">
									<p>110</p>
								</c>
								<c ca="center">
									<p>131</p>
								</c>
								<c ca="center">
									<p>3</p>
								</c>
								<c ca="center">
									<p>134</p>
								</c>
							</r>
							<r>
								<c ca="center">
									<p>PS00822</p>
								</c>
								<c ca="center">
									<p>PFDRHDW</p>
								</c>
								<c ca="center">
									<p>9</p>
								</c>
								<c ca="center">
									<p>2251</p>
								</c>
								<c ca="center">
									<p>100</p>
								</c>
								<c ca="center">
									<p>1</p>
								</c>
								<c ca="center">
									<p>1</p>
								</c>
								<c ca="center">
									<p>5</p>
								</c>
								<c ca="center">
									<p>204</p>
								</c>
								<c ca="center">
									<p>1</p>
								</c>
								<c ca="center">
									<p>400</p>
								</c>
							</r>
							<r>
								<c ca="center">
									<p>PS00419</p>
								</c>
								<c ca="center">
									<p>CDGPGRGGTC</p>
								</c>
								<c ca="center">
									<p>207</p>
								</c>
								<c ca="center">
									<p>32936</p>
								</c>
								<c ca="center">
									<p>100</p>
								</c>
								<c ca="center">
									<p>1</p>
								</c>
								<c ca="center">
									<p>1</p>
								</c>
								<c ca="center">
									<p>1</p>
								</c>
								<c ca="center">
									<p>3</p>
								</c>
								<c ca="center">
									<p>1</p>
								</c>
								<c ca="center">
									<p>158</p>
								</c>
							</r>
							<r>
								<c ca="center">
									<p>PS00349</p>
								</c>
								<c ca="center">
									<p>RKRKYFKKHEKR</p>
								</c>
								<c ca="center">
									<p>18</p>
								</c>
								<c ca="center">
									<p>2929</p>
								</c>
								<c ca="center">
									<p>100</p>
								</c>
								<c ca="center">
									<p>1</p>
								</c>
								<c ca="center">
									<p>38</p>
								</c>
								<c ca="center">
									<p>86</p>
								</c>
								<c ca="center">
									<p>2884</p>
								</c>
								<c ca="center">
									<p>19</p>
								</c>
								<c ca="center">
									<p>310</p>
								</c>
							</r>
							<r>
								<c ca="center">
									<p>PS00861</p>
								</c>
								<c ca="center">
									<p>GWTLNSAGYLLGP</p>
								</c>
								<c ca="center">
									<p>32</p>
								</c>
								<c ca="center">
									<p>888</p>
								</c>
								<c ca="center">
									<p>100</p>
								</c>
								<c ca="center">
									<p>1</p>
								</c>
								<c ca="center">
									<p>66</p>
								</c>
								<c ca="center">
									<p>301</p>
								</c>
								<c ca="center">
									<p>179</p>
								</c>
								<c ca="center">
									<p>1</p>
								</c>
								<c ca="center">
									<p>569</p>
								</c>
							</r>
							<r>
								<c ca="center">
									<p>PS01024</p>
								</c>
								<c ca="center">
									<p>EFDYLKSLEIEEKIN</p>
								</c>
								<c ca="center">
									<p>60</p>
								</c>
								<c ca="center">
									<p>5527</p>
								</c>
								<c ca="center">
									<p>100</p>
								</c>
								<c ca="center">
									<p>1</p>
								</c>
								<c ca="center">
									<p>620</p>
								</c>
								<c ca="center">
									<p>2427</p>
								</c>
								<c ca="center">
									<p>5266</p>
								</c>
								<c ca="center">
									<p>1</p>
								</c>
								<c ca="center">
									<p>5244</p>
								</c>
							</r>
							<r>
								<c ca="center">
									<p>PS00291</p>
								</c>
								<c ca="center">
									<p>AGAAAAGAVVGGLGGY</p>
								</c>
								<c ca="center">
									<p>136</p>
								</c>
								<c ca="center">
									<p>2423</p>
								</c>
								<c ca="center">
									<p>100</p>
								</c>
								<c ca="center">
									<p>1</p>
								</c>
								<c ca="center">
									<p>1033</p>
								</c>
								<c ca="center">
									<p>1770</p>
								</c>
								<c ca="center">
									<p>184</p>
								</c>
								<c ca="center">
									<p>3</p>
								</c>
								<c ca="center">
									<p>1984</p>
								</c>
							</r>
							<r>
								<c cspan="11">
									<hr/>
								</c>
							</r>
							<r>
								<c ca="center">
									<p>
										<it>R</it>
										<sub>
											<it>m</it>
										</sub>
									</p>
								</c>
								<c>
									<p/>
								</c>
								<c>
									<p/>
								</c>
								<c>
									<p/>
								</c>
								<c>
									<p/>
								</c>
								<c ca="center">
									<p>0.2340</p>
								</c>
								<c ca="center">
									<p>4.526E-3</p>
								</c>
								<c ca="center">
									<p>1.854E-3</p>
								</c>
								<c ca="center">
									<p>9.075E-4</p>
								</c>
								<c ca="center">
									<p>0.1358</p>
								</c>
								<c ca="center">
									<p>9.764E-4</p>
								</c>
							</r>
						</tblbdy>
						<tblfn>
							<p>Ranking results of eleven Prosite datasets (identified by the Prosite (PS) entry column). For each dataset, the number of protein sequences, the number of different n-grams (Diff NGrams), where <it>n </it>is equal to the motif length and the relative support of the target motifs (Rel. Supp) are presented. Motifs are ranked with Information-theoretic based measures. Ranks obtained by support (Supp Rank) and information gain (Info) are also provided for comparison purposes. Last row gives the <it>R</it><sub><it>m </it></sub>values of each measure, where best results are obtained by support and IG.</p>
						</tblfn>
					</tbl>
					<tbl id="T4">
						<title>
							<p>Table 4</p>
						</title>
						<caption>
							<p>Evaluation of contiguous motifs on protein synthetic data.</p>
						</caption>
						<tblbdy cols="6">
							<r>
								<c ca="center">
									<p>Motif</p>
								</c>
								<c ca="center">
									<p>Supp</p>
								</c>
								<c ca="center">
									<p>ZScore</p>
								</c>
								<c ca="center">
									<p>LogOdd</p>
								</c>
								<c ca="center">
									<p>Pratt</p>
								</c>
								<c ca="center">
									<p>IG</p>
								</c>
							</r>
							<r>
								<c cspan="6">
									<hr/>
								</c>
							</r>
							<r>
								<c ca="center">
									<p>SSN</p>
								</c>
								<c ca="center">
									<p>1</p>
								</c>
								<c ca="center">
									<p>3710</p>
								</c>
								<c ca="center">
									<p>1</p>
								</c>
								<c ca="center">
									<p>2130</p>
								</c>
								<c ca="center">
									<p>1</p>
								</c>
							</r>
							<r>
								<c ca="center">
									<p>IYKQ</p>
								</c>
								<c ca="center">
									<p>1</p>
								</c>
								<c ca="center">
									<p>1533</p>
								</c>
								<c ca="center">
									<p>2</p>
								</c>
								<c ca="center">
									<p>11817</p>
								</c>
								<c ca="center">
									<p>1</p>
								</c>
							</r>
							<r>
								<c ca="center">
									<p>NDFNE</p>
								</c>
								<c ca="center">
									<p>1</p>
								</c>
								<c ca="center">
									<p>1</p>
								</c>
								<c ca="center">
									<p>1</p>
								</c>
								<c ca="center">
									<p>13483</p>
								</c>
								<c ca="center">
									<p>1</p>
								</c>
							</r>
							<r>
								<c ca="center">
									<p>PLMPES</p>
								</c>
								<c ca="center">
									<p>1</p>
								</c>
								<c ca="center">
									<p>1</p>
								</c>
								<c ca="center">
									<p>2</p>
								</c>
								<c ca="center">
									<p>4973</p>
								</c>
								<c ca="center">
									<p>1</p>
								</c>
							</r>
							<r>
								<c ca="center">
									<p>MRKMVTAG</p>
								</c>
								<c ca="center">
									<p>1</p>
								</c>
								<c ca="center">
									<p>1</p>
								</c>
								<c ca="center">
									<p>6</p>
								</c>
								<c ca="center">
									<p>9818</p>
								</c>
								<c ca="center">
									<p>1</p>
								</c>
							</r>
							<r>
								<c ca="center">
									<p>TKYEETGAFK</p>
								</c>
								<c ca="center">
									<p>1</p>
								</c>
								<c ca="center">
									<p>1</p>
								</c>
								<c ca="center">
									<p>43</p>
								</c>
								<c ca="center">
									<p>7350</p>
								</c>
								<c ca="center">
									<p>1</p>
								</c>
							</r>
							<r>
								<c ca="center">
									<p>DRTGMHSIFFLP</p>
								</c>
								<c ca="center">
									<p>1</p>
								</c>
								<c ca="center">
									<p>1</p>
								</c>
								<c ca="center">
									<p>3</p>
								</c>
								<c ca="center">
									<p>11721</p>
								</c>
								<c ca="center">
									<p>1</p>
								</c>
							</r>
							<r>
								<c ca="center">
									<p>MTENKVGESICPAAPN</p>
								</c>
								<c ca="center">
									<p>1</p>
								</c>
								<c ca="center">
									<p>1</p>
								</c>
								<c ca="center">
									<p>29</p>
								</c>
								<c ca="center">
									<p>9589</p>
								</c>
								<c ca="center">
									<p>1</p>
								</c>
							</r>
							<r>
								<c cspan="6">
									<hr/>
								</c>
							</r>
							<r>
								<c ca="center">
									<p>
										<it>R</it>
										<sub>
											<it>m</it>
										</sub>
									</p>
								</c>
								<c ca="center">
									<p>1</p>
								</c>
								<c ca="center">
									<p>0.0015</p>
								</c>
								<c ca="center">
									<p>0.0919</p>
								</c>
								<c ca="center">
									<p>1.128E-4</p>
								</c>
								<c ca="center">
									<p>1</p>
								</c>
							</r>
						</tblbdy>
						<tblfn>
							<p>Ranking results for eight synthetic protein datasets. Each dataset contains 50 sequences of length 300. Target motifs have a support of 100%. Motifs are ranked with Information-theoretic measures and support. Last row gives the <it>R</it><sub><it>m </it></sub>values of each measure, where the best results are obtained by IG and support.</p>
						</tblfn>
					</tbl>
				</sec>
				<sec>
					<st>
						<p>Rigid Gap Motifs</p>
					</st>
					<p>Table <tblr tid="T5">5</tblr> shows the ranking of rigid gap motifs from ten datasets of the Dilimot database. This experiment was performed to evaluate weakly conserved motifs. Table <tblr tid="T6">6</tblr> presents the results for 8 datasets from Prosite. The evaluation is focused on long and strongly conserved rigid gap motifs. Teiresias algorithm was used to extract the motifs, were L and W parameters were set to conform the characteristics of the target motif and the minimum support set to 80% of its actual support.</p>
					<tbl id="T5">
						<title>
							<p>Table 5</p>
						</title>
						<caption>
							<p>Evaluation of rigid gap motifs on Dilimot datasets.</p>
						</caption>
						<tblbdy cols="8">
							<r>
								<c ca="center">
									<p>Motif</p>
								</c>
								<c ca="center">
									<p>NumSeqs</p>
								</c>
								<c ca="center">
									<p>Abs. Supp</p>
								</c>
								<c ca="center">
									<p>Supp Rank</p>
								</c>
								<c ca="center">
									<p>IG</p>
								</c>
								<c ca="center">
									<p>Pratt</p>
								</c>
								<c ca="center">
									<p>LogOdd</p>
								</c>
								<c ca="center">
									<p>Zscore</p>
								</c>
							</r>
							<r>
								<c cspan="8">
									<hr/>
								</c>
							</r>
							<r>
								<c ca="center">
									<p>LPSN</p>
								</c>
								<c ca="center">
									<p>15</p>
								</c>
								<c ca="center">
									<p>4</p>
								</c>
								<c ca="center">
									<p>1294</p>
								</c>
								<c ca="center">
									<p>520</p>
								</c>
								<c ca="center">
									<p>2429</p>
								</c>
								<c ca="center">
									<p>4</p>
								</c>
								<c ca="center">
									<p>6</p>
								</c>
							</r>
							<r>
								<c ca="center">
									<p>WS.WS</p>
								</c>
								<c ca="center">
									<p>34</p>
								</c>
								<c ca="center">
									<p>7</p>
								</c>
								<c ca="center">
									<p>15</p>
								</c>
								<c ca="center">
									<p>22</p>
								</c>
								<c ca="center">
									<p>31</p>
								</c>
								<c ca="center">
									<p>28</p>
								</c>
								<c ca="center">
									<p>28</p>
								</c>
							</r>
							<r>
								<c ca="center">
									<p>Q.RLQ..Q</p>
								</c>
								<c ca="center">
									<p>15</p>
								</c>
								<c ca="center">
									<p>4</p>
								</c>
								<c ca="center">
									<p>5259</p>
								</c>
								<c ca="center">
									<p>660</p>
								</c>
								<c ca="center">
									<p>5213</p>
								</c>
								<c ca="center">
									<p>1</p>
								</c>
								<c ca="center">
									<p>1</p>
								</c>
							</r>
							<r>
								<c ca="center">
									<p>P.LP.K</p>
								</c>
								<c ca="center">
									<p>24</p>
								</c>
								<c ca="center">
									<p>8</p>
								</c>
								<c ca="center">
									<p>1334</p>
								</c>
								<c ca="center">
									<p>336</p>
								</c>
								<