|
Descriptions of protein datasets. # Seq. gives the number of input protein sequences. Length gives the length of the protein motif searched for. |V| gives the number of vertices in the original graph constructed from the dataset. DEE gives the methods used to prune the graph, and are denoted by (1) clique-bounds DEE, (2) tighter constrained bounds and (3) graph decomposition. |VDEE| is the number of vertices in the graph after pruning. E-value lists the e-value of the motif found by the LP/DEE algorithm. |
||||||
| Dataset |
# Seq. |
Length |
|V| |
DEE |
|VDEE| |
E-value |
|
|
||||||
| Lipocalin |
5 |
16 |
844 |
(1) |
5 |
3.80 × 10-16 |
| Helix-Turn-Helix |
30 |
20 |
6870 |
(1,2,3) |
260 |
3.88 × 10-67 |
| Tumor Necrosis Factor |
10 |
17 |
2329 |
(1) |
10 |
1.50 × 10-40 |
| Zinc Metallopeptidase |
10 |
12 |
7761 |
(1,2) |
10 |
5.82 × 10-23 |
| Immunoglobulin Fold |
18 |
10 |
7498 |
(1,2,3) |
187 |
3.04 × 10-24 |
Zaslavsky and Singh Algorithms for Molecular Biology 2006 1:13 doi:10.1186/1748-7188-1-13 |
||||||