Email updates

Keep up to date with the latest news and content from Algorithms for Molecular Biology and BioMed Central.

Open Access Research

Ubiquity of synonymity: almost all large binary trees are not uniquely identified by their spectra or their immanantal polynomials

Frederick A Matsen1* and Steven N Evans2

Author Affiliations

1 Program in Computational Biology, Fred Hutchinson Cancer Research Center, Seattle, Washington, USA

2 Department of Statistics, University of California at Berkeley, Berkeley, California, USA

For all author emails, please log on.

Algorithms for Molecular Biology 2012, 7:14  doi:10.1186/1748-7188-7-14

Published: 21 May 2012

Abstract

Background

There are several common ways to encode a tree as a matrix, such as the adjacency matrix, the Laplacian matrix (that is, the infinitesimal generator of the natural random walk), and the matrix of pairwise distances between leaves. Such representations involve a specific labeling of the vertices or at least the leaves, and so it is natural to attempt to identify trees by some feature of the associated matrices that is invariant under relabeling. An obvious candidate is the spectrum of eigenvalues (or, equivalently, the characteristic polynomial).

Results

We show for any of these choices of matrix that the fraction of binary trees with a unique spectrum goes to zero as the number of leaves goes to infinity. We investigate the rate of convergence of the above fraction to zero using numerical methods. For the adjacency and Laplacian matrices, we show that the a priori more informative immanantal polynomials have no greater power to distinguish between trees.

Conclusion

Our results show that a generic large binary tree is highly unlikely to be identified uniquely by common spectral invariants.