PRABI-Doua

Pôle Rhône-Alpes de Bioinformatique Site Doua

Barre

Logo_nhPhyML

Bastien Boussau
bastien.boussau@univ-lyon1.fr
Laboratoire de Biométrie et Biologie Evolutive, UMR CNRS 5558


INTRODUCTION
DOWNLOAD
INSTALLATION
USAGE
OTHER OPTIONS
OUTPUT
ADVICE
CITATION
REFERENCES

INTRODUCTION

nhPhyML is a program built to compute phylogenetic trees under the non stationary, non homogeneous model of DNA sequence evolution of Galtier and Gouy (1998).
As such, it provides estimates of the G+C contents of ancient sequences.
It uses the algorithmic structure of PhyML (Guindon and Gascuel, 2003) adapted to the rooted and irreversible case (many thanks to Stéphane Guindon for providing PhyML source code).
The program can also be downloaded from PhyML homepage, under "PHYML unofficial versions".
It is provided as is and should be used with appropriate care.
Please report any bug to my address: bastien.boussau@univ-lyon1.fr.
It has been tested on Unix and Linux systems.
For more information about my work,  see  my lab page.

If you are interested not in the reconstruction of a phylogenetic tree but in studying the pattern of substitutions along a fixed topology and reconstructing ancestral sequences, I suggest you give a look at Bio++ and the BppSuite suite of software.

DOWNLOAD

You can either download an executable file for LINUX or download an archive containing all the source code. Example files can also be downloaded here.

INSTALLATION

Installation on Linux:

Installation on Mac OS X (thanks to Cedric Simillion for finding out how to install nhPhyML on Mac OS X)

  1. cd into the nhPhyml directory after extracting the archive
  2. remove the nhPhyml binary and all .o files that came with the archive
  3. open the Makefile in a text editor and remove the -static option from the CFLAGS line
  4. typing "make" now produces a working binary for OS X.

USAGE

nhPhyml needs a rooted tree (the root will never be moved throughout the tree space search) and a sequence file in phylip format. Note that if the tree is not rooted, you will get a "Seg Fault".

Where:

Only the sequence file and the tree file are mandatory. Default values for the other parameters are:
-format=i -positions=123 -tstv=e -rates=1 -topology=e -outseqs=n -eqfreq=unlim
When there is only one rate of evolution, no alpha is used.

OTHER OPTIONS

Those options are less central but might be useful to some users.

OUTPUT

5 or 6 files are produced in the directory containing the "SequenceFile" :

ADVICE

The tree space exploration is done as in PhyML v.2.2, by Nearest Neighbor Interchanges (NNIs). These topological rearrangements are local, and do not permit testing topologies distant from the input one, especially when the number of sequences is important. Therefore, it is recommended that you run the program using many different input trees. The resulting trees can be compared using CONSEL with the help of the LNF files.

CITATION

Please cite the following article when using nhPhyML:

I believe it would also be good to cite the articles by Galtier and Gouy (1998) as the model used in nhPhyML comes from this work, and by Guindon and Gascuel (2003) as much of nhPhyML code comes from PhyML.

REFERENCES

Bastien Boussau, PhD
bastien.boussau@univ-lyon1.fr
Laboratoire de Biométrie et Biologie Evolutive, UMR CNRS 5558