Institut Pasteur blankvertical divider clipartblank C3BI blankvertical divider clipartblank Bioinformatics and Biostatistics Hub blankvertical divider clipartblank GIPhy


Groupe d'Inférence Phylogénétique

Genome Informatics and Phylogenetics


Welcome to the homepage of GIPhy, one of the expert groups of the Bioinformatics and Biostatistics Hub from the C3BI, Institut Pasteur, Paris, France. The group GIPhy is highly involved in scientific research topics focusing on biological classifications. Therefore, projects regarding important themes such as systematics, taxonomy, homology and related fields are specifically addressed by the members of this dedicated group.

Institutional webpage

More details about GIPhy (members, main projects, publication list):

Databases and Datasets

Empirical Models of Amino Acid Substitution || a complete list of amino acid replacement matrices for model-based sequence evolution analyses

PhyloM || phylogenetic markers (along with multiple sequence alignments and position specific scoring matrices) that are well-suited for the phylogenetic analysis of specific phyla

RVDB-prot || reference viral coding sequence and associated HMM database developed for enhancing virus detection from High-Throughput Sequencing data

Programs and Tools

AlienTrimmer || a tool for clipping and trimming High-Throughput Sequencing reads

BMGE || a tool for selecting characters or encoding character states from a multiple sequence alignment for phylogenetic inference

C2A/A2C || two tools for translating and back-translating codon and amino-acid sequence files, respectively

Concatenate || a tool for building a supermatrix of characters by concatenating multiple sequence alignments

contig_info || a tool for estimating standard descriptive statistics from contig sequences

eFASTA || a tool for extracting a nucleotide segment from a FASTA-formatted file

FASTA2AGP || a tool for creating an AGP file from a FASTA-formatted scaffold sequence file

findSynapomorphies || a tool for finding characters shared by a group of aligned sequences

Gklust || a tool for fast genome sequence clustering

JolyTree || a tool for inferring distance-based phylogenetic trees from unaligned genome sequences   bioconda bioconda

gbk2ENA || a tool for converting Genbank files into EMBL-like files suitable for submission to the ENA

REQ || a tool for estimating branch supports in distance-based phylogenetic trees   bioconda biocondaεq-assessing-branch-supports-oƒ-a-distance-based-phylogenetic-tree-with-the-rate-oƒ-elementary-quartets

wgetGenBankWGS || a tool for downloading genome assembly FASTA files from the GenBank or RefSeq repositories

Supplementary Data

Supplementary data accompanying some of our published analyses