A novel methodology for large-scale phylogeny partition
- Authors: Prosperi, M; Ciccozzi, M; Fanti, I; Saladini, F; Pecorari, M; Borghi, V; Di Giambenedetto, S; Bruzzone, B; Capetti, A; Vivarelli, A; Rusconi, S; Re, M; Gismondo, M; Sighinolfi, L; Gray, R; Salemi, M; Zazzi, M; De Luca, A; Mancuso, S
- Publication year: 2011
- Type: Articolo in rivista (Articolo in rivista)
- Key words: Algorithms; Classification; Female; Gene Products, pol; HIV Infections; HIV-1; Humans; Male; Phylogeny; Chemistry (all); Biochemistry, Genetics and Molecular Biology (all); Physics and Astronomy (all)
- OA Link: http://hdl.handle.net/10447/215391
Abstract
Understanding the determinants of virus transmission is a fundamental step for effective design of screening and intervention strategies to control viral epidemics. Phylogenetic analysis can be a valid approach for the identification of transmission chains, and very-large data sets can be analysed through parallel computation. Here we propose and validate a new methodology for the partition of large-scale phylogenies and the inference of transmission clusters. This approach, on the basis of a depth-first search algorithm, conjugates the evaluation of node reliability, tree topology and patristic distance analysis. The method has been applied to identify transmission clusters of a phylogeny of 11,541 human immunodeficiency virus-1 subtype B pol gene sequences from a large Italian cohort. Molecular transmission chains were characterized by means of different clinical/demographic factors, such as the interaction between male homosexuals and male heterosexuals. Our method takes an advantage of a flexible notion of transmission cluster and can become a general framework to analyse other epidemics. © 2011 Macmillan Publishers Limited. All rights reserved.