Try a new search

Format these results:

Searched for:

in-biosketch:yes

person:baeleg01

Total Results:

109


The epidemic dynamics of hepatitis C virus subtypes 4a and 4d in Saudi Arabia

Al-Qahtani, Ahmed A; Baele, Guy; Khalaf, Nisreen; Suchard, Marc A; Al-Anazi, Mashael R; Abdo, Ayman A; Sanai, Faisal M; Al-Ashgar, Hamad I; Khan, Mohammed Q; Al-Ahdal, Mohammed N; Lemey, Philippe; Vrancken, Bram
The relatedness between viral variants sampled at different locations through time can provide information pertinent to public health that cannot readily be obtained through standard surveillance methods. Here, we use virus genetic data to identify the transmission dynamics that drive the hepatitis C virus subtypes 4a (HCV4a) and 4d (HCV4d) epidemics in Saudi Arabia. We use a comprehensive dataset of newly generated and publicly available sequence data to infer the HCV4a and HCV4d evolutionary histories in a Bayesian statistical framework. We also introduce a novel analytical method for an objective assessment of the migration intensity between locations. We find that international host mobility patterns dominate over within country spread in shaping the Saudi Arabia HCV4a epidemic, while this may be different for the HCV4d epidemic. This indicates that the subtypes 4a and 4d burden can be most effectively reduced by combining the prioritized screening and treatment of Egyptian immigrants with domestic prevention campaigns. Our results highlight that the joint investigation of evolutionary and epidemiological processes can provide valuable public health information, even in the absence of extensive metadata information.
PMCID:5359580
PMID: 28322313
ISSN: 2045-2322
CID: 5170202

Emerging Concepts of Data Integration in Pathogen Phylodynamics

Baele, Guy; Suchard, Marc A; Rambaut, Andrew; Lemey, Philippe
Phylodynamics has become an increasingly popular statistical framework to extract evolutionary and epidemiological information from pathogen genomes. By harnessing such information, epidemiologists aim to shed light on the spatio-temporal patterns of spread and to test hypotheses about the underlying interaction of evolutionary and ecological dynamics in pathogen populations. Although the field has witnessed a rich development of statistical inference tools with increasing levels of sophistication, these tools initially focused on sequences as their sole primary data source. Integrating various sources of information, however, promises to deliver more precise insights in infectious diseases and to increase opportunities for statistical hypothesis testing. Here, we review how the emerging concept of data integration is stimulating new advances in Bayesian evolutionary inference methodology which formalize a marriage of statistical thinking and evolutionary biology. These approaches include connecting sequence to trait evolution, such as for host, phenotypic and geographic sampling information, but also the incorporation of covariates of evolutionary and epidemic processes in the reconstruction procedures. We highlight how a full Bayesian approach to covariate modeling and testing can generate further insights into sequence evolution, trait evolution, and population dynamics in pathogen populations. Specific examples demonstrate how such approaches can be used to test the impact of host on rabies and HIV evolutionary rates, to identify the drivers of influenza dispersal as well as the determinants of rabies cross-species transmissions, and to quantify the evolutionary dynamics of influenza antigenicity. Finally, we briefly discuss how data integration is now also permeating through the inference of transmission dynamics, leading to novel insights into tree-generative processes and detailed reconstructions of transmission trees. [Bayesian inference; birth–death models; coalescent models; continuous trait evolution; covariates; data integration; discrete trait evolution; pathogen phylodynamics.
PMCID:5837209
PMID: 28173504
ISSN: 1076-836x
CID: 5170182

Domestication and Divergence of Saccharomyces cerevisiae Beer Yeasts

Gallone, Brigida; Steensels, Jan; Prahl, Troels; Soriaga, Leah; Saels, Veerle; Herrera-Malaver, Beatriz; Merlevede, Adriaan; Roncoroni, Miguel; Voordeckers, Karin; Miraglia, Loren; Teiling, Clotilde; Steffy, Brian; Taylor, Maryann; Schwartz, Ariel; Richardson, Toby; White, Christopher; Baele, Guy; Maere, Steven; Verstrepen, Kevin J
Whereas domestication of livestock, pets, and crops is well documented, it is still unclear to what extent microbes associated with the production of food have also undergone human selection and where the plethora of industrial strains originates from. Here, we present the genomes and phenomes of 157 industrial Saccharomyces cerevisiae yeasts. Our analyses reveal that today's industrial yeasts can be divided into five sublineages that are genetically and phenotypically separated from wild strains and originate from only a few ancestors through complex patterns of domestication and local divergence. Large-scale phenotyping and genome analysis further show strong industry-specific selection for stress tolerance, sugar utilization, and flavor production, while the sexual cycle and other phenotypes related to survival in nature show decay, particularly in beer yeasts. Together, these results shed light on the origins, evolutionary history, and phenotypic diversity of industrial yeasts and provide a resource for further selection of superior strains. PAPERCLIP.
PMCID:5018251
PMID: 27610566
ISSN: 1097-4172
CID: 5170142

SpreaD3: Interactive Visualization of Spatiotemporal History and Trait Evolutionary Processes

Bielejec, Filip; Baele, Guy; Vrancken, Bram; Suchard, Marc A; Rambaut, Andrew; Lemey, Philippe
Model-based phylogenetic reconstructions increasingly consider spatial or phenotypic traits in conjunction with sequence data to study evolutionary processes. Alongside parameter estimation, visualization of ancestral reconstructions represents an integral part of these analyses. Here, we present a complete overhaul of the spatial phylogenetic reconstruction of evolutionary dynamics software, now called SpreaD3 to emphasize the use of data-driven documents, as an analysis and visualization package that primarily complements Bayesian inference in BEAST (http://beast.bio.ed.ac.uk, last accessed 9 May 2016). The integration of JavaScript D3 libraries (www.d3.org, last accessed 9 May 2016) offers novel interactive web-based visualization capacities that are not restricted to spatial traits and extend to any discrete or continuously valued trait for any organism of interest.
PMCID:6398721
PMID: 27189542
ISSN: 1537-1719
CID: 5170132

Identifying predictors of time-inhomogeneous viral evolutionary processes

Bielejec, Filip; Baele, Guy; Rodrigo, Allen G; Suchard, Marc A; Lemey, Philippe
Various factors determine the rate at which mutations are generated and fixed in viral genomes. Viral evolutionary rates may vary over the course of a single persistent infection and can reflect changes in replication rates and selective dynamics. Dedicated statistical inference approaches are required to understand how the complex interplay of these processes shapes the genetic diversity and divergence in viral populations. Although evolutionary models accommodating a high degree of complexity can now be formalized, adequately informing these models by potentially sparse data, and assessing the association of the resulting estimates with external predictors, remains a major challenge. In this article, we present a novel Bayesian evolutionary inference method, which integrates multiple potential predictors and tests their association with variation in the absolute rates of synonymous and non-synonymous substitutions along the evolutionary history. We consider clinical and virological measures as predictors, but also changes in population size trajectories that are simultaneously inferred using coalescent modelling. We demonstrate the potential of our method in an application to within-host HIV-1 sequence data sampled throughout the infection of multiple patients. While analyses of individual patient populations lack statistical power, we detect significant evidence for an abrupt drop in non-synonymous rates in late stage infection and a more gradual increase in synonymous rates over the course of infection in a joint analysis across all patients. The former is predicted by the immune relaxation hypothesis while the latter may be in line with increasing replicative fitness during the asymptomatic stage.
PMCID:5072463
PMID: 27774306
ISSN: 2057-1577
CID: 5170162

Bayesian codon substitution modelling to identify sources of pathogen evolutionary rate variation

Baele, Guy; Suchard, Marc A; Bielejec, Filip; Lemey, Philippe
Phylodynamic reconstructions rely on a measurable molecular footprint of epidemic processes in pathogen genomes. Identifying the factors that govern the tempo and mode by which these processes leave a footprint in pathogen genomes represents an important goal towards understanding infectious disease evolution. Discriminating between synonymous and non-synonymous substitution rates is crucial for testing hypotheses about the sources of evolutionary rate variation. Here, we implement a codon substitution model in a Bayesian statistical framework to estimate absolute rates of synonymous and non-synonymous substitution in unknown evolutionary histories. To demonstrate how this model can provide critical insights into pathogen evolutionary dynamics, we adopt hierarchical phylogenetic modelling with fixed effects and apply it to two viral examples. Using within-host HIV-1 data from patients with different host genetic background and different disease progression rates, we show that viral populations undergo faster absolute synonymous substitution rates in patients with faster disease progression, probably reflecting faster replication rates. We also re-analyse rabies data from different bat species in the Americas to demonstrate that climate predicts absolute synonymous substitution rates, which can be attributed to climate-associated bat activity and viral transmission dynamics. In conclusion, our model to estimate absolute rates of synonymous and non-synonymous substitution can provide a powerful approach to investigate how host ecology can shape the tempo of pathogen evolution.
PMCID:5320644
PMID: 28348854
ISSN: 2057-5858
CID: 5170212

Sub-Epidemics Explain Localized High Prevalence of Reduced Susceptibility to Rilpivirine in Treatment-Naive HIV-1-Infected Patients: Subtype and Geographic Compartmentalization of Baseline Resistance Mutations

Theys, Kristof; Van Laethem, Kristel; Gomes, Perpetua; Baele, Guy; Pineda-Peña, Andrea-Clemencia; Vandamme, Anne-Mieke; Camacho, Ricardo J; Abecasis, Ana B
OBJECTIVE:The latest nonnucleoside reverse transcriptase inhibitor (NNRTI) rilpivirine (RPV) is indicated for human immunodeficiency virus type-1 (HIV-1) patients initiating antiretroviral treatment, but the extent of genotypic RPV resistance in treatment-naive patients outside clinical trials is poorly defined. STUDY DESIGN/METHODS:This retrospective observational study of clinical data from Belgium and Portugal evaluates genotypic information from HIV-1 drug-naive patients obtained for the purpose of drug resistance testing. Rilpivirine resistance-associated mutations (RPV-RAMs) were defined based on clinical trials, phenotypic studies, and expert-based resistance algorithms. Viral susceptibility to RPV alone and to the single-tablet regimen was estimated using expert-based resistance algorithms. RESULTS:In 4,631 HIV-1 treatment-naive patients infected with diverse HIV-1 subtypes, major RPV-RAMs were detected in 4.6%, while complete viral susceptibility to RPV was estimated in 95% of patients. Subtype C- and F1-infected patients displayed the highest levels of reduced viral susceptibility at baseline, respectively 13.2% and 9.3%, mainly due to subtype- and geographic-dependent occurrence of RPV-RAMs E138A and A98G as natural polymorphisms. Strikingly, a founder effect in Portugal resulted in a 138A prevalence of 13.2% in local subtype C-infected treatment-naive patients. The presence of transmitted drug resistance did not impact our estimates. CONCLUSION/CONCLUSIONS:RPV is the first HIV-1 inhibitor for which, in the absence of transmitted drug resistance, intermediate or high-level genotypic resistance can be detected in treatment-naive patients. The extent of RPV susceptibility in treatment-naive patients differs depending on the HIV-1 subtype and dynamics of local compartmentalized epidemics. The highest prevalence of reduced susceptibility was found to be 15.7% in Portuguese subtype C-infected treatment-naive patients. In this context, even in the absence of transmitted HIV-1 drug resistance (TDR), drug resistance testing at baseline should be considered extremely important before starting treatment with this NNRTI.
PMCID:4845676
PMID: 26651266
ISSN: 1931-8405
CID: 5170112

Genealogical Working Distributions for Bayesian Model Testing with Phylogenetic Uncertainty

Baele, Guy; Lemey, Philippe; Suchard, Marc A
Marginal likelihood estimates to compare models using Bayes factors frequently accompany Bayesian phylogenetic inference. Approaches to estimate marginal likelihoods have garnered increased attention over the past decade. In particular, the introduction of path sampling (PS) and stepping-stone sampling (SS) into Bayesian phylogenetics has tremendously improved the accuracy of model selection. These sampling techniques are now used to evaluate complex evolutionary and population genetic models on empirical data sets, but considerable computational demands hamper their widespread adoption. Further, when very diffuse, but proper priors are specified for model parameters, numerical issues complicate the exploration of the priors, a necessary step in marginal likelihood estimation using PS or SS. To avoid such instabilities, generalized SS (GSS) has recently been proposed, introducing the concept of "working distributions" to facilitate--or shorten--the integration process that underlies marginal likelihood estimation. However, the need to fix the tree topology currently limits GSS in a coalescent-based framework. Here, we extend GSS by relaxing the fixed underlying tree topology assumption. To this purpose, we introduce a "working" distribution on the space of genealogies, which enables estimating marginal likelihoods while accommodating phylogenetic uncertainty. We propose two different "working" distributions that help GSS to outperform PS and SS in terms of accuracy when comparing demographic and evolutionary models applied to synthetic data and real-world examples. Further, we show that the use of very diffuse priors can lead to a considerable overestimation in marginal likelihood when using PS and SS, while still retrieving the correct marginal likelihood using both GSS approaches. The methods used in this article are available in BEAST, a powerful user-friendly software package to perform Bayesian evolutionary analyses.
PMCID:5009437
PMID: 26526428
ISSN: 1076-836x
CID: 5170102

Quantifying Next Generation Sequencing Sample Pre-Processing Bias in HIV-1 Complete Genome Sequencing

Vrancken, Bram; Trovão, Nídia Sequeira; Baele, Guy; van Wijngaerden, Eric; Vandamme, Anne-Mieke; van Laethem, Kristel; Lemey, Philippe
Genetic analyses play a central role in infectious disease research. Massively parallelized "mechanical cloning" and sequencing technologies were quickly adopted by HIV researchers in order to broaden the understanding of the clinical importance of minor drug-resistant variants. These efforts have, however, remained largely limited to small genomic regions. The growing need to monitor multiple genome regions for drug resistance testing, as well as the obvious benefit for studying evolutionary and epidemic processes makes complete genome sequencing an important goal in viral research. In addition, a major drawback for NGS applications to RNA viruses is the need for large quantities of input DNA. Here, we use a generic overlapping amplicon-based near full-genome amplification protocol to compare low-input enzymatic fragmentation (Nexteraâ„¢) with conventional mechanical shearing for Roche 454 sequencing. We find that the fragmentation method has only a modest impact on the characterization of the population composition and that for reliable results, the variation introduced at all steps of the procedure--from nucleic acid extraction to sequencing--should be taken into account, a finding that is also relevant for NGS technologies that are now more commonly used. Furthermore, by applying our protocol to deep sequence a number of pre-therapy plasma and PBMC samples, we illustrate the potential benefits of a near complete genome sequencing approach in routine genotyping.
PMCID:4728572
PMID: 26751471
ISSN: 1999-4915
CID: 5170122

Bayesian Inference Reveals Host-Specific Contributions to the Epidemic Expansion of Influenza A H5N1

Trovão, Nídia Sequeira; Suchard, Marc A; Baele, Guy; Gilbert, Marius; Lemey, Philippe
Since its first isolation in 1996 in Guangdong, China, the highly pathogenic avian influenza virus (HPAIV) H5N1 has circulated in avian hosts for almost two decades and spread to more than 60 countries worldwide. The role of different avian hosts and the domestic-wild bird interface has been critical in shaping the complex HPAIV H5N1 disease ecology, but remains difficult to ascertain. To shed light on the large-scale H5N1 transmission patterns and disentangle the contributions of different avian hosts on the tempo and mode of HPAIV H5N1 dispersal, we apply Bayesian evolutionary inference techniques to comprehensive sets of hemagglutinin and neuraminidase gene sequences sampled between 1996 and 2011 throughout Asia and Russia. Our analyses demonstrate that the large-scale H5N1 transmission dynamics are structured according to different avian flyways, and that the incursion of the Central Asian flyway specifically was driven by Anatidae hosts coinciding with rapid rate of spread and an epidemic wavefront acceleration. This also resulted in long-distance dispersal that is likely to be explained by wild bird migration. We identify a significant degree of asymmetry in the large-scale transmission dynamics between Anatidae and Phasianidae, with the latter largely representing poultry as an evolutionary sink. A joint analysis of host dynamics and continuous spatial diffusion demonstrates that the rate of viral dispersal and host diffusivity is significantly higher for Anatidae compared with Phasianidae. These findings complement risk modeling studies and satellite tracking of wild birds in demonstrating a continental-scale structuring into areas of H5N1 persistence that are connected through migratory waterfowl.
PMCID:4831561
PMID: 26341298
ISSN: 1537-1719
CID: 5170092