Publication Date

2014

Document Type

Dissertation/Thesis

First Advisor

Johns, Mitrick A.

Degree Name

Ph.D. (Doctor of Philosophy)

Legacy Department

Department of Biological Sciences

LCSH

Bacillus (Bacteria)--Genetics; Genomics--Methods; Bioinformatics; Computer science

Abstract

A comprehensive study into the phylogeny of the Bacillus core genome was accomplished using two primary studies. The first study involved two previously studied methods, MUMmer and BSR, and one novel approach, RINC, to compare core genome phylogeny. BSR analysis revealed genomic rearrangements among the genomes of interest. MUMmer was used to confirm these genomic rearrangements and provide evidence that 76.2% of the inverted regions were statistically significant. Circular chromosome comparisons connecting homologous core genes revealed an ancestral inversion pivoted on the terminus in several species. The inversions were resolved, allowing for the identification of the location of the core genome before the inversion event. These analyses led to development of a novel approach to core genome phylogeny, RINC. Based on the tree produced, as well as agreement of groups of nodes with previous studies, the RINC approach was successful at comparing the core genome of Bacillus. RINC offers a simplistic yet powerful tool for core genome phylogeny.||In the second study, core genomes of Bacillus and Eudicot were determined individually from whole genome sequences of members of each genus, along with closely related species using BSRs. The Eudicot core genome was used as a control for evidence that results observed in the Bacillus core genomes were not due to horizontal gene transfer. Each core genome was analyzed with MrBayes and BUCKy to test the robustness of both approaches using sequences from different domains of life. Three multiple sequence alignments, ClustalW2, MUSCLE, and T-COFFEE, were used as inputs for MrBayes. Over 75% of all genes studied had a resolved gene tree topology after one of ten million generations of MrBayes, regardless of MSA. BUCKy calculated concordance trees for both core genomes. BUCKy tree topology was not affected by the MSAs. In both core genomes, high concordance factors were found on interior nodes, defining the relationship between groups of species. Outer nodes between outgroups had lower concordance factors, leading to the conclusion that genomes of more species would need to be included in order to resolve the phylogeny from more distant relatives.

Comments

Includes supplementary digital materials.||Advisors: Mitrick A. Johns.||Committee members: Melvin R. Duvall; Gordon D. Pusch; Thomas L. Sims; Wesley Swingley.

Extent

215 pages

Language

eng

Publisher

Northern Illinois University

Rights Statement

In Copyright

Rights Statement 2

NIU theses are protected by copyright. They may be viewed from Huskie Commons for any purpose, but reproduction or distribution in any format is prohibited without the written permission of the authors.

Media Type

Text

Share

COinS