Matching Items (2)
Filtering by

Clear all filters

134524-Thumbnail Image.png
Description
With the rising data output and falling costs of Next Generation Sequencing technologies, research into data compression is crucial to maintaining storage efficiency and costs. High throughput sequencers such as the HiSeqX Ten can produce up to 1.8 terabases of data per run, and such large storage demands are even

With the rising data output and falling costs of Next Generation Sequencing technologies, research into data compression is crucial to maintaining storage efficiency and costs. High throughput sequencers such as the HiSeqX Ten can produce up to 1.8 terabases of data per run, and such large storage demands are even more important to consider for institutions that rely on their own servers rather than large data centers (cloud storage)1. Compression algorithms aim to reduce the amount of space taken up by large genomic datasets by encoding the most frequently occurring symbols with the shortest bit codewords and by changing the order of the data to make it easier to encode. Depending on the probability distribution of the symbols in the dataset or the structure of the data, choosing the wrong algorithm could result in a compressed file larger than the original or a poorly compressed file that results in a waste of time and space2. To test efficiency among compression algorithms for each file type, 37 open-source compression algorithms were used to compress six types of genomic datasets (FASTA, VCF, BCF, GFF, GTF, and SAM) and evaluated on compression speed, decompression speed, compression ratio, and file size using the benchmark test lzbench. Compressors that outpreformed the popular bioinformatics compressor Gzip (zlib -6) were evaluated against one another by ratio and speed for each file type and across the geometric means of all file types. Compressors that exhibited fast compression and decompression speeds were also evaluated by transmission time through variable speed internet pipes in scenarios where the file was compressed only once or compressed multiple times.
ContributorsHowell, Abigail (Author) / Cartwright, Reed (Thesis director) / Wilson Sayres, Melissa (Committee member) / Taylor, Jay (Committee member) / Barrett, The Honors College (Contributor)
Created2017-05
156606-Thumbnail Image.png
Description
Persistent cooperation between unrelated conspecifics rarely occurs in mature eusocial insect societies. In this dissertation, I present evidence of non-kin cooperation in the Nearctic honey ant Myrmecocystus mendax. Using microsatellite markers, I show that mature colonies in the Sierra Ancha Mountain of central Arizona contain multiple unrelated matrilines, an observation

Persistent cooperation between unrelated conspecifics rarely occurs in mature eusocial insect societies. In this dissertation, I present evidence of non-kin cooperation in the Nearctic honey ant Myrmecocystus mendax. Using microsatellite markers, I show that mature colonies in the Sierra Ancha Mountain of central Arizona contain multiple unrelated matrilines, an observation that is consistent with primary polygyny. In contrast, similar analyses suggest that colonies in the Chiricahua Mountains of southeastern Arizona are primarily monogynous. These interpretations are consistent with field and laboratory observations. Whereas cooperative colony founding was observed frequently among groups of Sierra Ancha foundresses, founding in the Chiricahua population was restricted to individual foundresses. Furthermore, Sierra Ancha foundresses successfully established incipient laboratory colonies without undergoing queen culling following emergence of the first workers. Multi-queen laboratory Sierra Ancha colonies also produced more workers and repletes than haplometrotic colonies, and when brood raiding was induced between colonies, queens of those with more workers had a higher survival probability.

Microsatellite analyses of additional locations within the M. mendax range suggest that polygyny is also present in some other populations, especially in central-northern Arizona, albeit at lower frequencies than that in the Sierra Anchas. In addition, analyses of multiple types of genetic data, including microsatellites, the mitochondrial barcoding region, and over 2000 nuclear ultra-conserved elements indicate that M. mendax populations within the southwestern U.S. and northwestern Mexico are geographically structured, with strong support for the existence of two or more divergent clades as well as isolation-by-distance within clades. This structure is further shown to correlate with variation in queen number and hair length, a diagnostic taxonomic feature used to distinguish honey ant species.

Together, these findings suggest that regional ecological pressures (e.g. colony density , climate) may have acted on colony founding and social strategy to select for increasing workforce size and, along with genetic drift, have driven geographically isolated M. mendax populations to differentiate genetically and morphologically. The presence of colony fusion in the laboratory and life history traits in honey ant that are influenced by colony size, including repletism, brood raiding, and tournament, support this evolutionary scenario.
ContributorsEriksson, Ti (Author) / Gadau, Jürgen (Thesis advisor) / Taylor, Jay (Thesis advisor) / Fewell, Jennifer (Committee member) / Hӧlldobler, Bert (Committee member) / Johnson, Robert (Committee member) / Pratt, Stephen (Committee member) / Arizona State University (Publisher)
Created2018