Filtering by
- All Subjects: Biology
Structural Equation Modeling (SEM) is a multivariate analysis methodology that could potentially be utilized to examine the barrier effect that river systems have on genetic differentiation. In this project, river systems are split into the variables of Daily Average Discharge, Average River Width, and Seasonality measurements and regressed onto the genetic differentiation, measured as Fst. This data was collected from the USGS database (U.S. Geological Survey, 2020), sequencing files from differing literature, or Google Earth measurements. Different Structural Equation Modeling models are used to model different system structures as well as compare it to more traditional methodologies like Generalized Linear Modeling and Generalized Linear Mixed Modeling. Ultimately results were limited by the small sample size, however, interesting patterns still emerged from the models. The SE models indicate that Discharge plays a primary role in the genetic differentiation of adjacent river populations. In addition to this, the results demonstrate how quantification of indirect effects, particularly those relating to discharge, give more informative interpretations than traditional multivariate statistics alone. These findings prompt further investigations into this potential methodology.
The project intersects the environmental humanities, critical theory, and cultural studies with the Desert Southwest. It explores the fullness of desert places with regard to cultures, borders, and languages, as well as nonhuman forces and intensities like heat, light, and distance. Dispelling the dominant notion of desert as void or wasteland, it sets a stage to suit the polyvocality of desert place. My work is interdisciplinary because the desert demands it. It begins with Cormac McCarthy’s Blood Meridian in order to reorient readers towards the rupture of the US War With Mexico which helped set the national and cultural borders in effect today. I then explore Denis Villeneuve’s film Sicario to emphasize the correlation between political hierarchy and verticality; those who can experience the desert from above are exempt from the conditions below, where Urrea’s The Devil’s Highway and Gaspar de Alba’s Desert Blood take place. The novels expose the immanence and violence of being on the ground in the desert and at the lower end of said hierarchies. Analyzing Yuri Herrera’s Signs Preceding the End of the World and Mora’s Encantado enables what I term a desert hauntology to produce a desert full of memory, myth, ancestors, and enchantment. Finally, the project puts visual artists James Turrell and Rafa Esparza in conversation to discover a desert phenomenology. The result is an instigation of how far is too far when decentering the human, and what role does place-based art play in creating and empowering community.
John Ford was from Maine. Georgia O’Keeffe, from Wisconsin. Edward Abbey, Pennsylvania. As someone born and raised in the Desert Southwest, I’ve written the project I have yet to encounter.
This can make mutation detection difficult; and while increasing sequencing depth
can often help, sequence-specific errors and other non-random biases cannot be de-
tected by increased depth. The problem of accurate genotyping is exacerbated when
there is not a reference genome or other auxiliary information available.
I explore several methods for sensitively detecting mutations in non-model or-
ganisms using an example Eucalyptus melliodora individual. I use the structure of
the tree to find bounds on its somatic mutation rate and evaluate several algorithms
for variant calling. I find that conventional methods are suitable if the genome of a
close relative can be adapted to the study organism. However, with structured data,
a likelihood framework that is aware of this structure is more accurate. I use the
techniques developed here to evaluate a reference-free variant calling algorithm.
I also use this data to evaluate a k-mer based base quality score recalibrator
(KBBQ), a tool I developed to recalibrate base quality scores attached to sequencing
data. Base quality scores can help detect errors in sequencing reads, but are often
inaccurate. The most popular method for correcting this issue requires a known
set of variant sites, which is unavailable in most cases. I simulate data and show
that errors in this set of variant sites can cause calibration errors. I then show that
KBBQ accurately recalibrates base quality scores while requiring no reference or other
information and performs as well as other methods.
Finally, I use the Eucalyptus data to investigate the impact of quality score calibra-
tion on the quality of output variant calls and show that improved base quality score
calibration increases the sensitivity and reduces the false positive rate of a variant
calling algorithm.
Agassiz’s desert tortoise (Gopherus agassizii) is a long-lived species native to the Mojave Desert and is listed as threatened under the US Endangered Species Act. To aid conservation efforts for preserving the genetic diversity of this species, we generated a whole genome reference sequence with an annotation based on deep transcriptome sequences of adult skeletal muscle, lung, brain, and blood. The draft genome assembly for G. agassizii has a scaffold N50 length of 252 kbp and a total length of 2.4 Gbp. Genome annotation reveals 20,172 protein-coding genes in the G. agassizii assembly, and that gene structure is more similar to chicken than other turtles. We provide a series of comparative analyses demonstrating (1) that turtles are among the slowest-evolving genome-enabled reptiles, (2) amino acid changes in genes controlling desert tortoise traits such as shell development, longevity and osmoregulation, and (3) fixed variants across the Gopherus species complex in genes related to desert adaptations, including circadian rhythm and innate immune response. This G. agassizii genome reference and annotation is the first such resource for any tortoise, and will serve as a foundation for future analysis of the genetic basis of adaptations to the desert environment, allow for investigation into genomic factors affecting tortoise health, disease and longevity, and serve as a valuable resource for additional studies in this species complex.
Data Availability: All genomic and transcriptomic sequence files are available from the NIH-NCBI BioProject database (accession numbers PRJNA352725, PRJNA352726, and PRJNA281763). All genome assembly, transcriptome assembly, predicted protein, transcript, genome annotation, repeatmasker, phylogenetic trees, .vcf and GO enrichment files are available on Harvard Dataverse (doi:10.7910/DVN/EH2S9K).