Search Content

HBS1L-MYB loci involvement in Fetal Hemoglobin Expression

Description

This project studies two single nucleotide polymorphisms (SNPs) within the HBS1L-MYB loci. Both SNPs are associated with a heightened expression of fetal hemoglobin. DNA samples of NCAA athletes who have sickle cell trait were genotyped to find the allele frequency of each SNP. When comparing all populations using information provided…

This project studies two single nucleotide polymorphisms (SNPs) within the HBS1L-MYB loci. Both SNPs are associated with a heightened expression of fetal hemoglobin. DNA samples of NCAA athletes who have sickle cell trait were genotyped to find the allele frequency of each SNP. When comparing all populations using information provided from the Human Genome Project on Ensembl, the minor A allele has a frequency of 22% and the major, G, allele has a frequency of 78%. The frequency distribution of the minor allele in the population data was higher than the frequency obtained from the sampled data by 15%. This means that the samples, which are heterozygous for sickle cell, display a lower frequency for the mutation than the global population.

ContributorsCiambella, Michelle Lynn (Author) / Stone, Anne (Thesis director) / Foy, Joseph (Committee member) / Madrigal, Lorena (Committee member) / Barrett, The Honors College (Contributor) / School of Life Sciences (Contributor)

Created2014-05

Novel DNA Extraction Methods for Mollusks and the History and Significance of Bermuda Land Snails

Description

Bermuda Land Snails make up a genus called Poecilozonites that is endemic to Bermuda and is extensively present in its fossil record. These snails were also integral to the creation of the theory of punctuated equilibrium. The DNA of mollusks is difficult to sequence because of a class of proteins…

Bermuda Land Snails make up a genus called Poecilozonites that is endemic to Bermuda and is extensively present in its fossil record. These snails were also integral to the creation of the theory of punctuated equilibrium. The DNA of mollusks is difficult to sequence because of a class of proteins called mucopolysaccharides that are present in high concentrations in mollusk tissue, and are not removed with standard DNA extraction methods. They inhibit Polymerase Chain Reactions (PCRs) and interfere with Next Generation Sequencing methods. This paper will discuss the DNA extraction methods that were designed to remove the inhibitory proteins that were tested on another gastropod species (Pomacea canaliculata). These were chosen because they are invasive and while they are not pulmonates, they are similar enough to Bermuda Land Snails to reliably test extraction methods. The methods that were tested included two commercially available kits: the Qiagen Blood and Tissue Kit and the Omega Biotek Mollusc Extraction Kit, and one Hexadecyltrimethylammonium Bromide (CTAB) Extraction method that was modified for use on mollusk tissue. The Blood and Tissue kit produced some DNA, the mollusk kit produced almost none, and the CTAB Extraction Method produced the highest concentrations on average, and may prove to be the most viable option for future extractions. PCRs attempted with the extracted DNA have all failed, though it is likely due to an issue with reagents. Further spectrographic analysis of the DNA from the test extractions has shown that they were successful at removing mucopolysaccharides. When the protocol is optimized, it will be used to extract DNA from the tissue from six individuals from each of the two extant species of Bermuda Land Snails. This DNA will be used in several experiments involving Next Generation Sequencing, with the goal of assembling a variety of genome data. These data will then be used to a construct reference genome for Bermuda Land Snails. The genomes generated by this project will be used in population genetic analyses between individuals of the same species, and between individuals of different species. These analyses will then be used to aid in conservation efforts for the species.

ContributorsClark, Patrick Louis (Author) / Stone, Anne (Thesis director) / Winingear, Stevie (Committee member) / School of Life Sciences (Contributor, Contributor) / School of Human Evolution & Social Change (Contributor) / Barrett, The Honors College (Contributor)

Created2021-05

A composite genome approach to identify phylogenetically informative data from next-generation sequencing

Description

Background
Improvements in sequencing technology now allow easy acquisition of large datasets; however, analyzing these data for phylogenetics can be challenging. We have developed a novel method to rapidly obtain homologous genomic data for phylogenetics directly from next-generation sequencing reads without the use of a reference genome. This software, called SISRS,…

Background
Improvements in sequencing technology now allow easy acquisition of large datasets; however, analyzing these data for phylogenetics can be challenging. We have developed a novel method to rapidly obtain homologous genomic data for phylogenetics directly from next-generation sequencing reads without the use of a reference genome. This software, called SISRS, avoids the time consuming steps of de novo whole genome assembly, multiple genome alignment, and annotation.
Results
For simulations SISRS is able to identify large numbers of loci containing variable sites with phylogenetic signal. For genomic data from apes, SISRS identified thousands of variable sites, from which we produced an accurate phylogeny. Finally, we used SISRS to identify phylogenetic markers that we used to estimate the phylogeny of placental mammals. We recovered eight phylogenies that resolved the basal relationships among mammals using datasets with different levels of missing data. The three alternate resolutions of the basal relationships are consistent with the major hypotheses for the relationships among mammals, all of which have been supported previously by different molecular datasets.
Conclusions
SISRS has the potential to transform phylogenetic research. This method eliminates the need for expensive marker development in many studies by using whole genome shotgun sequence data directly. SISRS is open source and freely available at https://github.com/rachelss/SISRS/releases.

ContributorsSchwartz, Rachel (Author) / Harkins, Kelly (Author) / Stone, Anne (Author) / Cartwright, Reed (Author) / Biodesign Institute (Contributor) / Center for Evolution and Medicine (Contributor) / College of Liberal Arts and Sciences (Contributor) / School of Human Evolution and Social Change (Contributor) / School of Life Sciences (Contributor)

Created2015-06-11

Natural and Anthropogenic Hybridization in Two Species of Eastern Brazilian Marmosets (Callithrix jacchus and C. penicillata)

Description

Animal hybridization is well documented, but evolutionary outcomes and conservation priorities often differ for natural and anthropogenic hybrids. Among primates, an order with many endangered species, the two contexts can be hard to disentangle from one another, which carries important conservation implications. Callithrix marmosets give us a unique glimpse of…

Animal hybridization is well documented, but evolutionary outcomes and conservation priorities often differ for natural and anthropogenic hybrids. Among primates, an order with many endangered species, the two contexts can be hard to disentangle from one another, which carries important conservation implications. Callithrix marmosets give us a unique glimpse of genetic hybridization effects under distinct natural and human-induced contexts. Here, we use a 44 autosomal microsatellite marker panel to examine genome-wide admixture levels and introgression at a natural C. jacchus and C. penicillata species border along the Sao Francisco River in NE Brazil and in an area of Rio de Janeiro state where humans introduced these species exotically. Additionally, we describe for the first time autosomal genetic diversity in wild C. penicillata and expand previous C. jacchus genetic data. We characterize admixture within the natural zone as bimodal where hybrid ancestry is biased toward one parental species or the other. We also show evidence that Sao Francisco River islands are gateways for bidirectional gene flow across the species border. In the anthropogenic zone, marmosets essentially form a hybrid swarm with intermediate levels of admixture, likely from the absence of strong physical barriers to interspecific breeding. Our data show that while hybridization can occur naturally, the presence of physical, even if leaky, barriers to hybridization is important for maintaining species genetic integrity. Thus, we suggest further study of hybridization under different contexts to set well informed conservation guidelines for hybrid populations that often fit somewhere between "natural" and "man-made."

ContributorsMalukiewicz, Joanna (Author) / Boere, Vanner (Author) / Fuzessy, Lisieux F. (Author) / Grativol, Adriana D. (Author) / de Oliveira e Silva, Ita (Author) / Pereira, Luiz C. M. (Author) / Ruiz-Miranda, Carlos R. (Author) / Valenca, Yuri M. (Author) / Stone, Anne (Author) / College of Liberal Arts and Sciences (Contributor) / School of Human Evolution and Social Change (Contributor)

Created2015-06-10

Validation of qPCR Methods for the Detection of Mycobacterium in New World Animal Reservoirs

Description

Zoonotic pathogens that cause leprosy (Mycobacterium leprae) and tuberculosis (Mycobacterium tuberculosis complex, MTBC) continue to impact modern human populations. Therefore, methods able to survey mycobacterial infection in potential animal hosts are necessary for proper evaluation of human exposure threats. Here we tested for mycobacterial-specific single- and multi-copy loci using qPCR.…

Zoonotic pathogens that cause leprosy (Mycobacterium leprae) and tuberculosis (Mycobacterium tuberculosis complex, MTBC) continue to impact modern human populations. Therefore, methods able to survey mycobacterial infection in potential animal hosts are necessary for proper evaluation of human exposure threats. Here we tested for mycobacterial-specific single- and multi-copy loci using qPCR. In a trial study in which armadillos were artificially infected with M. leprae, these techniques were specific and sensitive to pathogen detection, while more traditional ELISAs were only specific. These assays were then employed in a case study to detect M. leprae as well as MTBC in wild marmosets. All marmosets were negative for M. leprae DNA, but 14 were positive for the mycobacterial rpoB gene assay. Targeted capture and sequencing of rpoB and other MTBC genes validated the presence of mycobacterial DNA in these samples and revealed that qPCR is useful for identifying mycobacterial-infected animal hosts.

ContributorsHousman, Genevieve (Author) / Malukiewicz, Joanna (Author) / Boere, Vanner (Author) / Grativol, Adriana D. (Author) / Pereira, Luiz Cezar M. (Author) / de Oliveira e Silva, Ita (Author) / Ruiz-Miranda, Carlos R. (Author) / Truman, Richard (Author) / Stone, Anne (Author) / College of Liberal Arts and Sciences (Contributor) / School of Human Evolution and Social Change (Contributor) / Biodesign Institute (Contributor) / Personalized Diagnostics (Contributor)

Created2015-11-16

A Curation of the Callithrix penicillata Draft Genome

Description

Callithrix penicillata, also known as the Black-tufted marmoset primarily lives in the Brazilian highlands and has had little research conducted on it. For this project I performed a genome curation on the newly assembled genome of this species. The scaffolds obtained by the Dovetail Genomics reads were organized and labeled…

Callithrix penicillata, also known as the Black-tufted marmoset primarily lives in the Brazilian highlands and has had little research conducted on it. For this project I performed a genome curation on the newly assembled genome of this species. The scaffolds obtained by the Dovetail Genomics reads were organized and labeled into chromosomes using the 2014 Callithrix jacchus genome as a reference. Then, using that same genome as a reference, 13 of the chromosomes were reverse complimented to be continuous with the 2014 Callithrix jacchus genome. The N50 statistics of the assembly were calculated and found to be 124 Mb. Quality scores were run for the final genome using referee and visualized with a bar plot, with 99% of sites scoring above 0. Heterozygosity was also calculated and found to be 0.3%. Finally, the final version of the genome was visually compared to the 2017 Callithrix jacchus genome and the GRCh38 human genome. This genome was submitted to the NCBIs database to await further approval.

ContributorsJohnson, Joelle Genevieve (Author) / Cartwright, Reed (Thesis director) / Stone, Anne (Committee member) / School of Molecular Sciences (Contributor) / Barrett, The Honors College (Contributor)

Created2019-12

Evaluating variant calling best practices

Description

Analyzing human DNA sequence data allows researchers to identify variants associated with disease, reconstruct the demographic histories of human populations, and further understand the structure and function of the genome. Identifying variants in whole genome sequences is a crucial bioinformatics step in sequence data processing and can be performed using…

Analyzing human DNA sequence data allows researchers to identify variants associated with disease, reconstruct the demographic histories of human populations, and further understand the structure and function of the genome. Identifying variants in whole genome sequences is a crucial bioinformatics step in sequence data processing and can be performed using multiple approaches. To investigate the consistency between different bioinformatics methods, we compared the accuracy and sensitivity of two genotyping strategies, joint variant calling and single-sample variant calling. Autosomal and sex chromosome variant call sets were produced by joint and single-sample calling variants for 10 female individuals. The accuracy of variant calls was assessed using SNP array genotype data collected from each individual. To compare the ability of joint and single-sample calling to capture low-frequency variants, folded site frequency spectra were constructed from variant call sets. To investigate the potential for these different variant calling methods to impact downstream analyses, we estimated nucleotide diversity for call sets produced using each approach. We found that while both methods were equally accurate when validated by SNP array sites, single-sample calling identified a greater number of singletons. However, estimates of nucleotide diversity were robust to these differences in the site frequency spectrum between call sets. Our results suggest that despite single-sample calling’s greater sensitivity for low-frequency variants, the differences between approaches have a minimal effect on downstream analyses. While joint calling may be a more efficient approach for genotyping many samples, in situations that preclude large sample sizes, our study suggests that single-sample calling is a suitable alternative.

ContributorsHowell, Emma (Co-author) / Wilson, Melissa (Thesis director) / Stone, Anne (Committee member) / Phung, Tanya (Committee member) / School of Life Sciences (Contributor) / Barrett, The Honors College (Contributor)

Created2020-05

Rapid evolution of BRCA1 and BRCA2in humans and other primates

Description

Background
The maintenance of chromosomal integrity is an essential task of every living organism and cellular repair mechanisms exist to guard against insults to DNA. Given the importance of this process, it is expected that DNA repair proteins would be evolutionarily conserved, exhibiting very minimal sequence change over time. However, BRCA1,…

Background
The maintenance of chromosomal integrity is an essential task of every living organism and cellular repair mechanisms exist to guard against insults to DNA. Given the importance of this process, it is expected that DNA repair proteins would be evolutionarily conserved, exhibiting very minimal sequence change over time. However, BRCA1, an essential gene involved in DNA repair, has been reported to be evolving rapidly despite the fact that many protein-altering mutations within this gene convey a significantly elevated risk for breast and ovarian cancers.
Results
To obtain a deeper understanding of the evolutionary trajectory of BRCA1, we analyzed complete BRCA1 gene sequences from 23 primate species. We show that specific amino acid sites have experienced repeated selection for amino acid replacement over primate evolution. This selection has been focused specifically on humans and our closest living relatives, chimpanzees (Pan troglodytes) and bonobos (Pan paniscus). After examining BRCA1 polymorphisms in 7 bonobo, 44 chimpanzee, and 44 rhesus macaque (Macaca mulatta) individuals, we find considerable variation within each of these species and evidence for recent selection in chimpanzee populations. Finally, we also sequenced and analyzed BRCA2 from 24 primate species and find that this gene has also evolved under positive selection.
Conclusions
While mutations leading to truncated forms of BRCA1 are clearly linked to cancer phenotypes in humans, there is also an underlying selective pressure in favor of amino acid-altering substitutions in this gene. A hypothesis where viruses are the drivers of this natural selection is discussed.

ContributorsLou, Dianne I. (Author) / McBee, Ross M. (Author) / Le, Uyen Q. (Author) / Stone, Anne (Author) / Wilkerson, Gregory K. (Author) / Demogines, Ann M. (Author) / Sawyer, Sara L. (Author) / College of Liberal Arts and Sciences (Contributor) / School of Human Evolution and Social Change (Contributor) / School of Life Sciences (Contributor)

Created2014-07-11

Using Ancient DNA Methods to Examine Dire Wolf Population History

Description

Dire wolves have recently risen to fame as a result of the popular television program Game of Thrones, and thus many viewers know dire wolves as the sigil and loyal companions of the Stark house. Far fewer recognize dire wolves by their scientific name, Canis dirus, or understand the population…

Dire wolves have recently risen to fame as a result of the popular television program Game of Thrones, and thus many viewers know dire wolves as the sigil and loyal companions of the Stark house. Far fewer recognize dire wolves by their scientific name, Canis dirus, or understand the population history of this ‘fearsome wolf’ species that roamed the Americas until the megafaunal mass extinction event of the Late Pleistocene. Although numerous studies have examined the species using morphological and geographical methods, thus far their results have been either inconclusive or contradictory. Remaining questions include the relationships dire wolves share with other members of the Canis genus and the internal structure of their populations. Advancements in ancient DNA recovery methods may make it possible to study dire wolf specimens at the molecular level for the first time and may therefore prove useful in clarifying the answers to these questions. Eighteen dire wolf specimens were collected from across the United States and subjected to ancient DNA extraction, library preparation, amplification and purification, bait preparation and capture, and next-generation sequencing. There was an average of 76.9 unique reads and 5.73% coverage when mapped to the Canis familiaris reference genome in ultraconserved regions of the mitochondrial genome. The results indicate that endogenous ancient DNA was not successfully recovered and perhaps ancient DNA recovery methods have not advanced to the point of retrieving informative amounts of DNA from particularly old, thermally degraded specimens. Nevertheless, the ever-changing nature of ancient DNA research makes it vital to continually test the limitations of the field and suggests that ancient DNA recovery methods will prove useful in illuminating dire wolf population history at some point in the future.

ContributorsSkerry, Katherine Marie (Author) / Stone, Anne (Thesis director) / Amdam, Gro (Committee member) / Larson, Greger (Committee member) / School of Human Evolution and Social Change (Contributor) / School of Nutrition and Health Promotion (Contributor) / School of Life Sciences (Contributor) / Barrett, The Honors College (Contributor)

Created2018-05

Standard mapping protocols misestimate sex-biased gene expression

Description

There are several challenges to accurately inferring levels of transcription using RNA-sequencing (RNA-seq) data, including detecting and correcting for reference genome mapping bias. One potential confounder of RNA-seq analysis results from the application of a standardized pipeline to samples of different sexes in species with chromosomal sex determination. The homology…

There are several challenges to accurately inferring levels of transcription using RNA-sequencing (RNA-seq) data, including detecting and correcting for reference genome mapping bias. One potential confounder of RNA-seq analysis results from the application of a standardized pipeline to samples of different sexes in species with chromosomal sex determination. The homology between the human X and Y chromosomes will routinely cause mismapping to occur, artificially biasing estimates of sex-biased gene transcription. For this reason we tested sex-specific mapping scenarios in humans on RNA-seq samples from the brains of 5 genetic females and 5 genetic males to assess how inferences of differential gene expression patterns change depending on the reference genome. We first applied a mapping protocol where we mapped all individuals to the entire human reference genome (complete), including the X and Y chromosomes, and computed differential expression between the set of genetic male and genetic female samples. We next mapped the genetic female samples (46,XX) to the human reference genome with the Y chromosome removed (Y-excluded) and the genetic male samples (46, XY) to the human reference genome (including the Y chromosome), but with the pseudoautosomal regions of the Y chromosome hard-masked (YPARs-masked) for the two sex-specific mappings. Using the complete and sex-specific mapping protocols, we compared the differential expression measurements of genetic males and genetic females from cuffDiff outputs. The second strategy called 33 additional genes as being differentially expressed between the two sexes when compared to the complete mapping protocol. This research provides a framework for a new standard of reference genome mappings to correct for sex-biased gene expression estimates that can be used in future studies.

ContributorsBrotman, Sarah Marie (Author) / Wilson Sayres, Melissa (Thesis director) / Crook, Sharon (Committee member) / Webster, Timothy (Committee member) / School of Life Sciences (Contributor) / School of Mathematical and Natural Sciences (Contributor) / Barrett, The Honors College (Contributor)

Created2017-05

Filtering by