This growing collection consists of scholarly works authored by ASU-affiliated faculty, staff, and community members, and it contains many open access articles. ASU-affiliated authors are encouraged to Share Your Work in KEEP.

Displaying 1 - 10 of 35
Filtering by

Clear all filters

129333-Thumbnail Image.png
Description

MicroRNAs (miRNAs) are short non-coding RNAs that regulate gene output at the post-transcriptional level by targeting degenerate elements primarily in 3′untranslated regions (3′UTRs) of mRNAs. Individual miRNAs can regulate networks of hundreds of genes, yet for the majority of miRNAs few, if any, targets are known. Misexpression of miRNAs is

MicroRNAs (miRNAs) are short non-coding RNAs that regulate gene output at the post-transcriptional level by targeting degenerate elements primarily in 3′untranslated regions (3′UTRs) of mRNAs. Individual miRNAs can regulate networks of hundreds of genes, yet for the majority of miRNAs few, if any, targets are known. Misexpression of miRNAs is also a major contributor to cancer progression, thus there is a critical need to validate miRNA targets in high-throughput to understand miRNAs' contribution to tumorigenesis. Here we introduce a novel high-throughput assay to detect miRNA targets in 3′UTRs, called Luminescent Identification of Functional Elements in 3′UTRs (3′LIFE). We demonstrate the feasibility of 3′LIFE using a data set of 275 human 3′UTRs and two cancer-relevant miRNAs, let-7c and miR-10b, and compare our results to alternative methods to detect miRNA targets throughout the genome. We identify a large number of novel gene targets for these miRNAs, with only 32% of hits being bioinformatically predicted and 27% directed by non-canonical interactions. Functional analysis of target genes reveals consistent roles for each miRNA as either a tumor suppressor (let-7c) or oncogenic miRNA (miR-10b), and preferentially target multiple genes within regulatory networks, suggesting 3′LIFE is a rapid and sensitive method to detect miRNA targets in high-throughput.

ContributorsWolter, Justin (Author) / Kotagama, Kasuen (Author) / Pierre-Bez, Alexandra C. (Author) / Firago, Mari (Author) / Mangone, Marco (Author) / College of Liberal Arts and Sciences (Contributor)
Created2014-09-29
129370-Thumbnail Image.png
Description

Adaptation requires genetic variation, but founder populations are generally genetically depleted. Here we sequence two populations of an inbred ant that diverge in phenotype to determine how variability is generated. Cardiocondyla obscurior has the smallest of the sequenced ant genomes and its structure suggests a fundamental role of transposable elements

Adaptation requires genetic variation, but founder populations are generally genetically depleted. Here we sequence two populations of an inbred ant that diverge in phenotype to determine how variability is generated. Cardiocondyla obscurior has the smallest of the sequenced ant genomes and its structure suggests a fundamental role of transposable elements (TEs) in adaptive evolution. Accumulations of TEs (TE islands) comprising 7.18% of the genome evolve faster than other regions with regard to single-nucleotide variants, gene/exon duplications and deletions and gene homology. A non-random distribution of gene families, larvae/adult specific gene expression and signs of differential methylation in TE islands indicate intragenomic differences in regulation, evolutionary rates and coalescent effective population size. Our study reveals a tripartite interplay between TEs, life history and adaptation in an invasive species.

ContributorsSchrader, Lukas (Author) / Kim, Jay W. (Author) / Ence, Daniel (Author) / Zimin, Aleksey (Author) / Klein, Antonia (Author) / Wyschetzki, Katharina (Author) / Weichselgartner, Tobias (Author) / Kemena, Carsten (Author) / Stoekl, Johannes (Author) / Schultner, Eva (Author) / Wurm, Yannick (Author) / Smith, Christopher D. (Author) / Yandell, Mark (Author) / Heinze, Juergen (Author) / Gadau, Juergen (Author) / Oettler, Jan (Author) / College of Liberal Arts and Sciences (Contributor)
Created2014-12-01
128958-Thumbnail Image.png
Description

Background: Immunosignaturing is a new peptide microarray based technology for profiling of humoral immune responses. Despite new challenges, immunosignaturing gives us the opportunity to explore new and fundamentally different research questions. In addition to classifying samples based on disease status, the complex patterns and latent factors underlying immunosignatures, which we attempt

Background: Immunosignaturing is a new peptide microarray based technology for profiling of humoral immune responses. Despite new challenges, immunosignaturing gives us the opportunity to explore new and fundamentally different research questions. In addition to classifying samples based on disease status, the complex patterns and latent factors underlying immunosignatures, which we attempt to model, may have a diverse range of applications.

Methods: We investigate the utility of a number of statistical methods to determine model performance and address challenges inherent in analyzing immunosignatures. Some of these methods include exploratory and confirmatory factor analyses, classical significance testing, structural equation and mixture modeling.

Results: We demonstrate an ability to classify samples based on disease status and show that immunosignaturing is a very promising technology for screening and presymptomatic screening of disease. In addition, we are able to model complex patterns and latent factors underlying immunosignatures. These latent factors may serve as biomarkers for disease and may play a key role in a bioinformatic method for antibody discovery.

Conclusion: Based on this research, we lay out an analytic framework illustrating how immunosignatures may be useful as a general method for screening and presymptomatic screening of disease as well as antibody discovery.

ContributorsBrown, Justin (Author) / Stafford, Phillip (Author) / Johnston, Stephen (Author) / Dinu, Valentin (Author) / College of Health Solutions (Contributor)
Created2011-08-19
128960-Thumbnail Image.png
Description

Background: Microarray image analysis processes scanned digital images of hybridized arrays to produce the input spot-level data for downstream analysis, so it can have a potentially large impact on those and subsequent analysis. Signal saturation is an optical effect that occurs when some pixel values for highly expressed genes or

Background: Microarray image analysis processes scanned digital images of hybridized arrays to produce the input spot-level data for downstream analysis, so it can have a potentially large impact on those and subsequent analysis. Signal saturation is an optical effect that occurs when some pixel values for highly expressed genes or peptides exceed the upper detection threshold of the scanner software (216 - 1 = 65, 535 for 16-bit images). In practice, spots with a sizable number of saturated pixels are often flagged and discarded. Alternatively, the saturated values are used without adjustments for estimating spot intensities. The resulting expression data tend to be biased downwards and can distort high-level analysis that relies on these data. Hence, it is crucial to effectively correct for signal saturation.

Results: We developed a flexible mixture model-based segmentation and spot intensity estimation procedure that accounts for saturated pixels by incorporating a censored component in the mixture model. As demonstrated with biological data and simulation, our method extends the dynamic range of expression data beyond the saturation threshold and is effective in correcting saturation-induced bias when the lost information is not tremendous. We further illustrate the impact of image processing on downstream classification, showing that the proposed method can increase diagnostic accuracy using data from a lymphoma cancer diagnosis study.

Conclusions: The presented method adjusts for signal saturation at the segmentation stage that identifies a pixel as part of the foreground, background or other. The cluster membership of a pixel can be altered versus treating saturated values as truly observed. Thus, the resulting spot intensity estimates may be more accurate than those obtained from existing methods that correct for saturation based on already segmented data. As a model-based segmentation method, our procedure is able to identify inner holes, fuzzy edges and blank spots that are common in microarray images. The approach is independent of microarray platform and applicable to both single- and dual-channel microarrays.

ContributorsYang, Yan (Author) / Stafford, Phillip (Author) / Kim, YoonJoo (Author) / College of Liberal Arts and Sciences (Contributor)
Created2011-11-30
129065-Thumbnail Image.png
Description

Background: Lizards are evolutionarily the most closely related vertebrates to humans that can lose and regrow an entire appendage. Regeneration in lizards involves differential expression of hundreds of genes that regulate wound healing, musculoskeletal development, hormonal response, and embryonic morphogenesis. While microRNAs are able to regulate large groups of genes, their

Background: Lizards are evolutionarily the most closely related vertebrates to humans that can lose and regrow an entire appendage. Regeneration in lizards involves differential expression of hundreds of genes that regulate wound healing, musculoskeletal development, hormonal response, and embryonic morphogenesis. While microRNAs are able to regulate large groups of genes, their role in lizard regeneration has not been investigated.

Results: MicroRNA sequencing of green anole lizard (Anolis carolinensis) regenerating tail and associated tissues revealed 350 putative novel and 196 known microRNA precursors. Eleven microRNAs were differentially expressed between the regenerating tail tip and base during maximum outgrowth (25 days post autotomy), including miR-133a, miR-133b, and miR-206, which have been reported to regulate regeneration and stem cell proliferation in other model systems. Three putative novel differentially expressed microRNAs were identified in the regenerating tail tip.

Conclusions: Differentially expressed microRNAs were identified in the regenerating lizard tail, including known regulators of stem cell proliferation. The identification of 3 putative novel microRNAs suggests that regulatory networks, either conserved in vertebrates and previously uncharacterized or specific to lizards, are involved in regeneration. These findings suggest that differential regulation of microRNAs may play a role in coordinating the timing and expression of hundreds of genes involved in regeneration.

ContributorsHutchins, Elizabeth (Author) / Eckalbar, Walter (Author) / Wolter, Justin (Author) / Mangone, Marco (Author) / Kusumi, Kenro (Author) / College of Liberal Arts and Sciences (Contributor)
Created2016-05-05
129075-Thumbnail Image.png
Description

Background: High-throughput technologies such as DNA, RNA, protein, antibody and peptide microarrays are often used to examine differences across drug treatments, diseases, transgenic animals, and others. Typically one trains a classification system by gathering large amounts of probe-level data, selecting informative features, and classifies test samples using a small number of

Background: High-throughput technologies such as DNA, RNA, protein, antibody and peptide microarrays are often used to examine differences across drug treatments, diseases, transgenic animals, and others. Typically one trains a classification system by gathering large amounts of probe-level data, selecting informative features, and classifies test samples using a small number of features. As new microarrays are invented, classification systems that worked well for other array types may not be ideal. Expression microarrays, arguably one of the most prevalent array types, have been used for years to help develop classification algorithms. Many biological assumptions are built into classifiers that were designed for these types of data. One of the more problematic is the assumption of independence, both at the probe level and again at the biological level. Probes for RNA transcripts are designed to bind single transcripts. At the biological level, many genes have dependencies across transcriptional pathways where co-regulation of transcriptional units may make many genes appear as being completely dependent. Thus, algorithms that perform well for gene expression data may not be suitable when other technologies with different binding characteristics exist. The immunosignaturing microarray is based on complex mixtures of antibodies binding to arrays of random sequence peptides. It relies on many-to-many binding of antibodies to the random sequence peptides. Each peptide can bind multiple antibodies and each antibody can bind multiple peptides. This technology has been shown to be highly reproducible and appears promising for diagnosing a variety of disease states. However, it is not clear what is the optimal classification algorithm for analyzing this new type of data.

Results: We characterized several classification algorithms to analyze immunosignaturing data. We selected several datasets that range from easy to difficult to classify, from simple monoclonal binding to complex binding patterns in asthma patients. We then classified the biological samples using 17 different classification algorithms. Using a wide variety of assessment criteria, we found ‘Naïve Bayes’ far more useful than other widely used methods due to its simplicity, robustness, speed and accuracy.

Conclusions: ‘Naïve Bayes’ algorithm appears to accommodate the complex patterns hidden within multilayered immunosignaturing microarray data due to its fundamental mathematical properties.

ContributorsKukreja, Muskan (Author) / Johnston, Stephen (Author) / Stafford, Phillip (Author) / Biodesign Institute (Contributor)
Created2012-06-21
129076-Thumbnail Image.png
Description

Background: Tissue-specific RNA plasticity broadly impacts the development, tissue identity and adaptability of all organisms, but changes in composition, expression levels and its impact on gene regulation in different somatic tissues are largely unknown. Here we developed a new method, polyA-tagging and sequencing (PAT-Seq) to isolate high-quality tissue-specific mRNA from Caenorhabditis

Background: Tissue-specific RNA plasticity broadly impacts the development, tissue identity and adaptability of all organisms, but changes in composition, expression levels and its impact on gene regulation in different somatic tissues are largely unknown. Here we developed a new method, polyA-tagging and sequencing (PAT-Seq) to isolate high-quality tissue-specific mRNA from Caenorhabditis elegans intestine, pharynx and body muscle tissues and study changes in their tissue-specific transcriptomes and 3’UTRomes.

Results: We have identified thousands of novel genes and isoforms differentially expressed between these three tissues. The intestine transcriptome is expansive, expressing over 30% of C. elegans mRNAs, while muscle transcriptomes are smaller but contain characteristic unique gene signatures. Active promoter regions in all three tissues reveal both known and novel enriched tissue-specific elements, along with putative transcription factors, suggesting novel tissue-specific modes of transcription initiation. We have precisely mapped approximately 20,000 tissue-specific polyadenylation sites and discovered that about 30% of transcripts in somatic cells use alternative polyadenylation in a tissue-specific manner, with their 3’UTR isoforms significantly enriched with microRNA targets.

Conclusions: For the first time, PAT-Seq allowed us to directly study tissue specific gene expression changes in an in vivo setting and compare these changes between three somatic tissues from the same organism at single-base resolution within the same experiment. We pinpoint precise tissue-specific transcriptome rearrangements and for the first time link tissue-specific alternative polyadenylation to miRNA regulation, suggesting novel and unexplored tissue-specific post-transcriptional regulatory networks in somatic cells.

ContributorsBlazie, Stephen (Author) / Babb, Cody (Author) / Wilky, Henry (Author) / Rawls, Alan (Author) / Park, Jin (Author) / Mangone, Marco (Author) / College of Liberal Arts and Sciences (Contributor)
Created2015-01-20
129101-Thumbnail Image.png
Description

Background: 3′untranslated regions (3′UTRs) are poorly understood portions of eukaryotic mRNAs essential for post-transcriptional gene regulation. Sequence elements in 3′UTRs can be target sites for regulatory molecules such as RNA binding proteins and microRNAs (miRNAs), and these interactions can exert significant control on gene networks. However, many such interactions remain uncharacterized

Background: 3′untranslated regions (3′UTRs) are poorly understood portions of eukaryotic mRNAs essential for post-transcriptional gene regulation. Sequence elements in 3′UTRs can be target sites for regulatory molecules such as RNA binding proteins and microRNAs (miRNAs), and these interactions can exert significant control on gene networks. However, many such interactions remain uncharacterized due to a lack of high-throughput (HT) tools to study 3′UTR biology. HT cloning efforts such as the human ORFeome exemplify the potential benefits of genomic repositories for studying human disease, especially in relation to the discovery of biomarkers and targets for therapeutic agents. Currently there are no publicly available human 3′UTR libraries. To address this we have prepared the first version of the human 3′UTRome (h3′UTRome v1) library. The h3′UTRome is produced to a single high quality standard using the same recombinational cloning technology used for the human ORFeome, enabling universal operating methods and high throughput experimentation. The library is thoroughly sequenced and annotated with simple online access to information, and made publicly available through gene repositories at low cost to all scientists with minimal restriction.

Results: The first release of the h3′UTRome library comprises 1,461 human 3′UTRs cloned into Gateway® entry vectors, ready for downstream analyses. It contains 3′UTRs for 985 transcription factors, 156 kinases, 171 RNA binding proteins, and 186 other genes involved in gene regulation and in disease. We demonstrate the feasibility of the h3′UTRome library by screening a panel of 87 3′UTRs for targeting by two miRNAs: let-7c, which is implicated in tumorigenesis, and miR-221, which is implicated in atherosclerosis and heart disease. The panel is enriched with genes involved in the RAS signaling pathway, putative novel targets for the two miRNAs, as well as genes implicated in tumorigenesis and heart disease.

Conclusions: The h3′UTRome v1 library is a modular resource that can be utilized for high-throughput screens to identify regulatory interactions between trans-acting factors and 3′UTRs, Importantly, the library can be customized based on the specifications of the researcher, allowing the systematic study of human 3′UTR biology.

ContributorsKotagama, Kasuen (Author) / Babb, Cody (Author) / Wolter, Justin (Author) / Murphy, Ronan P. (Author) / Mangone, Marco (Author) / College of Liberal Arts and Sciences (Contributor)
Created2015-12-09
Description

We present a phylogeographic study of at least six reproductively isolated lineages of new world harvester ants within the Pogonomyrmex barbatus and P. rugosus species group. The genetic and geographic relationships within this clade are complex: Four of the identified lineages show genetic caste determination (GCD) and are divided into

We present a phylogeographic study of at least six reproductively isolated lineages of new world harvester ants within the Pogonomyrmex barbatus and P. rugosus species group. The genetic and geographic relationships within this clade are complex: Four of the identified lineages show genetic caste determination (GCD) and are divided into two pairs. Each pair has evolved under a mutualistic system that necessitates sympatry. These paired lineages are dependent upon one another because their GCD requires interlineage matings for the production of F1 hybrid workers, and intralineage matings are required to produce queens. This GCD system maintains genetic isolation among these interdependent lineages, while simultaneously requiring co-expansion and emigration as their distributions have changed over time. It has also been demonstrated that three of these four GCD lineages have undergone historical hybridization, but the narrower sampling range of previous studies has left questions on the hybrid parentage, breadth, and age of these groups. Thus, reconstructing the phylogenetic and geographic history of this group allows us to evaluate past insights and hypotheses and to plan future inquiries in a more complete historical biogeographic context. Using mitochondrial DNA sequences sampled across most of the morphospecies’ ranges in the U.S.A. and Mexico, we conducted a detailed phylogeographic study. Remarkably, our results indicate that one of the GCD lineage pairs has experienced a dramatic range expansion, despite the genetic load and fitness costs of the GCD system. Our analyses also reveal a complex pattern of vicariance and dispersal in Pogonomyrmex harvester ants that is largely concordant with models of late Miocene, Pliocene, and Pleistocene range shifts among various arid-adapted taxa in North America.

ContributorsMott, Brendon (Author) / Gadau, Juergen (Author) / Anderson, Kirk E. (Author) / College of Liberal Arts and Sciences (Contributor)
Created2015-07-01
129181-Thumbnail Image.png
Description

Desaturase genes are essential for biological processes, including lipid metabolism, cell signaling, and membrane fluidity regulation. Insect desaturases are particularly interesting for their role in chemical communication, and potential contribution to speciation, symbioses, and sociality. Here, we describe the acyl-CoA desaturase gene families of 15 insects, with a focus on

Desaturase genes are essential for biological processes, including lipid metabolism, cell signaling, and membrane fluidity regulation. Insect desaturases are particularly interesting for their role in chemical communication, and potential contribution to speciation, symbioses, and sociality. Here, we describe the acyl-CoA desaturase gene families of 15 insects, with a focus on social Hymenoptera. Phylogenetic reconstruction revealed that the insect desaturases represent an ancient gene family characterized by eight subfamilies that differ strongly in their degree of conservation and frequency of gene gain and loss. Analyses of genomic organization showed that five of these subfamilies are represented in a highly microsyntenic region conserved across holometabolous insect taxa, indicating an ancestral expansion during early insect evolution. In three subfamilies, ants exhibit particularly large expansions of genes. Despite these expansions, however, selection analyses showed that desaturase genes in all insect lineages are predominantly undergoing strong purifying selection. Finally, for three expanded subfamilies, we show that ants exhibit variation in gene expression between species, and more importantly, between sexes and castes within species. This suggests functional differentiation of these genes and a role in the regulation of reproductive division of labor in ants. The dynamic pattern of gene gain and loss of acyl-CoA desaturases in ants may reflect changes in response to ecological diversification and an increased demand for chemical signal variability. This may provide an example of how gene family expansions can contribute to lineage-specific adaptations through structural and regulatory changes acting in concert to produce new adaptive phenotypes.

ContributorsHelmkampf, Martin (Author) / Cash, Elizabeth (Author) / Gadau, Juergen (Author) / College of Liberal Arts and Sciences (Contributor)
Created2015-02-01