Filtering by
- Language: English
![155356-Thumbnail Image.png](https://d1rbsgppyrdqq4.cloudfront.net/s3fs-public/styles/width_400/public/2021-08/155356-Thumbnail%20Image.png?versionId=5BmzscJee4NzjzPHHrvlo15mcbHt96or&X-Amz-Content-Sha256=UNSIGNED-PAYLOAD&X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Credential=AKIASBVQ3ZQ42ZLA5CUJ/20240616/us-west-2/s3/aws4_request&X-Amz-Date=20240616T030217Z&X-Amz-SignedHeaders=host&X-Amz-Expires=120&X-Amz-Signature=b8e21225664bc392b1d5a64756aea8610dcc4a9c97dc5896b805852a40de4be6&itok=3Qyt-4NM)
Almost every step during analysis and quantification requires the use of an often empirically determined threshold, which makes quantification of noise less accurate. In addition, each research group often develops their own data analysis pipeline making it impossible to compare data from different groups. To remedy this problem a streamlined and standardized scRNA-seq data analysis and normalization protocol was designed and developed. After analyzing multiple experiments we identified the possible pipeline stages, and tools needed. Our pipeline is capable of handling data with adapters and barcodes, which was not the case with pipelines from some experiments. Our pipeline can be used to analyze single experiment scRNA-seq data and also to compare scRNA-seq data across experiments. Various processes like data gathering, file conversion, and data merging were automated in the pipeline. The main focus was to standardize and normalize single-cell RNA-seq data to minimize technical noise introduced by disparate platforms.
![155320-Thumbnail Image.png](https://d1rbsgppyrdqq4.cloudfront.net/s3fs-public/styles/width_400/public/2021-08/155320-Thumbnail%20Image.png?versionId=t3F9fCVuhM07xtWc2d0Japo2FyzLRD8g&X-Amz-Content-Sha256=UNSIGNED-PAYLOAD&X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Credential=AKIASBVQ3ZQ42ZLA5CUJ/20240616/us-west-2/s3/aws4_request&X-Amz-Date=20240616T030217Z&X-Amz-SignedHeaders=host&X-Amz-Expires=120&X-Amz-Signature=8371c8b45d2f0514f4ccdc80ec76c5ecfdbb857cebff007e9338691d5c05d3a4&itok=WR6vz6NE)
![158125-Thumbnail Image.png](https://d1rbsgppyrdqq4.cloudfront.net/s3fs-public/styles/width_400/public/2021-09/158125-Thumbnail%20Image.png?versionId=zSmdVCsoZo1ZUVxq_XDbUcJhBAX_9iTG&X-Amz-Content-Sha256=UNSIGNED-PAYLOAD&X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Credential=AKIASBVQ3ZQ42ZLA5CUJ/20240616/us-west-2/s3/aws4_request&X-Amz-Date=20240616T030204Z&X-Amz-SignedHeaders=host&X-Amz-Expires=120&X-Amz-Signature=2cf048ae4ad94a140cedc676c43ed5c9ad20227480ad18bcf2ee73e877be4cec&itok=lzVB-uvO)
![158747-Thumbnail Image.png](https://d1rbsgppyrdqq4.cloudfront.net/s3fs-public/styles/width_400/public/2021-09/158747-Thumbnail%20Image.png?versionId=2lUACedlxkjRvge.qEaaBNL8zSe_LF8v&X-Amz-Content-Sha256=UNSIGNED-PAYLOAD&X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Credential=AKIASBVQ3ZQ42ZLA5CUJ/20240616/us-west-2/s3/aws4_request&X-Amz-Date=20240616T154146Z&X-Amz-SignedHeaders=host&X-Amz-Expires=120&X-Amz-Signature=74ab8a0e407641b3978a839759bd7a836630c203726d3a66735e440f4f54a7fe&itok=4Hjs5I4R)
![158301-Thumbnail Image.png](https://d1rbsgppyrdqq4.cloudfront.net/s3fs-public/styles/width_400/public/2021-09/158301-Thumbnail%20Image.png?versionId=dO7utKVbieR04oaA3P4p9CGWKF02Boi.&X-Amz-Content-Sha256=UNSIGNED-PAYLOAD&X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Credential=AKIASBVQ3ZQ42ZLA5CUJ/20240617/us-west-2/s3/aws4_request&X-Amz-Date=20240617T045649Z&X-Amz-SignedHeaders=host&X-Amz-Expires=120&X-Amz-Signature=6e39ac9d005e80776b622da1ee36a0be2d9cb4c474e80ad655db2233038ab8fe&itok=sIXT9-fI)
![161295-Thumbnail Image.png](https://d1rbsgppyrdqq4.cloudfront.net/s3fs-public/styles/width_400/public/2021-11/161295-Thumbnail%20Image.png?versionId=SKlZEh8YPsjVFHPbZTUP4Q9CvwqLF_DS&X-Amz-Content-Sha256=UNSIGNED-PAYLOAD&X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Credential=AKIASBVQ3ZQ42ZLA5CUJ/20240616/us-west-2/s3/aws4_request&X-Amz-Date=20240616T030204Z&X-Amz-SignedHeaders=host&X-Amz-Expires=120&X-Amz-Signature=5a629b90dd44eaee9ac4a242eb4ad02e88a10ed21487d38b38539dadeea608b7&itok=x57nYTdr)
![158849-Thumbnail Image.png](https://d1rbsgppyrdqq4.cloudfront.net/s3fs-public/styles/width_400/public/2021-09/158849-Thumbnail%20Image.png?versionId=vD7I2L0uxmV2bmST0RJYH6Bek3cqScIA&X-Amz-Content-Sha256=UNSIGNED-PAYLOAD&X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Credential=AKIASBVQ3ZQ42ZLA5CUJ/20240615/us-west-2/s3/aws4_request&X-Amz-Date=20240615T075944Z&X-Amz-SignedHeaders=host&X-Amz-Expires=120&X-Amz-Signature=9440c2b519662ed5c3a297e25393df65c8d9c0945b7a65673f9d6c72fc5a8181&itok=erXVHBxE)
This can make mutation detection difficult; and while increasing sequencing depth
can often help, sequence-specific errors and other non-random biases cannot be de-
tected by increased depth. The problem of accurate genotyping is exacerbated when
there is not a reference genome or other auxiliary information available.
I explore several methods for sensitively detecting mutations in non-model or-
ganisms using an example Eucalyptus melliodora individual. I use the structure of
the tree to find bounds on its somatic mutation rate and evaluate several algorithms
for variant calling. I find that conventional methods are suitable if the genome of a
close relative can be adapted to the study organism. However, with structured data,
a likelihood framework that is aware of this structure is more accurate. I use the
techniques developed here to evaluate a reference-free variant calling algorithm.
I also use this data to evaluate a k-mer based base quality score recalibrator
(KBBQ), a tool I developed to recalibrate base quality scores attached to sequencing
data. Base quality scores can help detect errors in sequencing reads, but are often
inaccurate. The most popular method for correcting this issue requires a known
set of variant sites, which is unavailable in most cases. I simulate data and show
that errors in this set of variant sites can cause calibration errors. I then show that
KBBQ accurately recalibrates base quality scores while requiring no reference or other
information and performs as well as other methods.
Finally, I use the Eucalyptus data to investigate the impact of quality score calibra-
tion on the quality of output variant calls and show that improved base quality score
calibration increases the sensitivity and reduces the false positive rate of a variant
calling algorithm.
![161497-Thumbnail Image.png](https://d1rbsgppyrdqq4.cloudfront.net/s3fs-public/styles/width_400/public/2021-11/161497-Thumbnail%20Image.png?versionId=blhZko905hY0N5RfaextGtay0vwlMgA4&X-Amz-Content-Sha256=UNSIGNED-PAYLOAD&X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Credential=AKIASBVQ3ZQ42ZLA5CUJ/20240616/us-west-2/s3/aws4_request&X-Amz-Date=20240616T005211Z&X-Amz-SignedHeaders=host&X-Amz-Expires=120&X-Amz-Signature=8e80c2701f4bc91c840a73d54742cd3c4fc51616366a62b23bbfd955a2227f4c&itok=l164ByUn)
Pathways of Distinction Analysis of Liver Cancer Data: Genetic Differences Between Males and Females
Neural progenitor cells (NPCs) derived from human pluripotent stem cells (hPSCs) are a multipotent cell population that is capable of nearly indefinite expansion and subsequent differentiation into the various neuronal and supporting cell types that comprise the CNS. However, current protocols for differentiating NPCs toward neuronal lineages result in a mixture of neurons from various regions of the CNS. In this study, we determined that endogenous WNT signaling is a primary contributor to the heterogeneity observed in NPC cultures and neuronal differentiation. Furthermore, exogenous manipulation of WNT signaling during neural differentiation, through either activation or inhibition, reduces this heterogeneity in NPC cultures, thereby promoting the formation of regionally homogeneous NPC and neuronal cultures. The ability to manipulate WNT signaling to generate regionally specific NPCs and neurons will be useful for studying human neural development and will greatly enhance the translational potential of hPSCs for neural-related therapies.
![129022-Thumbnail Image.png](https://d1rbsgppyrdqq4.cloudfront.net/s3fs-public/styles/width_400/public/2021-04/129022-Thumbnail%20Image.png?versionId=PbpXvdVnJ37hycP83F0FDCe7iaPVLmo2&X-Amz-Content-Sha256=UNSIGNED-PAYLOAD&X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Credential=AKIASBVQ3ZQ42ZLA5CUJ/20240617/us-west-2/s3/aws4_request&X-Amz-Date=20240617T102226Z&X-Amz-SignedHeaders=host&X-Amz-Expires=120&X-Amz-Signature=38cb3e6b0c83dba026385598fd86803e2f774b7e12e2a813dd706b56d5dc2ba6&itok=7M2xtaHb)
Background: Blindness has evolved repeatedly in cave-dwelling organisms, and many hypotheses have been proposed to explain this observation, including both accumulation of neutral loss-of-function mutations and adaptation to darkness. Investigating the loss of sight in cave dwellers presents an opportunity to understand the operation of fundamental evolutionary processes, including drift, selection, mutation, and migration.
Results: Here we model the evolution of blindness in caves. This model captures the interaction of three forces: (1) selection favoring alleles causing blindness, (2) immigration of sightedness alleles from a surface population, and (3) mutations creating blindness alleles. We investigated the dynamics of this model and determined selection-strength thresholds that result in blindness evolving in caves despite immigration of sightedness alleles from the surface. We estimate that the selection coefficient for blindness would need to be at least 0.005 (and maybe as high as 0.5) for blindness to evolve in the model cave-organism, Astyanax mexicanus.
Conclusions: Our results indicate that strong selection is required for the evolution of blindness in cave-dwelling organisms, which is consistent with recent work suggesting a high metabolic cost of eye development.