Filtering by
- Language: English
![155367-Thumbnail Image.png](https://d1rbsgppyrdqq4.cloudfront.net/s3fs-public/styles/width_400/public/2021-08/155367-Thumbnail%20Image.png?versionId=BHzrsjjP8UQoseJDQSXI4f6bsTC6K6Yz&X-Amz-Content-Sha256=UNSIGNED-PAYLOAD&X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Credential=AKIASBVQ3ZQ42ZLA5CUJ/20240616/us-west-2/s3/aws4_request&X-Amz-Date=20240616T013050Z&X-Amz-SignedHeaders=host&X-Amz-Expires=120&X-Amz-Signature=d32327f813e824d5c3b28eff5e5f58ffbd3081a8e39fb53d74361772e2dbf84a&itok=LvcLTtKz)
![155356-Thumbnail Image.png](https://d1rbsgppyrdqq4.cloudfront.net/s3fs-public/styles/width_400/public/2021-08/155356-Thumbnail%20Image.png?versionId=5BmzscJee4NzjzPHHrvlo15mcbHt96or&X-Amz-Content-Sha256=UNSIGNED-PAYLOAD&X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Credential=AKIASBVQ3ZQ42ZLA5CUJ/20240616/us-west-2/s3/aws4_request&X-Amz-Date=20240616T030217Z&X-Amz-SignedHeaders=host&X-Amz-Expires=120&X-Amz-Signature=b8e21225664bc392b1d5a64756aea8610dcc4a9c97dc5896b805852a40de4be6&itok=3Qyt-4NM)
Almost every step during analysis and quantification requires the use of an often empirically determined threshold, which makes quantification of noise less accurate. In addition, each research group often develops their own data analysis pipeline making it impossible to compare data from different groups. To remedy this problem a streamlined and standardized scRNA-seq data analysis and normalization protocol was designed and developed. After analyzing multiple experiments we identified the possible pipeline stages, and tools needed. Our pipeline is capable of handling data with adapters and barcodes, which was not the case with pipelines from some experiments. Our pipeline can be used to analyze single experiment scRNA-seq data and also to compare scRNA-seq data across experiments. Various processes like data gathering, file conversion, and data merging were automated in the pipeline. The main focus was to standardize and normalize single-cell RNA-seq data to minimize technical noise introduced by disparate platforms.
![155320-Thumbnail Image.png](https://d1rbsgppyrdqq4.cloudfront.net/s3fs-public/styles/width_400/public/2021-08/155320-Thumbnail%20Image.png?versionId=t3F9fCVuhM07xtWc2d0Japo2FyzLRD8g&X-Amz-Content-Sha256=UNSIGNED-PAYLOAD&X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Credential=AKIASBVQ3ZQ42ZLA5CUJ/20240616/us-west-2/s3/aws4_request&X-Amz-Date=20240616T030217Z&X-Amz-SignedHeaders=host&X-Amz-Expires=120&X-Amz-Signature=8371c8b45d2f0514f4ccdc80ec76c5ecfdbb857cebff007e9338691d5c05d3a4&itok=WR6vz6NE)
![158125-Thumbnail Image.png](https://d1rbsgppyrdqq4.cloudfront.net/s3fs-public/styles/width_400/public/2021-09/158125-Thumbnail%20Image.png?versionId=zSmdVCsoZo1ZUVxq_XDbUcJhBAX_9iTG&X-Amz-Content-Sha256=UNSIGNED-PAYLOAD&X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Credential=AKIASBVQ3ZQ42ZLA5CUJ/20240616/us-west-2/s3/aws4_request&X-Amz-Date=20240616T030204Z&X-Amz-SignedHeaders=host&X-Amz-Expires=120&X-Amz-Signature=2cf048ae4ad94a140cedc676c43ed5c9ad20227480ad18bcf2ee73e877be4cec&itok=lzVB-uvO)
![158747-Thumbnail Image.png](https://d1rbsgppyrdqq4.cloudfront.net/s3fs-public/styles/width_400/public/2021-09/158747-Thumbnail%20Image.png?versionId=2lUACedlxkjRvge.qEaaBNL8zSe_LF8v&X-Amz-Content-Sha256=UNSIGNED-PAYLOAD&X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Credential=AKIASBVQ3ZQ42ZLA5CUJ/20240616/us-west-2/s3/aws4_request&X-Amz-Date=20240616T014610Z&X-Amz-SignedHeaders=host&X-Amz-Expires=120&X-Amz-Signature=18211ee3874d36cf1f36a7ed52c9e8489161ab91b9ebe111c12881a98a5d1e83&itok=4Hjs5I4R)
![158301-Thumbnail Image.png](https://d1rbsgppyrdqq4.cloudfront.net/s3fs-public/styles/width_400/public/2021-09/158301-Thumbnail%20Image.png?versionId=dO7utKVbieR04oaA3P4p9CGWKF02Boi.&X-Amz-Content-Sha256=UNSIGNED-PAYLOAD&X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Credential=AKIASBVQ3ZQ42ZLA5CUJ/20240616/us-west-2/s3/aws4_request&X-Amz-Date=20240616T035330Z&X-Amz-SignedHeaders=host&X-Amz-Expires=120&X-Amz-Signature=3e5accd308e4b8ac97df4806ee1c81bf3d37d3524fef65b6eb9954d0f321c042&itok=sIXT9-fI)
![161295-Thumbnail Image.png](https://d1rbsgppyrdqq4.cloudfront.net/s3fs-public/styles/width_400/public/2021-11/161295-Thumbnail%20Image.png?versionId=SKlZEh8YPsjVFHPbZTUP4Q9CvwqLF_DS&X-Amz-Content-Sha256=UNSIGNED-PAYLOAD&X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Credential=AKIASBVQ3ZQ42ZLA5CUJ/20240616/us-west-2/s3/aws4_request&X-Amz-Date=20240616T030204Z&X-Amz-SignedHeaders=host&X-Amz-Expires=120&X-Amz-Signature=5a629b90dd44eaee9ac4a242eb4ad02e88a10ed21487d38b38539dadeea608b7&itok=x57nYTdr)
![158849-Thumbnail Image.png](https://d1rbsgppyrdqq4.cloudfront.net/s3fs-public/styles/width_400/public/2021-09/158849-Thumbnail%20Image.png?versionId=vD7I2L0uxmV2bmST0RJYH6Bek3cqScIA&X-Amz-Content-Sha256=UNSIGNED-PAYLOAD&X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Credential=AKIASBVQ3ZQ42ZLA5CUJ/20240615/us-west-2/s3/aws4_request&X-Amz-Date=20240615T075944Z&X-Amz-SignedHeaders=host&X-Amz-Expires=120&X-Amz-Signature=9440c2b519662ed5c3a297e25393df65c8d9c0945b7a65673f9d6c72fc5a8181&itok=erXVHBxE)
This can make mutation detection difficult; and while increasing sequencing depth
can often help, sequence-specific errors and other non-random biases cannot be de-
tected by increased depth. The problem of accurate genotyping is exacerbated when
there is not a reference genome or other auxiliary information available.
I explore several methods for sensitively detecting mutations in non-model or-
ganisms using an example Eucalyptus melliodora individual. I use the structure of
the tree to find bounds on its somatic mutation rate and evaluate several algorithms
for variant calling. I find that conventional methods are suitable if the genome of a
close relative can be adapted to the study organism. However, with structured data,
a likelihood framework that is aware of this structure is more accurate. I use the
techniques developed here to evaluate a reference-free variant calling algorithm.
I also use this data to evaluate a k-mer based base quality score recalibrator
(KBBQ), a tool I developed to recalibrate base quality scores attached to sequencing
data. Base quality scores can help detect errors in sequencing reads, but are often
inaccurate. The most popular method for correcting this issue requires a known
set of variant sites, which is unavailable in most cases. I simulate data and show
that errors in this set of variant sites can cause calibration errors. I then show that
KBBQ accurately recalibrates base quality scores while requiring no reference or other
information and performs as well as other methods.
Finally, I use the Eucalyptus data to investigate the impact of quality score calibra-
tion on the quality of output variant calls and show that improved base quality score
calibration increases the sensitivity and reduces the false positive rate of a variant
calling algorithm.
![161497-Thumbnail Image.png](https://d1rbsgppyrdqq4.cloudfront.net/s3fs-public/styles/width_400/public/2021-11/161497-Thumbnail%20Image.png?versionId=blhZko905hY0N5RfaextGtay0vwlMgA4&X-Amz-Content-Sha256=UNSIGNED-PAYLOAD&X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Credential=AKIASBVQ3ZQ42ZLA5CUJ/20240616/us-west-2/s3/aws4_request&X-Amz-Date=20240616T005211Z&X-Amz-SignedHeaders=host&X-Amz-Expires=120&X-Amz-Signature=8e80c2701f4bc91c840a73d54742cd3c4fc51616366a62b23bbfd955a2227f4c&itok=l164ByUn)
Pathways of Distinction Analysis of Liver Cancer Data: Genetic Differences Between Males and Females
![129516-Thumbnail Image.png](https://d1rbsgppyrdqq4.cloudfront.net/s3fs-public/styles/width_400/public/2021-04/129516-Thumbnail%20Image.png?versionId=d55V_4_vRQje8cErBvsvbjtUgxriG3Oi&X-Amz-Content-Sha256=UNSIGNED-PAYLOAD&X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Credential=AKIASBVQ3ZQ42ZLA5CUJ/20240605/us-west-2/s3/aws4_request&X-Amz-Date=20240605T195240Z&X-Amz-SignedHeaders=host&X-Amz-Expires=120&X-Amz-Signature=30b88a9613ca602dd8197cfa85e849931c11e90758f32b43c2200b3fe3d4c914&itok=Irdlf-2W)
Deposits of dark material appear on Vesta’s surface as features of relatively low-albedo in the visible wavelength range of Dawn’s camera and spectrometer. Mixed with the regolith and partially excavated by younger impacts, the material is exposed as individual layered outcrops in crater walls or ejecta patches, having been uncovered and broken up by the impact. Dark fans on crater walls and dark deposits on crater floors are the result of gravity-driven mass wasting triggered by steep slopes and impact seismicity. The fact that dark material is mixed with impact ejecta indicates that it has been processed together with the ejected material. Some small craters display continuous dark ejecta similar to lunar dark-halo impact craters, indicating that the impact excavated the material from beneath a higher-albedo surface. The asymmetric distribution of dark material in impact craters and ejecta suggests non-continuous distribution in the local subsurface. Some positive-relief dark edifices appear to be impact-sculpted hills with dark material distributed over the hill slopes.
Dark features inside and outside of craters are in some places arranged as linear outcrops along scarps or as dark streaks perpendicular to the local topography. The spectral characteristics of the dark material resemble that of Vesta’s regolith. Dark material is distributed unevenly across Vesta’s surface with clusters of all types of dark material exposures. On a local scale, some craters expose or are associated with dark material, while others in the immediate vicinity do not show evidence for dark material. While the variety of surface exposures of dark material and their different geological correlations with surface features, as well as their uneven distribution, indicate a globally inhomogeneous distribution in the subsurface, the dark material seems to be correlated with the rim and ejecta of the older Veneneia south polar basin structure. The origin of the dark material is still being debated, however, the geological analysis suggests that it is exogenic, from carbon-rich low-velocity impactors, rather than endogenic, from freshly exposed mafic material or melt, exposed or created by impacts.