This collection includes both ASU Theses and Dissertations, submitted by graduate students, and the Barrett, Honors College theses submitted by undergraduate students. 

Displaying 1 - 2 of 2
Filtering by

Clear all filters

134940-Thumbnail Image.png
Description
Currently, quantification of single cell RNA species in their natural contexts is restricted due to the little number of parallel analysis. Through this, we identify a method to increase the multiplexing capacity of RNA analysis for single cells in situ. Initially, RNA transcripts are found by using fluorescence in situ

Currently, quantification of single cell RNA species in their natural contexts is restricted due to the little number of parallel analysis. Through this, we identify a method to increase the multiplexing capacity of RNA analysis for single cells in situ. Initially, RNA transcripts are found by using fluorescence in situ hybridization (FISH). Once imaging and data storage is completed, the fluorescence signal is detached through photobleaching. By doing so, the FISH is reinitiated to detect other RNA species residing in the same cell. After reiterative cycles of hybridization, imaging and photobleaching, the identities, positions and copy numbers of a huge amount of varied RNA species can be computed in individual cells in situ. Through this approach, we have evaluated seven different transcripts in single HeLa cells with five reiterative RNA FISH cycles. This method has the ability to detect over 100 varied RNA species in single cells in situ, which can be further applied in studies of systems biology, molecular diagnosis and targeted therapies.
ContributorsJavangula, Saiswathi (Author) / Guo, Jia (Thesis director) / Liang, Jianming (Committee member) / School of Molecular Sciences (Contributor) / School of Nutrition and Health Promotion (Contributor) / Barrett, The Honors College (Contributor)
Created2016-12
155085-Thumbnail Image.png
Description
High-level inference tasks in video applications such as recognition, video retrieval, and zero-shot classification have become an active research area in recent years. One fundamental requirement for such applications is to extract high-quality features that maintain high-level information in the videos.

Many video feature extraction algorithms have been purposed, such

High-level inference tasks in video applications such as recognition, video retrieval, and zero-shot classification have become an active research area in recent years. One fundamental requirement for such applications is to extract high-quality features that maintain high-level information in the videos.

Many video feature extraction algorithms have been purposed, such as STIP, HOG3D, and Dense Trajectories. These algorithms are often referred to as “handcrafted” features as they were deliberately designed based on some reasonable considerations. However, these algorithms may fail when dealing with high-level tasks or complex scene videos. Due to the success of using deep convolution neural networks (CNNs) to extract global representations for static images, researchers have been using similar techniques to tackle video contents. Typical techniques first extract spatial features by processing raw images using deep convolution architectures designed for static image classifications. Then simple average, concatenation or classifier-based fusion/pooling methods are applied to the extracted features. I argue that features extracted in such ways do not acquire enough representative information since videos, unlike images, should be characterized as a temporal sequence of semantically coherent visual contents and thus need to be represented in a manner considering both semantic and spatio-temporal information.

In this thesis, I propose a novel architecture to learn semantic spatio-temporal embedding for videos to support high-level video analysis. The proposed method encodes video spatial and temporal information separately by employing a deep architecture consisting of two channels of convolutional neural networks (capturing appearance and local motion) followed by their corresponding Fully Connected Gated Recurrent Unit (FC-GRU) encoders for capturing longer-term temporal structure of the CNN features. The resultant spatio-temporal representation (a vector) is used to learn a mapping via a Fully Connected Multilayer Perceptron (FC-MLP) to the word2vec semantic embedding space, leading to a semantic interpretation of the video vector that supports high-level analysis. I evaluate the usefulness and effectiveness of this new video representation by conducting experiments on action recognition, zero-shot video classification, and semantic video retrieval (word-to-video) retrieval, using the UCF101 action recognition dataset.
ContributorsHu, Sheng-Hung (Author) / Li, Baoxin (Thesis advisor) / Turaga, Pavan (Committee member) / Liang, Jianming (Committee member) / Tong, Hanghang (Committee member) / Arizona State University (Publisher)
Created2016