Theses and Dissertations
Displaying 1 - 2 of 2
Filtering by
- All Subjects: People with visual disabilities
- All Subjects: Regularized Topic Model
- Creators: Li, Baoxin
Description
A lot of strides have been made in enabling technologies to aid individuals with visual impairment live an independent life. The advent of smart devices and participatory web has especially facilitated the possibility of new interactions to aide everyday tasks. Current systems however tend to be complex and require multiple cumbersome devices which invariably come with steep learning curves. Building new cyber-human systems with simple integrated interfaces while keeping in mind the specific requirements of the target users would help alleviate their mundane yet significant daily needs. Navigation is one such significant need that forms an integral part of everyday life and is one of the areas where individuals with visual impairment face the most discomfort. There is little technology out there to help travelers with navigating new routes. A number of research prototypes have been proposed but none of them are available to the general population. This may be due to the need for special equipment that needs expertise before deployment, or trained professionals needing to calibrate devices or because of the fact that the systems are just not scalable. Another area that needs assistance is the field of education. Lot of the classroom material and textbook material is not readily available in alternate formats for use. Another such area that requires attention is information delivery in the age of web 2.0. Popular websites like Facebook, Amazon, etc are designed with sighted people as target audience. While the mobile editions with their pared down versions make it easier to navigate with screen readers, the truth remains that there is still a long way to go in making such websites truly accessible.
ContributorsPaladugu, Devi Archana (Author) / Li, Baoxin (Thesis advisor) / Hedgpeth, Terri (Committee member) / Atkinson, Robert (Committee member) / Walker, Erin (Committee member) / Arizona State University (Publisher)
Created2016
Description
While discrete emotions like joy, anger, disgust etc. are quite popular, continuous
emotion dimensions like arousal and valence are gaining popularity within the research
community due to an increase in the availability of datasets annotated with these
emotions. Unlike the discrete emotions, continuous emotions allow modeling of subtle
and complex affect dimensions but are difficult to predict.
Dimension reduction techniques form the core of emotion recognition systems and
help create a new feature space that is more helpful in predicting emotions. But these
techniques do not necessarily guarantee a better predictive capability as most of them
are unsupervised, especially in regression learning. In emotion recognition literature,
supervised dimension reduction techniques have not been explored much and in this
work a solution is provided through probabilistic topic models. Topic models provide
a strong probabilistic framework to embed new learning paradigms and modalities.
In this thesis, the graphical structure of Latent Dirichlet Allocation has been explored
and new models tuned to emotion recognition and change detection have been built.
In this work, it has been shown that the double mixture structure of topic models
helps 1) to visualize feature patterns, and 2) to project features onto a topic simplex
that is more predictive of human emotions, when compared to popular techniques
like PCA and KernelPCA. Traditionally, topic models have been used on quantized
features but in this work, a continuous topic model called the Dirichlet Gaussian
Mixture model has been proposed. Evaluation of DGMM has shown that while modeling
videos, performance of LDA models can be replicated even without quantizing
the features. Until now, topic models have not been explored in a supervised context
of video analysis and thus a Regularized supervised topic model (RSLDA) that
models video and audio features is introduced. RSLDA learning algorithm performs
both dimension reduction and regularized linear regression simultaneously, and has outperformed supervised dimension reduction techniques like SPCA and Correlation
based feature selection algorithms. In a first of its kind, two new topic models, Adaptive
temporal topic model (ATTM) and SLDA for change detection (SLDACD) have
been developed for predicting concept drift in time series data. These models do not
assume independence of consecutive frames and outperform traditional topic models
in detecting local and global changes respectively.
emotion dimensions like arousal and valence are gaining popularity within the research
community due to an increase in the availability of datasets annotated with these
emotions. Unlike the discrete emotions, continuous emotions allow modeling of subtle
and complex affect dimensions but are difficult to predict.
Dimension reduction techniques form the core of emotion recognition systems and
help create a new feature space that is more helpful in predicting emotions. But these
techniques do not necessarily guarantee a better predictive capability as most of them
are unsupervised, especially in regression learning. In emotion recognition literature,
supervised dimension reduction techniques have not been explored much and in this
work a solution is provided through probabilistic topic models. Topic models provide
a strong probabilistic framework to embed new learning paradigms and modalities.
In this thesis, the graphical structure of Latent Dirichlet Allocation has been explored
and new models tuned to emotion recognition and change detection have been built.
In this work, it has been shown that the double mixture structure of topic models
helps 1) to visualize feature patterns, and 2) to project features onto a topic simplex
that is more predictive of human emotions, when compared to popular techniques
like PCA and KernelPCA. Traditionally, topic models have been used on quantized
features but in this work, a continuous topic model called the Dirichlet Gaussian
Mixture model has been proposed. Evaluation of DGMM has shown that while modeling
videos, performance of LDA models can be replicated even without quantizing
the features. Until now, topic models have not been explored in a supervised context
of video analysis and thus a Regularized supervised topic model (RSLDA) that
models video and audio features is introduced. RSLDA learning algorithm performs
both dimension reduction and regularized linear regression simultaneously, and has outperformed supervised dimension reduction techniques like SPCA and Correlation
based feature selection algorithms. In a first of its kind, two new topic models, Adaptive
temporal topic model (ATTM) and SLDA for change detection (SLDACD) have
been developed for predicting concept drift in time series data. These models do not
assume independence of consecutive frames and outperform traditional topic models
in detecting local and global changes respectively.
ContributorsLade, Prasanth (Author) / Panchanathan, Sethuraman (Thesis advisor) / Davulcu, Hasan (Committee member) / Li, Baoxin (Committee member) / Balasubramanian, Vineeth N (Committee member) / Arizona State University (Publisher)
Created2015