Search Content

Re-sonification of objects, events, and environments

Description

Digital sound synthesis allows the creation of a great variety of sounds. Focusing on interesting or ecologically valid sounds for music, simulation, aesthetics, or other purposes limits the otherwise vast digital audio palette. Tools for creating such sounds vary from arbitrary methods of altering recordings to precise simulations of vibrating…

Digital sound synthesis allows the creation of a great variety of sounds. Focusing on interesting or ecologically valid sounds for music, simulation, aesthetics, or other purposes limits the otherwise vast digital audio palette. Tools for creating such sounds vary from arbitrary methods of altering recordings to precise simulations of vibrating objects. In this work, methods of sound synthesis by re-sonification are considered. Re-sonification, herein, refers to the general process of analyzing, possibly transforming, and resynthesizing or reusing recorded sounds in meaningful ways, to convey information. Applied to soundscapes, re-sonification is presented as a means of conveying activity within an environment. Applied to the sounds of objects, this work examines modeling the perception of objects as well as their physical properties and the ability to simulate interactive events with such objects. To create soundscapes to re-sonify geographic environments, a method of automated soundscape design is presented. Using recorded sounds that are classified based on acoustic, social, semantic, and geographic information, this method produces stochastically generated soundscapes to re-sonify selected geographic areas. Drawing on prior knowledge, local sounds and those deemed similar comprise a locale's soundscape. In the context of re-sonifying events, this work examines processes for modeling and estimating the excitations of sounding objects. These include plucking, striking, rubbing, and any interaction that imparts energy into a system, affecting the resultant sound. A method of estimating a linear system's input, constrained to a signal-subspace, is presented and applied toward improving the estimation of percussive excitations for re-sonification. To work toward robust recording-based modeling and re-sonification of objects, new implementations of banded waveguide (BWG) models are proposed for object modeling and sound synthesis. Previous implementations of BWGs use arbitrary model parameters and may produce a range of simulations that do not match digital waveguide or modal models of the same design. Subject to linear excitations, some models proposed here behave identically to other equivalently designed physical models. Under nonlinear interactions, such as bowing, many of the proposed implementations exhibit improvements in the attack characteristics of synthesized sounds.

ContributorsFink, Alex M (Author) / Spanias, Andreas S (Thesis advisor) / Cook, Perry R. (Committee member) / Turaga, Pavan (Committee member) / Tsakalis, Konstantinos (Committee member) / Arizona State University (Publisher)

Created2013

New directions in sparse models for image analysis and restoration

Description

Effective modeling of high dimensional data is crucial in information processing and machine learning. Classical subspace methods have been very effective in such applications. However, over the past few decades, there has been considerable research towards the development of new modeling paradigms that go beyond subspace methods. This dissertation focuses…

Effective modeling of high dimensional data is crucial in information processing and machine learning. Classical subspace methods have been very effective in such applications. However, over the past few decades, there has been considerable research towards the development of new modeling paradigms that go beyond subspace methods. This dissertation focuses on the study of sparse models and their interplay with modern machine learning techniques such as manifold, ensemble and graph-based methods, along with their applications in image analysis and recovery. By considering graph relations between data samples while learning sparse models, graph-embedded codes can be obtained for use in unsupervised, supervised and semi-supervised problems. Using experiments on standard datasets, it is demonstrated that the codes obtained from the proposed methods outperform several baseline algorithms. In order to facilitate sparse learning with large scale data, the paradigm of ensemble sparse coding is proposed, and different strategies for constructing weak base models are developed. Experiments with image recovery and clustering demonstrate that these ensemble models perform better when compared to conventional sparse coding frameworks. When examples from the data manifold are available, manifold constraints can be incorporated with sparse models and two approaches are proposed to combine sparse coding with manifold projection. The improved performance of the proposed techniques in comparison to sparse coding approaches is demonstrated using several image recovery experiments. In addition to these approaches, it might be required in some applications to combine multiple sparse models with different regularizations. In particular, combining an unconstrained sparse model with non-negative sparse coding is important in image analysis, and it poses several algorithmic and theoretical challenges. A convex and an efficient greedy algorithm for recovering combined representations are proposed. Theoretical guarantees on sparsity thresholds for exact recovery using these algorithms are derived and recovery performance is also demonstrated using simulations on synthetic data. Finally, the problem of non-linear compressive sensing, where the measurement process is carried out in feature space obtained using non-linear transformations, is considered. An optimized non-linear measurement system is proposed, and improvements in recovery performance are demonstrated in comparison to using random measurements as well as optimized linear measurements.

ContributorsNatesan Ramamurthy, Karthikeyan (Author) / Spanias, Andreas (Thesis advisor) / Tsakalis, Konstantinos (Committee member) / Karam, Lina (Committee member) / Turaga, Pavan (Committee member) / Arizona State University (Publisher)

Created2013

Sparse methods in image understanding and computer vision

Description

Image understanding has been playing an increasingly crucial role in vision applications. Sparse models form an important component in image understanding, since the statistics of natural images reveal the presence of sparse structure. Sparse methods lead to parsimonious models, in addition to being efficient for large scale learning. In sparse…

Image understanding has been playing an increasingly crucial role in vision applications. Sparse models form an important component in image understanding, since the statistics of natural images reveal the presence of sparse structure. Sparse methods lead to parsimonious models, in addition to being efficient for large scale learning. In sparse modeling, data is represented as a sparse linear combination of atoms from a "dictionary" matrix. This dissertation focuses on understanding different aspects of sparse learning, thereby enhancing the use of sparse methods by incorporating tools from machine learning. With the growing need to adapt models for large scale data, it is important to design dictionaries that can model the entire data space and not just the samples considered. By exploiting the relation of dictionary learning to 1-D subspace clustering, a multilevel dictionary learning algorithm is developed, and it is shown to outperform conventional sparse models in compressed recovery, and image denoising. Theoretical aspects of learning such as algorithmic stability and generalization are considered, and ensemble learning is incorporated for effective large scale learning. In addition to building strategies for efficiently implementing 1-D subspace clustering, a discriminative clustering approach is designed to estimate the unknown mixing process in blind source separation. By exploiting the non-linear relation between the image descriptors, and allowing the use of multiple features, sparse methods can be made more effective in recognition problems. The idea of multiple kernel sparse representations is developed, and algorithms for learning dictionaries in the feature space are presented. Using object recognition experiments on standard datasets it is shown that the proposed approaches outperform other sparse coding-based recognition frameworks. Furthermore, a segmentation technique based on multiple kernel sparse representations is developed, and successfully applied for automated brain tumor identification. Using sparse codes to define the relation between data samples can lead to a more robust graph embedding for unsupervised clustering. By performing discriminative embedding using sparse coding-based graphs, an algorithm for measuring the glomerular number in kidney MRI images is developed. Finally, approaches to build dictionaries for local sparse coding of image descriptors are presented, and applied to object recognition and image retrieval.

ContributorsJayaraman Thiagarajan, Jayaraman (Author) / Spanias, Andreas (Thesis advisor) / Frakes, David (Committee member) / Tepedelenlioğlu, Cihan (Committee member) / Turaga, Pavan (Committee member) / Arizona State University (Publisher)

Created2013

Classifying everyday activity through label propagation with sparse training data

Description

We solve the problem of activity verification in the context of sustainability. Activity verification is the process of proving the user assertions pertaining to a certain activity performed by the user. Our motivation lies in incentivizing the user for engaging in sustainable activities like taking public transport or recycling. Such…

We solve the problem of activity verification in the context of sustainability. Activity verification is the process of proving the user assertions pertaining to a certain activity performed by the user. Our motivation lies in incentivizing the user for engaging in sustainable activities like taking public transport or recycling. Such incentivization schemes require the system to verify the claim made by the user. The system verifies these claims by analyzing the supporting evidence captured by the user while performing the activity. The proliferation of portable smart-phones in the past few years has provided us with a ubiquitous and relatively cheap platform, having multiple sensors like accelerometer, gyroscope, microphone etc. to capture this evidence data in-situ. In this research, we investigate the supervised and semi-supervised learning techniques for activity verification. Both these techniques make use the data set constructed using the evidence submitted by the user. Supervised learning makes use of annotated evidence data to build a function to predict the class labels of the unlabeled data points. The evidence data captured can be either unimodal or multimodal in nature. We use the accelerometer data as evidence for transportation mode verification and image data as evidence for recycling verification. After training the system, we achieve maximum accuracy of 94% when classifying the transport mode and 81% when detecting recycle activity. In the case of recycle verification, we could improve the classification accuracy by asking the user for more evidence. We present some techniques to ask the user for the next best piece of evidence that maximizes the probability of classification. Using these techniques for detecting recycle activity, the accuracy increases to 93%. The major disadvantage of using supervised models is that it requires extensive annotated training data, which expensive to collect. Due to the limited training data, we look at the graph based inductive semi-supervised learning methods to propagate the labels among the unlabeled samples. In the semi-supervised approach, we represent each instance in the data set as a node in the graph. Since it is a complete graph, edges interconnect these nodes, with each edge having some weight representing the similarity between the points. We propagate the labels in this graph, based on the proximity of the data points to the labeled nodes. We estimate the performance of these algorithms by measuring how close the probability distribution of the data after label propagation is to the probability distribution of the ground truth data. Since labeling has a cost associated with it, in this thesis we propose two algorithms that help us in selecting minimum number of labeled points to propagate the labels accurately. Our proposed algorithm achieves a maximum of 73% increase in performance when compared to the baseline algorithm.

ContributorsDesai, Vaishnav (Author) / Sundaram, Hari (Thesis advisor) / Li, Baoxin (Thesis advisor) / Turaga, Pavan (Committee member) / Arizona State University (Publisher)

Created2013

Low complexity differential geometric computations with applications to human activity analysis

Description

In this thesis, we consider the problem of fast and eﬃcient indexing techniques for time sequences which evolve on manifold-valued spaces. Using manifolds is a convenient way to work with complex features that often do not live in Euclidean spaces. However, computing standard notions of geodesic distance, mean etc. can…

In this thesis, we consider the problem of fast and eﬃcient indexing techniques for time sequences which evolve on manifold-valued spaces. Using manifolds is a convenient way to work with complex features that often do not live in Euclidean spaces. However, computing standard notions of geodesic distance, mean etc. can get very involved due to the underlying non-linearity associated with the space. As a result a complex task such as manifold sequence matching would require very large number of computations making it hard to use in practice. We believe that one can device smart approximation algorithms for several classes of such problems which take into account the geometry of the manifold and maintain the favorable properties of the exact approach. This problem has several applications in areas of human activity discovery and recognition, where several features and representations are naturally studied in a non-Euclidean setting. We propose a novel solution to the problem of indexing manifold-valued sequences by proposing an intrinsic approach to map sequences to a symbolic representation. This is shown to enable the deployment of fast and accurate algorithms for activity recognition, motif discovery, and anomaly detection. Toward this end, we present generalizations of key concepts of piece-wise aggregation and symbolic approximation for the case of non-Euclidean manifolds. Experiments show that one can replace expensive geodesic computations with much faster symbolic computations with little loss of accuracy in activity recognition and discovery applications. The proposed methods are ideally suited for real-time systems and resource constrained scenarios.

ContributorsAnirudh, Rushil (Author) / Turaga, Pavan (Thesis advisor) / Spanias, Andreas (Committee member) / Li, Baoxin (Committee member) / Arizona State University (Publisher)

Created2012

Feature extraction from compressive cameras with application to activity recognition

Description

Recent advances in camera architectures and associated mathematical representations now enable compressive acquisition of images and videos at low data-rates. While most computer vision applications of today are composed of conventional cameras, which collect a large amount redundant data and power hungry embedded systems, which compress the collected data for…

Recent advances in camera architectures and associated mathematical representations now enable compressive acquisition of images and videos at low data-rates. While most computer vision applications of today are composed of conventional cameras, which collect a large amount redundant data and power hungry embedded systems, which compress the collected data for further processing, compressive cameras offer the advantage of direct acquisition of data in compressed domain and hence readily promise to find applicability in computer vision, particularly in environments hampered by limited communication bandwidths. However, despite the significant progress in theory and methods of compressive sensing, little headway has been made in developing systems for such applications by exploiting the merits of compressive sensing. In such a setting, we consider the problem of activity recognition, which is an important inference problem in many security and surveillance applications. Since all successful activity recognition systems involve detection of human, followed by recognition, a potential fully functioning system motivated by compressive camera would involve the tracking of human, which requires the reconstruction of atleast the initial few frames to detect the human. Once the human is tracked, the recognition part of the system requires only the features to be extracted from the tracked sequences, which can be the reconstructed images or the compressed measurements of such sequences. However, it is desirable in resource constrained environments that these features be extracted from the compressive measurements without reconstruction. Motivated by this, in this thesis, we propose a framework for understanding activities as a non-linear dynamical system, and propose a robust, generalizable feature that can be extracted directly from the compressed measurements without reconstructing the original video frames. The proposed feature is termed recurrence texture and is motivated from recurrence analysis of non-linear dynamical systems. We show that it is possible to obtain discriminative features directly from the compressed stream and show its utility in recognition of activities at very low data rates.

ContributorsKulkarni, Kuldeep Sharad (Author) / Turaga, Pavan (Thesis advisor) / Spanias, Andreas (Committee member) / Frakes, David (Committee member) / Arizona State University (Publisher)

Created2012

A Study into the Social Impact that Live-In Musician-in-Residence Programs have on Residents in Independent Living Retirement Communities Through University Partnerships

Description

This research aims to identify ways in which student live-in Musician-in-Residence programs help meet the social needs of older adults through university partnerships. Independent Living retirement communities face a gap in music programming. Student live-in Musician-in-Residence programs like the one at Mirabella at Arizona State University (Mirabella at ASU) were…

This research aims to identify ways in which student live-in Musician-in-Residence programs help meet the social needs of older adults through university partnerships. Independent Living retirement communities face a gap in music programming. Student live-in Musician-in-Residence programs like the one at Mirabella at Arizona State University (Mirabella at ASU) were used to help determine how music impacted the quality of life of retirees and how it affected their relationships with a younger generation. Only residents in Independent Living were included in the study. Prior research has shown that when an older adult relocates to senior living, it can be viewed stereotypically as a sign that they are diminishing their capacity to live independently and are preparing to live the rest of their lives detached from society. Additionally, research shows that some retirement communities are unaware of how music programs can encourage the fostering of meaningful relationships for independent retired adults. As adults are retiring earlier, they are living healthier lives and require quality programming that reflects their active lifestyle. In this research, the questions asked provided qualitative responses and residents shared anecdotal reports of their experiences. Questions were divided into two categories, 1). Residential history and prior music experience, 2). Sense of belonging and retention. The results of this study suggest that intergenerational music programs contribute to maintaining older adults' social and emotional health by providing opportunities to engage in music through observation and participation. They also show that music programs serve as conduits for fostering relationships between seemingly disparate groups, in this case, the older and younger populations.

ContributorsCox, Tychiko Dustin (Author) / Swoboda, Deanna (Thesis advisor) / Hawkins, Gordon (Thesis advisor) / Wells, Christi Jay (Committee member) / Arizona State University (Publisher)

Created2023

A Formal Analysis and Transcribed Arrangements for Tuba and Piano of Works by Black Composers Before 1950

Description

This paper presents three new arrangements of works for solo tuba and piano, originally written by Black composers before 1950. The works presented here include Méphisto masqué by Edmond Dédé, Three Arabian Dances by Amanda Aldridge, and Warbling in the Moonlight by Alton Augustus Adams. Composer biographies, a formal analysis,…

This paper presents three new arrangements of works for solo tuba and piano, originally written by Black composers before 1950. The works presented here include Méphisto masqué by Edmond Dédé, Three Arabian Dances by Amanda Aldridge, and Warbling in the Moonlight by Alton Augustus Adams. Composer biographies, a formal analysis, and form diagrams are included for each piece, along with the new transcriptions.

ContributorsMatejek, Matthew Ryan (Author) / Swoboda, Deanna (Thesis advisor) / Edwards, Bradley (Committee member) / Shea, Nicholas (Committee member) / Arizona State University (Publisher)

Created2023

All My Spirit Tingled: A History, Recording, and Performance Guide to a Newly Commissioned Work for Trumpet by Composer Robert Tindle

Description

Over the past fifty years, the number of new compositions written for trumpet has increased tremendously. Fueled by close collaboration between composers, performers, and organizations, audiences are yearning to hear these new works. The purpose of this doctoral project is to provide insight into how a commission for solo trumpet,…

Over the past fifty years, the number of new compositions written for trumpet has increased tremendously. Fueled by close collaboration between composers, performers, and organizations, audiences are yearning to hear these new works. The purpose of this doctoral project is to provide insight into how a commission for solo trumpet, All My Spirit Tingled, came to fruition and how a burgeoning soloist may best learn this challenging repertoire. The first chapter provides background on the composer, his musical vision, and the chosen soloist for this commission. The second chapter provides a detailed performance guide of All My Spirit Tingled, including references to technical studies, etudes, and solos from trumpet literature that may provide further material with which to grow as a performer of this work. The dissertation provides a professional recording of the premiere to assist the reader throughout the performance guide. This document also includes program notes for the composition, as well as composer biographical information, a list of other works by Robert Tindle featuring brass instruments, and a transcript of the composer and performer interview.

ContributorsDeshler, David Woehrle (Author) / Burgstaller, Joesef (Thesis advisor) / Swoboda, Deanna (Committee member) / Temple, Alex (Committee member) / Arizona State University (Publisher)

Created2023

Analyses and Recordings of Selected Works by Germaine Tailleferre, Dora Pejačević, Teresa Milanollo, and Ika Peyron

Description

Saxophonists regularly transcribe works from the 19th and 20th centuries in order tobolster our repertoire from those eras. As one of the youngest concert instruments, few substantial works exist for the instrument prior to the mid 20th century. By regularly transcribing works that are standards in other instruments’ repertoires, we have perpetuated the…

Saxophonists regularly transcribe works from the 19th and 20th centuries in order tobolster our repertoire from those eras. As one of the youngest concert instruments, few substantial works exist for the instrument prior to the mid 20th century. By regularly transcribing works that are standards in other instruments’ repertoires, we have perpetuated the historical underrepresentation of female composers from the same time period. In answer to this, I have researched, analyzed, transcribed, and recorded four works originally for violin and piano written by female composers born in the 19th century. This program represents differing styles and nationalities, while being a cohesive program of works. The repertoire consists of a set of character pieces by Ika Peyron, sonatas by Dora Pejačević and Germaine Tailleferre, and finally a theme and variations by Teresa Milanollo to serve as a closer. Each chapter provides insights into my transcription process and tables of the alterations made to the original material, as well as short analyses of each piece. i

ContributorsDodge-Overstreet, Jessica (Author) / Creviston, Christopher (Thesis advisor) / Shea, Nicholas (Thesis advisor) / Swoboda, Deanna (Committee member) / Spring, Robert (Committee member) / Arizona State University (Publisher)

Created2023

Filtering by