Search Content

Invariant human pose feature extraction for movement recognition and pose estimation

Description

Reliable extraction of human pose features that are invariant to view angle and body shape changes is critical for advancing human movement analysis. In this dissertation, the multifactor analysis techniques, including the multilinear analysis and the multifactor Gaussian process methods, have been exploited to extract such invariant pose features from…

Reliable extraction of human pose features that are invariant to view angle and body shape changes is critical for advancing human movement analysis. In this dissertation, the multifactor analysis techniques, including the multilinear analysis and the multifactor Gaussian process methods, have been exploited to extract such invariant pose features from video data by decomposing various key contributing factors, such as pose, view angle, and body shape, in the generation of the image observations. Experimental results have shown that the resulting pose features extracted using the proposed methods exhibit excellent invariance properties to changes in view angles and body shapes. Furthermore, using the proposed invariant multifactor pose features, a suite of simple while effective algorithms have been developed to solve the movement recognition and pose estimation problems. Using these proposed algorithms, excellent human movement analysis results have been obtained, and most of them are superior to those obtained from state-of-the-art algorithms on the same testing datasets. Moreover, a number of key movement analysis challenges, including robust online gesture spotting and multi-camera gesture recognition, have also been addressed in this research. To this end, an online gesture spotting framework has been developed to automatically detect and learn non-gesture movement patterns to improve gesture localization and recognition from continuous data streams using a hidden Markov network. In addition, the optimal data fusion scheme has been investigated for multicamera gesture recognition, and the decision-level camera fusion scheme using the product rule has been found to be optimal for gesture recognition using multiple uncalibrated cameras. Furthermore, the challenge of optimal camera selection in multi-camera gesture recognition has also been tackled. A measure to quantify the complementary strength across cameras has been proposed. Experimental results obtained from a real-life gesture recognition dataset have shown that the optimal camera combinations identified according to the proposed complementary measure always lead to the best gesture recognition results.

ContributorsPeng, Bo (Author) / Qian, Gang (Thesis advisor) / Ye, Jieping (Committee member) / Li, Baoxin (Committee member) / Spanias, Andreas (Committee member) / Arizona State University (Publisher)

Created2011

Stanislav Binicki's opera Na uranku: genesis of critical analysis of the first Serbian opera

Description

The focus of this study was the first Serbian opera, Na Uranku (At Dawn). It was written by Stanislav Binièki (1872-1942) and was first performed in 1903 at the National Theatre in Belgrade. There were two objectives of this project: (1) a live concert performance of the opera, which produced…

The focus of this study was the first Serbian opera, Na Uranku (At Dawn). It was written by Stanislav Binièki (1872-1942) and was first performed in 1903 at the National Theatre in Belgrade. There were two objectives of this project: (1) a live concert performance of the opera, which produced an audio recording that can be found as an appendix; and, (2) an accompanying document containing a history and an analysis of the work. While Binièki's opera is recognized as an extraordinary artistic achievement, and a new genre of musical enrichment for Serbian music, little had been previously written either about the composer or the work. At Dawn is a romantic opera in the verismo tradition with national elements. The significance of this opera is not only in its artistic expression but also in how it helped the music of Serbia evolve. Early opera settings in Serbia in the mid-nineteenth to early twentieth century did not have the same wealth of history upon which to draw as had existed in the rich operatic oeuvre in Western Europe and Russia. Similarly, conditions for performance were not satisfactory, as were no professional orchestras or singers. Furthermore, audiences were not accustomed to this type of art form. The opera served as an educational instrument for the audience, not only training them to a different type of music but also evolving its national consciousness. Binièki's opera was a foundation on which later generations of composers built. The artistic value of this opera is emphasized. The musical language includes an assimilation of various influences from Western Europe and Russia, properly incorporated into the Serbian musical core. Audience reaction is discussed, a positive affirmation that Binièki was moving in the right direction in establishing a path for the further development of the artistic field of Serbian musical culture. A synopsis of the work as well as the requisite performing forces is also included.

ContributorsMinov, Jana (Author) / Russell, Timothy (Thesis advisor) / Levy, Benjamin (Committee member) / Schildkret, David (Committee member) / Rogers, Rodney (Committee member) / Reber, William (Committee member) / Arizona State University (Publisher)

Created2011

Korean arirang: history, genres, and adaptations In Edward Niedermaier's "Arirang Variations

Description

This study treats in some depth a contemporary solo piano work, "Arirang Variations" (2006) by Edward "Teddy" Niedermaier (b. 1983). Though Niedermaier is an American composer and pianist, he derives his inspiration for that work from four types of Korean arirang: "Arirang," "Raengsanmopan Older Babe Arirang," "Gangwondo Arirang" and "Kin…

This study treats in some depth a contemporary solo piano work, "Arirang Variations" (2006) by Edward "Teddy" Niedermaier (b. 1983). Though Niedermaier is an American composer and pianist, he derives his inspiration for that work from four types of Korean arirang: "Arirang," "Raengsanmopan Older Babe Arirang," "Gangwondo Arirang" and "Kin Arirang." The analysis of "Arirang Variations" focuses primarily on how the composer adapts arirang in each variation and develops them into his own musical language. A salient feature of Niedermaier's composition is his combination of certain contradictions: traditional and contemporary styles, and Western and Eastern musical styles. In order to discuss in detail the musical elements of arirang used in "Arirang Variations," scores of all the arirang Niedermaier references are included with the discussion of each. Unfortunately, sources concerning three of these were limited to a single book by Yon-gap Kim, Pukhan Arirang Yongu (A Study of North Korean Arirang), because "Raengsanmopan Older Babe Arirang," "Gangwondo Arirang" and "Kin Arirang"are North Korean versions of arirang. Since arirang are the most important Korean folk song genre, basic information concerning such features of Korean traditional musical elements as scales, vocal techniques, rhythms and types of folk songs are provided along with an overview of the history and origins of arirang. Given that each arirang has distinctive characteristics that vary by region, the four best-known types of arirang are introduced to demonstrate these differences.

ContributorsPark, Hyunjin (Author) / Meir, Baruch (Thesis advisor) / Campbell, Andrew (Committee member) / Levy, Benjamin (Committee member) / Thompson, Janice (Committee member) / Arizona State University (Publisher)

Created2011

Mining semantics from low-level features in multimedia computing

Description

Bridging semantic gap is one of the fundamental problems in multimedia computing and pattern recognition. The challenge of associating low-level signal with their high-level semantic interpretation is mainly due to the fact that semantics are often conveyed implicitly in a context, relying on interactions among multiple levels of concepts or…

Bridging semantic gap is one of the fundamental problems in multimedia computing and pattern recognition. The challenge of associating low-level signal with their high-level semantic interpretation is mainly due to the fact that semantics are often conveyed implicitly in a context, relying on interactions among multiple levels of concepts or low-level data entities. Also, additional domain knowledge may often be indispensable for uncovering the underlying semantics, but in most cases such domain knowledge is not readily available from the acquired media streams. Thus, making use of various types of contextual information and leveraging corresponding domain knowledge are vital for effectively associating high-level semantics with low-level signals with higher accuracies in multimedia computing problems. In this work, novel computational methods are explored and developed for incorporating contextual information/domain knowledge in different forms for multimedia computing and pattern recognition problems. Specifically, a novel Bayesian approach with statistical-sampling-based inference is proposed for incorporating a special type of domain knowledge, spatial prior for the underlying shapes; cross-modality correlations via Kernel Canonical Correlation Analysis is explored and the learnt space is then used for associating multimedia contents in different forms; model contextual information as a graph is leveraged for regulating interactions among high-level semantic concepts (e.g., category labels), low-level input signal (e.g., spatial/temporal structure). Four real-world applications, including visual-to-tactile face conversion, photo tag recommendation, wild web video classification and unconstrained consumer video summarization, are selected to demonstrate the effectiveness of the approaches. These applications range from classic research challenges to emerging tasks in multimedia computing. Results from experiments on large-scale real-world data with comparisons to other state-of-the-art methods and subjective evaluations with end users confirmed that the developed approaches exhibit salient advantages, suggesting that they are promising for leveraging contextual information/domain knowledge for a wide range of multimedia computing and pattern recognition problems.

ContributorsWang, Zhesheng (Author) / Li, Baoxin (Thesis advisor) / Sundaram, Hari (Committee member) / Qian, Gang (Committee member) / Ye, Jieping (Committee member) / Arizona State University (Publisher)

Created2011

An analytical overview of Dominick Argento's Evensong, of love and angels

Description

Many of the works of Dominick Argento have been researched and analyzed, but his choral work Evensong: Of Love and Angels s has received limited attention thus far. Written in memoriam for his wife Carolyn Bailey Argento, Evensong draws its musical material from her initials C.B.A. These letters, translated into…

Many of the works of Dominick Argento have been researched and analyzed, but his choral work Evensong: Of Love and Angels s has received limited attention thus far. Written in memoriam for his wife Carolyn Bailey Argento, Evensong draws its musical material from her initials C.B.A. These letters, translated into note names, form a conspicuous head motive that is present in each movement of the work, and it serves multiple functions: as a melodic feature, as the foundation for a twelve-tone row, and as a harmonic base. This paper provides an overview of the work's conception with specific relation to Argento's biographical details, compositional style, and work habits; a brief review of the critical reception of the work; and a succinct analysis of the form and cyclical materials found in each movement.

ContributorsPage, Carrie Leigh, 1980- (Author) / Rogers, Rodney (Thesis advisor) / DeMars, James (Committee member) / Levy, Benjamin (Committee member) / Oldani, Robert (Committee member) / Arizona State University (Publisher)

Created2011

Everyday arias: for soprano and orchestra

Description

Everyday Arias for soprano and orchestra was composed largely in Arizona and completed in February 2011. The text was taken from a small collection of the composer's own poetry referencing her memories of life in rural Mississippi. Everyday Arias endeavors to elevate these prosaic experiences and settings to art, expressing…

Everyday Arias for soprano and orchestra was composed largely in Arizona and completed in February 2011. The text was taken from a small collection of the composer's own poetry referencing her memories of life in rural Mississippi. Everyday Arias endeavors to elevate these prosaic experiences and settings to art, expressing the everyday as beautiful and worthy of artistic treatment. The primary compositional model for this work was Samuel Barber's Knoxville: Summer of 1915, but other influences included Charles Ives, Aaron Copland, Benjamin Britten, and Dominick Argento. Barber's and Argento's musical treatment of prose style seemed particularly appropriate to the goals of Everyday Arias. Ives and Copland used hymn tunes both to evoke certain associations of worship and as sources of interesting material. The vocal writing of all five composers was influential, but the orchestration techniques for winds are largely a product of studying Ives and Argento, while many string gestures are more obviously tied to Britten and - more historically - Debussy.The primary motive that weaves through the work features an ascending major second followed by a descending perfect fourth, in a long-short-long rhythmic pattern. As a melodic fragment, the motive is often inverted to a descending-ascending pattern, or distorted slightly by expanding the second interval to a perfect fifth, or used in retrograde. The motive was derived from the first measure of the melody "Toplady" (1830) by Thomas Hastings, better known as the hymn "Rock of Ages." In the first movement, the motive is used most frequently in sequences. The second movement treats the motive as a melodic element and as a unit in ostinati. The final movement humorously transforms it into a syncopated gesture to evoke ragtime.

ContributorsPage, Carrie Leigh (Composer) / Rogers, Rodney (Thesis advisor) / DeMars, James (Committee member) / Levy, Benjamin (Committee member) / Oldani, Robert (Committee member) / Arizona State University (Publisher)

Created2011

Finding provenance data in social media

Description

A statement appearing in social media provides a very significant challenge for determining the provenance of the statement. Provenance describes the origin, custody, and ownership of something. Most statements appearing in social media are not published with corresponding provenance data. However, the same characteristics that make the social media environment…

A statement appearing in social media provides a very significant challenge for determining the provenance of the statement. Provenance describes the origin, custody, and ownership of something. Most statements appearing in social media are not published with corresponding provenance data. However, the same characteristics that make the social media environment challenging, including the massive amounts of data available, large numbers of users, and a highly dynamic environment, provide unique and untapped opportunities for solving the provenance problem for social media. Current approaches for tracking provenance data do not scale for online social media and consequently there is a gap in provenance methodologies and technologies providing exciting research opportunities. The guiding vision is the use of social media information itself to realize a useful amount of provenance data for information in social media. This departs from traditional approaches for data provenance which rely on a central store of provenance information. The contemporary online social media environment is an enormous and constantly updated "central store" that can be mined for provenance information that is not readily made available to the average social media user. This research introduces an approach and builds a foundation aimed at realizing a provenance data capability for social media users that is not accessible today.

ContributorsBarbier, Geoffrey P (Author) / Liu, Huan (Thesis advisor) / Bell, Herbert (Committee member) / Li, Baoxin (Committee member) / Sen, Arunabha (Committee member) / Arizona State University (Publisher)

Created2011

Multi-label dimensionality reduction

Description

Multi-label learning, which deals with data associated with multiple labels simultaneously, is ubiquitous in real-world applications. To overcome the curse of dimensionality in multi-label learning, in this thesis I study multi-label dimensionality reduction, which extracts a small number of features by removing the irrelevant, redundant, and noisy information while considering…

Multi-label learning, which deals with data associated with multiple labels simultaneously, is ubiquitous in real-world applications. To overcome the curse of dimensionality in multi-label learning, in this thesis I study multi-label dimensionality reduction, which extracts a small number of features by removing the irrelevant, redundant, and noisy information while considering the correlation among different labels in multi-label learning. Specifically, I propose Hypergraph Spectral Learning (HSL) to perform dimensionality reduction for multi-label data by exploiting correlations among different labels using a hypergraph. The regularization effect on the classical dimensionality reduction algorithm known as Canonical Correlation Analysis (CCA) is elucidated in this thesis. The relationship between CCA and Orthonormalized Partial Least Squares (OPLS) is also investigated. To perform dimensionality reduction efficiently for large-scale problems, two efficient implementations are proposed for a class of dimensionality reduction algorithms, including canonical correlation analysis, orthonormalized partial least squares, linear discriminant analysis, and hypergraph spectral learning. The first approach is a direct least squares approach which allows the use of different regularization penalties, but is applicable under a certain assumption; the second one is a two-stage approach which can be applied in the regularization setting without any assumption. Furthermore, an online implementation for the same class of dimensionality reduction algorithms is proposed when the data comes sequentially. A Matlab toolbox for multi-label dimensionality reduction has been developed and released. The proposed algorithms have been applied successfully in the Drosophila gene expression pattern image annotation. The experimental results on some benchmark data sets in multi-label learning also demonstrate the effectiveness and efficiency of the proposed algorithms.

ContributorsSun, Liang (Author) / Ye, Jieping (Thesis advisor) / Li, Baoxin (Committee member) / Liu, Huan (Committee member) / Mittelmann, Hans D. (Committee member) / Arizona State University (Publisher)

Created2011

Musikalischer Parnassus by Johann Kaspar Ferdinand Fischer (1656-1746): guitar transcription and performance guide of Suites VI and VIII

Description

Guitar repertoire from the Baroque period consists primarily of transcriptions, which suggests that modern performers may explore more sources to identify eligible works to transcribe. The Musikalischer Parnassus, a collection of dance suites for harpsichord by Johann Kaspar Ferdinand Fischer (1656-1746), is worthy of such a transcription. This collection has…

Guitar repertoire from the Baroque period consists primarily of transcriptions, which suggests that modern performers may explore more sources to identify eligible works to transcribe. The Musikalischer Parnassus, a collection of dance suites for harpsichord by Johann Kaspar Ferdinand Fischer (1656-1746), is worthy of such a transcription. This collection has high artistic value and possesses a range and texture that make much of it playable on the guitar. The purpose of this research project is to introduce Fischer and his works to the classical guitar community, and also to explore the artistic qualities of Musikalischer Parnassus that qualify it for transcription for guitar. This document addresses the transcription process of two selected suites: VI, Euterpe and VIII, Polymnia by Fischer. The outcome is an edition for guitar and a performance guide, which includes interpretations and stylistic considerations for each movement.

ContributorsFang, Zhou, D.M.A (Author) / Koonce, Frank (Thesis advisor) / Levy, Benjamin (Committee member) / Rotaru, Catalin (Committee member) / Arizona State University (Publisher)

Created2013

Techniques for soundscape retrieval and synthesis

Description

The study of acoustic ecology is concerned with the manner in which life interacts with its environment as mediated through sound. As such, a central focus is that of the soundscape: the acoustic environment as perceived by a listener. This dissertation examines the application of several computational tools in the…

The study of acoustic ecology is concerned with the manner in which life interacts with its environment as mediated through sound. As such, a central focus is that of the soundscape: the acoustic environment as perceived by a listener. This dissertation examines the application of several computational tools in the realms of digital signal processing, multimedia information retrieval, and computer music synthesis to the analysis of the soundscape. Namely, these tools include a) an open source software library, Sirens, which can be used for the segmentation of long environmental field recordings into individual sonic events and compare these events in terms of acoustic content, b) a graph-based retrieval system that can use these measures of acoustic similarity and measures of semantic similarity using the lexical database WordNet to perform both text-based retrieval and automatic annotation of environmental sounds, and c) new techniques for the dynamic, realtime parametric morphing of multiple field recordings, informed by the geographic paths along which they were recorded.

ContributorsMechtley, Brandon Michael (Author) / Spanias, Andreas S (Thesis advisor) / Sundaram, Hari (Thesis advisor) / Cook, Perry R. (Committee member) / Li, Baoxin (Committee member) / Arizona State University (Publisher)

Created2013

Filtering by