Matching Items (83)
Filtering by

Clear all filters

141461-Thumbnail Image.png
Description
In the digital humanities, there is a constant need to turn images and PDF files into plain text to apply analyses such as topic modelling, named entity recognition, and other techniques. However, although there exist different solutions to extract text embedded in PDF files or run OCR on images, they

In the digital humanities, there is a constant need to turn images and PDF files into plain text to apply analyses such as topic modelling, named entity recognition, and other techniques. However, although there exist different solutions to extract text embedded in PDF files or run OCR on images, they typically require additional training (for example, scholars have to learn how to use the command line) or are difficult to automate without programming skills. The Giles Ecosystem is a distributed system based on Apache Kafka that allows users to upload documents for text and image extraction. The system components are implemented using Java and the Spring Framework and are available under an Open Source license on GitHub (https://github.com/diging/).
ContributorsLessios-Damerow, Julia (Contributor) / Peirson, Erick (Contributor) / Laubichler, Manfred (Contributor) / ASU-SFI Center for Biosocial Complex Systems (Contributor)
Created2017-09-28
187906-Thumbnail Image.png
Description

A two-part presentation from the ASU Library and Knowledge Enterprise Research Data Management Office. Presented at the 2023 Rocky Mountain Advanced Computing Consortium (RMACC).

Session 1: Data management planning is an integral step in the research data life cycle. Large amounts of data and lengthy code accompanying supercomputing runs are no

A two-part presentation from the ASU Library and Knowledge Enterprise Research Data Management Office. Presented at the 2023 Rocky Mountain Advanced Computing Consortium (RMACC).

Session 1: Data management planning is an integral step in the research data life cycle. Large amounts of data and lengthy code accompanying supercomputing runs are no exception. Planning before analysis will benefit research and the researcher by providing a clear strategy for collecting, storing, analyzing, and sharing the data at the end of the research cycle. Supercomputing can require significant storage beyond scratch space, but researchers typically need to be informed of what tools are appropriate and available. Framed within the planning phase of the life cycle, this presentation presents ASU’s Storage Selector as a quick and easy tool to find the most appropriate storage resources provided by the university to help researchers choose a proper storage and management solution for their research data at the right time in their project. We will also explore the DMP Tool, developed by the California Digital Library, which provides a resource-rich platform for writing data management plans, including institutional-specific guidance, feedback request, and public plans that can be used as guides.

Session 2: This presentation overviews the ongoing working relationship between the ASU Library Open Science and Scholarly Communication division, Research Data Management Office, and Research Computing. We will explore these teams’ interdisciplinary relationships and interdependence as the institution increasingly supports open science practices and initiatives. We will include case studies regarding the decision-making process, data-sharing decisions, and opportunities and challenges when transferring research data from a high-performance computing environment to the ASU Research Data Repository. Finally, we will share lessons learned as we intentionally shepherd research data from active project management and storage to final publication and preservation.

ContributorsHarp, Matthew (Author) / Claypool, Kathryn (Author)
Created2023-05-17
Description

(Preprint.) Today's college and university learning landscapes are dynamic and
characterized by increased student demand for highly flexible and self-paced online learning opportunities. Recent fiscal conditions in higher education make learning landscape development more challenging due to finite resources and competing priorities. Similarly, academic libraries are experiencing substantial budget and staff

(Preprint.) Today's college and university learning landscapes are dynamic and
characterized by increased student demand for highly flexible and self-paced online learning opportunities. Recent fiscal conditions in higher education make learning landscape development more challenging due to finite resources and competing priorities. Similarly, academic libraries are experiencing substantial budget and staff reductions. Despite these trends, academic libraries are in a strong position to contribute to surrounding learning landscapes by expanding student online learning opportunities and promoting the critical use of information. Evolving learning technologies available for free or at low cost provide higher education and libraries with the tools to respond to this fluid environment.

ContributorsKammerlocher, Lisa (Author) / Couture, Julianne (Author) / Sparks, Olivia (Author) / Harp, Matthew (Author) / Allgood, Tammy (Author)
Created2011
DescriptionMarketing library resources, services and personnel to information-overloaded university students can be a challenge. Learn how Arizona State University Libraries produces the fun and informative Library Minute video series, how it’s used by instructors, and how it’s received by students.
ContributorsPerry, Anali Maughan (Author) / Harp, Matthew (Author)
Created2010-10-12
Description

It is known that in classical fluids turbulence typically occurs at high Reynolds numbers. But can turbulence occur at low Reynolds numbers? Here we investigate the transition to turbulence in the classic Taylor-Couette system in which the rotating fluids are manufactured ferrofluids with magnetized nanoparticles embedded in liquid carriers. We

It is known that in classical fluids turbulence typically occurs at high Reynolds numbers. But can turbulence occur at low Reynolds numbers? Here we investigate the transition to turbulence in the classic Taylor-Couette system in which the rotating fluids are manufactured ferrofluids with magnetized nanoparticles embedded in liquid carriers. We find that, in the presence of a magnetic field transverse to the symmetry axis of the system, turbulence can occur at Reynolds numbers that are at least one order of magnitude smaller than those in conventional fluids. This is established by extensive computational ferrohydrodynamics through a detailed investigation of transitions in the flow structure, and characterization of behaviors of physical quantities such as the energy, the wave number, and the angular momentum through the bifurcations. A finding is that, as the magnetic field is increased, onset of turbulence can be determined accurately and reliably. Our results imply that experimental investigation of turbulence may be feasible by using ferrofluids. Our study of transition to and evolution of turbulence in the Taylor-Couette ferrofluidic flow system provides insights into the challenging problem of turbulence control.

ContributorsAltmeyer, Sebastian (Author) / Do, Younghae (Author) / Lai, Ying-Cheng (Author) / Ira A. Fulton Schools of Engineering (Contributor)
Created2015-06-12
Description

A relatively unexplored issue in cybersecurity science and engineering is whether there exist intrinsic patterns of cyberattacks. Conventional wisdom favors absence of such patterns due to the overwhelming complexity of the modern cyberspace. Surprisingly, through a detailed analysis of an extensive data set that records the time-dependent frequencies of attacks

A relatively unexplored issue in cybersecurity science and engineering is whether there exist intrinsic patterns of cyberattacks. Conventional wisdom favors absence of such patterns due to the overwhelming complexity of the modern cyberspace. Surprisingly, through a detailed analysis of an extensive data set that records the time-dependent frequencies of attacks over a relatively wide range of consecutive IP addresses, we successfully uncover intrinsic spatiotemporal patterns underlying cyberattacks, where the term “spatio” refers to the IP address space. In particular, we focus on analyzing macroscopic properties of the attack traffic flows and identify two main patterns with distinct spatiotemporal characteristics: deterministic and stochastic. Strikingly, there are very few sets of major attackers committing almost all the attacks, since their attack “fingerprints” and target selection scheme can be unequivocally identified according to the very limited number of unique spatiotemporal characteristics, each of which only exists on a consecutive IP region and differs significantly from the others. We utilize a number of quantitative measures, including the flux-fluctuation law, the Markov state transition probability matrix, and predictability measures, to characterize the attack patterns in a comprehensive manner. A general finding is that the attack patterns possess high degrees of predictability, potentially paving the way to anticipating and, consequently, mitigating or even preventing large-scale cyberattacks using macroscopic approaches.

ContributorsChen, Yu-Zhong (Author) / Huang, Zi-Gang (Author) / Xu, Shouhuai (Author) / Lai, Ying-Cheng (Author) / Ira A. Fulton Schools of Engineering (Contributor)
Created2015-05-20
Description

Supply-demand processes take place on a large variety of real-world networked systems ranging from power grids and the internet to social networking and urban systems. In a modern infrastructure, supply-demand systems are constantly expanding, leading to constant increase in load requirement for resources and consequently, to problems such as low

Supply-demand processes take place on a large variety of real-world networked systems ranging from power grids and the internet to social networking and urban systems. In a modern infrastructure, supply-demand systems are constantly expanding, leading to constant increase in load requirement for resources and consequently, to problems such as low efficiency, resource scarcity, and partial system failures. Under certain conditions global catastrophe on the scale of the whole system can occur through the dynamical process of cascading failures. We investigate optimization and resilience of time-varying supply-demand systems by constructing network models of such systems, where resources are transported from the supplier sites to users through various links. Here by optimization we mean minimization of the maximum load on links, and system resilience can be characterized using the cascading failure size of users who fail to connect with suppliers.

We consider two representative classes of supply schemes: load driven supply and fix fraction supply. Our findings are: (1) optimized systems are more robust since relatively smaller cascading failures occur when triggered by external perturbation to the links; (2) a large fraction of links can be free of load if resources are directed to transport through the shortest paths; (3) redundant links in the performance of the system can help to reroute the traffic but may undesirably transmit and enlarge the failure size of the system; (4) the patterns of cascading failures depend strongly upon the capacity of links; (5) the specific location of the trigger determines the specific route of cascading failure, but has little effect on the final cascading size; (6) system expansion typically reduces the efficiency; and (7) when the locations of the suppliers are optimized over a long expanding period, fewer suppliers are required. These results hold for heterogeneous networks in general, providing insights into designing optimal and resilient complex supply-demand systems that expand constantly in time.

ContributorsZhang, Si-Ping (Author) / Huang, Zi-Gang (Author) / Dong, Jia-Qi (Author) / Eisenberg, Daniel (Author) / Seager, Thomas (Author) / Lai, Ying-Cheng (Author) / Ira A. Fulton Schools of Engineering (Contributor)
Created2015-06-23
Description

In 2014/2015, Arizona State University (ASU) Libraries, the Labriola National American Indian Data Center, and the ASU American Indian Studies Department completed an ASU Institute for Humanities Research (IHR) seed grant entitled “Carlos Montezuma’s Wassaja Newsletter: Digitization, Access and Context” to digitize all ASU held issues of the newsletter Wassaja

In 2014/2015, Arizona State University (ASU) Libraries, the Labriola National American Indian Data Center, and the ASU American Indian Studies Department completed an ASU Institute for Humanities Research (IHR) seed grant entitled “Carlos Montezuma’s Wassaja Newsletter: Digitization, Access and Context” to digitize all ASU held issues of the newsletter Wassaja Freedom’s Signal for the Indian, which Yavapai activist-intellectual Carlos Montezuma, MD (1866-1923) self-published during 1916-1922. The grant team additionally selected a portion of the ASU Libraries Carlos Montezuma archival collection for digitization to provide a more complete picture of Dr. Carlos Montezuma’s life and work.

The ASU grant team produced a searchable online collection on the ASU Digital Repository and created an online exhibition in conjunction with the IHR Nexus Lab’s Developing Wassaja Project. The Nexus Lab’s role at ASU is to grow the digital humanities through interdisciplinary collaborations bringing together humanities, science, and technology. The Nexus Lab partnered with the grant team to create the Developing Wassaja Project which provided an opportunity for faculty, staff, and students at ASU to engage in electronic publication through web application development.

The resulting web platform, Wassaja: A Carlos Montezuma Project, provides context for this digitized collection and facilitates community interaction, including a partnership with Dr. Montezuma’s home community the Fort McDowell Yavapai Nation. In this webcast, Digital Projects Librarian Matthew Harp, Developing Wassaja Project team member Joe Buenker (subject librarian), and grant team member Joyce Martin (librarian and curator of the Labriola National American Indian Data Center) will discuss and demonstrate the resources created and the resulting partnership with the Fort McDowell Yavapai Nation. The webcast will focus on identifying collaborators and needed skills to engage in Digital Humanities research and on identifying the stages of a collaborative project.

Participants will gain insight on working directly with diverse communities; overcoming technical limitations of traditional institutional repositories; collaborative strategies with faculty, research centers, and cultural heritage societies; solutions for moving hidden collections into an engaging digital exhibition; integrating digital humanities research and instruction with library curation; and preparing for long term costs and management issues.

ContributorsHarp, Matthew (Author) / Martin, Joyce (Author) / Buenker, Joseph (Author)
Created2016-03-23
Description

Limited to streaming only those videos a vendor hosted, ASU Libraries sought to expand collection options with a trial project for hosting content locally. Kaltura, was selected as the platform, but Kaltura does not work out of the box. This presentation will cover how using Drupal, along with Kaltura, we

Limited to streaming only those videos a vendor hosted, ASU Libraries sought to expand collection options with a trial project for hosting content locally. Kaltura, was selected as the platform, but Kaltura does not work out of the box. This presentation will cover how using Drupal, along with Kaltura, we built a working video hosting solution. The presentation will cover administrative hurdles, stumbling blocks, pitfalls, enhancements, and lessons learned along the way.

ContributorsHarp, Matthew (Author) / farrelly, deg (Author) / Kurtz, Jeremy (Author) / Allgood, Tammy (Author)
Created2012-06-25
Description

While PhD dissertations are typically accessible many other terminal degree projects remain invisible and inaccessible to a greater audience. Over the past year and a half, librarians at Arizona State University collaborated with faculty and departmental administrators across a variety of fields to develop and create institutional repository collections that

While PhD dissertations are typically accessible many other terminal degree projects remain invisible and inaccessible to a greater audience. Over the past year and a half, librarians at Arizona State University collaborated with faculty and departmental administrators across a variety of fields to develop and create institutional repository collections that highlight and authoritatively share this type of student scholarship with schools, researchers, and future employers. This poster will present the benefits, challenges, and considerations required to successfully implement and manage these collections of applied final projects or capstone projects. Specifically, issues/challenges related to metadata consistency, faculty buy-in, and developing an ingest process, as well as benefits related to increased visibility and improved educational and employment opportunities will be discussed. This interactive presentation will also discuss lessons learned from the presenter’s experiences in context of how they can easily apply to benefit their respective institutions.

ContributorsHarp, Matthew (Author) / Dyal, Samuel (Author) / Pardon, Kevin (Author) / Arizona State University. ASU Library (Contributor)
Created2017-05-02