Matching Items (2,883)
Filtering by

Clear all filters

156689-Thumbnail Image.png
Description
Since the advent of the internet and even more after social media platforms, the explosive growth of textual data and its availability has made analysis a tedious task. Information extraction systems are available but are generally too specific and often only extract certain kinds of information they deem necessary and

Since the advent of the internet and even more after social media platforms, the explosive growth of textual data and its availability has made analysis a tedious task. Information extraction systems are available but are generally too specific and often only extract certain kinds of information they deem necessary and extraction worthy. Using data visualization theory and fast, interactive querying methods, leaving out information might not really be necessary. This thesis explores textual data visualization techniques, intuitive querying, and a novel approach to all-purpose textual information extraction to encode large text corpus to improve human understanding of the information present in textual data.

This thesis presents a modified traversal algorithm on dependency parse output of text to extract all subject predicate object pairs from text while ensuring that no information is missed out. To support full scale, all-purpose information extraction from large text corpuses, a data preprocessing pipeline is recommended to be used before the extraction is run. The output format is designed specifically to fit on a node-edge-node model and form the building blocks of a network which makes understanding of the text and querying of information from corpus quick and intuitive. It attempts to reduce reading time and enhancing understanding of the text using interactive graph and timeline.
ContributorsHashmi, Syed Usama (Author) / Bansal, Ajay (Thesis advisor) / Bansal, Srividya (Committee member) / Gonzalez Sanchez, Javier (Committee member) / Arizona State University (Publisher)
Created2018
174861-Thumbnail Image.jpg
Created1925-19-39 (uncertain)
174868-Thumbnail Image.jpg
Created1934
174924-Thumbnail Image.jpg
Created1926
174931-Thumbnail Image.jpg
Created1926
174934-Thumbnail Image.jpg
Created1926
174981-Thumbnail Image.jpg
Created1928
Description

Human Papillomavirus, or HPV, is a viral pathogen that most commonly spreads through sexual contact. HPV strains 6 and 11 normally cause genital warts, while HPV strains 16 and 18 commonly cause cervical cancer, which causes cancerous cells to spread in the cervix. Physicians can detect those HPV strains, using

Human Papillomavirus, or HPV, is a viral pathogen that most commonly spreads through sexual contact. HPV strains 6 and 11 normally cause genital warts, while HPV strains 16 and 18 commonly cause cervical cancer, which causes cancerous cells to spread in the cervix. Physicians can detect those HPV strains, using a Pap smear, which is a diagnostic test that collects cells from the female cervix.

Created2021-04-06
Description

Johann Gregor Mendel studied patterns of trait inheritance in plants during the nineteenth century. Mendel, an Augustinian monk, conducted experiments on pea plants at St. Thomas’ Abbey in what is now Brno, Czech Republic. Twentieth century scientists used Mendel’s recorded observations to create theories about genetics.

Created2022-01-13