<?xml version="1.0"?>
<OAI-PMH xmlns="http://www.openarchives.org/OAI/2.0/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/ http://www.openarchives.org/OAI/2.0/OAI-PMH.xsd"><responseDate>2026-05-13T11:02:40Z</responseDate><request verb="GetRecord" metadataPrefix="oai_dc">https://keep.lib.asu.edu/oai/request</request><GetRecord><record><header><identifier>oai:keep.lib.asu.edu:node-153003</identifier><datestamp>2024-12-20T18:25:12Z</datestamp><setSpec>oai_pmh:all</setSpec><setSpec>oai_pmh:repo_items</setSpec></header><metadata><oai_dc:dc xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:oai_dc="http://www.openarchives.org/OAI/2.0/oai_dc/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/oai_dc/ http://www.openarchives.org/OAI/2.0/oai_dc.xsd"><dc:identifier>153003</dc:identifier>
          <dc:identifier>https://hdl.handle.net/2286/R.I.25942</dc:identifier>
                  <dc:rights>http://rightsstatements.org/vocab/InC/1.0/</dc:rights>
          <dc:rights>All Rights Reserved</dc:rights>
                  <dc:date>2014</dc:date>
                  <dc:format>viii, 90 p. : ill. (some col.)</dc:format>
                  <dc:type>Doctoral Dissertation</dc:type>
          <dc:type>Academic theses</dc:type>
          <dc:type>Text</dc:type>
                  <dc:language>eng</dc:language>
                  <dc:contributor>De, Sushovan</dc:contributor>
          <dc:contributor>Kambhampati, Subbarao</dc:contributor>
          <dc:contributor>Chen, Yi</dc:contributor>
          <dc:contributor>Candan, K. Selcuk</dc:contributor>
          <dc:contributor>Liu, Huan</dc:contributor>
          <dc:contributor>Arizona State University</dc:contributor>
                  <dc:description>Partial requirement for: Ph.D., Arizona State University, 2014</dc:description>
          <dc:description>Includes bibliographical references (p. 87-90)</dc:description>
          <dc:description>Field of study: Computer science</dc:description>
          <dc:description>Recent efforts in data cleaning have focused mostly on problems like data deduplication, record matching, and data standardization; few of these focus on fixing incorrect attribute values in tuples. Correcting values in tuples is typically performed by a minimum cost repair of tuples that violate static constraints like CFDs (which have to be provided by domain experts, or learned from a clean sample of the database). In this thesis, I provide a method for correcting individual attribute values in a structured database using a Bayesian generative model and a statistical error model learned from the noisy database directly. I thus avoid the necessity for a domain expert or master data. I also show how to efficiently perform consistent query answering using this model over a dirty database, in case write permissions to the database are unavailable. A Map-Reduce architecture to perform this computation in a distributed manner is also shown. I evaluate these methods over both synthetic and real data.</dc:description>
                  <dc:subject>Computer Science</dc:subject>
          <dc:subject>Consistent Query Answering</dc:subject>
          <dc:subject>Databases</dc:subject>
          <dc:subject>Data Cleaning</dc:subject>
          <dc:subject>Information retrieval</dc:subject>
          <dc:subject>Probabilistic databases</dc:subject>
          <dc:subject>Database management</dc:subject>
          <dc:subject>Structured programming</dc:subject>
          <dc:subject>Information retrieval</dc:subject>
                  <dc:title>Unsupervised Bayesian data cleaning techniques for structured data</dc:title></oai_dc:dc></metadata></record></GetRecord></OAI-PMH>
