<?xml version="1.0"?>
<OAI-PMH xmlns="http://www.openarchives.org/OAI/2.0/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/ http://www.openarchives.org/OAI/2.0/OAI-PMH.xsd"><responseDate>2026-05-19T01:10:47Z</responseDate><request verb="GetRecord" metadataPrefix="oai_dc">https://keep.lib.asu.edu/oai/request</request><GetRecord><record><header><identifier>oai:keep.lib.asu.edu:node-201147</identifier><datestamp>2025-05-05T15:53:02Z</datestamp><setSpec>oai_pmh:all</setSpec><setSpec>oai_pmh:repo_items</setSpec></header><metadata><oai_dc:dc xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:oai_dc="http://www.openarchives.org/OAI/2.0/oai_dc/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/oai_dc/ http://www.openarchives.org/OAI/2.0/oai_dc.xsd"><dc:identifier>201147</dc:identifier>
          <dc:identifier>https://hdl.handle.net/2286/R.2.N.201147</dc:identifier>
                  <dc:rights>http://rightsstatements.org/vocab/InC/1.0/</dc:rights>
          <dc:rights>All Rights Reserved</dc:rights>
                  <dc:date>2025</dc:date>
                  <dc:format>53 pages</dc:format>
                  <dc:type>Masters Thesis</dc:type>
          <dc:type>Academic theses</dc:type>
                  <dc:language>en</dc:language>
                  <dc:contributor>Liou, Kuan-Ru</dc:contributor>
          <dc:contributor>Wei, Hua</dc:contributor>
          <dc:contributor>Zou, Jia</dc:contributor>
          <dc:contributor>Choi, YooJung</dc:contributor>
          <dc:contributor>Arizona State University</dc:contributor>
                  <dc:description>Partial requirement for: M.S., Arizona State University, 2025</dc:description>
          <dc:description>Field of study: Computer Science</dc:description>
          <dc:description>Asking ambiguous questions is a natural aspect of human communication, making it essential for Large Language Models (LLMs) to effectively recognize and address ambiguities. However, a comprehensive analysis of how well LLMs detect and solve ambiguities is lacking. Besides, though several datasets exist on ambiguity, the absence of explicit explanations of ambiguity and annotations of ambiguity types limits the comprehensive evaluation. To address this issue, I introduce Abg-SciQA, a dataset designed to evaluate and help LLMs detect ambiguities and generate appropriate clarification questions using challenge questions in the area of social and nature science. Abg-SciQA encompasses four tasks: Ambiguity Detection, Ambiguity Type Classification, Clarification Question Generation, and Clarification-Based Question Answering, where each task has corresponding annotations. I evaluate the dataset using both closed-source and open-source LLMs and fine-tune it on open-source LLMs. My experiments show that the most state-of-the-art LLMs still encounter difficultiesin resolving ambiguity in natural questions, and fine-tuning on Abg-SciQA can significantly enhance their capabilities to understand and address ambiguities. Notably, in the Ambiguity Detection task, the F1 score of Llama2-7b improves significantly from 16.6% to 79.1%. On the other hand, Abg-SciQA remains a challenging benchmark for LLMs, revealing ample room for model improvement.

</dc:description>
                  <dc:subject>Computer Science</dc:subject>
          <dc:subject>Linguistics</dc:subject>
          <dc:subject>Ambiguity</dc:subject>
          <dc:subject>Dataset</dc:subject>
          <dc:subject>Fine-tuning</dc:subject>
          <dc:subject>LLMs</dc:subject>
                  <dc:title>Abg-SciQA: Benchmarking and Enhancing Ambiguity Detection and Clarification in Language Models</dc:title></oai_dc:dc></metadata></record></GetRecord></OAI-PMH>
