<?xml version="1.0"?>
<OAI-PMH xmlns="http://www.openarchives.org/OAI/2.0/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/ http://www.openarchives.org/OAI/2.0/OAI-PMH.xsd"><responseDate>2026-05-25T03:12:43Z</responseDate><request verb="GetRecord" metadataPrefix="oai_dc">https://keep.lib.asu.edu/oai/request</request><GetRecord><record><header><identifier>oai:keep.lib.asu.edu:node-156938</identifier><datestamp>2024-12-20T18:25:12Z</datestamp><setSpec>oai_pmh:all</setSpec><setSpec>oai_pmh:repo_items</setSpec></header><metadata><oai_dc:dc xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:oai_dc="http://www.openarchives.org/OAI/2.0/oai_dc/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/oai_dc/ http://www.openarchives.org/OAI/2.0/oai_dc.xsd"><dc:identifier>156938</dc:identifier>
          <dc:identifier>https://hdl.handle.net/2286/R.I.51713</dc:identifier>
                  <dc:rights>http://rightsstatements.org/vocab/InC/1.0/</dc:rights>
                  <dc:date>2018</dc:date>
                  <dc:format>109 pages</dc:format>
                  <dc:type>Masters Thesis</dc:type>
          <dc:type>Academic theses</dc:type>
          <dc:type>Text</dc:type>
                  <dc:language>eng</dc:language>
                  <dc:contributor>KAMBAM, KARTHIK</dc:contributor>
          <dc:contributor>Zhang, Wenlong</dc:contributor>
          <dc:contributor>Nedich, Angelia</dc:contributor>
          <dc:contributor>Ren, Yi</dc:contributor>
          <dc:contributor>Arizona State University</dc:contributor>
                  <dc:description>Masters Thesis Electrical Engineering 2018</dc:description>
          <dc:description>Coordination and control of Intelligent Agents as a team is considered in this thesis.&lt;br/&gt;&lt;br/&gt;Intelligent agents learn from experiences, and in times of uncertainty use the knowl-&lt;br/&gt;&lt;br/&gt;edge acquired to make decisions and accomplish their individual or team objectives.&lt;br/&gt;&lt;br/&gt;Agent objectives are defined using cost functions designed uniquely for the collective&lt;br/&gt;&lt;br/&gt;task being performed. Individual agent costs are coupled in such a way that group ob-&lt;br/&gt;&lt;br/&gt;jective is attained while minimizing individual costs. Information Asymmetry refers&lt;br/&gt;&lt;br/&gt;to situations where interacting agents have no knowledge or partial knowledge of cost&lt;br/&gt;&lt;br/&gt;functions of other agents. By virtue of their intelligence, i.e., by learning from past&lt;br/&gt;&lt;br/&gt;experiences agents learn cost functions of other agents, predict their responses and&lt;br/&gt;&lt;br/&gt;act adaptively to accomplish the team’s goal.&lt;br/&gt;&lt;br/&gt;Algorithms that agents use for learning others’ cost functions are called Learn-&lt;br/&gt;&lt;br/&gt;ing Algorithms, and algorithms agents use for computing actuation (control) which&lt;br/&gt;&lt;br/&gt;drives them towards their goal and minimize their cost functions are called Control&lt;br/&gt;&lt;br/&gt;Algorithms. Typically knowledge acquired using learning algorithms is used in con-&lt;br/&gt;&lt;br/&gt;trol algorithms for computing control signals. Learning and control algorithms are&lt;br/&gt;&lt;br/&gt;designed in such a way that the multi-agent system as a whole remains stable during&lt;br/&gt;&lt;br/&gt;learning and later at an equilibrium. An equilibrium is defined as the event/point&lt;br/&gt;&lt;br/&gt;where cost functions of all agents are optimized simultaneously. Cost functions are&lt;br/&gt;&lt;br/&gt;designed so that the equilibrium coincides with the goal state multi-agent system as&lt;br/&gt;&lt;br/&gt;a whole is trying to reach.&lt;br/&gt;&lt;br/&gt;In collective load transport, two or more agents (robots) carry a load from point&lt;br/&gt;&lt;br/&gt;A to point B in space. Robots could have different control preferences, for example,&lt;br/&gt;&lt;br/&gt;different actuation abilities, however, are still required to coordinate and perform&lt;br/&gt;&lt;br/&gt;load transport. Control preferences for each robot are characterized using a scalar&lt;br/&gt;&lt;br/&gt;parameter θ i unique to the robot being considered and unknown to other robots. &lt;br/&gt;&lt;br/&gt;With the aid of state and control input observations, agents learn control preferences&lt;br/&gt;&lt;br/&gt;of other agents, optimize individual costs and drive the multi-agent system to a goal&lt;br/&gt;&lt;br/&gt;state.&lt;br/&gt;&lt;br/&gt;Two learning and Control algorithms are presented. In the first algorithm(LCA-&lt;br/&gt;&lt;br/&gt;1), an existing work, each agent optimizes a cost function similar to 1-step receding&lt;br/&gt;&lt;br/&gt;horizon optimal control problem for control. LCA-1 uses recursive least squares as&lt;br/&gt;&lt;br/&gt;the learning algorithm and guarantees complete learning in two time steps. LCA-1 is&lt;br/&gt;&lt;br/&gt;experimentally verified as part of this thesis.&lt;br/&gt;&lt;br/&gt;A novel learning and control algorithm (LCA-2) is proposed and verified in sim-&lt;br/&gt;&lt;br/&gt;ulations and on hardware. In LCA-2, each agent solves an infinite horizon linear&lt;br/&gt;&lt;br/&gt;quadratic regulator (LQR) problem for computing control. LCA-2 uses a learning al-&lt;br/&gt;&lt;br/&gt;gorithm similar to line search methods, and guarantees learning convergence to true&lt;br/&gt;&lt;br/&gt;values asymptotically.&lt;br/&gt;&lt;br/&gt;Simulations and hardware implementation show that the LCA-2 is stable for a&lt;br/&gt;&lt;br/&gt;variety of systems. Load transport is demonstrated using both the algorithms. Ex-&lt;br/&gt;&lt;br/&gt;periments running algorithm LCA-2 are able to resist disturbances and balance the&lt;br/&gt;&lt;br/&gt;assumed load better compared to LCA-1.</dc:description>
                  <dc:subject>Electrical Engineering</dc:subject>
          <dc:subject>Robotics</dc:subject>
          <dc:subject>Systems science</dc:subject>
          <dc:subject>Control Theory</dc:subject>
          <dc:subject>Learning and Control Algorithms</dc:subject>
          <dc:subject>LQR</dc:subject>
          <dc:subject>Multi-agent coordination</dc:subject>
          <dc:subject>Numerical Methods</dc:subject>
          <dc:subject>Optimization</dc:subject>
                  <dc:title>Multi-Agent Coordination and Control under Information Asymmetry with Applications to Collective Load Transport</dc:title></oai_dc:dc></metadata></record></GetRecord></OAI-PMH>
