{"id":138962,"date":"2022-05-05T09:03:30","date_gmt":"2022-05-05T14:03:30","guid":{"rendered":"https:\/\/lifeboat.com\/blog\/2022\/05\/these-virtual-robot-arms-get-smarter"},"modified":"2022-05-05T09:03:30","modified_gmt":"2022-05-05T14:03:30","slug":"these-virtual-robot-arms-get-smarter","status":"publish","type":"post","link":"https:\/\/lifeboat.com\/blog\/2022\/05\/these-virtual-robot-arms-get-smarter","title":{"rendered":"These virtual robot arms get smarter"},"content":{"rendered":"<p><a class=\"aligncenter blog-photo\" href=\"https:\/\/lifeboat.com\/blog.images\/these-virtual-robot-arms-get-smarter2.jpg\"><\/a><\/p>\n<p>A virtual <a href=\"https:\/\/robotics-self-play.github.io\/\">robot arm has learned to solve a wide range of different puzzles <\/a>\u2014stacking blocks, setting the table, arranging chess pieces\u2014without having to be retrained for each task. It did this by playing against a second robot arm that was trained to give it harder and harder challenges.<\/p>\n<p><strong>Self play: <\/strong>Developed by researchers at <a href=\"https:\/\/www.technologyreview.com\/2020\/02\/17\/844721\/ai-openai-moonshot-elon-musk-sam-altman-greg-brockman-messy-secretive-reality\/\">OpenAI<\/a>, the identical robot arms\u2014Alice and Bob\u2014learn by playing a game against each other in a simulation, without human input. The robots use reinforcement learning, a technique in which AIs are trained by trial and error what actions to take in different situations to achieve certain goals. The game involves moving objects around on a virtual tabletop. By arranging objects in specific ways, Alice tries to set puzzles that are hard for Bob to solve. Bob tries to solve Alice\u2019s puzzles. As they learn, Alice sets more complex puzzles and Bob gets better at solving them.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>A virtual robot arm has learned to solve a wide range of different puzzles \u2014stacking blocks, setting the table, arranging chess pieces\u2014without having to be retrained for each task. It did this by playing against a second robot arm that was trained to give it harder and harder challenges. Self play: Developed by researchers at [\u2026]<\/p>\n","protected":false},"author":662,"featured_media":0,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1509,6],"tags":[],"class_list":["post-138962","post","type-post","status-publish","format-standard","hentry","category-entertainment","category-robotics-ai"],"_links":{"self":[{"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/posts\/138962","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/users\/662"}],"replies":[{"embeddable":true,"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/comments?post=138962"}],"version-history":[{"count":0,"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/posts\/138962\/revisions"}],"wp:attachment":[{"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/media?parent=138962"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/categories?post=138962"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/tags?post=138962"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}