{"id":155331,"date":"2023-01-13T04:25:33","date_gmt":"2023-01-13T10:25:33","guid":{"rendered":"https:\/\/lifeboat.com\/blog\/2023\/01\/first-look-rl-cai-madison-claude-fine-tuned-52b-by-anthropic-announced-dec-2022-rlaif-v-rlhf"},"modified":"2023-01-13T04:25:33","modified_gmt":"2023-01-13T10:25:33","slug":"first-look-rl-cai-madison-claude-fine-tuned-52b-by-anthropic-announced-dec-2022-rlaif-v-rlhf","status":"publish","type":"post","link":"https:\/\/lifeboat.com\/blog\/2023\/01\/first-look-rl-cai-madison-claude-fine-tuned-52b-by-anthropic-announced-dec-2022-rlaif-v-rlhf","title":{"rendered":"First look \u2014 RL-CAI\/Madison\/Claude (fine-tuned 52B) by Anthropic \u2014 Announced Dec\/2022 (RLAIF v RLHF)"},"content":{"rendered":"<p><\/p>\n<p><iframe style=\"display: block; margin: 0 auto; width: 100%; aspect-ratio: 4\/3; object-fit: contain;\" src=\"https:\/\/www.youtube.com\/embed\/B7Mg8Hbcc0w?feature=oembed\" frameborder=\"0\" allow=\"accelerometer; autoplay; encrypted-media; gyroscope;\n   picture-in-picture\" allowfullscreen><\/iframe><\/p>\n<p>The Memo: <a href=\"https:\/\/lifearchitect.ai\/memo\/\">https:\/\/lifearchitect.ai\/memo\/<\/a><\/p>\n<p>Read the paper: <a href=\"https:\/\/arxiv.org\/abs\/2212.08073\">https:\/\/arxiv.org\/abs\/2212.08073<\/a><br \/> GitHub repo: <a href=\"https:\/\/github.com\/anthropics\/ConstitutionalHarmlessnessPaper\/tree\/main\/samples\">https:\/\/github.com\/anthropics\/ConstitutionalHarmlessnessPaper\/tree\/main\/samples<\/a>.<\/p>\n<p>Chapters:<br \/> 0:00 Opening.<br \/> 3:59 Demonstration.<br \/> 11:26 Explanation.<\/p>\n<p>Dr Alan D. Thompson is a world expert in artificial intelligence (AI), specialising in the augmentation of human intelligence, and advancing the evolution of \u2018integrated AI\u2019. Alan\u2019s applied AI research and visualisations are featured across major international media, including citations in the University of Oxford\u2019s debate on AI Ethics in December 2021.<\/p>\n<blockquote class=\"wp-embedded-content\" data-secret=\"VD2IYhSWYS\"><p><a href=\"https:\/\/lifearchitect.ai\/\">Home<\/a><\/p><\/blockquote>\n<p><iframe loading=\"lazy\" class=\"wp-embedded-content\" sandbox=\"allow-scripts\" security=\"restricted\" style=\"position: absolute; visibility: hidden;\" title=\"&#8220;Home&#8221; &#8212; Dr Alan D. Thompson \u2013 LifeArchitect.ai\" src=\"https:\/\/lifearchitect.ai\/embed\/#?secret=louSupHfqv#?secret=VD2IYhSWYS\" data-secret=\"VD2IYhSWYS\" width=\"600\" height=\"338\" frameborder=\"0\" marginwidth=\"0\" marginheight=\"0\" scrolling=\"no\"><\/iframe><\/p>\n<p>Music:<br \/> Under licence.<\/p>\n<p>Liborio Conti \u2014 Looking Forward (The Memo outro)<\/p>\n<div class=\"more-link-wrapper\"> <a class=\"more-link\" href=\"https:\/\/lifeboat.com\/blog\/2023\/01\/first-look-rl-cai-madison-claude-fine-tuned-52b-by-anthropic-announced-dec-2022-rlaif-v-rlhf\">Continue reading \u201cFirst look \u2014 RL-CAI\/Madison\/Claude (fine-tuned 52B) by Anthropic \u2014 Announced Dec\/2022 (RLAIF v RLHF)\u201d | &gt;<\/a><\/div>\n","protected":false},"excerpt":{"rendered":"<p>The Memo: https:\/\/lifearchitect.ai\/memo\/ Read the paper: https:\/\/arxiv.org\/abs\/2212.08073 GitHub repo: https:\/\/github.com\/anthropics\/ConstitutionalHarmlessnessPaper\/tree\/main\/samples. Chapters: 0:00 Opening. 3:59 Demonstration. 11:26 Explanation. Dr Alan D. Thompson is a world expert in artificial intelligence (AI), specialising in the augmentation of human intelligence, and advancing the evolution of \u2018integrated AI\u2019. Alan\u2019s applied AI research and visualisations are featured across major international media, [\u2026]<\/p>\n","protected":false},"author":556,"featured_media":0,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[30,6],"tags":[],"class_list":["post-155331","post","type-post","status-publish","format-standard","hentry","category-ethics","category-robotics-ai"],"_links":{"self":[{"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/posts\/155331","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/users\/556"}],"replies":[{"embeddable":true,"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/comments?post=155331"}],"version-history":[{"count":0,"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/posts\/155331\/revisions"}],"wp:attachment":[{"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/media?parent=155331"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/categories?post=155331"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/tags?post=155331"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}