{"id":169837,"date":"2023-08-17T03:24:16","date_gmt":"2023-08-17T08:24:16","guid":{"rendered":"https:\/\/lifeboat.com\/blog\/2023\/08\/ai-and-the-issues-with-data-scraping"},"modified":"2023-08-17T03:24:16","modified_gmt":"2023-08-17T08:24:16","slug":"ai-and-the-issues-with-data-scraping","status":"publish","type":"post","link":"https:\/\/lifeboat.com\/blog\/2023\/08\/ai-and-the-issues-with-data-scraping","title":{"rendered":"AI And The Issues with Data Scraping"},"content":{"rendered":"<p><a class=\"aligncenter blog-photo\" href=\"https:\/\/lifeboat.com\/blog.images\/ai-and-the-issues-with-data-scraping2.jpg\"><\/a><\/p>\n<p>This post is also available in: <a href=\"https:\/\/i-hls.com\/he\/archives\/120432\" class=\"\"><img decoding=\"async\" style=\"display:inline; margin: 0;\" class=\"\" src=\"https:\/\/i-hls.com\/wp-content\/plugins\/sitepress-multilingual-cms\/res\/flags\/he.png\" alt=\"he\" title=\"\u05e2\u05d1\u05e8\u05d9\u05ea\"> \u05e2\u05d1\u05e8\u05d9\u05ea (Hebrew)<\/a><\/p>\n<p>Many artificial intelligence tools use public data to train their large language models, but now large social media sites are looking for ways to defend against data scraping. The problem is that scraping isn\u2019t currently illegal.<\/p>\n<p>According to Cybernews, data scraping refers to a computer program extracting data from the output generated from another program, and it is becoming a big problem for large social media sites like Twitter or Reddit.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>This post is also available in: \u05e2\u05d1\u05e8\u05d9\u05ea (Hebrew) Many artificial intelligence tools use public data to train their large language models, but now large social media sites are looking for ways to defend against data scraping. The problem is that scraping isn\u2019t currently illegal. According to Cybernews, data scraping refers to a computer program extracting [\u2026]<\/p>\n","protected":false},"author":662,"featured_media":0,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[6],"tags":[],"class_list":["post-169837","post","type-post","status-publish","format-standard","hentry","category-robotics-ai"],"_links":{"self":[{"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/posts\/169837","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/users\/662"}],"replies":[{"embeddable":true,"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/comments?post=169837"}],"version-history":[{"count":0,"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/posts\/169837\/revisions"}],"wp:attachment":[{"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/media?parent=169837"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/categories?post=169837"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/tags?post=169837"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}