{"id":33360,"date":"2017-01-10T23:05:05","date_gmt":"2017-01-11T07:05:05","guid":{"rendered":"http:\/\/lifeboat.com\/blog\/2017\/01\/building-a-google-for-the-dark-web"},"modified":"2017-06-04T08:20:38","modified_gmt":"2017-06-04T15:20:38","slug":"building-a-google-for-the-dark-web","status":"publish","type":"post","link":"https:\/\/lifeboat.com\/blog\/2017\/01\/building-a-google-for-the-dark-web","title":{"rendered":"Building a Google for the dark web"},"content":{"rendered":"<p><a class=\"aligncenter blog-photo\" href=\"https:\/\/lifeboat.com\/blog.images\/building-a-google-for-the-dark-web.jpg\"><\/a><\/p>\n<p>I can honestly state there is already one that folks are using; I would suggest DARPA should assess it and maybe acquire it. As it would give them a jump start and they can enhance it for their own needs.<\/p>\n<hr>\n<p>In today\u2019s data-rich world, companies, governments and individuals want to analyze anything and everything they can get their hands on \u2013 and the World Wide Web has loads of information. At present, the most easily indexed material from the web is text. But <a href=\"http:\/\/www.popsci.com\/dark-web-revealed\">as much as 89<\/a> to <a href=\"https:\/\/www.quora.com\/How-big-is-the-deep-web\/answer\/Joseph-Hirschhorn-Howard\">96 percent<\/a> of the content on the internet is actually something else \u2013 images, video, audio, <a href=\"http:\/\/www.iana.org\/assignments\/media-types\/media-types.xhtml\">in all thousands of different kinds of nontextual data types<\/a>.<\/p>\n<p>Further, the vast majority of online content isn\u2019t available in a form that\u2019s easily indexed by electronic archiving systems like Google\u2019s. Rather, it requires a user to log in, or it is provided dynamically by a program running when a user visits the page. If we\u2019re going to catalog online human knowledge, we need to be sure we can get to and recognize all of it, and that we can do so automatically.<\/p>\n<p>How can we teach computers to recognize, index and search all the different types of material that\u2019s available online? Thanks to federal efforts in the global fight against <a href=\"http:\/\/phys.org\/tags\/human+trafficking\/\" rel=\"tag\" class=\"\">human trafficking<\/a> and weapons dealing, my research forms the basis for a new tool that can help with this effort.<\/p>\n<p><!-- Link: <a href=\"http:\/\/phys.org\/news\/2017-01-google-dark-web.html\">http:\/\/phys.org\/news\/2017&#45;01-google-dark-web.html<\/a> --><\/p>\n","protected":false},"excerpt":{"rendered":"<p>I can honestly state there is already one that folks are using; I would suggest DARPA should assess it and maybe acquire it. As it would give them a jump start and they can enhance it for their own needs. In today\u2019s data-rich world, companies, governments and individuals want to analyze anything and everything they [\u2026]<\/p>\n","protected":false},"author":395,"featured_media":0,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[418,6],"tags":[],"class_list":["post-33360","post","type-post","status-publish","format-standard","hentry","category-internet","category-robotics-ai"],"_links":{"self":[{"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/posts\/33360","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/users\/395"}],"replies":[{"embeddable":true,"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/comments?post=33360"}],"version-history":[{"count":2,"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/posts\/33360\/revisions"}],"predecessor-version":[{"id":59197,"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/posts\/33360\/revisions\/59197"}],"wp:attachment":[{"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/media?parent=33360"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/categories?post=33360"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/tags?post=33360"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}