{"id":652,"date":"2024-01-11T15:08:32","date_gmt":"2024-01-11T14:08:32","guid":{"rendered":"https:\/\/www.francelabs.com\/blog\/?p=652"},"modified":"2024-01-11T15:08:33","modified_gmt":"2024-01-11T14:08:33","slug":"how-enterprise-search-can-help-you-for-gdpr-compliance","status":"publish","type":"post","link":"https:\/\/www.francelabs.com\/blog\/how-enterprise-search-can-help-you-for-gdpr-compliance\/","title":{"rendered":"How Enterprise Search can help you for GDPR compliance"},"content":{"rendered":"\n<p>Datafari, as an <a href=\"https:\/\/www.datafari.com\/en\">Enterprise Search solution<\/a>, has an overall visibility over all of the knowledge bases of an organization. As such, it is a good entry point to check where PII (Personally Identifiable Information) are stored. <\/p>\n\n\n\n<p>Indeed, as part of the GDPR requirements, any organization must maintain a list of where PII data are stored. But as soon as the knowledge base grows too much, it is impossible to manually maintain such a list. Distributing this task over the different departments of the organization is a good start, but it has its limits, for instance due to the possible misinterpretation from colleagues about what PII are. <\/p>\n\n\n\n<!--more-->\n\n\n\n<p>This is where Enterprise Search solutions come in handy: because they go through all of the internal documents and data, it is simple to add detection mechanisms to automate the generation of a list of documents that are potential candidates as PII holders.<\/p>\n\n\n\n<p>Such a feature is feasible for free, using the open source version of Datafari, aka Datafari Community Edition. We presented during the <a href=\"https:\/\/www.opensource-experience.com\/\">Open Source Experience event<\/a> in Paris end of 2023, a demo and a walkthrough on how to set it up. Thanks for this tutorial, you can have an end-to-end systems that detects regular expressions (think phone numbers, social security card numbers etc) as well as entities via Machine Learning (people names, organizations for instance) using a dedicated Spacy server leveraging the Transformers models. You can now do it yourself following this link that details the necessary steps: <a href=\"https:\/\/datafari.atlassian.net\/wiki\/spaces\/DATAFARI\/pages\/2910486529\/GDPR+Inventory+-+Identify+documents+with+privacy+related+data+with+Datafari\" target=\"_blank\" rel=\"noreferrer noopener\">using Datafari for GDPR PII inventory<\/a>.<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><a href=\"https:\/\/www.francelabs.com\/blog\/wp-content\/uploads\/2024\/01\/image.png\"><img loading=\"lazy\" decoding=\"async\" width=\"361\" height=\"769\" src=\"https:\/\/www.francelabs.com\/blog\/wp-content\/uploads\/2024\/01\/image.png\" alt=\"\" class=\"wp-image-655\" srcset=\"https:\/\/www.francelabs.com\/blog\/wp-content\/uploads\/2024\/01\/image.png 361w, https:\/\/www.francelabs.com\/blog\/wp-content\/uploads\/2024\/01\/image-141x300.png 141w\" sizes=\"auto, (max-width: 361px) 100vw, 361px\" \/><\/a><\/figure>\n\n\n\n<p>Should you need more help to assist you, you can obviously reach out to us.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Datafari, as an Enterprise Search solution, has an overall visibility over all of the knowledge bases of an organization. As such, it is a good entry point to check where PII (Personally Identifiable Information) are stored. Indeed, as part of &hellip; <a href=\"https:\/\/www.francelabs.com\/blog\/how-enterprise-search-can-help-you-for-gdpr-compliance\/\">Continue reading <span class=\"meta-nav\">&rarr;<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[53,1],"tags":[],"class_list":["post-652","post","type-post","status-publish","format-standard","hentry","category-datafari","category-search"],"_links":{"self":[{"href":"https:\/\/www.francelabs.com\/blog\/wp-json\/wp\/v2\/posts\/652","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.francelabs.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.francelabs.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.francelabs.com\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.francelabs.com\/blog\/wp-json\/wp\/v2\/comments?post=652"}],"version-history":[{"count":4,"href":"https:\/\/www.francelabs.com\/blog\/wp-json\/wp\/v2\/posts\/652\/revisions"}],"predecessor-version":[{"id":657,"href":"https:\/\/www.francelabs.com\/blog\/wp-json\/wp\/v2\/posts\/652\/revisions\/657"}],"wp:attachment":[{"href":"https:\/\/www.francelabs.com\/blog\/wp-json\/wp\/v2\/media?parent=652"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.francelabs.com\/blog\/wp-json\/wp\/v2\/categories?post=652"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.francelabs.com\/blog\/wp-json\/wp\/v2\/tags?post=652"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}