Data anonymization patent landscape

  • Mirjana Pejić Bach Department of informatics, Faculty of Economics & Business, University of Zagreb
  • Jasmina Pivar Department of informatics, Faculty of Economics & Business, University of Zagreb
  • Ksenija Dumičić Department of statistics, Faculty of Economics & Business, University of Zagreb

Abstract

The omnipresent, unstoppable increase in digital data has led to a greater understanding of the importance of data privacy. Different approaches are used to implement data privacy. The goal of this paper is to develop a data anonymization patent landscape, by determining the following: (i) the trend in data anonymization patenting, (ii) the type of technical content protected in data anonymization, (iii) the organizations and countries most active in patenting data anonymization know-how; and (iv) the topics emerging most often in patent titles. Patents from the PatSeer database relating to data anonymization from 2001 to 2015 were analyzed. We used the longitudinal approach in combination with text mining techniques to develop a data anonymization patent landscape.

The results indicated the following. The number of single patent families is growing with a high increase after 2010, thus indicating a positive trend in the area of patenting data anonymization solutions. The majority of patenting activities relate to the G Physics section. Organizations from the USA and Japan assigned the majority of patents related to data anonymization. The results of text mining indicate that the most often used word in titles of data anonymization patents are “anonym*, “method”, “data” and “system”. Several additional words that indicated the most frequent topics related to data anonymization were: “equipment”, “software”, “protection”, “identification”, or “encryption”, and specific topics such as “community”, “medical”, or “service”.

Published
2017-03-31
Section
CRORR Journal Regular Issue