K-Anonymization Implementation Using Apache Spark
dc.contributor.author | Tortikar, Pratik | |
dc.date.accessioned | 2019-04-08T16:11:05Z | |
dc.date.available | 2019-04-08T16:11:05Z | |
dc.date.issued | 2019 | |
dc.description.abstract | This experiment attempts on data which can reveal a person’s identity to anonymize with k-1 anonymity principle. "Given person-specific field-structured data, produce a release of the data with scientific guarantees that the individuals who are the subjects of the data cannot be re-identified while the data remain practically useful”. The attempt to value the sensitivity and meaningful information with huge amount of data concerning privacy-preserving techniques are maintained to overcome fears with everyone’s delicate data. With this paper, we study the k-anonymity principle algorithm in the context of big data, and introduce a top-down k-anonymization, L-diversity and t-closeness solutions for Apache spark using Java. In the era of volumes of data, science needs more scalable and efficient methods to overcome data leakage, where there is information like public health, diagnosis, sensitive information like name, zip, race, education which leaks the information and would be against privacy of one’s data. | en_US |
dc.identifier.uri | https://hdl.handle.net/10365/29524 | |
dc.publisher | North Dakota State University | en_US |
dc.rights | NDSU Policy 190.6.2 | |
dc.rights.uri | https://www.ndsu.edu/fileadmin/policy/190.pdf | |
dc.subject.lcsh | Computer security. | |
dc.subject.lcsh | Data protection. | |
dc.subject.lcsh | Privacy, Right of. | |
dc.subject.lcsh | Big data. | |
dc.subject.lcsh | Spark (Electronic resource : Apache Software Foundation) | |
dc.title | K-Anonymization Implementation Using Apache Spark | en_US |
dc.type | Master's paper | en_US |
ndsu.advisor | Ludwig, Simone | |
ndsu.college | Engineering | en_US |
ndsu.degree | Master of Science (MS) | en_US |
ndsu.department | Computer Science | en_US |
ndsu.program | Software Engineering | en_US |
Files
Original bundle
1 - 1 of 1
No Thumbnail Available
- Name:
- K-Anonymization Implementation Using Apache Spark.pdf
- Size:
- 1.48 MB
- Format:
- Adobe Portable Document Format
- Description:
- K-Anonymization Implementation Using Apache Spark
License bundle
1 - 1 of 1
No Thumbnail Available
- Name:
- license.txt
- Size:
- 1.63 KB
- Format:
- Item-specific license agreed to upon submission
- Description: