K-Anonymization Implementation Using Apache Spark

dc.contributor.authorTortikar, Pratik
dc.date.accessioned2019-04-08T16:11:05Z
dc.date.available2019-04-08T16:11:05Z
dc.date.issued2019
dc.description.abstractThis experiment attempts on data which can reveal a person’s identity to anonymize with k-1 anonymity principle. "Given person-specific field-structured data, produce a release of the data with scientific guarantees that the individuals who are the subjects of the data cannot be re-identified while the data remain practically useful”. The attempt to value the sensitivity and meaningful information with huge amount of data concerning privacy-preserving techniques are maintained to overcome fears with everyone’s delicate data. With this paper, we study the k-anonymity principle algorithm in the context of big data, and introduce a top-down k-anonymization, L-diversity and t-closeness solutions for Apache spark using Java. In the era of volumes of data, science needs more scalable and efficient methods to overcome data leakage, where there is information like public health, diagnosis, sensitive information like name, zip, race, education which leaks the information and would be against privacy of one’s data.en_US
dc.identifier.urihttps://hdl.handle.net/10365/29524
dc.publisherNorth Dakota State Universityen_US
dc.rightsNDSU Policy 190.6.2
dc.rights.urihttps://www.ndsu.edu/fileadmin/policy/190.pdf
dc.subject.lcshComputer security.
dc.subject.lcshData protection.
dc.subject.lcshPrivacy, Right of.
dc.subject.lcshBig data.
dc.subject.lcshSpark (Electronic resource : Apache Software Foundation)
dc.titleK-Anonymization Implementation Using Apache Sparken_US
dc.typeMaster's paperen_US
ndsu.advisorLudwig, Simone
ndsu.collegeEngineeringen_US
ndsu.degreeMaster of Science (MS)en_US
ndsu.departmentComputer Scienceen_US
ndsu.programSoftware Engineeringen_US

Files

Original bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
K-Anonymization Implementation Using Apache Spark.pdf
Size:
1.48 MB
Format:
Adobe Portable Document Format
Description:
K-Anonymization Implementation Using Apache Spark

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.63 KB
Format:
Item-specific license agreed to upon submission
Description: