Show simple item record

dc.contributor.authorTortikar, Pratik
dc.description.abstractThis experiment attempts on data which can reveal a person’s identity to anonymize with k-1 anonymity principle. "Given person-specific field-structured data, produce a release of the data with scientific guarantees that the individuals who are the subjects of the data cannot be re-identified while the data remain practically useful”. The attempt to value the sensitivity and meaningful information with huge amount of data concerning privacy-preserving techniques are maintained to overcome fears with everyone’s delicate data. With this paper, we study the k-anonymity principle algorithm in the context of big data, and introduce a top-down k-anonymization, L-diversity and t-closeness solutions for Apache spark using Java. In the era of volumes of data, science needs more scalable and efficient methods to overcome data leakage, where there is information like public health, diagnosis, sensitive information like name, zip, race, education which leaks the information and would be against privacy of one’s data.en_US
dc.publisherNorth Dakota State Universityen_US
dc.rightsNDSU Policy 190.6.2
dc.titleK-Anonymization Implementation Using Apache Sparken_US
dc.typeMaster's paperen_US
dc.date.accessioned2019-04-08T16:11:05Z
dc.date.available2019-04-08T16:11:05Z
dc.date.issued2019
dc.identifier.urihttps://hdl.handle.net/10365/29524
dc.subject.lcshComputer security.
dc.subject.lcshData protection.
dc.subject.lcshPrivacy, Right of.
dc.subject.lcshBig data.
dc.subject.lcshSpark (Electronic resource : Apache Software Foundation)
dc.rights.urihttps://www.ndsu.edu/fileadmin/policy/190.pdf
ndsu.degreeMaster of Science (MS)en_US
ndsu.collegeEngineeringen_US
ndsu.departmentComputer Scienceen_US
ndsu.programSoftware Engineeringen_US
ndsu.advisorLudwig, Simone


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record