The repository is currently being upgraded to DSpace 7. Temporarily, only admins can login. Submission of items and changes to existing items is prohibited until the completion of this upgrade process.
Generating t-Closed Partitions of Datasets with Multiple Sensitive Attributes
Citation
Gowda, V. T. & Bagai, R. (2023). "Generating t-Closed Partitions of Datasets with Multiple Sensitive Attributes," 7th International Conference on Cryptography, Security and Privacy (CSP), Tianjin, China, 2023, pp. 107-111, doi: 10.1109/CSP58884.2023.00024.
Abstract
The popular t-closeness privacy model requires the "distance" between the distribution of sensitive attribute values in any given raw dataset and their distribution in every equivalence class created to not exceed some privacy threshold t. While most existing methods for achieving t-closeness handle data with just a single sensitive attribute, datasets with multiple sensitive attributes are very common in the real world. Here we demonstrate a technique for creating equivalence classes from a dataset containing multiple sensitive attributes. The equivalence classes generated by our method satisfy t-closeness without taking any $t$ values as input. While generalization of quasi-identifier attributes leads to information loss, the size of generated classes is roughly identical and differs by at most one, which results in a lower information loss. Generating classes with minimum information loss for a given value of $t$ is NP-hard, the equivalence classes generated by our method takes O(r log r) time.
Description
Click on the DOI link to access this conferece paper (may not be free).