zlacker

[parent] [thread] 6 comments
1. shusso+(OP)[view] [source] 2019-07-24 10:39:25
The article title misses a bit of nuance from the paper which is specifically talking about re-identification.

e.g from the paper:

"We show that, as a male born on July 31, 1945 and living in Cambridge (02138), the information used by Latanya Sweeney at the time, William Weld was unique with a 58% likelihood (ξx = 0.58 and κx = 0.77), meaning that Latanya Sweeney’s re-identification had 77% chances of being correct. We show that, if his medical records had included number of children—5 for William Weld—, her re-identification would have had 99.8% chances of being correct!"

replies(1): >>Freak_+k2
2. Freak_+k2[view] [source] 2019-07-24 11:10:32
>>shusso+(OP)
What accounts for that remaining 2‰ of uncertainty?
replies(3): >>air7+x5 >>bsanr2+e9 >>Cynddl+w9
◧◩
3. air7+x5[view] [source] [discussion] 2019-07-24 11:44:12
>>Freak_+k2
Good question. I'd also assume that re-identification chances would always be of the form 100/k (k being the integer number of people who fit the bill)
◧◩
4. bsanr2+e9[view] [source] [discussion] 2019-07-24 12:17:26
>>Freak_+k2
Perhaps the risk that some of the data the probability is based on is wrong, e.g., "She fits the bill except they only have 4 children and the 5th is an entry error, so it's not her."
◧◩
5. Cynddl+w9[view] [source] [discussion] 2019-07-24 12:19:16
>>Freak_+k2
Co-author here. We designed a statistical model, which is never 100% sure a re-identification is correct. There is, e.g., a non-null probability that two individuals in the US share 5, 10, or even 15 demographics attribute.
replies(1): >>mnky98+yn
◧◩◪
6. mnky98+yn[view] [source] [discussion] 2019-07-24 13:56:14
>>Cynddl+w9
Can you provide a link to your paper?
replies(1): >>Cynddl+4G
◧◩◪◨
7. Cynddl+4G[view] [source] [discussion] 2019-07-24 15:49:27
>>mnky98+yn
The article is available here, in open access: https://www.nature.com/articles/s41467-019-10933-3
[go to top]