handouts/ho07.tex
author Christian Urban <christian dot urban at kcl dot ac dot uk>
Tue, 05 Jan 2016 01:37:31 +0000
changeset 442 cceb3d2dcba0
parent 431 4b53f83c070c
child 443 67d7d239c617
permissions -rw-r--r--
updated
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
307
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
     1
\documentclass{article}
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
     2
\usepackage{../style}
314
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
     3
\usepackage{../graphics}
307
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
     4
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
     5
\begin{document}
423
11b46fa92a85 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 366
diff changeset
     6
\fnote{\copyright{} Christian Urban, 2014, 2015}
307
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
     7
429
ff053e2766e8 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 423
diff changeset
     8
%https://nakedsecurity.sophos.com/2015/11/12/california-collects-owns-and-sells-infants-dna-samples/
431
4b53f83c070c updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 429
diff changeset
     9
%http://randomwalker.info/teaching/fall-2012-privacy-technologies/?
4b53f83c070c updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 429
diff changeset
    10
%https://josephhall.org/papers/NYU-MCC-1303-S2012_privacy_syllabus.pdf
4b53f83c070c updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 429
diff changeset
    11
%http://www.jetlaw.org/wp-content/uploads/2014/06/Bambauer_Final.pdf
4b53f83c070c updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 429
diff changeset
    12
%http://www.cs.cmu.edu/~yuxiangw/docs/Differential%20Privacy.pdf
4b53f83c070c updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 429
diff changeset
    13
%https://www.youtube.com/watch?v=Gx13lgEudtU
4b53f83c070c updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 429
diff changeset
    14
%https://fpf.org/wp-content/uploads/Differential-Privacy-as-a-Response-to-the-Reidentification-Threat-Klinefelter-and-Chin.pdf
442
cceb3d2dcba0 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 431
diff changeset
    15
%http://research.neustar.biz/2014/09/08/differential-privacy-the-basics/
429
ff053e2766e8 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 423
diff changeset
    16
307
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
    17
\section*{Handout 7 (Privacy)}
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
    18
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
    19
The first motor car was invented around 1886. For ten years,
314
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
    20
until 1896, the law in the UK (and elsewhere) required a
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
    21
person to walk in front of any moving car waving a red flag.
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
    22
Cars were such a novelty that most people did not know what to
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
    23
make of them. The person with the red flag was intended to
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
    24
warn the public, for example horse owners, about the impending
307
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
    25
novelty---a car. In my humble opinion, we are at the same
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
    26
stage of development with privacy. Nobody really knows what it
311
8befc029ca1e updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 310
diff changeset
    27
is about or what it is good for. All seems very hazy. There
313
1d243ac51078 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 312
diff changeset
    28
are a few laws (e.g.~cookie law, right-to-be-forgotten law)
1d243ac51078 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 312
diff changeset
    29
which address problems with privacy, but even if they are well
311
8befc029ca1e updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 310
diff changeset
    30
intentioned, they either back-fire or are already obsolete
8befc029ca1e updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 310
diff changeset
    31
because of newer technologies. The result is that the world of
314
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
    32
``privacy'' looks a little bit like the old Wild
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
    33
West---lawless and mythical.
309
b1ba3d88696e updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 308
diff changeset
    34
313
1d243ac51078 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 312
diff changeset
    35
For example, UCAS, a charity set up to help students with
1d243ac51078 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 312
diff changeset
    36
applying to universities, has a commercial unit that happily
1d243ac51078 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 312
diff changeset
    37
sells your email addresses to anybody who forks out enough
314
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
    38
money for bombarding you with spam. Yes, you can opt out very
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
    39
often from such ``schemes'', but in case of UCAS any opt-out
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
    40
will limit also legit emails you might actually be interested
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
    41
in.\footnote{The main objectionable point, in my opinion, is
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
    42
that the \emph{charity} everybody has to use for HE
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
    43
applications has actually very honourable goals (e.g.~assist
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
    44
applicants in gaining access to universities), but the small
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
    45
print (or better the link ``About us'') reveals they set up
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
    46
their organisation so that they can also shamelessly sell the
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
    47
email addresses they ``harvest''. Everything is of course very
423
11b46fa92a85 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 366
diff changeset
    48
legal\ldots{}ethical?\ldots{}well that is in the eye of the
314
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
    49
beholder. See:
307
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
    50
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
    51
\url{http://www.ucas.com/about-us/inside-ucas/advertising-opportunities} 
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
    52
or
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
    53
\url{http://www.theguardian.com/uk-news/2014/mar/12/ucas-sells-marketing-access-student-data-advertisers}}
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
    54
311
8befc029ca1e updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 310
diff changeset
    55
Another example: Verizon, an ISP who is supposed to provide
8befc029ca1e updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 310
diff changeset
    56
you just with connectivity, has found a ``nice'' side-business
8befc029ca1e updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 310
diff changeset
    57
too: When you have enabled all privacy guards in your browser
313
1d243ac51078 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 312
diff changeset
    58
(the few you have at your disposal), Verizon happily adds a
311
8befc029ca1e updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 310
diff changeset
    59
kind of cookie to your
307
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
    60
HTTP-requests.\footnote{\url{http://webpolicy.org/2014/10/24/how-verizons-advertising-header-works/}}
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
    61
As shown in the picture below, this cookie will be sent to
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
    62
every web-site you visit. The web-sites then can forward the
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
    63
cookie to advertisers who in turn pay Verizon to tell them
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
    64
everything they want to know about the person who just made
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
    65
this request, that is you.
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
    66
 
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
    67
\begin{center}
366
34a8f73b2c94 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 325
diff changeset
    68
\includegraphics[scale=0.16]{../pics/verizon.png}
307
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
    69
\end{center}
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
    70
314
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
    71
\noindent How disgusting! Even worse, Verizon is not known for
307
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
    72
being the cheapest ISP on the planet (completely the
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
    73
contrary), and also not known for providing the fastest
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
    74
possible speeds, but rather for being among the few ISPs in
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
    75
the US with a quasi-monopolistic ``market distribution''.
310
591b62e1f86a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 309
diff changeset
    76
591b62e1f86a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 309
diff changeset
    77
307
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
    78
Well, we could go on and on\ldots{}and that has not even
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
    79
started us yet with all the naughty things NSA \& Friends are
311
8befc029ca1e updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 310
diff changeset
    80
up to. Why does privacy actually matter? Nobody, I think, has
8befc029ca1e updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 310
diff changeset
    81
a conclusive answer to this question yet. Maybe the following
8befc029ca1e updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 310
diff changeset
    82
four notions help with clarifying the overall picture
8befc029ca1e updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 310
diff changeset
    83
somewhat: 
307
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
    84
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
    85
\begin{itemize}
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
    86
\item \textbf{Secrecy} is the mechanism used to limit the
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
    87
      number of principals with access to information (e.g.,
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
    88
      cryptography or access controls). For example I better
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
    89
      keep my password secret, otherwise people from the wrong
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
    90
      side of the law might impersonate me.
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
    91
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
    92
\item \textbf{Confidentiality} is the obligation to protect
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
    93
      the secrets of other people or organisations (secrecy
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
    94
      for the benefit of an organisation). For example as a
308
2a814c06ae03 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 307
diff changeset
    95
      staff member at King's I have access to data, even
307
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
    96
      private data, I am allowed to use in my work but not
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
    97
      allowed to disclose to anyone else.
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
    98
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
    99
\item \textbf{Anonymity} is the ability to leave no evidence of
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
   100
      an activity (e.g., sharing a secret). This is not equal
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
   101
        with privacy---anonymity is required in many 
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
   102
        circumstances, for example for whistle-blowers, 
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
   103
        voting, exam marking and so on.
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
   104
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
   105
\item \textbf{Privacy} is the ability or right to protect your
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
   106
      personal secrets (secrecy for the benefit of an
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
   107
      individual). For example, in a job interview, I might
313
1d243ac51078 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 312
diff changeset
   108
      not like to disclose that I am pregnant, if I were a
1d243ac51078 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 312
diff changeset
   109
      woman, or that I am a father. Lest they might not hire
1d243ac51078 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 312
diff changeset
   110
      me. Similarly, I might not like to disclose my location
1d243ac51078 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 312
diff changeset
   111
      data, because thieves might break into my house if they
1d243ac51078 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 312
diff changeset
   112
      know I am away at work. Privacy is essentially
1d243ac51078 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 312
diff changeset
   113
      everything which ``shouldn't be anybody's business''.
307
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
   114
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
   115
\end{itemize}
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
   116
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
   117
\noindent While this might provide us with some rough
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
   118
definitions, the problem with privacy is that it is an
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
   119
extremely fine line what should stay private and what should
310
591b62e1f86a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 309
diff changeset
   120
not. For example, since I am working in academia, I am every
591b62e1f86a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 309
diff changeset
   121
so often very happy to be a digital exhibitionist: I am very
591b62e1f86a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 309
diff changeset
   122
happy to disclose all `trivia' related to my work on my
591b62e1f86a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 309
diff changeset
   123
personal web-page. This is a kind of bragging that is normal
591b62e1f86a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 309
diff changeset
   124
in academia (at least in the field of CS), even expected if
591b62e1f86a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 309
diff changeset
   125
you look for a job. I am even happy that Google maintains a
309
b1ba3d88696e updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 308
diff changeset
   126
profile about all my academic papers and their citations. 
307
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
   127
309
b1ba3d88696e updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 308
diff changeset
   128
On the other hand I would be very irritated if anybody I do
b1ba3d88696e updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 308
diff changeset
   129
not know had a too close look on my private live---it
b1ba3d88696e updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 308
diff changeset
   130
shouldn't be anybody's business. The reason is that knowledge
423
11b46fa92a85 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 366
diff changeset
   131
about my private life can often be used against me. As mentioned
309
b1ba3d88696e updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 308
diff changeset
   132
above, public location data might mean I get robbed. If
b1ba3d88696e updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 308
diff changeset
   133
supermarkets build a profile of my shopping habits, they will
b1ba3d88696e updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 308
diff changeset
   134
use it to \emph{their} advantage---surely not to \emph{my}
b1ba3d88696e updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 308
diff changeset
   135
advantage. Also whatever might be collected about my life will
313
1d243ac51078 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 312
diff changeset
   136
always be an incomplete, or even misleading, picture. For
1d243ac51078 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 312
diff changeset
   137
example I am pretty sure my creditworthiness score was
1d243ac51078 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 312
diff changeset
   138
temporarily(?) destroyed by not having a regular income in
1d243ac51078 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 312
diff changeset
   139
this country (before coming to King's I worked in Munich for
1d243ac51078 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 312
diff changeset
   140
five years). To correct such incomplete or flawed credit
1d243ac51078 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 312
diff changeset
   141
history data there is, since recently, a law that allows you
1d243ac51078 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 312
diff changeset
   142
to check what information is held about you for determining
1d243ac51078 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 312
diff changeset
   143
your creditworthiness. But this concerns only a very small
423
11b46fa92a85 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 366
diff changeset
   144
part of the data that is held about me/you. Also
11b46fa92a85 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 366
diff changeset
   145
what about cases where data is wrong or outdated (but do we
11b46fa92a85 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 366
diff changeset
   146
need a right-to be forgotten).
11b46fa92a85 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 366
diff changeset
   147
307
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
   148
311
8befc029ca1e updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 310
diff changeset
   149
To see how private matter can lead really to the wrong
8befc029ca1e updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 310
diff changeset
   150
conclusions, take the example of Stephen Hawking: When he was
8befc029ca1e updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 310
diff changeset
   151
diagnosed with his disease, he was given a life expectancy of
8befc029ca1e updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 310
diff changeset
   152
two years. If employers would know about such problems, would
8befc029ca1e updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 310
diff changeset
   153
they have employed Hawking? Now, he is enjoying his 70+
8befc029ca1e updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 310
diff changeset
   154
birthday. Clearly personal medical data needs to stay private.
310
591b62e1f86a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 309
diff changeset
   155
309
b1ba3d88696e updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 308
diff changeset
   156
To cut a long story short, I let you ponder about the two
314
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
   157
statements which are often voiced in discussions about privacy:
307
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
   158
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
   159
\begin{itemize}
314
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
   160
\item \textit{``You have zero privacy anyway. Get over 
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
   161
it.''}\\
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
   162
\mbox{}\hfill{}{\small{}(by Scott Mcnealy, former CEO of Sun)}
307
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
   163
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
   164
\item \textit{``If you have nothing to hide, you have nothing 
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
   165
to fear.''}
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
   166
\end{itemize}
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
   167
 
314
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
   168
\noindent If you like to watch a movie which has this topic as
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
   169
its main focus I recommend \emph{Gattaca} from
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
   170
1997.\footnote{\url{http://www.imdb.com/title/tt0119177/}} If
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
   171
you want to read up on this topic, I can recommend the
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
   172
following article that appeared in 2011 in the Chronicle of
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
   173
Higher Education:
309
b1ba3d88696e updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 308
diff changeset
   174
b1ba3d88696e updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 308
diff changeset
   175
\begin{center} 
b1ba3d88696e updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 308
diff changeset
   176
\url{http://chronicle.com/article/Why-Privacy-Matters-Even-if/127461/} 
b1ba3d88696e updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 308
diff changeset
   177
\end{center} 
b1ba3d88696e updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 308
diff changeset
   178
b1ba3d88696e updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 308
diff changeset
   179
\noindent Funnily, or maybe not so funnily, the author of this
b1ba3d88696e updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 308
diff changeset
   180
article carefully tries to construct an argument that does not
b1ba3d88696e updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 308
diff changeset
   181
only attack the nothing-to-hide statement in cases where
313
1d243ac51078 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 312
diff changeset
   182
governments \& co collect people's deepest secrets, or
309
b1ba3d88696e updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 308
diff changeset
   183
pictures of people's naked bodies, but an argument that
b1ba3d88696e updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 308
diff changeset
   184
applies also in cases where governments ``only'' collect data
310
591b62e1f86a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 309
diff changeset
   185
relevant to, say, preventing terrorism. The fun is of course
591b62e1f86a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 309
diff changeset
   186
that in 2011 we could just not imagine that respected
591b62e1f86a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 309
diff changeset
   187
governments would do such infantile things as intercepting
591b62e1f86a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 309
diff changeset
   188
people's nude photos. Well, since Snowden we know some people
591b62e1f86a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 309
diff changeset
   189
at the NSA did exactly that and then shared such photos among
591b62e1f86a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 309
diff changeset
   190
colleagues as ``fringe benefit''.  
309
b1ba3d88696e updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 308
diff changeset
   191
b1ba3d88696e updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 308
diff changeset
   192
b1ba3d88696e updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 308
diff changeset
   193
\subsubsection*{Re-Identification Attacks} 
b1ba3d88696e updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 308
diff changeset
   194
310
591b62e1f86a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 309
diff changeset
   195
Apart from philosophical musings, there are fortunately also
591b62e1f86a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 309
diff changeset
   196
some real technical problems with privacy. The problem I want
591b62e1f86a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 309
diff changeset
   197
to focus on in this handout is how to safely disclose datasets
313
1d243ac51078 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 312
diff changeset
   198
containing potentially very private data, say health records.
1d243ac51078 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 312
diff changeset
   199
What can go wrong with such disclosures can be illustrated
1d243ac51078 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 312
diff changeset
   200
with four well-known examples:
307
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
   201
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
   202
\begin{itemize}
309
b1ba3d88696e updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 308
diff changeset
   203
\item In 2006, a then young company called Netflix offered a 1
307
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
   204
      Mio \$ prize to anybody who could improve their movie
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
   205
      rating algorithm. For this they disclosed a dataset
309
b1ba3d88696e updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 308
diff changeset
   206
      containing 10\% of all Netflix users at the time
b1ba3d88696e updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 308
diff changeset
   207
      (appr.~500K). They removed names, but included numerical
313
1d243ac51078 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 312
diff changeset
   208
      ratings of movies as well as times when ratings were
1d243ac51078 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 312
diff changeset
   209
      uploaded. Though some information was perturbed (i.e.,
1d243ac51078 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 312
diff changeset
   210
      slightly modified).
307
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
   211
      
309
b1ba3d88696e updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 308
diff changeset
   212
      Two researchers had a closer look at this anonymised
b1ba3d88696e updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 308
diff changeset
   213
      data and compared it with public data available from the
311
8befc029ca1e updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 310
diff changeset
   214
      International Movie Database (IMDb). They found that
8befc029ca1e updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 310
diff changeset
   215
      98\% of the entries could be re-identified in the
8befc029ca1e updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 310
diff changeset
   216
      Netflix dataset: either by their ratings or by the dates
8befc029ca1e updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 310
diff changeset
   217
      the ratings were uploaded. The result was a class-action
309
b1ba3d88696e updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 308
diff changeset
   218
      suit against Netflix, which was only recently resolved
b1ba3d88696e updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 308
diff changeset
   219
      involving a lot of money.
307
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
   220
310
591b62e1f86a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 309
diff changeset
   221
\item In the 1990ies, medical datasets were often made public
591b62e1f86a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 309
diff changeset
   222
      for research purposes. This was done in anonymised form
311
8befc029ca1e updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 310
diff changeset
   223
      with names removed, but birth dates, gender and ZIP-code
310
591b62e1f86a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 309
diff changeset
   224
      were retained. In one case where such data about
591b62e1f86a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 309
diff changeset
   225
      hospital visits of state employees in Massachusetts was
591b62e1f86a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 309
diff changeset
   226
      made public, the then governor assured the public that
591b62e1f86a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 309
diff changeset
   227
      the released dataset protected patient privacy by
311
8befc029ca1e updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 310
diff changeset
   228
      deleting identifiers. 
8befc029ca1e updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 310
diff changeset
   229
      
8befc029ca1e updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 310
diff changeset
   230
      A graduate student could not resist cross-referencing
313
1d243ac51078 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 312
diff changeset
   231
      public voter data with the released data that still
1d243ac51078 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 312
diff changeset
   232
      included birth dates, gender and ZIP-code. The result
1d243ac51078 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 312
diff changeset
   233
      was that she could send the governor his own hospital
1d243ac51078 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 312
diff changeset
   234
      record. It turns out that birth dates, gender and
1d243ac51078 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 312
diff changeset
   235
      ZIP-code uniquely identify 87\% of people in the US.
1d243ac51078 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 312
diff changeset
   236
      This work resulted in a number of laws prescribing which
1d243ac51078 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 312
diff changeset
   237
      private data cannot be released in such datasets.
309
b1ba3d88696e updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 308
diff changeset
   238
 
b1ba3d88696e updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 308
diff changeset
   239
\item In 2006, AOL published 20 million Web search queries
310
591b62e1f86a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 309
diff changeset
   240
      collected from 650,000 users (names had been deleted).
591b62e1f86a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 309
diff changeset
   241
      This was again done for research purposes. However,
591b62e1f86a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 309
diff changeset
   242
      within days an old lady, Thelma Arnold, from Lilburn,
591b62e1f86a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 309
diff changeset
   243
      Georgia, (11,596 inhabitants) was identified as user
591b62e1f86a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 309
diff changeset
   244
      No.~4417749 in this dataset. It turned out that search
591b62e1f86a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 309
diff changeset
   245
      engine queries are deep windows into people's private
591b62e1f86a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 309
diff changeset
   246
      lives. 
309
b1ba3d88696e updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 308
diff changeset
   247
  
311
8befc029ca1e updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 310
diff changeset
   248
\item Genome-Wide Association Studies (GWAS) was a public
309
b1ba3d88696e updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 308
diff changeset
   249
      database of gene-frequency studies linked to diseases.
311
8befc029ca1e updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 310
diff changeset
   250
      It would essentially record that people who have a
313
1d243ac51078 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 312
diff changeset
   251
      disease, say diabetes, have also certain genes. In order
311
8befc029ca1e updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 310
diff changeset
   252
      to maintain privacy, the dataset would only include
313
1d243ac51078 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 312
diff changeset
   253
      aggregate information. In case of DNA data this
1d243ac51078 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 312
diff changeset
   254
      aggregation was achieved by mixing the DNA of many
1d243ac51078 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 312
diff changeset
   255
      individuals (having a disease) into a single solution.
1d243ac51078 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 312
diff changeset
   256
      Then this mixture was sequenced and included in the
1d243ac51078 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 312
diff changeset
   257
      dataset. The idea was that the aggregate information
1d243ac51078 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 312
diff changeset
   258
      would still be helpful to researchers, but would protect
1d243ac51078 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 312
diff changeset
   259
      the DNA data of individuals. 
311
8befc029ca1e updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 310
diff changeset
   260
       
313
1d243ac51078 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 312
diff changeset
   261
      In 2007 a forensic computer scientist showed that
1d243ac51078 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 312
diff changeset
   262
      individuals can still be identified. For this he used
311
8befc029ca1e updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 310
diff changeset
   263
      the DNA data from a comparison group (people from the
8befc029ca1e updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 310
diff changeset
   264
      general public) and ``subtracted'' this data from the
313
1d243ac51078 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 312
diff changeset
   265
      published data. He was left with data that included all
1d243ac51078 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 312
diff changeset
   266
      ``special'' DNA-markers of the individuals present in
1d243ac51078 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 312
diff changeset
   267
      the original mixture. He essentially deleted the
423
11b46fa92a85 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 366
diff changeset
   268
      ``background noise'' in the published data. The problem
11b46fa92a85 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 366
diff changeset
   269
      with DNA data is that it is of such a high resolution
11b46fa92a85 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 366
diff changeset
   270
      that even if the mixture contained maybe 100
11b46fa92a85 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 366
diff changeset
   271
      individuals, you can with current technology detect
11b46fa92a85 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 366
diff changeset
   272
      whether an individual was included in the mixture or
11b46fa92a85 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 366
diff changeset
   273
      not.
310
591b62e1f86a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 309
diff changeset
   274
      
311
8befc029ca1e updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 310
diff changeset
   275
      This result changed completely how DNA data is nowadays
8befc029ca1e updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 310
diff changeset
   276
      published for research purposes. After the success of 
8befc029ca1e updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 310
diff changeset
   277
      the human-genome project with a very open culture of
8befc029ca1e updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 310
diff changeset
   278
      exchanging data, it became much more difficult to 
313
1d243ac51078 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 312
diff changeset
   279
      anonymise data so that patient's privacy is preserved.
311
8befc029ca1e updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 310
diff changeset
   280
      The public GWAS database was taken offline in 2008.
307
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
   281
      
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
   282
\end{itemize}
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
   283
311
8befc029ca1e updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 310
diff changeset
   284
\noindent There are many lessons that can be learned from
313
1d243ac51078 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 312
diff changeset
   285
these examples. One is that when making datasets public in
1d243ac51078 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 312
diff changeset
   286
anonymised form, you want to achieve \emph{forward privacy}.
1d243ac51078 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 312
diff changeset
   287
This means, no matter what other data that is also available
1d243ac51078 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 312
diff changeset
   288
or will be released later, the data in the original dataset
1d243ac51078 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 312
diff changeset
   289
does not compromise an individual's privacy. This principle
1d243ac51078 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 312
diff changeset
   290
was violated by the availability of ``outside data'' in the
1d243ac51078 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 312
diff changeset
   291
Netflix and governor of Massachusetts cases. The additional
1d243ac51078 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 312
diff changeset
   292
data permitted a re-identification of individuals in the
1d243ac51078 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 312
diff changeset
   293
dataset. In case of GWAS a new technique of re-identification
1d243ac51078 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 312
diff changeset
   294
compromised the privacy of people in the dataset. The case of
1d243ac51078 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 312
diff changeset
   295
the AOL dataset shows clearly how incomplete such data can be:
1d243ac51078 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 312
diff changeset
   296
Although the queries uniquely identified the older lady, she
1d243ac51078 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 312
diff changeset
   297
also looked up diseases that her friends had, which had
1d243ac51078 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 312
diff changeset
   298
nothing to do with her. Any rational analysis of her query
1d243ac51078 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 312
diff changeset
   299
data must therefore have concluded, the lady is on her
314
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
   300
death bed, while she was actually very much alive and kicking.
311
8befc029ca1e updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 310
diff changeset
   301
8befc029ca1e updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 310
diff changeset
   302
\subsubsection*{Differential Privacy}
8befc029ca1e updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 310
diff changeset
   303
314
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
   304
Differential privacy is one of the few methods that tries to
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
   305
achieve forward privacy. The basic idea is to add appropriate
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
   306
noise, or errors, to any query of the dataset. The intention
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
   307
is to make the result of a query insensitive to individual
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
   308
entries in the database. That means the results are
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
   309
approximately the same no matter if a particular individual is
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
   310
in the dataset or not. The hope is that the added error does
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
   311
not eliminate the ``signal'' one is looking for in the
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
   312
dataset.
311
8befc029ca1e updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 310
diff changeset
   313
314
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
   314
%\begin{center}
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
   315
%User\;\;\;\;    
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
   316
%\begin{tabular}{c}
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
   317
%tell me $f(x)$ $\Rightarrow$\\
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
   318
%$\Leftarrow$ $f(x) + \text{noise}$
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
   319
%\end{tabular}
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
   320
%\;\;\;\;\begin{tabular}{@{}c}
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
   321
%Database\\
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
   322
%$x_1, \ldots, x_n$
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
   323
%\end{tabular}
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
   324
%\end{center}
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
   325
%
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
   326
%\begin{center}
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
   327
%\begin{tabular}{l|l}
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
   328
%Staff & Salary\\\hline
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
   329
%$PM$ & \pounds{107}\\
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
   330
%$PF$ & \pounds{102}\\
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
   331
%$LM_1$ & \pounds{101}\\
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
   332
%$LF_2$ & \pounds{97}\\
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
   333
%$LM_3$ & \pounds{100}\\
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
   334
%$LM_4$ & \pounds{99}\\
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
   335
%$LF_5$ & \pounds{98}
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
   336
%\end{tabular}
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
   337
%\end{center}
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
   338
%
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
   339
%
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
   340
%\begin{center}
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
   341
%\begin{tikzpicture} 
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
   342
%\begin{axis}[symbolic y coords={salary},
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
   343
%             ytick=data,
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
   344
%             height=3cm]
%\addplot+[jump mark mid] coordinates
%{(0,salary)   (0.1,salary) 
% (0.4,salary) (0.5,salary)  
% (0.8,salary) (0.9,salary)};
%\end{axis}
%\end{tikzpicture}
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
   345
%\end{center}
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
   346
%
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
   347
%\begin{tikzpicture}[outline/.style={draw=#1,fill=#1!20}]
%  \node [outline=red]            {red box};
%  \node [outline=blue] at (0,-1) {blue box};
%\end{tikzpicture}
311
8befc029ca1e updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 310
diff changeset
   348
313
1d243ac51078 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 312
diff changeset
   349
\ldots
1d243ac51078 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 312
diff changeset
   350
1d243ac51078 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 312
diff changeset
   351
312
c913fe9bfd59 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 311
diff changeset
   352
\subsubsection*{Further Reading}
c913fe9bfd59 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 311
diff changeset
   353
314
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
   354
Two cool articles about how somebody obtained via the Freedom
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
   355
of Information Law the taxicab dataset of New York and someone
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
   356
else showed how easy it is to mine for private information: 
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
   357
423
11b46fa92a85 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 366
diff changeset
   358
\begin{center}\small
11b46fa92a85 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 366
diff changeset
   359
\begin{tabular}{p{0.78\textwidth}}
11b46fa92a85 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 366
diff changeset
   360
\url{http://chriswhong.com/open-data/foil_nyc_taxi/}\smallskip\\
314
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
   361
\url{http://research.neustar.biz/2014/09/15/riding-with-the-stars-passenger-privacy-in-the-nyc-taxicab-dataset}
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
   362
\end{tabular}
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
   363
\end{center}
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
   364
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
   365
\noindent 
312
c913fe9bfd59 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 311
diff changeset
   366
A readable article about how supermarkets mine your shopping
314
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
   367
habits (especially how they prey on new exhausted parents
313
1d243ac51078 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 312
diff changeset
   368
;o) appeared in 2012 in the New York Times:
312
c913fe9bfd59 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 311
diff changeset
   369
c913fe9bfd59 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 311
diff changeset
   370
\begin{center}
c913fe9bfd59 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 311
diff changeset
   371
\url{http://www.nytimes.com/2012/02/19/magazine/shopping-habits.html}
c913fe9bfd59 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 311
diff changeset
   372
\end{center}
c913fe9bfd59 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 311
diff changeset
   373
c913fe9bfd59 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 311
diff changeset
   374
\noindent An article that analyses privacy and shopping habits 
423
11b46fa92a85 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 366
diff changeset
   375
from a more economic point of view is available from:
312
c913fe9bfd59 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 311
diff changeset
   376
c913fe9bfd59 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 311
diff changeset
   377
\begin{center}
c913fe9bfd59 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 311
diff changeset
   378
\url{http://www.dtc.umn.edu/~odlyzko/doc/privacy.economics.pdf}
c913fe9bfd59 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 311
diff changeset
   379
\end{center}
c913fe9bfd59 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 311
diff changeset
   380
c913fe9bfd59 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 311
diff changeset
   381
\noindent An attempt to untangle the web of current technology
c913fe9bfd59 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 311
diff changeset
   382
for spying on consumers is published in:
c913fe9bfd59 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 311
diff changeset
   383
c913fe9bfd59 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 311
diff changeset
   384
\begin{center}
c913fe9bfd59 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 311
diff changeset
   385
\url{http://cyberlaw.stanford.edu/files/publication/files/trackingsurvey12.pdf}
c913fe9bfd59 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 311
diff changeset
   386
\end{center}
c913fe9bfd59 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 311
diff changeset
   387
313
1d243ac51078 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 312
diff changeset
   388
\noindent An article that sheds light on the paradox that
312
c913fe9bfd59 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 311
diff changeset
   389
people usually worry about privacy invasions of little
313
1d243ac51078 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 312
diff changeset
   390
significance, and overlook the privacy invasion that might
1d243ac51078 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 312
diff changeset
   391
cause significant damage:
312
c913fe9bfd59 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 311
diff changeset
   392
c913fe9bfd59 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 311
diff changeset
   393
\begin{center}
c913fe9bfd59 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 311
diff changeset
   394
\url{http://www.heinz.cmu.edu/~acquisti/papers/Acquisti-Grossklags-Chapter-Etrics.pdf}
c913fe9bfd59 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 311
diff changeset
   395
\end{center}
307
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
   396
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
   397
\end{document}
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
   398
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
   399
http://randomwalker.info/teaching/fall-2012-privacy-technologies/?
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
   400
http://chronicle.com/article/Why-Privacy-Matters-Even-if/127461/
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
   401
http://repository.cmu.edu/cgi/viewcontent.cgi?article=1077&context=hcii
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
   402
https://josephhall.org/papers/NYU-MCC-1303-S2012_privacy_syllabus.pdf
314
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
   403
http://www.jetlaw.org/wp-content/uploads/2014/06/Bambauer_Final.pdf
315
7bd723cb9b32 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 314
diff changeset
   404
http://www.cs.cmu.edu/~yuxiangw/docs/Differential%20Privacy.pdf
7bd723cb9b32 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 314
diff changeset
   405
https://www.youtube.com/watch?v=Gx13lgEudtU
7bd723cb9b32 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 314
diff changeset
   406
https://www.cs.purdue.edu/homes/ctask/pdfs/CERIAS_Presentation.pdf
7bd723cb9b32 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 314
diff changeset
   407
http://www.futureofprivacy.org/wp-content/uploads/Differential-Privacy-as-a-Response-to-the-Reidentification-Threat-Klinefelter-and-Chin.pdf
7bd723cb9b32 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 314
diff changeset
   408
http://www.cis.upenn.edu/~aaroth/courses/slides/Overview.pdf
325
48c6751f2173 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 315
diff changeset
   409
http://www.cl.cam.ac.uk/~sjm217/papers/tor14design.pdf
314
e01f55e7485a updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 313
diff changeset
   410
307
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
   411
%%% Local Variables: 
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
   412
%%% mode: latex
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
   413
%%% TeX-master: t
98ee5f760a8c added hw 7
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
   414
%%% End: