updated
authorChristian Urban <christian dot urban at kcl dot ac dot uk>
Fri, 15 Jan 2016 02:33:25 +0000
changeset 444 aea1d40cf1ba
parent 443 67d7d239c617
child 445 9ad6445a0354
updated
handouts/ho07.pdf
handouts/ho07.tex
Binary file handouts/ho07.pdf has changed
--- a/handouts/ho07.tex	Mon Jan 11 02:05:24 2016 +0000
+++ b/handouts/ho07.tex	Fri Jan 15 02:33:25 2016 +0000
@@ -326,6 +326,16 @@
 data must therefore have concluded, the lady is on her
 death bed, while she was actually very much alive and kicking.
 
+In 2016, Yahoo released the so far largest machine learning
+dataset to the research community. It includes approximately
+13.5 TByte of data representing around 100 Billion events from
+anonymized user-news items, collected by recording
+interactions of about 20M users from February 2015 to May
+2015. Yahoo's gracious goal is to promote independent research
+in the fields of large-scale machine learning and recommender
+systems. It remains to be seen whether this data will really
+only be used for that purpose.
+
 \subsubsection*{Differential Privacy}
 
 Differential privacy is one of the few methods that tries to