handouts/ho07.tex
changeset 444 aea1d40cf1ba
parent 443 67d7d239c617
child 448 48d0a9890adc
--- a/handouts/ho07.tex	Mon Jan 11 02:05:24 2016 +0000
+++ b/handouts/ho07.tex	Fri Jan 15 02:33:25 2016 +0000
@@ -326,6 +326,16 @@
 data must therefore have concluded, the lady is on her
 death bed, while she was actually very much alive and kicking.
 
+In 2016, Yahoo released the so far largest machine learning
+dataset to the research community. It includes approximately
+13.5 TByte of data representing around 100 Billion events from
+anonymized user-news items, collected by recording
+interactions of about 20M users from February 2015 to May
+2015. Yahoo's gracious goal is to promote independent research
+in the fields of large-scale machine learning and recommender
+systems. It remains to be seen whether this data will really
+only be used for that purpose.
+
 \subsubsection*{Differential Privacy}
 
 Differential privacy is one of the few methods that tries to