{"id":131,"date":"2013-09-26T07:00:45","date_gmt":"2013-09-26T11:00:45","guid":{"rendered":"http:\/\/datacolada.org\/?p=131"},"modified":"2016-03-20T05:45:22","modified_gmt":"2016-03-20T09:45:22","slug":"2-using-personal-listening-habits-to-identify-personal-music-preferences","status":"publish","type":"post","link":"https:\/\/datacolada.org\/2","title":{"rendered":"[2] Using Personal Listening Habits to Identify Personal Music Preferences"},"content":{"rendered":"<p>Not everything at Data Colada is as serious as fraudulent data. This post is way less serious than that. This post is about music and teaching.<\/p>\n<p>As part of their final exam, my students analyze a data set. For a few years that data set has been a collection of my personal listening data from iTunes over the previous year. The data set has about 500 rows, with each reporting a song from that year, when I purchased it, how many times I listened to it, and a handful of other pieces of information. The students predict the songs I will include on my end-of-year \u201cLeif\u2019s Favorite Songs\u201d compact disc. (Note to the youth: compact discs were physical objects that look a lot like Blu-Ray discs. We used to put them in machines to hear music.) So the students are meant to combine regressions and intuitions to make predictions. I grade them based on how many songs they correctly predict. I love this assignment.<\/p>\n<p>The downside, as my TA tells me, is that my answer key is terrible. The problem is that I am encumbered both by my (slightly) superior statistical sense and my (substantially) superior sense of my own intentions and preferences. You see, a lot goes into the construction of a good mix tape (Note to the youth: tapes were like CD\u2019s, except if you wanted to hear track 1 and then track 8 you were SOL.) I expected my students to account for that. \u201cAh look,\u201d I am picturing, \u201che listened a lot to <em>Pumped Up Kicks<\/em>. But that would be an embarrassing pick. On the other hand, he skipped this Gil Scott-Heron remix a lot, but you know that\u2019s going on there.\u201d They don\u2019t do that. They pick the songs I listen to a lot.<\/p>\n<p>But then they miss certain statistical realities. When it comes to grading, the single biggest differentiator is whether or not a student accounts for how long a song is in the playlist (see the scatterplot of 2011, below). If you don\u2019t account for it, then you think that all of my favorite songs were released in the first couple of months. A solid 50% of students think that I have a mad crush on January music. The other half try to account for it. Some calculate a \u201clistens per day\u201d metric, while others use a standardization procedure of one type or another. I personally use a method that essentially accounts for the likelihood that a song will come up, and therefore heavily discounts the very early tracks and weighs the later tracks all about the same. You may ask, \u201cwait, why are you analyzing your own data?\u201d No good explanation. I will say though, I almost certainly change my preferences based on these analyses \u2013 I change them away from what my algorithm predicts. That is bad for the assignment. I am not a perfect teacher.<\/p>\n<p>I don\u2019t think that I will use this assignment anymore since I no longer listen to iTunes. Now I use Spotify. (Note to the old: Spotify is like a musical science fiction miracle that you will never understand. I don\u2019t.)<br \/>\n<a href=\"https:\/\/datacolada.org\/wp-content\/uploads\/2013\/09\/Leifs-Song-Scatterplot.jpeg\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-133\" src=\"https:\/\/datacolada.org\/wp-content\/uploads\/2013\/09\/Leifs-Song-Scatterplot.jpeg\" alt=\"Leif's Song Scatterplot\" width=\"1202\" height=\"1594\" srcset=\"https:\/\/datacolada.org\/wp-content\/uploads\/2013\/09\/Leifs-Song-Scatterplot.jpeg 1202w, https:\/\/datacolada.org\/wp-content\/uploads\/2013\/09\/Leifs-Song-Scatterplot-226x300.jpeg 226w, https:\/\/datacolada.org\/wp-content\/uploads\/2013\/09\/Leifs-Song-Scatterplot-772x1024.jpeg 772w\" sizes=\"auto, (max-width: 1202px) 100vw, 1202px\" \/><\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Not everything at Data Colada is as serious as fraudulent data. This post is way less serious than that. This post is about music and teaching. As part of their final exam, my students analyze a data set. For a few years that data set has been a collection of my personal listening data from&#8230;<\/p>\n","protected":false},"author":5,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"inline_featured_image":false,"jetpack_post_was_ever_published":false,"_jetpack_newsletter_access":"","_jetpack_dont_email_post_to_subs":false,"_jetpack_newsletter_tier_id":0,"_jetpack_memberships_contains_paywalled_content":false,"_jetpack_memberships_contains_paid_content":false,"footnotes":"","jetpack_publicize_message":"","jetpack_publicize_feature_enabled":true,"jetpack_social_post_already_shared":true,"jetpack_social_options":{"image_generator_settings":{"template":"highway","default_image_id":0,"font":"","enabled":false},"version":2},"_wp_rev_ctl_limit":""},"categories":[29,28],"tags":[],"class_list":["post-131","post","type-post","status-publish","format-standard","hentry","category-music","category-teaching"],"jetpack_publicize_connections":[],"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"jetpack-related-posts":[],"_links":{"self":[{"href":"https:\/\/datacolada.org\/wp-json\/wp\/v2\/posts\/131","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/datacolada.org\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/datacolada.org\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/datacolada.org\/wp-json\/wp\/v2\/users\/5"}],"replies":[{"embeddable":true,"href":"https:\/\/datacolada.org\/wp-json\/wp\/v2\/comments?post=131"}],"version-history":[{"count":0,"href":"https:\/\/datacolada.org\/wp-json\/wp\/v2\/posts\/131\/revisions"}],"wp:attachment":[{"href":"https:\/\/datacolada.org\/wp-json\/wp\/v2\/media?parent=131"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/datacolada.org\/wp-json\/wp\/v2\/categories?post=131"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/datacolada.org\/wp-json\/wp\/v2\/tags?post=131"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}