I swear I’m getting old or something (we’re talking Internet old here) because it takes me a day or two to get my thoughts down from events from the past week … my real-time blogging abilities are clearly limited at best … :-)
On Tuesday of this week I had a chance to participate in a panel discussion entitled Portable Playlists and other POSH-ibilities that had myself, Lucas Gonze, Tom Conrad and Tantek Celic. This was part of the wider “Media Web Meetup” discussions going on and hosted by the Songbird folks.
We touched on quite a few topics (some of which I’ll comment on in another blog post) but the one I wanted to cover that didn’t get much airtime was (ironically) what it would take to make a portable playlist format a reality.
Before the panel both Tom and I were chatting about how there is really only one big problem that both Pandora and MyStrands face: catalog resolution. This is a huge problem that consumes quite a bit of developer time in both of our camps. Unfortunately, every playlist format out there simply punts on this problem. They point at some “resource” that is the catalog entry. Now, from a portability standpoint that’s great and I can appreciate the why’s of why you’d do that.
First off, what is this resolution problem I’m talking about? Imagine a user tells me they are listening to “These boots are made for walkin’”. I want to take that song and resolve it to some entry in my catalog. If I do a search for that I’m presented with oodles and oodles of possibles. Not only that, I may get the same song returned to me (that is the exact same waveform) that lives on a greatest hits album or a compilation of the 80’s, etc. We figure between Pandora and MyStrands we’ve got easily a couple of hundred thousand lines of code to solve the same problem. Bummer.
Just about everyone is going to have some sort of catalog entry. We have artist pages (for example, see U2) that are tied to specific catalog information that we get from several sources. We have to do resolution from ‘U2 - Beautiful Day’ to something in our catalog to give more information to the user when they click the link (or when we want to provide recommendations). It would be so much simpler if the user gave us an audio signature of that song so that we could use something like Musicbrainz to resolve what it is to our own catalog.
The other thing that has always struck me as a pain is the management of catalog data. When Radiohead’s In Rainbows came out this past Fall it took weeks for the data to propagate into sites catalogs. Wouldn’t it make sense to have something like a Wikipedia for song/track/artist information? User managed and available for free use complete with Musicbrainz signatures for all of the music? Then we could take a set of plays from iLike or Last.fm and do interesting things on MyStrands as could they. The big question is, would users want/need/be willing to update information as mundane as the track, album and artist listings for music?
I think a lot of companies don’t think of catalog management as their “core” thing they do. For us its recommendations. For Pandora and iLike its different. We all could benefit from a public commons of information that was referential across all of our sites. This kind of data negotiation is only going to get harder the longer we avoid doing it. I for one would love to see a project evolve (and not CDDB or FreeDB) that could provide this information in a timely fashion. Or is this all just Musicbrainz I’m talking about here?