Searching and filtering across 100mm+ songs

I was the sole designer on Attribution Engine - Pex’s flagship music licensing product. From August 2019 to September 2020, we went from concept to Series A raise valuing the company at over $180mm.

Our team size grew 6x in less than a year, and it was time to deliver on our promises.

Initially, I was tasked with designing the platform broadly, owning every module within AE. As the company grew, I was able to dive deeper into specific areas.

Beyond our Series A, my main area of ownership within AE was the content management system. We first designed a system that could easily scale beyond 100mm songs.

Now, we needed to figure out how to find and discover songs.

My role

Senior staff designer through the end to end process: discovery, user research, requirements, design, testing, support through launch.

The team

1 product manager, 4 backend engineers, 1 frontend engineer

Timeline

December 2021 - March 2022

User research to build empathy internally

Pex historically had struggled to see value in user research, and as a result, often lacked understanding for the day to day lives of our users.

One of my key contributions was changing the tone and reframing user research. Showing by doing and routinely bringing gold nuggets back from the field. This helped us build solutions that were more on the mark.

User interviews

Content ID power users

ContentID was Pex’s biggest competitor in the space. Focusing on ContentID power users allowed us to start with enterprise adjacent users, but not the major labels themselves. We needed to prove a bit of value internally, before sales and exec teams were ok with us approaching the majors.

I sat down with 5 different ContentID power users, all with slightly different use cases.

I wanted to understand:

Major labels

Similarly, I went deep with 5 different major label and mid-level label users of ContentID.

I wanted to understand:

Key insights

Some key areas and patterns began to emerge right away from very few interviews. It was time to organize our findings.

Overall, both majors and ContentID power users saw basic search and filtering as a table stakes feature. Fair enough.

What both groups uncovered though was that in these systems or other internal systems they used, it was quite difficult to find or discover high value songs.

Majors own millions of songs. Not all created equal.

How might we enable discovery of high value songs across millions at scale, so replicating past success became more science than art?

Potential data points

Views
We held a treasure trove of data, not only in what was being supplied to us, but we also knew which songs were being viewed on social media. Looking at the views for a song was a good indicator it was currently getting traction on all platforms.

Licenses
Similar to views, the number of times a song was being licensed by UGC creators was a good indicator of its traction and virality.

Trending
Could we look at songs that were heating up? Less obvious choices, but songs that had more sizable movement in say - the last 2 weeks.

Ownership
The average hit song of the last decade has 15+ owners between artists, writers, labels and publishers. This means while a song can be a huge hit, it could still be low value given what that label’s stake in the song is. Could we filter out songs where the stake is too low or vice versa?

Determining filters

Armed with the early interviews and insights, I got to work distilling it all down. What filters had the highest impact?

I believed it would be a blend of more broad type filters used in combination to yield a result. For example, configuring various filters to discover high value assets.

In tandem, we also needed more direct filters. For example, entering a specific ID that yields one result.

Label

Record labels often own a multitude of sub and sister labels. Being able to filter by label felt like table stakes.

In my interviews, many folks told me about having to jump around to multiple systems to get to various different label’s catalogs. Obviously painful.

With AE, we were porting all of that together, which is only helpful if you can split it back out if need be.

Artist

Labels own and control many artists’ catalogs. This one allowed folks to go a little more in the weeds. They could look for one artist or hundreds.

ISRC

ISRC was a unique code assigned to the recording at the point of release. This was how most labels identified specific songs.

In talking with labels, I learned they were often downloading big CSV files, opening Excel, finding the ISRC column, and then copy and pasting various ISRC codes into other platforms. It was pretty painful.

I opted to create a filter that allowed them to do that same workflow, but copy the entire row if they’d like. They could paste as many ISRC codes as they wished. Whatever results were shown were direct hits based on the codes.

Ownership percentage

The average hit song of the last decade has 15+ owners. Although certain songs could appear to be high-value, in practice to that specific label, it wasn’t always the case.

Filtering by ownership could allow them to filter broadly by % of their global share of ownership.

In a future iteration, I’d like to add a layer for countries. For instance, you may own 100% of Norway, but that’s not as high value as owning 100% of the US.

Policy

Policy got to the heart of the matter. Filtering by policy meant you could show only results of songs being monetized. Or, maybe you wanted to investigate songs that were being blocked from licensing.

Closing thoughts

Searching and filtering is often deemed as table stakes work. Often overlooked and half baked into products.

By setting the stage early and really building empathy internally, we were able to craft lightweight solutions that not only aided in their current daily work, but also brought new discoveries.

Labels were able to identify new high value songs they were previously unaware of. This could lead to more revenue within our platform, but also on other lucrative platforms.

The details matter. Understanding the end user deeply matters.

Searching and filtering across 100mm+ songs

My role

The team

Timeline

Discoverability at scale

Problem statement

Breakdown of the problem

User research to build empathy internally

User interviews

Key insights

Potential data points

Market research

Determining filters

Label

Artist

ISRC

Ownership percentage

Policy

Interaction design

Number of auto suggested search results

RANKING AUTO SUGGESTED SEARCH RESULTS

Bulk search pattern

Adding filters

Removing filters

What's next?

Closing thoughts