are any of the music sites not scraping content to train AI

Your band, other bands, singers, songwriters, more.

Moderator: Ghost Hip

Post Reply
User avatar
friendship
IAMILFFAMOUS
IAMILFFAMOUS
Posts: 4156
Joined: Sun Mar 10, 2013 5:22 pm

are any of the music sites not scraping content to train AI

Post by friendship »

I haven't been on the up and up but I heard Soundcloud started doing this last year, and that's the one I was using most recently. Is Bandcamp doing this too?
sound journal
actualidiot wrote:12-bit's almost analog, right?
User avatar
backwardsvoyager
IAMILFFAMOUS
IAMILFFAMOUS
Posts: 4208
Joined: Wed Nov 21, 2012 4:52 am
Location: Tokyo

Re: are any of the music sites not scraping content to train AI

Post by backwardsvoyager »

As in the company themselves using user content to train AI models, or it being possible for others to scrape data?
Bandcamp AFAIK hasn't done anything like this (yet).

It is possible to scrape any track with a preview stream from Bandcamp without paying, so theoretically anything you upload could end up as training data, but because preview streams are all 128kbps mp3, they would not be very useful.
Companies will do any number of nefarious things to circumvent actually paying for training data, so it's a real concern, but it's high quality (in this case high-bitrate, extensively/accurately tagged) data that they're after.
User avatar
friendship
IAMILFFAMOUS
IAMILFFAMOUS
Posts: 4156
Joined: Sun Mar 10, 2013 5:22 pm

Re: are any of the music sites not scraping content to train AI

Post by friendship »

backwardsvoyager wrote: Sat May 31, 2025 2:44 am As in the company themselves using user content to train AI models, or it being possible for others to scrape data?
Bandcamp AFAIK hasn't done anything like this (yet).

It is possible to scrape any track with a preview stream from Bandcamp without paying, so theoretically anything you upload could end up as training data, but because preview streams are all 128kbps mp3, they would not be very useful.
Companies will do any number of nefarious things to circumvent actually paying for training data, so it's a real concern, but it's high quality (in this case high-bitrate, extensively/accurately tagged) data that they're after.
The former, companies training their AI on artist uploads. I don't make music for the money, but I also don't exactly want to voluntarily give companies free reign to make money off of my work while I don't, either.

Bandcamp it is, I guess?
sound journal
actualidiot wrote:12-bit's almost analog, right?
User avatar
backwardsvoyager
IAMILFFAMOUS
IAMILFFAMOUS
Posts: 4208
Joined: Wed Nov 21, 2012 4:52 am
Location: Tokyo

Re: are any of the music sites not scraping content to train AI

Post by backwardsvoyager »

Right, yeah I would give BC the benefit of the doubt. They've done alright by artists even since the buyout.

Not sure about the profit incentive for sites that sell digital DL's, but streaming services are inherently liable to start using AI in efforts to skimp on artist royalities by generating similar content and leading users there via playlist/recommendation algos, etc. (if they haven't already)
I've been following stuff like HarmonyCloak (https://mosis.eecs.utk.edu/harmonycloak.html) as it could well get to the point where we can't upload anything anywhere without it becoming training data, but even then it's hard to say whether poisoning filters, etc. will be a solution. :idk:
Post Reply