this post was submitted on 02 Aug 2024
1 points (100.0% liked)

Technology

58480 readers
3893 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 1 year ago
MODERATORS
 

"Suno’s training data includes essentially all music files of reasonable quality that are accessible on the open internet."

"Rather than trying to argue that Suno was not trained on copyrighted songs, the company is instead making a Fair Use argument to say that the law should allow for AI training on copyrighted works without permission or compensation."

Archived (also bypass paywall): https://archive.ph/ivTGs

you are viewing a single comment's thread
view the rest of the comments
[–] Hildegarde@lemmy.world 0 points 2 months ago (2 children)

One of the four fair use factors is the portion of the copyrighted work that was taken. For a finding of fair use under this factor, the infringing work must only take the amount of copyrighted material needed for the infringing work's purpose.

If they ripped every single file they have access to, there's no way to be found as fair use under this factor. If they argue they were using a curated list of only the works they needed to develop their model it could be fair use, but admitting to taking every possible work in their entirety is a surefire way to fail a fair use defense.

[–] Vlyn@lemmy.zip 0 points 2 months ago (1 children)

But that's not how model training works, it doesn't simply copy and paste entire songs into its training data. It more or less "listens" to it, analyzes it and when you ask to create a rock song for example it just has an algorithm behind it what a song like that would sound like.

But you can't just ask it to generate Bohemian Rhapsody from its data, it would probably get very close depending on the training, but it would never be 100% the same (except the model was only trained on this one song).

Just like you can listen to rock songs and then make your own, that's totally valid. The problem here is of course automation and scale, but saying it's not fair use is dubious.

[–] Hildegarde@lemmy.world 0 points 2 months ago (1 children)

Fair use is a legal doctrine relating to derivitave works based on copyrighted works. An AI model's fair use determination would be judged by the same standards and all derivative works.

It doesn't matter how they used the copyrighted works. This factor is about scale not intent.

There are four factors, and no single factor is determinative. But admitting their model uses as much training as possible makes their model less likely to be fair use.

[–] Vlyn@lemmy.zip 0 points 2 months ago (1 children)

If I as a human listened to every single song of a band from start to finish, then produced a similar song in the same vein (lyrics / music genre), it would be fair use.

So why would it stop being fair use if an AI does the same thing? Just that the AI can listen to every song of this band and a million other bands, combining them.

[–] Hildegarde@lemmy.world 0 points 2 months ago (1 children)

Because fair use is an affirmative defense to copyright infringement. To use a fair use defense you have to admit your work is infringing, but argue that the infringement is justifiable.

Trying to defend the AI with fair use requires you to admit the AI itself is infringing, but justifiable, and by the doctrine of fair use, it is almost certainly not.

Only humans can hold copyrights. Your example would be a non-infringing work because it lacks direct copying. An AI doing the same would make an uncopyrightable work, with the AI itself being infinging if you tried the fair use defense.

[–] Vlyn@lemmy.zip 0 points 2 months ago (1 children)

Only humans can hold copyrights.

Yeah, no. Most copyrighted material is owned by companies, you don't have to be a natural person to hold copyrights. And if a company can hold copyrights, you can also argue it can have fair use.

[–] Hildegarde@lemmy.world 0 points 2 months ago (1 children)

Companies are run by people. The human employees create copyrighted works that become the property of their employer by the terms of their contract. That's how work for hire contracts work....

You would know this if you have ever worked in any creative field.

[–] Vlyn@lemmy.zip 0 points 2 months ago

I work in a creative field. But companies are companies. If I work for a company and create something, it doesn't belong to a natural person, it immediately goes over to the company.

Not the CEO or CTO or whoever is in management, it belongs to the legal entity. Isn't this a company owning the work I just created? If the CEO dies, the company still owns it.

[–] kromem@lemmy.world 0 points 2 months ago

Your interpretation of copyright law would be helped by reading this piece from an EFF lawyer who has actually litigated copyright cases in the past:

https://www.eff.org/deeplinks/2023/04/how-we-think-about-copyright-and-ai-art-0