There’s been this weird idea lately, even among people who used to recognize that copyright only empowers the largest gatekeepers, that in the AI world we have to magically flip the script on copyr…
“Let companies rip off your work, or else only Big Tech will be able to rip off your work”
Maybe we’re so far in capitalist hellholle that we simply consider everything to be for sale. What about GPL work that OpenAI steals? Or personal data? With how secretive they are with data they “scraped” we don’t even know if they have any right at all to repackage and sell it.
Making only big companies able to “rip off your work” (not an accurate representation, but whatever) Is not the solution you think it is.
The only solution is to force all models trained on public data to not be covered by copyrights by default. Any output from those models should also by default be in the commons. The solution is to avoid copyright cartels, not strengthen them.
Agreed that interim solution should be to make all “AI” work public domain since it treats everything it trains on as public domain. I’m for it because it would would immediately stop being profitable for commercial enterprises. Then check who they ripped off and settle any financial claims and damages before moving on to establish license for already created output.
Exactly. Make ALL output public domain. Force them to release their training sets. Force them to open source their models.
There will still be companies like Adobe and DeviantArt who will be able to work around this due to their ToS, but we have enough existing models to make them obsolete due to the power of FOSS.
Of course things are messy. I still think it’s the best option. I would say that yes, a character with AI eyes, would be public domain. Treat it like the GPL. If a small part of your code is GPL, all of your code has to be GPL.
Likewise, it isn’t easy to prove, people will get away with it doing in very small quantities and sufficiently reworking it, but extravagant examples would be caught, like serial plagiarists eventually are. The resulting loss in credibility could end careers. Of course, the best approach would be to completely remove copyrights altogether, then this wouldn’t be an issue at all.
IMO, we need to ask: What benefits the people? or What is in the public interest?
That should be the only thing of importance. That’s probably controversial. Some will call it socialism. It is pretty much how the US Constitution sees it, though.
Maybe you agree with this. But when you talk about “models trained on public data” you are basically thinking in terms of property rights, and not in terms of the public benefit.
The models (ie the weights specifically) may not be copyrightable, anyways. There’s no copyright on the result of number crunching. Once the model is further fine-tuned, there might be copyright, but it’s still unlike anything covered by copyright in the past.
One analogy I have is a 3D engine. The engineers design the look of the typical output by setting parameters, but that does not create a specific copyright on the parameters. There’s copyright on the design documents, the code, the UI, if any and maybe other stuff. It’s not quite the same, though.
Some jurisdictions have IP on databases. I think that would cover AI models. If I am right, then that means that any license agreements that come with models are ineffective in the US.
However, to copy these models, you first need to get your hands on them. They are still trade secrets, so don’t on leaks.
I’ll lift a comment from techdirt:
Maybe we’re so far in capitalist hellholle that we simply consider everything to be for sale. What about GPL work that OpenAI steals? Or personal data? With how secretive they are with data they “scraped” we don’t even know if they have any right at all to repackage and sell it.
Making only big companies able to “rip off your work” (not an accurate representation, but whatever) Is not the solution you think it is.
The only solution is to force all models trained on public data to not be covered by copyrights by default. Any output from those models should also by default be in the commons. The solution is to avoid copyright cartels, not strengthen them.
Agreed that interim solution should be to make all “AI” work public domain since it treats everything it trains on as public domain. I’m for it because it would would immediately stop being profitable for commercial enterprises. Then check who they ripped off and settle any financial claims and damages before moving on to establish license for already created output.
Exactly. Make ALL output public domain. Force them to release their training sets. Force them to open source their models.
There will still be companies like Adobe and DeviantArt who will be able to work around this due to their ToS, but we have enough existing models to make them obsolete due to the power of FOSS.
deleted by creator
Of course things are messy. I still think it’s the best option. I would say that yes, a character with AI eyes, would be public domain. Treat it like the GPL. If a small part of your code is GPL, all of your code has to be GPL.
Likewise, it isn’t easy to prove, people will get away with it doing in very small quantities and sufficiently reworking it, but extravagant examples would be caught, like serial plagiarists eventually are. The resulting loss in credibility could end careers. Of course, the best approach would be to completely remove copyrights altogether, then this wouldn’t be an issue at all.
deleted by creator
They can sell it all they want, and then the buyer should be able to share it for free. I’m OK with people selling their labor.
deleted by creator
(I edited my comment slightly due to my scatter brain then saw you basically expanding my thought in the same way)
IMO, we need to ask: What benefits the people? or What is in the public interest?
That should be the only thing of importance. That’s probably controversial. Some will call it socialism. It is pretty much how the US Constitution sees it, though.
Maybe you agree with this. But when you talk about “models trained on public data” you are basically thinking in terms of property rights, and not in terms of the public benefit.
Well, I think that removing copyrights altogether is in the public interest, so…there you go :)
The models (ie the weights specifically) may not be copyrightable, anyways. There’s no copyright on the result of number crunching. Once the model is further fine-tuned, there might be copyright, but it’s still unlike anything covered by copyright in the past.
One analogy I have is a 3D engine. The engineers design the look of the typical output by setting parameters, but that does not create a specific copyright on the parameters. There’s copyright on the design documents, the code, the UI, if any and maybe other stuff. It’s not quite the same, though.
Some jurisdictions have IP on databases. I think that would cover AI models. If I am right, then that means that any license agreements that come with models are ineffective in the US.
However, to copy these models, you first need to get your hands on them. They are still trade secrets, so don’t on leaks.
That’s how it is now.
No, the models is not in the commons. Their training data is also not known.