Incredible, really. It would have been so easy to just… not do that.
Why? The grandparent is not saying it's coincidence. Why is it not okay to hire someone who has a voice similar to celebrity X who you intentionally want to immitate? I mean if you don't actually mislead people to believe that your immitation is actually X - which would be obviously problematic?
It’s really a shame.
I don't believe Sam Altman, but I am interested in the general “is it legal/ethical to immitate something uncopyrightable” argument.
If it was just an actor, it might be a case of inspiration gone awry. But this particular actor sued Disney in 2021 after making a lot of movies and a lot of money making movies for them.
Deliberately poking a fight with a litigation happy actor is weird. Most weird is really benign. But this is the kind of weird that forces out of court settlements. It’s reckless.
Edit - mistyped the date as 2001. Changed to 2021.
In the movie The Seed of Chucky, Britney Spears gets killed. You can watch the clip at https://www.youtube.com/watch?v=x3kCg5o0cHA. It is very clearly Britney Spears.
Except Britney Spears was not hired for the role. They hired a Britney Spears impersonator for the scene. They did everything that they could to make it look like Britney, and think it was Britney. But it really wasn't.
Do you think that Britney should have sued the Chucky franchise for that? If so, should Elvis Presly's estate also sue all of the Elvis Presly impersonators out there? Where do you draw the line? And if not, where do you draw the line between what happened in Chucky, and what happened here?
I really don't see a line between now having someone who sounded like the actress, and then tweeting the name of one of her movies, and what happened 20 years ago with Chucky killing someone who looked like Britney, then showing a license plate saying "BRITNEY1", and THEN saying, "Whoops I did it again." (The title of her most famous song at the time.) If anything, the movie was more egregious.
The new voice2voice from OpenAI allows for a conversational dialect, most prominently demonstrated in pop culture by the movie Her. Sam's tweet makes perfect sense in that context.
Sky's voice has been the default voice in voice2voice for almost a year now, and no one has made a connection to the Her voice until it started acting more conversational. It seems pretty obvious that OpenAI was looking for a more conversational assistant, likely inspired by the movie Her, and it would have been cool if the actress had helped make that happen, but she didn't, and here we are.
Also Juniper has always been the superior voice model. I just now realized that one of my custom GPTs kept having this annoying bug where the voice kept switching from Juniper to Sky, and that seems to be resolved now that Sky got removed.
The movie producers didn't produce a simulation of Britney's voice and attempt to sell access to it.
However you feel about an probably-unapproved celebrity cameo in a movie, it's not the same thing as selling the ability to impersonate that celebrity's voice to anyone willing to pay, in perpetuity.
1. The plot of "Her" (guy falls in love with synthesized voice, played by Johansson)
2. Altman's affinity for the film (the article says he's called it his "favorite movie")
Reaching out to Johansson about cloning her voice, then doing so without permission feels like Altman is creeping on her.
The sooner this bubble pops, the better.
This isn't some college kid with an idea and too much passion.
The biggest difference that I see is that technology has made the simulation cheaper and easier.
No.[1]
[1] https://www.reddit.com/r/ChatGPT/comments/177v8wz/i_have_a_r...
> This scene was included in promotional spots for the film, most specifically Seed of Chucky's trailer, but the distributing company associated with the film, Focus Features, made the decision to significantly cut the scene down and add a disclaimer. The disclaimer that ran with the promotional spot, which was altered to only show a brief glimpse of Ariqat as Spears, stated: "Britney Spears does not appear in this film."
https://screenrant.com/seed-of-chucky-movie-promos-britney-s...
The biggest difference here is obviously one of scale. I don't think ScarJo would be threatening to sue you, the individual, if you did a voice impression of her for a talent show or a friends wedding.
Generally the "Right to Publicity" laws are clear about expiring at death. It's not like copyright.
This could be kinda like the dot com bubble -- the Internet went on to become BIG, but the companies just went bust... (and the ones that strive are probably not well known)
I went to ask the Internet "best AI tools", and there's no clear consensus:
Various Redditors go on to suggest "here's 100 you might like to try".
So there's clearly a bubble, thousands of startups all trying for similar things.
I am personally looking forward to try Wolfram GPT:
Actually, you probably can.[0]
[0] https://casetext.com/case/waits-v-frito-lay-inc
Edit: Added the context for the reply
Voices…are usually not so distinctive. However, certain voices are very distinct—Tom Waits, Miley Cyrus, James Earl Jones, Matt Berry. Those voices are pretty distinctively those people and simulating their voices it would be obvious who you are simulating. Other celebrity voices are much more generic. Scarlett fits into this with a pretty generic female voice with a faint NY/NJ accent.
Open AI screwed up by taking a generic voice and making it specific to the celebrity by reference and by actually pursuing the actor for the use of their voice.
ChatGPT is currently king of the mountain. That could change, but right now that's how it is.
Google's Gemini and Facebook's Llama 3 are clearly in a tier below. The 100s of tools you are seeing are various mixed and matched technologies that also belong in this tier.
Claude (massive context) and Mistral/Mixtral (decent with no censoring/guard rails) are interesting for special cases. And if you're determined and want to put in the effort, you can experiment or self-host and perhaps come up with some capabilities that do something special that suits a use case or something you want to optimize for (although not everyone has time for that).
So I wouldn't say it's just all this one big swirl of confusion and therefore a bubble and due to come crashing down. There's wheat, there's chaff, there's rhyme and reason.
I can see a civil judge or jury being given evidence showing very few listeners think the voices match in _blind voice tests_.
Here for example you can listen to the voices side by side:
https://www.reddit.com/r/ChatGPT/comments/1cwy6wz/comment/l4...
And here is voice of another actress ( Rashida Jones ):
https://www.youtube.com/watch?v=385414AVZcA
This test is not blind but YOU tell me which you think is similar to the openAI sky voice? And what does that tell you about likely court result for Johansson? And having reached this conclusion yourself would you now think the other actress Rashida Jones is entitled to compensation based on this similarly test? Because there are no other women with similar voices?
This is completely false. Claude Opus is significantly better than GPT 4.
> Mistral/Mixtral (decent with no censoring/guard rails)
These models have been heavily censored, I'm not sure what you're talking about. Community efforts like Dolphin to fine-tune Mixtral have some success, but no, Karen is definitely still hard at work in France, ensuring that Mistral AI's models don't offend anyone's precious fee-fees.
It's nevertheless true that there is a coherent landscape of better and worse models, and Chat GPT really does have separation from the other models as I mentioned above. I even mentioned that ChatGPTs position would be subject to change. My understanding is that this most recent version of Claude has been out and about in the wild for perhaps 2 months.
I feel like with even a little bit of charitable interpretation you could read my comment in a way that accounts for the emergence of such a thing as a new and improved model of Claude. So I appreciate your correction but it's hard to see how it amounts to anything more than a drive-by cheap shot that's unrelated to the point I'm making.
(Can't for the life of me recall if she sounds anything like Johansson; just putting her forward to tease her relative here. (Who is in the wrong in his arguments above.))