A voice can be zero shot encoded to a few hundred kb vector. Timbre, prosody, lots of characteristics. That's less information than a fingerprint. And more importantly, that's something you can dial in with a few knobs by simply listening by ear.
It's why your brain can easily hear things in other people's voices. They're not hard signals to reproduce. Some people with flexible vocal ranges can even impersonate others quite easily.
I'm sure most people have gotten, "you sound like X" once or twice. Not unlike the "you look like Y" comments.
Voices really aren't that fingerprint-y.
If we really want to split hairs and argue from biology, who "owns" the voice of a set of identical twins?