zlacker

[parent] [thread] 1 comments
1. tomato+(OP)[view] [source] 2022-05-23 21:52:17
why stop at audio? the pinnacle of this would be text-to-videos, equally indistinguishable from real thing.
replies(1): >>burles+h3
2. burles+h3[view] [source] 2022-05-23 22:09:59
>>tomato+(OP)
The way things look when still is much easier to fake than the way things move.

I would expect AI development to follow a similar path to digital media generally, as its following the increasing difficulty and space requirements of digitally representing said media: text < basic sounds < images < advanced audio < video.

What’s more impressive to me is how far ahead text-to-speech is, but I think the explanation is straightforward (the accessibility value has motivated us to work on that for a lot longer).

[go to top]