zlacker

[parent] [thread] 2 comments
1. optimu+(OP)[view] [source] 2020-06-09 03:19:25
Hi lunixbochs!

Your prototype is amazing! The quality of transcription is definitely better than ours via Google.

After we did some legal research we wanted to avoid storing the recordings and rather solely transcription text. Giving access to a platform for humans to verify the transcriptions and in turn train the model is a great idea.

I have started working on getting some pre-trained models set up. I am trying to implement them with wav2letter, deepspeech, kaldi, vosk, etc. - I just need to be pointed in the right direction.

Raspberry Pi's were something I was considering as well - small energy footprint and powerful enough to run these models.

Do you have any advice on ML or acoustic models to avoid? I am working with the 100 hour dataset now.

Thanks!

replies(1): >>johann+d01
2. johann+d01[view] [source] 2020-06-09 14:57:53
>>optimu+(OP)
I have the same setup as Broadcastify Calls (trunkrecorder) and a site built to play each audio recording then allow the user to provide what they heard. I used it to train some public safety specific models on Kaldi and Sphinx.

I have 30ish streams and keep 6 days worth, I could keep longer if you'd like to work together on this. I reached out to some of the people above, the Broadcastify guy for example, and they are, as mentioned, ready doing their own thing so didn't really care about what I wanted to share.

replies(1): >>robota+lt2
◧◩
3. robota+lt2[view] [source] [discussion] 2020-06-10 01:17:15
>>johann+d01
This sounds awesome - If you have any documentation up on how to do this, I would love to point to it from the trunk-recorder wiki.
[go to top]