CloseCaptions/subtitules capturing 1 side audio

DaMarin94 · December 20, 2021, 1:18pm

Hello, Im trying to add this feature to my proyect by consuming an external API where I can send audio (o microphone in real time) and it returns the string of the interpretation. My intention is to capture the audio from just 1 side of the videocall and show the text on the other side (for deaf people to use the service). I checked the documentation but Im not sure where should I add this code. Thanks and sorry if my question is out of place/category.

nazar-pc · December 20, 2021, 2:52pm

mediasoup is a library, you don’t just add a piece of code somewhere, you need to build an app using it that does what you need. In This case you’ll have to consume audio from a particular speaker and send it somehow to the service that will do text recognition.

DaMarin94 · December 20, 2021, 3:01pm

Thanks very much, I needed orientation on how plausible was what I was trying to do.

nazar-pc · December 20, 2021, 3:18pm

This is nothing unusual, I think recognition services even support RTP in some cases, so you can catch RTP packets in Node.js and send to the service.

jbaudanza · December 20, 2021, 4:26pm

It’s definitely plausible. I’ve implemented something similar using ffmpeg and Google’s speech-to-text API.

DaMarin94 · December 20, 2021, 5:46pm

Thanks both, Im going that way now that I know it’s posible.

Topic		Replies	Views
Processing participant audio input streams Integration	5	1911	May 20, 2020
Help expanding Recording example to send and receive please mediasoup-demo	3	1093	March 15, 2021
Capturing raw audio data on the server side mediasoup libraries	3	101	November 16, 2024
Sending audio stream from mediasoup to sound card in node app Integration	7	1024	February 15, 2021
Need Help Adding Google Speech To Text API On Server Side Integration	7	656	April 14, 2023

CloseCaptions/subtitules capturing 1 side audio

Related topics