Hello, I just found MediaSoup and I’ve been wondering if it’s a good fit for audio-only applications. Can video capabilities be disabled to allow for minimal server CPU usage?
I use it for an audio-only application and it works great. It’s sort of a clubhouse-style mobile app for language learning.
I’d like to have audio sessions with up to 30 users. Obviously not everybody speaking at once, some people on mute or push to talk, but does MediaSoup optimize for the number of producers and consumers automatically? 30 * 30 of streams sounds scary.
You can put producers into a “paused” state when the user is muted. In that state, they won’t consume as much CPU. If that’s what you mean by optimized, then yes. But it’s not automatic.
On the same note - can MediaSoup merge audio streams on the server side so that clients deal with only a single incoming stream?
No. Mediasoup just forwards RTP packets. It doesn’t do any decoding of the media streams. If you want to do that, you need to mix in ffmpeg or GStreamer. However, that is quite expensive cpu-wise and you’ll add quite a bit of latency to the conversations. This is probably not the route you want to go.