Implementing Recording In Mediasoup Conference Software

HGB467 · December 9, 2023, 4:40am

Hi, I am Looking To Implement Conference Recording In A Web Application That Uses Mediasoup. I Have Tried Using Puppeteer To Join Meeting As A Ghost Participant And Then Use Puppeteer-Stream Library To Record It But That Is Using Too Much CPU And The Results (Frame Rates) Are Not Very Good. Can Anyone Suggest Me The Best Way To Record A Mediasoup Conference. Any Help Will Be Much Appreciated!

zaidiqbal · December 9, 2023, 11:04am

Recording is going to be CPU intensive, because of the video, audio decoding and capturing stuff. How much cpu it is taking in your side? You are already using good approach which uses extensions to capture page stream rather than classic canvas based solution.

zaidiqbal · December 9, 2023, 11:20am

There is another way where you use puppeteer without headless mode and then use xvfb to display it to virtual display on server and then capture the things directly using getDisplayMedia that way no MediaRecorder is needed and no transfer for data from browser to nodejs is required that can definitely reduce CPU usage.

HGB467 · December 9, 2023, 1:37pm

Hey, Thanks For Your Reply. I Have Used The New Headless Mode (getDisplayMedia Works In That) But Was Unable To Capture Tab Audio. Also You Said Here No MediaRecorder, So How Do I Record It Without MediaRecorder?

zaidiqbal · December 9, 2023, 1:45pm

How you used it?

zaidiqbal · December 9, 2023, 1:57pm

This one I mentioned here is different, you don’t use puppeteer-stream library instead you use simple puppeteer and launch browser in headful mode and use xvfb as virtual display on server and that enables you to capture video, audio using getDisplayMedia js right from browser.

HGB467 · December 9, 2023, 2:09pm

Yes, I Had Tried That But Was Unable To Capture Tab Audio For Some Reason

HGB467 · December 9, 2023, 2:10pm

Yes, But That Will Also Use MediaRecorder, Right?

zaidiqbal · December 9, 2023, 3:31pm

No, you can capture both video, audio directly from xvfb via getDisplayMedia without using MediaRecorder.

HGB467 · December 9, 2023, 4:36pm

Using FFmpeg (X11Grab And Pulse Audio), Right?

zaidiqbal · December 9, 2023, 4:38pm

Not sure, but we used it while ago and we were able to get both video, audio with xvfb

zaidiqbal · December 11, 2023, 9:46am

@HGB467 we use xvfb, getDisplayMedia to capture both audio, video from the tab below are the options we use for getDisplayMedia:

const constraints = {
video: true,
audio: {
channelCount: 1,
sampleRate: 16000,
sampleSize: 16,
volume: 1,
echoCancellation: false,
noiseSuppression: false,
},
systemAudio: “include”,
preferCurrentTab: true,
};

These 2 parameter are the ones that make getDisplayMedia to capture audio of the tab:

systemAudio: “include”,
preferCurrentTab: true,

These are the parameters we use in puppeteer:

const puppeteerArgs = [
“–autoplay-policy=no-user-gesture-required”,
“–enable-usermedia-screen-capturing”,
“–allow-http-screen-capture”,
“–no-sandbox”,
“–auto-select-desktop-capture-source=Go-live”,
“–disable-setuid-sandbox”,
“–disable-web-security”,
“–use-gl=egl”,
“–disable-gpu”,
“–enable-webgl-image-chromium”,
“–start-maximized”,
“–start-fullscreen”,
“–enable-webgl-developer-extensions”,
“–enable-webgl-draft-extensions”,
];

dimoochka · December 12, 2023, 1:31am

Might I suggest a different approach?

It seems that your workflow has overhead that may be unnecessary (hence your CPU issues). This is how I conceptualize what you’re doing:

Step 1: (source client: raw stream -> encode -> producer ) -> RTP stream ->

Step 2: (mediasoup server: producer -> router -> consumer) -> RTP stream ->

Step 3: (puppeteer client: consumer -> decode -> raw stream -> encode)

In Step 3 you’re transcoding the same stream unnecessarily because you’re using puppeteer. I’m not sure if you can get around this with puppeteer because it’s built on libwertc which I think doesn’t give you access to the raw stream. You have other options -

If you’re also developing the mediasoup server, you can use the server-side consumer.on(‘rtp’) to capture the raw RTP stream (reference: mediasoup :: API). You’ll have to extract the encoded video from the RTP stream (no idea how to do that, but it’s probably not that hard). FFMPEG can probably do that (StreamingGuide – FFmpeg); naturally VLC too. The point is you want to avoid decoding (and especially encoding) the video since that’s where the CPU gets taxed the most.

If you’re not developing the mediasoup server, that’s going to be a bit more challenging. The hard part is figuring out the RTP stream port/IP and connecting it to something like VLC in headless mode to save the contents of the RTP stream.

Good luck!

zaidiqbal · December 12, 2023, 6:00am

This approach is even more performant but if we involve ffmpeg, gstreamer it still complicate stuff. I think people prefer headless browser because it let them have freedom over the streams and especially the design where you can easily change the appearance of the recording session you have.

HGB467 · December 12, 2023, 7:06am

Hey, Thank You For This Solution. The Problem Here Is How Do We Get That Stream In Nodejs Land Or To Save It To A File. Do We Use Mediasoup To Produce It And Then We Get Access To It Through Direct Consumer?

zaidiqbal · December 12, 2023, 7:23am

You will have this stream in browser side and these are ways to use it:

produce it to mediasoup server
or upload this stream to nodejs server using some api.
or use mediarecorder to upload the chunks to nodejs side and save it to file
or there may be way in puppeteer where you can directly access the stream in nodejs but that will require some r&d

dimoochka · December 12, 2023, 8:27pm

Not sure if you were referring to my reply, but assuming you were. On the Node server, I believe you would use a DirectTransport (mediasoup :: API - the documentation explains the process). I think the workflow is that you would make a DirectTransport on the same Router (a) where your stream is being produced (source client → producer → WebRtcTransport → Router [a] → Node server producer). Then you DirectTransport.consume() the Node server producer. Finally, use DirectTransport.on(‘rtcp’) or consumer.on(‘rtp’) to capture the packets on the Node server.

snnz · December 12, 2023, 10:05pm

But with this appoach you are recording individual streams, not the entire page with all layout and mixed sound.

dimoochka · December 12, 2023, 10:59pm

Agreed - wasn’t sure what his use case and server setup was. He was saying that he was CPU bound so I figured he might need to work around the transcoding.

HGB467 · December 13, 2023, 12:20pm

Thank You So Much For These Solutions. I Had Tried Few Of Them Before As Well. What I Have Come To Know Is That The Best Way To Capture Display Is By Using FFmpeg’s X11Grab And Pulse Audio (Or ALSA). I Am Currently Trying That And Getting Good Results

Topic		Replies	Views
Using Selenium for screen-recording Integration	18	3663	June 21, 2022
Understanding Mediasoup 3 (Part II) Integration	3	210	January 15, 2024
FFmeg as consumer Integration	4	1175	December 2, 2019
A question about desktop capture quality / framerate mediasoup libraries	2	986	December 2, 2019
Is it possible to Recording conference call? mediasoup libraries	18	5828	August 25, 2022

Implementing Recording In Mediasoup Conference Software

Related topics