Another thing you could do (haven’t tested yet), if you just need an active speaker, you could for example let ffmpeg record with rtp and let ffmpeg wait for a new connection when you just switch producer and reconnect to ffmpeg. ffmpeg has a parameter -stimeout 1000 which lets you set the timeout before it closes the rtp stream.
Then just like @ethand91 said, you can combine all the files together. I’m currently working on a tool that uses ffmpeg. With as input only the paths to the video files (of the seperately recorded users) and their start time. It then outputs a single video just like a video conference where you see multiple people coming on and off.
The layout is also customizable.
Now just adding some more support for when there is video-only, or audio-only mixed into it and also putting the most active speaker in front (based on audio or preset or something).
Still work in progress but I’ll probably release it soon.