You can use pre-built transformers in the Vonage Media Processor library or create your own custom audio or video transformer to apply to published video.
You can use the Publisher.AudioTransformers
and
Publisher.VideoTransformers
properties to apply audio and video transformers to a stream.
Important:
The Vonage Video Windows SDK includes two ways to implement transformers:
Moderate — For video, you can apply the background blur and background replacement video transformers included in the Vonage Media Library. See Applying a video transformer from the Vonage Media Library. For audio, you can apply the noise suppression audio transformer included in the Vonage Media Library. See Applying an audio transformer from the Vonage Media Library.
Advanced — You can create your own custom video transformers and custom audio transformers.
NVIDIA GPUs are recommended for optimal performance when using Vonage Media Library transformers.
Test on other configurations to check for support.
Transformers require adequate processor support. Even on supported devices, transformers may not be stable when background processes limit available processing resources. The same limitations may apply with custom media transformers in addition to transformers from the Vonage Media Library.
Windows may throttle CPU performance to conserve energy (for example, to extend battery life). This may result in suboptimal transformer performance and introduce unwanted audio or video artifacts. We recommend setting your operating system to use the best performance mode (or to not use low-power mode) in such cases.
Many video transformations (such as background blur) use segmentation to separate the speaker from the background. For best results, use proper lighting and a plain background. Insufficient lighting or complex backgrounds may cause video artifacts (for example, the speaker or a hat the speaker is wearing may get blurred along with the background).
You should perform benchmark tests on as many supported devices as possible, regardless of the transformation.
Due to significant increased size when integrating Vonage Media Library into SDK, from OpenTok SDK v2.27.2 the Media Transformers are available via the opt-in Vonage Media Library. This library needs to explicitly be added to the project.
The Vonage Media Library was initially embedded in OpenTok SDK. If your OpenTok SDK version is older than 2.27.2, move directly to Applying a video transformer from the Vonage Media Library and Applying an audio transformer from the Vonage Media Library.
A nuget package is available with name "Vonage.Client.Video.Transformers"
. Install it in your project.
If a call to VideoTransformer(string name, string properties)
or AudioTransformer(string name, string properties)
is made without loading the library, the transformer returned will be null. An exception will be raised with the following error code 0x0A000006 - OTC_MEDIA_TRANSFORMER_OPENTOK_TRANSFORMERS_LIBRARY_NOT_LOADED
.
Use the VideoTransformer(string name, string properties)
method to create a video transformer that uses a named transformer from the Vonage Media Library.
Two transformers are supported:
Background blur. For this filter, set the name
parameter to "BackgroundBlur"
.
And set a properties
parameter to a JSON string. The format of the JSON is "{"radius":"None"}".
Valid values for the radius
property are "None", "High", and "Low".
If you set the radius
property to "Custom", add a custom_radius
property to the JSON
string: "{"radius":"Custom","custom_radius":"value"}" (where custom_radius
is a positive integer
defining the blur radius).
VideoTransformer backgroundBlur = new VideoTransformer("BackgroundBlur", "{\"radius\":\"High\"}");
List<VideoTransformer> transformers = new ArrayList<VideoTransformer>
{
backgroundBlur
};
publisher.VideoTransformers = transformers;
Background replacement. For this filter, set the name
parameter to "BackgroundReplacement"
.
And set a properties
parameter to a JSON string. The format of the JSON is "{"image_file_path":"path/to/image"}", where image_file_path
is the absolute file path of a local image to use as virtual background. Supported image formats are PNG and JPEG.
VideoTransformer backgroundReplacement = new VideoTransformer("BackgroundReplacement", "{\"image_file_path\":\"path-to-image\"}");
List<VideoTransformer> transformers = new ArrayList<VideoTransformer>
{
backgroundReplacement
};
publisher.VideoTransformers = transformers;
Use the AudioTransformer(string name, string properties)
method to create an audio transformer that uses a named transformer from the Vonage Media Library.
One transformer is supported:
Noise Suppression. For this filter, set the name
parameter to "NoiseSuppression"
. Set the properties
parameter to a JSON string defining properties for the transformer. For the noise suppression transformer, this JSON does not include any property at the moment. Set it to an empty string ""
.
AudioTransformer noiseSuppression = new AudioTransformer("NoiseSuppression", "");
List<AudioTransformer> transformers = new ArrayList<AudioTransformer>
{
noiseSuppression
};
publisher.AudioTransformers = transformers;
Create a class that implements the ICustomVideoTransformer
interface. Implement the ICustomVideoTransformer.Transform()
method, applying a transformation to the VideoFrame
object passed into the method.
The ICustomVideoTransformer.Transform
method is triggered for each video frame:
public class MyCustomTransformer : IVideoTransformer
{
public void Transform(VideoFrame frame)
{
// transformer implementation
PixelFormat pixelFormat = frame.PixelFormat;
int numberOfPlanes = frame.NumberOfPlanes;
//invert U and V planes
IntPtr[] planes = new IntPtr[] { frame.GetPlane(0), frame.GetPlane(2), frame.GetPlane(1) };
int[] strides = new int[] { frame.GetPlaneStride(0), frame.GetPlaneStride(2), frame.GetPlaneStride(1) };
frame.ConvertInPlace(pixelFormat, planes, strides);
}
}
Then set the PublisherKit.VideoTransformers
property to an array that includes the object that implements the
OTCustomVideoTransformer interface:
MyCustomTransformer myCustomTransformer = new();
List<VideoTransformer> transformers = new ArrayList<VideoTransformer>
{
myCustomTransformer
};
publisher.VideoTransformers = transformers;
You can combine the Vonage Media library transformer (see the previous section) with custom transformers or apply
multiple custom transformers by adding multiple VideoTransformer objects to the ArrayList used
for the PublisherKit.VideoTransformers
property.
Create a class that implements the ICustomAudioTransformer
interface. Implement the ICustomAudioTransformer.Transform()
method, applying a transformation
to the AudioData
object passed into the method. The CustomAudioTransformer.Transform()
method
is triggered for each audio frame. The following example applies a simple amplitude limiter on
the audio:
public class MyCustomTransformer : IAudioTransformer
{
private const short CROP_LIMIT = (short)(short.MaxValue * 0.5);
public unsafe void Transform(AudioData frame)
{
// transformer implementation
short* data = (short*)frame.SampleBuffer;
int samplesXChannels = (int)frame.NumberOfSamples * (int)frame.NumberOfChannels;
for (int s = 0; s < samplesXChannels; ++s)
{
if (data[s] > CROP_LIMIT)
data[s] = CROP_LIMIT;
else if (data[s] < -CROP_LIMIT)
data[s] = -CROP_LIMIT;
}
}
}
Then set the OTPublisherKit.AudioTransformers
property to an array that includes the object that implements the
CustomAudioTransformer interface:
MyCustomTransformer myCustomTransformer = new();
List<VideoTransformer> transformers = new ArrayList<VideoTransformer>
{
myCustomTransformer
};
publisher.AudioTransformers = transformers;
You can apply multiple custom transformers by adding multiple OTPublisherKit.AudioTransformer objects to the ArrayList used
for the OTPublisherKit.AudioTransformers
property.
To clear video transformers for a publisher, set the Publisher.VideoTransformers
property to an empty array.
publisher.VideoTransformers = new ArrayList<VideoTransformer> {};
To clear audio transformers for a publisher, set the Publisher.AudioTransformers
property to an empty array.
publisher.AudioTransformers = new ArrayList<AudioTransformer> {};
See this sample at the opentok-windows-sdk-samples repo on GitHub.