Enable 2D Lip Sync Wav2Lip Pipeline with OpenVINO Runtime

No items found.

Authors: Xiake Sun, Kunda Xu

1. Introduction

Lip sync technologies are widely used for digital human use cases, which enhance the user experience in dialog scenarios.

Wav2Lip is a novel approach to generate accurate 2D lip-synced videos in the wild with only one video and an audio clip. Wav2Lip leverages an accurate lip-sync “expert" model and consecutive face frames for accurate, natural lip motion generation.

In this blog, we introduce how to enable and optimize Wav2Lippipeline with OpenVINO^TM.

Here is Wav2Lip pipeline overview:

2. Setup Environment

$ git clone https://github.com/sammysun0711/openvino_aigc_samples.git
$ cd Wav2Lip
$ conda create -n wav2lip python=3.8
$ conda activate wav2lip
$ pip install -r requirments.txt
$ sudo apt-get install ffmpeg

Download the Wav2lip pytorch model from link and move it to the checkpoints folder.

3. Pytorch to OpenVINO^TM Model Conversion

$ python export_openvino.py

The exported OpenVINO^TM model will be saved in the checkpoints folder.

4. Run pipeline inference with OpenVINO^TM Runtime

$ python inference_ov.py --face_detection_path checkpoints/face_detection.xml --wav2lip_path checkpoints/wav2lip.xml --inference_device CPU --face data_video_sun_5s.mp4 --audio data_audio_sun_5s.wav

Here are the parameters with descriptions:

--face_detection_path: path of face detection OpenVINO^TMIR

--wav2lip_path: path of wav2lip openvino^TM IR

--inference_device: specify the device to run OpenVINO^TMinference.

--face: input video with face information

--audio: input audio with voice information

--static: set True to use single frame for face detection for fast inference

The generated video will be saved as results/result_voice.mp4

Here is an example to compare original video and generated video after the Wav2Lip pipeline:

‍

5. Conclusion

In this blog, we introduce how to deploy wav2lip pipeline with OpenVINO^TM as follows:

Support Pytorch model to OpenVINO^TM model conversion.
Run and optimize wav2lip pipeline with OpenVINO^TM runtime.

Xiake

Sun

Enable 2D Lip Sync Wav2Lip Pipeline with OpenVINO Runtime

1. Introduction

2. Setup Environment

3. Pytorch to OpenVINOTM Model Conversion

4. Run pipeline inference with OpenVINOTM Runtime

5. Conclusion

Related Articles

3. Pytorch to OpenVINO^TM Model Conversion

4. Run pipeline inference with OpenVINO^TM Runtime