Conversion samples
1. Conversion samples with benchmark
Utilize speech audio from the Source and a facial image from the Target.
(HYFace: The proposed method. FVMVC: The benchmark. Sheng
., 2023)
Sample 1 (male-to-male)
Source Speaker
|
Target Speaker
|
Converted
FVMVC HYFace |
Sample 2 (male-to-male)
Source Speaker
|
Target Speaker
|
Converted
FVMVC HYFace |
Sample 3 (female-to-female)
Source Speaker
|
Target Speaker
|
Converted
FVMVC HYFace |
Sample 4 (female-to-female)
Source Speaker
|
Target Speaker
|
Converted
FVMVC HYFace |
Sample 5 (male-to-female)
Source Speaker
|
Target Speaker
|
Converted
FVMVC HYFace |
Sample 6 (male-to-female)
Source Speaker
|
Target Speaker
|
Converted
FVMVC HYFace |
Sample 7 (female-to-male)
Source Speaker
|
Target Speaker
|
Converted
FVMVC HYFace |
Sample 8 (female-to-male)
Source Speaker
|
Target Speaker
|
Converted
FVMVC HYFace |
