AutoLipSync Pro generates lip sync animations from speech audio using offline AI voice recognition. It converts speech to text and maps phonemes to viseme shape keys or pose actions. The tool requires 13 specific shape keys or pose actions to function, and includes a reference guide to assist with setup.
The transcription process runs in the background, allowing the Blender interface to remain responsive. While optimized for English, it can process other languages by approximating phonemes to English equivalents. The addon also features automatic eye blinking, a built-in audio converter, and includes a test character with configured shape keys and pose actions.