
How to Create AI Lipsync Videos
Turn any portrait photo into a singing avatar with ShiMuv's AI lipsync tool. A step-by-step tutorial.
March 10, 2026
ShiMuv's Lipsync tool uses AI to animate a still image to match your audio. The face appears to sing along naturally — no camera, actors or editing skills needed.
Step 1: Open the Lipsync Page
Navigate to /lipsync from the sidebar.
Step 2: Choose Your Image
Upload a front-facing portrait photo, or click Library to select one you've already uploaded or generated with AI. Clear, well-lit photos with the face centered produce the best results. AI-generated portraits from Shi-Studio work great too.
Step 3: Choose Your Audio
Select an audio file from your library — a recording, AI-generated track, or uploaded file. The tool supports MP3, WAV and FLAC.
Step 4: Trim the Audio
Use the built-in audio trimmer to select the section you want to sync. The current maximum is 15 seconds per generation. Drag the start and end handles to select the best part — typically a chorus or hook works well. You can preview the trimmed clip before generating.
Step 5: Generate
Click Generate Lipsync. The AI processes your image and audio in the cloud. This typically takes 30–60 seconds. The finished video appears in the preview player.
Step 6: Save and Share
Click Save to Library to keep the video. From there you can download it, publish it to the community feed, or import it into the Edit Hub to combine multiple lipsync clips into a full music video.