Deprecated: The each() function is deprecated. This message will be suppressed on further calls in /home/zhenxiangba/zhenxiangba.com/public_html/phproxy-improved-master/index.php on line 456
Paper page - OmniAvatar: Efficient Audio-Driven Avatar Video Generation with Adaptive Body Animation
[go: Go Back, main page]

Papers
arxiv:2506.18866

OmniAvatar: Efficient Audio-Driven Avatar Video Generation with Adaptive Body Animation

Published on Jun 23, 2025
Authors:
,
,
,
,

Abstract

OmniAvatar enhances full-body audio-driven human animation with improved lip-sync and natural movements using pixel-wise audio embeddings and LoRA training.

Significant progress has been made in audio-driven human animation, while most existing methods focus mainly on facial movements, limiting their ability to create full-body animations with natural synchronization and fluidity. They also struggle with precise prompt control for fine-grained generation. To tackle these challenges, we introduce OmniAvatar, an innovative audio-driven full-body video generation model that enhances human animation with improved lip-sync accuracy and natural movements. OmniAvatar introduces a pixel-wise multi-hierarchical audio embedding strategy to better capture audio features in the latent space, enhancing lip-syncing across diverse scenes. To preserve the capability for prompt-driven control of foundation models while effectively incorporating audio features, we employ a LoRA-based training approach. Extensive experiments show that OmniAvatar surpasses existing models in both facial and semi-body video generation, offering precise text-based control for creating videos in various domains, such as podcasts, human interactions, dynamic scenes, and singing. Our project page is https://omni-avatar.github.io/.

Community

Sign up or log in to comment

Get this paper in your agent:

hf papers read 2506.18866
Don't have the latest CLI?
curl -LsSf https://hf.co/cli/install.sh | bash

Models citing this paper 5

Browse 5 models citing this paper

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2506.18866 in a dataset README.md to link it from this page.

Spaces citing this paper 15

Browse 15 spaces citing this paper

Collections including this paper 0

No Collection including this paper

Add this paper to a collection to link it from this page.