Representations Before Pixels: Semantics-Guided Hierarchical Video Prediction Paper • 2604.11707 • Published 6 days ago • 7
Boosting Generative Image Modeling via Joint Image-Feature Synthesis Paper • 2504.16064 • Published Apr 22, 2025 • 14
Advancing Semantic Future Prediction through Multimodal Visual Sequence Transformers Paper • 2501.08303 • Published Jan 14, 2025