publications
publications by categories in reversed chronological order. generated by jekyll-scholar.
2026
- DisCa: Accelerating Video Diffusion Transformers with Distillation-Compatible Learnable Feature CachingIn Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2026to appear
- HiCache: Training-free Acceleration of Diffusion Models via Hermite Polynomial-based Feature CachingIn The Fourteenth International Conference on Learning Representations, 2026
- Forecast then Calibrate: Feature Caching as ODE for Efficient Diffusion TransformersIn Proceedings of the AAAI Conference on Artificial Intelligence, 2026
- Let Features Decide Their Own Solvers: Hybrid Feature Caching for Diffusion TransformersIn The Fourteenth International Conference on Learning Representations, 2026to appear
- LESA: Learnable Stage-Aware Predictors for Diffusion Model AccelerationIn Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2026to appear
- From Sketch to Fresco: Efficient Diffusion Transformer with Progressive ResolutionIn Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2026to appear
2025
- HunyuanVideo 1.5 Technical ReportTencent Hunyuan Foundation Model Team, As Core Contributors, arXiv preprint arXiv:2511.18870, 2025
- Accelerating Diffusion Transformers with Token-wise Feature CachingIn The Thirteenth International Conference on Learning Representations, 2025
- From Reusing to Forecasting: Accelerating Diffusion Models with TaylorSeersIn Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2025
- dLLM-Cache: Accelerating Diffusion Large Language Models with Adaptive CachingarXiv preprint arXiv:2506.06295, 2025
- EfficientVLA: Training-Free Acceleration and Compression for Vision-Language-Action ModelsIn Advances in Neural Information Processing Systems, 2025
- EEdit: Rethinking the Spatial and Temporal Redundancy for Efficient Image EditingIn Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2025
- Shifting AI Efficiency From Model-Centric to Data-Centric CompressionarXiv preprint arXiv:2505.19147, 2025
- Token Pruning for Caching Better: 9 Times Acceleration on Stable Diffusion for FreearXiv preprint arXiv:2501.00375, 2025
- A Survey on Cache Methods in Diffusion Models: Toward Efficient Multi-Modal GenerationarXiv preprint arXiv:2510.19755, 2025
- FreqCa: Accelerating Diffusion Models via Frequency-Aware CachingarXiv preprint arXiv:2510.08669, 2025
2024
- Accelerating Diffusion Transformers with Dual Feature CachingarXiv preprint arXiv:2412.18911, 2024