LLaPa: A Vision-Language Model Framework for Counterfactual-Aware Procedural Planning
ACM International Conference on Multimedia
Shibo, Sun and Xue, Li and Donglin, Di and Mingjie, Wei and Lanshun, Nie and Weinan, Zhang and Dechen, Zhan and Yang, Song and Lei Fan