Wanrong Zhu
Wanrong Zhu
Home
Publications
CV
An Yan
Latest
GPT-4V in Wonderland: Large Multimodal Models for Zero-Shot Smartphone GUI Navigation
Visualize Before You Write: Imagination-Guided Open-Ended Text Generation
CLIP also Understands Text: Prompting CLIP for Phrase Understanding
ImaginE: An Imagination-Based Automatic Evaluation Metric for Natural Language Generation
Multimodal Text Style Transfer for Outdoor Vision-and-Language Navigation
Cite
×