Wanrong Zhu

Wanrong Zhu

CS Ph.D. Candidate

University of California, Santa Barbara


Hi, I am a Ph.D. candidate in the Natural Language Processing Group at UCSB, advised by William Wang. Before joining UCSB, I received my B.S. degree in Computer Science from Peking University.


  wanrongzhu [at] cs.ucsb.edu
   Henley Hall, UCSB


Education

  • University of California, Santa Barbara

    Ph.D. in Computer Science

    Sep. 2019 - Present

  • Peking University

    B.S. in Computer Science

    Sep. 2015 - July 2019

Interests

  • Natural Language Processing
  • Language-and-Vision
  • Text Generation

Experience

  • Mosaic Team, AI2

    Research Intern. Hosts: Jack Hessel and Youngjae Yu

    June 2022 - Sep. 2022

  • Google Research

    Research Intern. Hosts: Bo Pang and Ashish Thapliyal

    June 2021 - Oct. 2021

  • AdsAI Team, Google

    Research Intern. Host: Pradyumna Narayana

    June 2020 - Oct. 2020

  • Language Technology Institution, Carnegie Mellon University

    Research Assistant. Advisor: Zhiting Hu

    July 2018 - Sep. 2018

Publications & Preprints

LayoutGPT: Compositional Visual Planning and Generation with Large Language Models

Preprint (arXiv 2305.15393)
LayoutGPT: Compositional Visual Planning and Generation with Large Language Models

Multimodal Procedural Planning via Dual Text-Image Prompting

Preprint (arXiv 2305.01795)
Multimodal Procedural Planning via Dual Text-Image Prompting

Multimodal C4: An Open, Billion-scale Corpus of Images Interleaved With Text

Preprint (arXiv 2304.06939)
Multimodal C4: An Open, Billion-scale Corpus of Images Interleaved With Text

OpenFlamingo: An Open-Source Framework for Training Vision-Language Models with In-Context Learning

Stay-tuned for the technical report!
OpenFlamingo: An Open-Source Framework for Training Vision-Language Models with In-Context Learning

Collaborative Generative AI: Integrating GPT-k for Efficient Editing in Text-to-Image Generation

Preprint (arXiv 2305.11317)
Collaborative Generative AI: Integrating GPT-k for Efficient Editing in Text-to-Image Generation

CLIP also Understands Text: Prompting CLIP for Phrase Understanding

Preprint (arXiv 2210.05836)
CLIP also Understands Text: Prompting CLIP for Phrase Understanding

Texar: A Modularized, Versatile, and Extensible Toolkit for Text Generation

The 57th Annual Meeting of the Association for Computational Linguistics:System Demonstrations (ACL 2019 System Demonstration)
Texar: A Modularized, Versatile, and Extensible Toolkit for Text Generation

Text Infilling

Preprint (arXiv 1901.00158)
Text Infilling