Tags

Vision-and-Language

In-Context Learning

Natual Language Generation

Causality

Language Planning

Dense Video Captioning

Natual Language Understanding

Automatic Evaluation

Vision-and-Language Navigation

Text Generation