University of Cambridge > Talks.cam > NLIP Seminar Series > Generating Natural-Language Video Descriptions using LSTM Recurrent Neural Networks

Log in

Google

Microsoft

Information on

Subscribing to talks details

Finding a talk details

Adding a talk details

Disseminating talks details

Help and Documentation details

Generating Natural-Language Video Descriptions using LSTM Recurrent Neural Networks

Download to your calendar using vCal

If you have a question about this talk, please contact Kris Cao .

We present a method for automatically generating English sentences describing short videos using deep neural networks. Specifically, we apply convolutional and Long Short-Term Memory (LSTM) recurrent networks to translate videos to English descriptions using an encoder/decoder framework. A sequence of image frames (represented using deep visual features) is first mapped to a vector encoding the full video, and then this encoding is mapped to a sequence of words. We have also explored how statistical linguistic knowledge mined from large text corpora, specifically LSTM language models and lexical embeddings, can improve the descriptions. Experimental evaluation on a corpus of short YouTube videos and movie clips annotated by Descriptive Video Service demonstrate the capabilities of the technique by comparing its output to human-generated descriptions.

This talk is part of the NLIP Seminar Series series.

This talk is included in these lists:

Note that ex-directory lists are not shown.

Generating Natural-Language Video Descriptions using LSTM Recurrent Neural Networks

📅 Download to calendar (vCal)

👤 Speaker: Raymond Mooney, University of Texas 🔗 Website
📅 Date & Time: Wednesday 18 May 2016, 16:00 - 17:00
📍 Venue: FW26, Computer Laboratory

Questions? Contact Kris Cao

Abstract

Series This talk is part of the NLIP Seminar Series series.

Included in Lists

Note: Ex-directory lists are not shown.

Log in

🔐 Log In

Information on

ℹ️ Information

Generating Natural-Language Video Descriptions using LSTM Recurrent Neural Networks

This talk is included in these lists:

Generating Natural-Language Video Descriptions using LSTM Recurrent Neural Networks

Abstract

Included in Lists

Log in

🔐 Log In

Information on

ℹ️ Information

Generating Natural-Language Video Descriptions using LSTM Recurrent Neural Networks

This talk is included in these lists:

Other lists

Other talks

Generating Natural-Language Video Descriptions using LSTM Recurrent Neural Networks

Abstract

Included in Lists