Visual and Embodied Dialog is a novel task that requires an AI agent to hold a meaningful dialog with humans in natural, conversational language about visual contents in the space around. To perform well on this task, the agent needs to ground the query not only in the visual content but also in the dialog history and build appropriate joint models of scene and dialog understanding.

This is a new area of Research in Multimodal Scene Understanding and Conversational Systems that brings together Researchers from Computer Vision, Dialog Systems and Deep Learning areas together to push the state of the art ahead in Visually Grounded Conversational Systems. In Anticipatory Computing Lab, we are conducting Research in Multimodal Sense-Making Areas to create compelling future AI and Intelligent Systems usages that require an assimilation of a variety of technologies such as Computer Vision, Audio Understanding and Language Understanding. This project helps bring together some of these technologies and fuse them appropriately to enable visual dialog capability on it.

To help us develop state of the art technologies in this area, we want to bring on-board a part time Summer Consultant from a reputed university. The hired candidate will work with researchers together to build models and prototypes on a suitable dataset for multimodal dialog understanding related area. The candidate should be a recognized expert in Multimodal Dialog Understanding area and should have published state of the art results at the top conferences related to Spoken Dialog Systems and Deep Learning areas.



· Help develop deep learning based models for interesting visual dialog understanding related problems.

· Provide consulting on state of the art models, practices and implementations for various multimodal architectures.

· Be able to actively take up module development responsibilities in the project.

· Expert knowledge of Deep Learning based Multimodal Architectures

· Proficient understanding of multimodal data collection systems

· Proficient knowledge of one or more of Deep Learning based Libraries (Tensorflow, Keras, PyTorch) for data processing and modeling requirements.

