Project to transform a natural language description into an image using Generative Adversarial Networks.
-
Updated
Dec 9, 2017 - Python
Project to transform a natural language description into an image using Generative Adversarial Networks.
App to cheer you up with some awesome quotes when depressed using deep learning
A two stage multi-modal loss model along with rigid body transformations to regress 3D bounding boxes
Real-world photo sequence question answering system (MemexQA). CVPR'18 and TPAMI'19
Attention Based Multi-modal Emotion Recognition; Stanford Emotional Narratives Dataset
Deep Multi-Sensory Object Category Recognition Using Interactive Behavioral Exploration
Implementation of the paper "Stacked Attention Networks for Image Question Answering" in Tensorflow
This repository contains the code for a video captioning system inspired by Sequence to Sequence -- Video to Text. This system takes as input a video and generates a caption in English describing the video.
DeepCU: Integrating Both Common and Unique Latent Information for Multimodal Sentiment Analysis, IJCAI-19
Slip or Not? Unsupervised Learning to Understand Physical Scene Using Multimodal Variational Physics Inference Network
Deep Multiset Canonical Correlation Analysis - An extension of CCA to multiple datasets
End-to-End learning framework for circular RNA classification from other long non-coding RNA using multimodal deep learning
Meme Sentiment Analysis SemEval 2020 Task 9
Classifying multimodal health data with LSTMs
My master thesis: Siamese multi-hop attention for cross-modal retrieval.
PyTorch code for cross-modal-retrieval on Flickr8k/30k using Bert and EfficientNet
Multimodal speaker diarization using pre-trained audio-visual synchronization model
A multi-modal deep learning model trained to predict a movie's genre given the movie poster and overview as an input.
This repository contains the Pytorch implementation for our SCAI (EMNLP-2018) submission "A Knowledge-Grounded Multimodal Search-Based Conversational Agent"
Add a description, image, and links to the multimodal-deep-learning topic page so that developers can more easily learn about it.
To associate your repository with the multimodal-deep-learning topic, visit your repo's landing page and select "manage topics."