multimodal-learning

Here are 237 public repositories matching this topic...

oswaldoludwig / visually-informed-embedding-of-word-VIEW-

Visually informed embedding of word (VIEW) is a tool for transferring multimodal background knowledge to NLP algorithms.

nlp deep-learning word2vec embeddings mscoco multimodal-learning

Updated Sep 18, 2016
Python

hoffsupes / Independent-Multimodal-Background-Subtraction

Star

Implementation for the Independent Multimodal Background Subtraction based on a paper written by Bloisi and Iocchi

computer-vision unsupervised-learning background-subtraction background-estimation change-detection multimodal-learning

Updated Oct 12, 2017
C++

mhw32 / multimodal-vae-public

Star

A PyTorch implementation of "Multimodal Generative Models for Scalable Weakly-Supervised Learning" (https://arxiv.org/abs/1802.05335)

machine-learning variational-autoencoder generative-models multimodal-learning

Updated Aug 17, 2018
Python

MyungsuChae / IROS2018_ws

Star

End-to-end multimodal emotion and gender recognition with dynamic weights of joint loss

deep-learning gender-recognition emotion-recognition multimodal-learning multitask-learning

Updated Sep 13, 2018
Python

HongminWu / HongminWu.github.io

Star

HongminWu.github.io

robotics lstm-neural-networks anomaly-detection multimodal-learning robot-introspection bayesian-nonparametric-methods

Updated Sep 27, 2018
CSS

chaitanya100100 / Attention-Modeling-for-Image-Captioning

Star

Attention Modeling for Image Captioning described in 'Show, Attend and Tell'

nlp computer-vision image-captioning attention-model multimodal-learning

Updated Nov 17, 2018
Python

badripatro / Visual_Question_Generation

Star

Torch code for Visual Question Generation

machine-learning deep-neural-networks deep-learning acl lstm triplet-loss decoder-network multimodal-learning question-generation lstm-sentiment-analysis multimodal emnlp-2018 visual-question-generation multimodel-network

Updated Mar 30, 2019
Lua

ABadCandy / BaiDuBigData19-URFC

Star

my solution with 0.67 accuracy

computer-vision deep-learning pytorch multimodal-learning

Updated May 21, 2019
Python

deepandas11 / LocNet

Star

Unsupervised localization of Text in Images

localization deep-learning python3 phrase multimodal-learning

Updated Jul 21, 2019
Python

wxjiao / Multimodal-Feature-Extraction

Star

A detailed description on how to extract and align text, audio, and video features at word-level.

multimodal-learning multimodal-representation

Updated Sep 10, 2019

AnnikaLindh / Diverse_and_Specific_Image_Captioning

Star

Unsupervised specificity-guided optimization of Image Captioning models to encourage meaningful diversity in the generated captions.

machine-learning research computer-vision deep-learning pytorch image-captioning natural-language-generation paper-implementations multimodal-learning

Updated Sep 17, 2019
Python

anujanegi / VQA

Star

Visual Question Answering System

machine-learning natural-language-processing deep-learning artificial-intelligence neural-networks vqa question-answering image-captioning squad multimodal-learning visual-question-answering multitask-learning squad-dataset

Updated Nov 13, 2019
Python

AshwinRJ / Face-Generation-from-Voice

Star

VoiceGAN - Hallucinating Faces from Voices

generative-adversarial-network face-generation multimodal-learning

Updated Nov 21, 2019
Jupyter Notebook

weiyx16 / Learning_ML

Star

HW for CS229 Machine Learning

machine-learning cs229 multimodal-learning

Updated Nov 29, 2019
MATLAB

kanchen-usc / VIG

Star

Dataset for Visually Indicated Sound Generation by Perceptually Optimized Classification

sound-synthesis video-analysis multimodal-learning

Updated Apr 6, 2020
Python

linxueya / ImageFusion_AD

Star

multimodal-learning alzheimer-s-disease

Updated May 29, 2020
Python

asnelt / mmae

Star

Package for Multimodal Autoencoders in TensorFlow / Keras

deep-learning tensorflow keras autoencoder autoencoders keras-models keras-tensorflow multimodal-learning multimodal-deep-learning bregman-distance

Updated Jun 8, 2020
Python

arpytanshu / HUSE-PyTorch

Star

PyTorch Implementation of HUSE: Hierarchical Universal Semantic Embeddings ( https://arxiv.org/pdf/1911.05978.pdf )

deep-learning transfer-learning multimodal-learning pytorch-implementation multimodal-representation universal-semantic-embedding

Updated Jun 14, 2020
Python

haamoon / mmtm

Star

Implementation of CVPR 2020 paper "MMTM: Multimodal Transfer Module for CNN Fusion"

pytorch action-recognition gesture-recognition multimodal-learning speech-enhancement multimodal-deep-learning cnn-fusion

Updated Jun 16, 2020
Python

ilos-vigil / scl-2020-product-detection

Star

4th place (top 1%) solution for Shopee Code League 2020 - Product Detection

computer-vision deep-learning tf-idf multimodal-learning text-cleaning

Updated Aug 2, 2020
Jupyter Notebook

Improve this page

Add a description, image, and links to the multimodal-learning topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the multimodal-learning topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

multimodal-learning

Here are 237 public repositories matching this topic...

oswaldoludwig / visually-informed-embedding-of-word-VIEW-

hoffsupes / Independent-Multimodal-Background-Subtraction

mhw32 / multimodal-vae-public

MyungsuChae / IROS2018_ws

HongminWu / HongminWu.github.io

chaitanya100100 / Attention-Modeling-for-Image-Captioning

badripatro / Visual_Question_Generation

ABadCandy / BaiDuBigData19-URFC

deepandas11 / LocNet

wxjiao / Multimodal-Feature-Extraction

AnnikaLindh / Diverse_and_Specific_Image_Captioning

anujanegi / VQA

AshwinRJ / Face-Generation-from-Voice

weiyx16 / Learning_ML

kanchen-usc / VIG

linxueya / ImageFusion_AD

asnelt / mmae

arpytanshu / HUSE-PyTorch

haamoon / mmtm

ilos-vigil / scl-2020-product-detection

Improve this page

Add this topic to your repo