Morency, “Deep multimodal fusion for persuasiveness prediction,” in Proc. ICMI, 2016
[2] H....Zhou, “Stacked latent attention for multimodal reasoning,” in Proc. CVPR, 2018.
[25] A....Zhang, “Multimodal residual learning for visual QA,” in Proc. NIPS, 2016.
[29] H. Noh, P....Thome, “MUTAN: Multimodal tucker fusion for visual question answering,” in Proc....Morency, “Efficient low-rank multimodal fusion with modality-specific factors,” in Proc.