【ACM MM论文集】国际多媒体顶级会议ACM Multimedia 2017 Open Access Repository

点击上方“专知”关注获取更多AI知识!

【导读】第25届ACM国际多媒体会议(ACM International Conference on Multimedia, 简称ACMMM)于2017年10月23日至27日在美国硅谷Mountain View隆重举行。自1993年首次召开以来,ACMMM每年召开一次,已经成为多媒体领域顶级会议,也是中国计算机学会推荐的A类国际学术会议热门方向有大规模图像视频分析、社会媒体研究、多模态人机交互、计算视觉、计算图像等等。

昨天我们分享了由ACM SIGMM China Chapter准备在10月17号在北京举行ACM MM 2017 Pre-Conference,欢迎查看参加!

【学术盛宴 】多媒体顶级会议ACM Multimedia 2017 China Pre-conference论文宣讲研讨会

今天我们分享ACM Multimedia 2017 文章,欢迎查看!

大会网址:http://www.acmmm.org/2017/

Exploring Outliers in Crowdsourced Ranking for QoEQianqian Xu , Institute of Information Engineering of Chinese Academy of Sciences); Ming Yan ; Chendi Huang ; Jiechao Xiong ; Qingming Huang ; Yuan Yao

Towards forward-looking online bitrate adaptation for DASHBo Wang ; Fengyuan Ren

Weighted Sparse Representation Regularized Graph Learning for RGB-T Object TrackingChenglong Li ; Nan Zhao ; Yijuan Lu ; Chengli Zhu ; Jin Tang

Multi-Scale Cascade Network for Salient Object DetectionXin Li ; Fan Yang ; Hong Cheng ; Junyu Chen ; Yuxiao Guo ; Leiting Chen

3D CNNs on Distance Matrices for Human Action RecognitionAlejandro José Hernández Ruiz ; Lorenzo Porzi ; Samuel Rota Bulò ; Francesc Moreno Noguer

16K Cinematic VR StreamingPatrice Rondao Alface ; Maarten Aerts ; Donny Tytgat ; Sammy Lievens ; Christoph Stevens ; Nico Verzijp ; Jean-François Macq

Sync-DRAW: Automatic Video Generation using Deep Recurrent Attentive ArchitecturesGaurav Mittal ; Tanya Marwah ; Vineeth N Balasubramanian

On Server Provisioning for Cloud GamingYusen Li ; Yunhua Deng ; Xueyan Tang ; Wentong Cai ; Xiaoguang Liu ; Gang Wang

Region-based Image Retrieval Revisited by Semantic Region Specification and Spatial Relationship RecommendationRyota Hinami ; Yusuke Matsui ; Shin'Ichi Satoh

Enhancing Micro-video Understanding by Harnessing External SoundsLiqiang Nie ; Xiang Wang ; Jianglong Zhang ; Xiangnan He ; Hanwang Zhang ; Richang Hong ; Qi Tian

Semi-Relaxtion Supervised Hashing for Cross-Modal RetrievalPeng-Fei Zhang ; Chuan-Xiang Li ; Meng-Yuan Liu ; Liqiang Nie ; Xin-Shun Xu

Sketch Recognition with Deep Visual-Sequential Fusion ModelJun-Yan He ; Xiao Wu ; Yu-Gang Jiang ; Bo Zhao ; Qiang Peng

From Part to Whole: Who is Behind the Painting?Daiqian Ma ; Feng Gao ; Yan Bai ; Yihang Lou ; Shiqi Wang ; Tiejun Huang ; Ling-Yu Duan

Adversarial Cross-Modal RetrievalBokun Wang ; Yang Yang ; Xing Xu ; Alan Hanjalic ; Heng Tao Shen

Catching the Temporal Regions-of-Interest for Video CaptioningZiwei Yang ; Yahong Han ; Zheng Wang

Image quality assessment for DIBR synthesized views using elastic metricSuiyi Ling ; Patrick Le Callet

What your Facebook Profile Picture Reveals about your PersonalityCristina Segalin ; Fabio Celli ; Luca Polonio ; David Stillwell ; Michal Kosinski ; Nicu Sebe ; Marco Cristani ; Bruno Lepri

Deep Asymmetric Pairwise HashingXin Gao ; Fumin Shen ; Li Liu ; Yang Yang ; Heng Tao Shen

Real-time Monocular Dense Mapping for Augmented RealityTangli Xue ; Hongcheng Luo ; Zikang Yuan ; Xin Yang

Learning Object-Centric Transformation for Video PredictionXiongtao Chen ; Wenmin Wang ; Jinzhuo Wang ; Weimian Li

Capturing spatial and temporal patterns for distinguishing between posed and spontaneous expressionsJiajia Yang ; Shangfei Wang

Deep Low-rank Sparse Collective Factorization for Cross-Domain RecommendationShuhui Jiang ; Zhengming Ding ; Yun Fu

Detecting Temporal Proposal for Action Localization with Tree-structured Search PolicyXinyang Jiang ; Siliang Tang ; Yang Yang ; Zhou Zhao ; Fei Wu ; Yueting Zhuang

Fluency-Guided Cross-Lingual Image CaptioningWeiyu Lan ; Xirong Li ; Jianfeng Dong

Learning Non-local Image Diffusion for Image DenoisingPeng Qiao ; Yong Dou ; Wensen Feng ; Yunjin Chen

An Image-based Deep Spectrum Feature Representation for the Recognition of Emotional SpeechNicholas Cummins ; Shahin Amiriparian ; Gerhard Hagerer ; Anton Batliner ; Stefan Steidl ; Björn Schuller

FastShrinkage: Perceptually-aware Retargeting Toward Mobile PlatformsZhenguang Liu ; Luming Zhang ; Rajiv Ratn ; Yi Yang ; Xuelong Li

QUETRA: A Queuing Theory Approach to DASH Rate AdaptationPraveen Kumar Yadav ; Arash Shafiei ; Wei Tsang Ooi

ElasticPlay: Responsive Video Summarization with Dynamic Time BudgetHaojian Jin ; Yale Song ; Koji Yatani

Learning Fashion Compatibility with Bidirectional LSTMsXintong Han ; Zuxuan Wu ; Yu-Gang Jiang ; Larry Davis

H-TIME: Haptic-enabled Tele-Immersive Musculoskeletal ExaminationYuan Tian ; Suraj Raghuraman ; Thiru Annaswamy ; Aleksander Borresen ; Klara Nahrstedt ; Balakrishnan Prabhakaran

Two-Stream Attentive CNNs for Image RetrievalFei Yang ; Jia Li ; Shikui Wei ; Qinjie Zheng ; Ting Liu ; Yao Zhao

Magic-wall: Visualizing Room DecorationTing Liu ; Yunchao Wei ; Yao Zhao ; Si Liu ; Shikui Wei

Automatic Music Video Generation Based on Simultaneous Soundtrack Recommendation and Video EditingJen-Chun Lin ; Wen-Li Wei ; James Yang ; Hsin-Min Wang ; Hong-Yuan Mark Liao

DeepArt: Learning Joint Representations of Visual ArtsHui Mao ; Ming Cheung ; James She

Automatic Generation of Lyrics ParodiesLorenzo Gatti ; Gözde Özbal ; Oliviero Stock ; Carlo Strapparava

Anti-camera LED LightingXiao Shu ; Xiaolin Wu ; Qifan Gao

Mr.MAPP: Mixed Reality for MAnaging Phantom PainKanchan Bahirat ; Thiru Annaswamy ; Balakrishnan Prabhakaran

Video Captioning with Guidance of Multimodal Latent TopicsShizhe Chen ; Jia Chen ; Qin Jin ; Alexander Hauptmann

Where are the sweet spots? A systematic approach to reproducible DASH Player comparisonsDenny Stohr ; Alexander Frömmgen ; Amr Rizk ; Michael Zink ; Ralf Steinmetz ; Wolfgang Effelsberg

Cross-modal Recipe Retrieval with Rich Food AttributesJingjing Chen ; Chong-Wah Ngo ; Tat-Seng Chua

Integrated Face Analytics Networks through Cross-Dataset Hybrid TrainingJianshu Li ; Shengtao Xiao ; Fang Zhao ; Jian Zhao ; Jianan Li ; Jiashi Feng ; Shuicheng Yan ; Terence Sim

Vocktail: A Virtual Cocktail for Pairing Digital Taste, Smell, and Color SensationsNimesha Ranasinghe ; Thi Ngoc Tram Nguyen ; Yan Liangkun ; Lien-Ya Lin ; David Tolley ; Ellen Yi-Luen Do

Hashtag-centric Immersive Search on Social MediaYuqi Gao ; Jitao Sang ; Tongwei Ren ; Changsheng Xu

Affect Recognition in Ads with Application to Computational AdvertisingAbhinav Shukla ; Shruti Gullapuram ; Harish Katti ; Narasimha Karthik Yadati ; Mohan Kankanhalli ; Ramanathan Subramanian , University of Illinois at Urbana-Champaign)

Exploring the use of Time-Dependent Cross-Network Information for Personalized RecommendationsDilruk Perera ; Roger Zimmermann

Learning Multimodal Attention LSTM Networks for Video Captioning}Jun Xu ; Ting Yao ; Yongdong Zhang ; Tao Mei

Spatio-Temporal AutoEncoder for Video Anomaly DetectionYiru Zhao ; Bing Deng ; Chen Shen ; Yao Liu ; Hongtao Lu ; Xian-Sheng Hua

Deep Siamese Network with Multi-level Similarity Perception for Person Re-identificationChen Shen ; Zhongming Jin ; Yiru Zhao ; Zhihang Fu ; Rongxin Jiang ; Yaowu Chen ; Xian-Sheng Hua

Fashion World Map: Understanding Cities Through Streetwear FashionYu-Ting Chang ; Wen-Huang Cheng ; Bo Wu ; Kai-Lung Hua

Automatic Adjustment of Stereoscopic Content for Long-Range Projections in Outdoor AreasBehnam Maneshgar ; Leila Sujir ; Sudhir Mudur ; Charalambos Poullis

SketchParse: Towards Rich Descriptions For Poorly Drawn Sketches Using Multi-Task Deep NetworksRavi Kiran Sarvadevabhatla ; Isht Dwivedi ; Abhijat Biswas ; Sahil Manocha ; Venkatesh Babu R.

Place-centric Visual Urban Perception with Hierarchical Deep Multi-instance RegressionXiaobai Liu ; Qi Chen ; Yuanlu Xu ; Lei Zhu ; Xuming He

A Delicious Recipe Analysis Framework for Exploring Multi-Modal Recipes with Various AttributesWeiqing Min ; Shuqiang Jiang ; Shuhui Wang ; Jitao Sang ; Shuhuan Mei

Temporal Binary Coding for Large-Scale Video SearchKe Xia ; Yuqing Ma ; Xianglong Liu ; Yadong Mu ; Li Liu

Learning to Compose with Professional Photographs on the WebYi-Ling Chen ; Jan Klopp ; Min Sun ; Shao-Yi Chien ; Kwan-Liu Ma

StructCap: Structured Semantic Embedding for Image CaptioningFuhai Chen ; Rongrong Ji ; Jinsong Su ; Yongjian Wu ; Yunsheng Wu

Unconstrained Fashion Landmark DetectionSijie Yan ; Ziwei Liu ; Ping Luo ; Xiaogang Wang ; Xiaoou Tang

Skeleton-aided Articulated Motion GenerationYichao Yan ; Jingwei Xu ; Bingbing Ni ; Wendong Zhang ; Xiaokang Yang

One-Shot Fine-Grained Instance RetrievalHantao Yao , Chinese Academy of Sciences; University of Chinese Academy of Sciences); Shiliang Zhang ; Yongdong Zhang , Chinese Academy of Sciences); Jintao Li , Chinese Academy of Sciences); Qi Tian

GLAD: Global-Local-Alignment Descriptor for Pedestrian RetrievalLonghui Wei ; Shiliang Zhang ; Hantao Yao ; Wen Gao ; Qi Tian

Deep progressive hashing for image retrievalJiale Bai ; Bingbing Ni ; Minsi Wang ; Hanjiang Lai ; Yang Shen ; Lin Mei ; Chongyang Zhang ; Chuanping Hu

FaceCollage: A Rapidly Deployable System for Real-time Head Reconstruction for On-The-Go 3D TelepresenceFuwen Tan ; Chi-Wing Fu ; Jianfei Cai ; Teng Deng ; Tat-Jen Cham

Protest Activity Detection and Perceived Violence Estimation from Social Media ImagesDonghyeon Won ; Zachary Steinert-Threlkeld ; Jungseock Joo

LiveJack: Integrating CDNs and Edge Clouds for Live Content BroadcastingBo Yan ; Shu Shi ; Yong Liu ; Weizhe Yuan ; Haoqin He ; Rittwik Jana ; Yang Xu ; H. Jonathan Chao

Modeling the Intransitive Pairwise Image Preference from Multiple AnglesJun Chen ; Chaokun Wang ; Jianmin Wang

Fast Deep Matting for Portrait Animation on Mobile PhoneBingke Zhu ; Yingying Chen ; Si Liu ; Bo Zhang ; Jinqiao Wang ; Ming Tang

Pedestrian Path Forecasting in Crowd: A Deep Spatio-Temporal PerspectiveYuke Li

ReGLe: Spatially Regularized Graph Learning for Visual TrackingChenglong Li ; Xiaohao Wu ; Zhimin Bao ; Jin Tang

360ProbDASH: Improving QoE of 360 Video Streaming Using Tile-based HTTP Adaptive StreamingLan Xie ; Zhimin Xu ; Yixuan Ban ; Xinggong Zhang ; Zongming Guo

Deep Unsupervised Convolutional Domain AdaptationJunbao Zhuo , Institute of Computing Technology, Chinese Academy of Sciences, Beijing, 100190, China); Shuhui Wang , Institute of Computing Technology, Chinese Academy of Sciences, Beijing, 100190, China); Weigang Zhang ); Qingming Huang

PD-Survey - Supporting Audience-Centric Research through Surveys on Public Display NetworksFlorian Alt

Improving Event Extraction via Cross-Modal IntegrationTongtao Zhang ; Spencer Whitehead ; Hanwang Zhang ; Hongzhi Li ; Joseph Ellis ; Lifu Huang ; Wei Liu ; Heng Ji ; Shih-Fu Chang

Indefinite Kernel Logistic RegressionFanghui Liu ; Xiaolin Huang ; Jie Yang

Multimodal Learning for Web Information ExtractionDihong Gong ; Daisy Wang ; Yang Peng

Query-adaptive Video Summarization via Quality-aware Relevance EstimationArun Balajee Vasudevan ; Michael Gygli ; Anna Volokitin ; Luc Van Gool

Predicting Human Intentions from Motion Cues Only: A 2D+3D Fusion ApproachAndrea Zunino ; Jacopo Cavazza ; Atesh Koul ; Andrea Cavallo ; Cristina Becchio ; Vittorio Murino

RGB-D Scene Recognition with Object-to-Object RelationXinhang Song ); Chengpeng Chen ); Shuqiang Jiang )

Exploring Consistent Preferences: Discrete Hashing with Pair-Exemplar for Scalable Landmark SearchLei Zhu ; Zi Huang ; Xiaojun Chang ; Jingkuan Song ; Heng Tao Shen

Data Generation for Improving Person Re-identificationLin Chen ; Hua Yang ; Shuang Wu ; Zhiyong Gao

Fast and Accurate Pedestrian Detection using Dual-Stage Group Cost-Sensitive RealBoost with Vector Form FiltersChengju Zhou ; Meiqing Wu ; Siew-Kei Lam

Positive and Unlabeled Learning for Anomaly Detection with Multi-featuresJiaqi Zhang ; Zhenzhen Wang ; Junsong Yuan ; Yap Peng Tan

Learning Visual Emotion Distributions via Multi-Modal Features FusionSicheng Zhao ; Guiguang Ding ; Yue Gao ; Jungong Han

ShareRender: Bypass GPU Virtualization to Enable Fine-grained Resource Sharing for Cloud GamingWei Zhang ; Xiaofei Liao ; Hai Jin ; Peng Li ; Li Lin

Vivepaper: Augmented Reality Virtual Book for Immersive Reading ExperienceZhongyang Zheng ; Bo Wang ; Yakun Wang ; Catherine Yang ; Zhongqian Dong ; Tianyang Yi ; Cyrus Choi ; Edward Chang

Online Cross-Modal Scene Retrieval by Binary Representation and Semantic GraphMengshi Qi ; Yunhong Wang ; Annan Li

NeuroStylist: Neural Compatibility Modeling for Clothing MatchingXuemeng Song ; Fuli Feng ; Jinhuan Liu ; Zekun Li ; Liqiang Nie ; Jun Ma

Sports VR Content Generation from Regular Camera FeedsKiana Calagari ; Mohamed Elgharib ; Mohamed Hefeeda ; Shervin Shirmohammadi

Two Birds One Stone: On both Cold-Start and Long-Tail RecommendationJingjing Li ; Ke Lu ; Zi Huang ; Heng Tao Shen

Multi-Networks Joint Learning for Large-Scale Cross-Modal RetrievalLiang Zhang ; Bingpeng Ma ; Guorong Li ; Qingming Huang ; Qi Tian

Salient Object Detection with Chained Multi-Scale Fully Convolutional NetworkYoubao Tang ; Xiangqian Wu

Fine-grained Discriminative Localization via Saliency-guided Faster R-CNNXiangteng He ; Yuxin Peng ; Junjie Zhao

Exploiting High-Level Semantics for No-Reference Image Quality Assessment of Realistic Blur ImagesDingquan Li ; Tingting Jiang ; Ming Jiang

Learning to Recognise Unseen Classes by A Few SimilesYang Long ; Ling Shao

Deep Cross-Modality Alignment for Multi-Shot Person Re-IDentificationZhichao Song ; Bingbing Ni ; Yichao Yan ; Zhe Ren ; Yi Xu ; Xiaokang Yang

Hierarchical Recurrent Neural Network for Video SummarizationBin Zhao ; Xuelong Li ; Xiaoqiang Lu

A simplified topological representation of text for local and global contextIshrat Rahman Sami ; Katayoun Farrahi

Improved Multimodal Representation Learning with Skip ConnectionsNing Zhang ; Yu Cao ; Yan Luo ; Benyuan Liu

Modeling Image Virality with Pairwise Spatial Transformer NetsAbhimanyu Dubey ; Sumeet Agarwal

Metric-based Generative Adversarial NetworkGuoxian Dai ; Jin Xie ; Yi Fang

More Than An Answer: Neural Pivot Network for Visual Question AnsweringYiyi Zhou ; Rongrong Ji ; Jinsong Su ; Yongjian Wu ; Yunsheng Wu

Photo2Trip: Exploiting Visual Contents in Geo-tagged Photos for Personalized Tour RecommendationPengpeng Zhao ; Xiefeng Xu ; Yanchi Liu ; Victor S. Sheng ; Kai Zheng ; Hui Xiong

Deep Active Learning Through Cognitive Information ParcelsWencang Zhao ; Yu Kong ; Zhengming Ding ; Shangqian Gao ; Yun Fu

A Paralinguistic Approach To Holistic Speaker Diarisation -- Using Age, Gender, Voice Likability and Personality TraitsYue Zhang ; William McGehee ; Maximilian Schmitt ; Florian Eyben ; Björn Schuller

OpTile: Toward Optimal Tiling in 360-degree Video StreamingMengbai Xiao ; Chao Zhou ; Yao Liu ; Songqing Chen

3DensiNet: A Robust Neural Network Architecture Towards 3D Volumetric Object Prediction From 2D ImageMeng Wang ; Lingjing Wang ; Yi Fang

Towards Micro-video Understanding by Joint Sequential-Sparse ModelingMeng Liu ; Liqiang Nie ; Meng Wang ; Baoquan Chen

LEAF: Latent Extended Attribute Features Discovery for Visual ClassificationHua Zhang ; Rui Wang ; Changqing Zhang ; Xiaochun Cao

Single Shot Temporal Action DetectionTianwei Lin ; Xu Zhao ; Zheng Shou

Too Many Pixels to Perceive: Subpixel Shutoff for Display Energy Reduction on OLED SmartphonesZhisheng Yan ; Chang Wen Chen

Finding the Secret of CNN Parameter Layout under Strict Size ConstraintLiao Lixin ; Yao Zhao ; Shikui Wei ; Wang Jingdong ; Liu Ruoyu

It’s All Around You: Exploring 360° Video Viewing Experiences on Mobile DevicesMarc van den Broeck ; Fahim Kawsar ; Johannes Schöning

Visualization of Stone Trajectories in Live Curling Broadcasts using Online Machine LearningMasaki Takahashi ); Shinsuke Yokozawa ); Hideki Mitsumine ); Tomoyuki Mishina ); Yasuyuki Matsuhisa ); Sawako Muramatsu )

Exploring Domain Knowledge for Affective Video Content AnalysesTanfang Chen ; Yaxin Wang ; Shangfei Wang ; Shiyu Chen

Deep Temporal Models using Identity Skip-Connections for Speech Emotion RecognitionJaebok Kim ; Gwenn Englebienne ; Khiet Truong ; Vanessa Evers

Video Description with Spatial-Temporal AttentionYunbin Tu ; Xishan Zhang ; Bingtao Liu ; Chenggang Yan

Deep Binary Reconstruction for Cross-modal HashingXuelong Li ; Di Hu ; Feiping Nie

Pedestrian Detection via Bi-directional Multi-scale AnalysisZhenyu Duan ; Jinpeng Lan ; Yi Xu ; Bingbing Ni ; Lixue Zhuang ; Xiaokang Yang

Rethinking HTTP Adaptive Streaming with the Mobile User PerceptionChao Wu ; Wenwu Zhu ; Qiushi Li ; Yaoxue Zhang

Fine-grained Recognition via Attribute-guided Attentive Feature AggregationYichao Yan ; Bingbing Ni ; Xiaokang Yang

NormFace: $L_2$ HyperSphere Embedding for Face VerificationFeng Wang ; Xiang Xiang ; Jian Cheng ; Alan Yuille

Semi-Dense Depth Interpolation using Deep Convolutional Neural NetworksIlya Makarov ; Vladimir Aliev ; Olga Gerasimova

Occlusion-aware Video Temporal ConsistencyChun-Han Yao ; Chia-Yang Chang ; Shao-Yi Chien

Video Question Answering via Hierarchical Dual-Level Attention Network LearningZhou Zhao ; Jinghao Lin ; Xinghua Jiang ; Deng Cai ; Xiaofei He ; Yueting Zhuang

Region-based Activity Recognition Using Conditional GANXinyu Li ; Yanyi Zhang ; Jianyu Zhang ; Yueyang Chen ; Huangcan Li ; Ivan Marsic ; Randall Burd

Learning a Target Sample Re-Generator for Cross-Database Micro-Expression RecognitionYuan Zong ; Xiaohua Huang ; Wenming Zheng ; Zhen Cui ; Guoying Zhao

REQUEST: Seamless Dynamic Adaptive Streaming over HTTP for Multi-Homed Smartphone under Resource ConstraintsJonghoe Koo ; Juheon Yi ; Joongheon Kim ; Mohammad A. Hoque ; Sunghyun Choi

Cross-media retrieval by learning rich semantic embeddings of multimediaMengdi Fan ; Wenmin Wang ; Peilei Dong ; Liang Han ; Ronggang Wang ; Ge Li

Optimal Set of 360-Degree Videos for Viewport-Adaptive StreamingXavier Corbillon ; Gwendal Simon ; Alisa Devlic ; Jacob Chakareski

WebRTC Congestion Control using Forward-Error CorrectionBalázs Kreith ; Varun Singh ; Jörg Ott

Visual Sentiment Analysis for Review Images with Item-Oriented and User-Oriented CNNQuoc-Tuan Truong ; Hady Lauw

From Multimedia Logs to Personal ChroniclesHyungik Oh ; Ramesh Jain

Experimental Analysis of Bandwidth Allocation in Automated Video Surveillance SystemsSina Gholamnejad Davani ; Nabil Sarhan

Mutually Guided Image FilteringXiaojie Guo ; Yu Li ; Jiayi Ma

Learning Semantic Feature Map for Visual Content RecognitionRui-Wei Zhao ; Zuxuan Wu ; Jianguo Li ; Yu-Gang Jiang

Video Visual Relation DetectionXindi Shang ; Tongwei Ren ; Jingfan Guo ; Hanwang Zhang ; Tat-Seng Chua

Deep Location-Specific TrackingLingxiao Yang ; Risheng Liu ; David Zhang ; Lei Zhang

A Multi-Task Framework for Weather RecognitionZhigang Wang ; Xuelong Li ; Xiaoqiang Lu

From Hard to Soft: Towards more Human-like Emotion Recognition by Modelling the Perception UncertaintyJing Han ; Zixing Zhang ; Maximilian Schmitt ; Maja Pantic ; Björn Schuller

When Cloud Meets Uncertain Crowd: An Auction Approach for Crowdsourced Livecast TranscodingYifei Zhu ; Jiangchuan Liu ; Zhi Wang ; Cong Zhang

Multimedia Semantic Integrity Assessment Using Joint Embedding Of Images and TextAyush Jaiswal ; Ekraam Sabir ; Wael Abd-Almageed ; Prem Natarajan

Discriminative Training of Complex-valued Deep Recurrent Neural Network for Singing Voice SeparationYuan-Shan Lee ); Kuo Yu ); Sih-Huei Chen ); Jia-Ching Wang

Multicamera Summarization of Rehabilitation Sessions in Home EnvironmentTarek Elgamal ; Klara Nahrstedt

Adaptive Low-Rank Multi-Label Active Learning for Image ClassificationJian Wu ; Anqian Guo ; Victor S. Sheng ; Pengpeng Zhao ; Zhiming Cui

Modeling the Resource Requirements of Convolutional Neural Networks on Mobile DevicesZongqing Lu ; Swati Rallapalli ; Kevin Chan ; Thomas La Porta

Adaptively Attending to Visual Attributes and Linguistic Knowledge for CaptioningYi Bin ; Yang Yang ; Jie Zhou ; Zi Huang ; Heng Tao Shen

Efficient Binary Coding for Subspace-based Query-by-Image Video RetrievalRuicong Xu ; Yang Yang ; Fumin Shen ; Ning Xie ; Heng Tao Shen

Adaptive Audio Classification for Smartphone in Noisy Car EnvironmentMyounggyu Won ; Haitham Alsaadan ; Yongsoon Eun

Real-Time False-Contours Removal for Inverse Tone Mapped High Dynamic Range Content using Projection Onto Convex Sets theoryGonzalo Luzardo ; Jan Aelterman ; Hiep Luong ; Wilfried Philips ; Daniel Ochoa

Incremental accelerated kernel discriminant analysisNikolaos Gkalelis ; Vasileios Mezaris

Venues in Social Media: Examining Ambiance Perception Through Scene SemanticsYassir Benkhedda ; Darshan Santani ; Daniel Gatica-Perez

Pseudo label based Unsupervised Deep discriminative Hashing for image retrievalQinghao Hu ; Jiaxiang Wu ; Jian Cheng ; Hanqing Lu

Moving as a Leader: Detecting Emergent Leadership in Small Groups using Body PoseCigdem Beyan ; Vasiliki-Maria Katsageorgiou ; Vittorio Murino

A Novel System for Visual Navigation of Educational Videos Using Multimodal CuesBaoquan Zhao ; Xiaonan Luo ; Shujin Lin ; Songhua Xu ; Ruomei Wang

#VisualHashtags: Visual Summarization of Social Media Events Using Mid-Level Visual ElementsSonal Goel ; Sarthak Ahuja ; A V Subramanyam ; Ponnurangam Kumaraguru

Mulit-scale Context based Attention for Dynamic Music Emotion PredictionYe Ma ; Xinxing Li ; Mingxing Xu ; Lianhong Cai

Outlining objects for interactive segmentation on touch devicesMatthieu Pizenberg ; Axel Carlier ; Emmanuel Faure ; Vincent Charvillat

Deep Matching and Validation Network: An End-to-End Solution to Constrained Image Splicing Localization and DetectionYue Wu ; Wael Abdalmageed ; Prem Natarajan

Multi-modal localization and enhancement of multiple sound sources from a micro aerial vehicleRicardo Sanchez-Matilla ; Lin Wang ; Andrea Cavallaro

Temporally Selective Attention Model for Social and Affective State Recognition in Multimedia ContentHongliang Yu ; Liangke Gui ; Michael Madaio ; Amy Ogan ; Justine Cassell ; Louis-Philippe Morency

Adaptive 360-Degree Video Streaming using Scalable Video CodingAfshin Taghavi Nasrabadi ; Anahita Mahzari ; Joseph D. Beshay ; Ravi Prakash

Deep Supervised Quantization by Self-Organized MapMin Wang ; Wengang Zhou ; Qi Tian ; Junfu Pu ; Houqiang Li

Selective Deep Convolutional Features for Image RetrievalTuan Hoang Nguyen Anh ; Thanh-Toan Do ; Dang-Khoa Le Tan ; Ngai-Man Cheung

Quality-of-Experience of Adaptive Video Streaming: Exploring the Space of AdaptationsZhengfang Duanmu ; Kede Ma ; Zhou Wang

Statistical Inference of Gaussian-Laplace Distribution for Person VerificationZheng Wang ; Ruimin Hu ; Yi Yu ; Junjun Jiang ; Jiayi Ma ; Shin'Ichi Satoh

Beyond Human-level License Plate Super-resolution with Progressive Vehicle Search and Domain Priori GANWu Liu ; Xinchen Liu ; Huadong Ma ; Peng Cheng

Learning to Generate and Edit HairstylesWeidong Yin ; Yanwei Fu ; Yiqing Ma ; Yugang Jiang ; Tao Xiang ; Xiangyang Xue

Adaptively Weighted Multi-task Deep Network for Person Attribute ClassificationKeke He ; Zhanxiong Wang ; Yanwei Fu ; Yu-Gang Jiang ; Rui Feng ; Xiangyang Xue

Laplacian-Steered Neural Style TransferShaohua Li ; Xinxing Xu ; Liqiang Nie ; Tat-Seng Chua

Video Question Answering via Gradually Refined Attention over Appearance and MotionDejing Xu ; Zhou Zhao ; Jun Xiao ; Fei Wu ; Hanwang Zhang ; Xiangnan He ; Yueting Zhuang

Cross-Domain Image Retrieval with Attention ModelingXin Ji ; Wei Wang ; Meihui Zhang ; Yang Yang

PQk-means: Billion-scale Clustering for Product-quantized CodesYusuke Matsui ; Keisuke Ogaki ; Toshihiko Yamasaki ; Kiyoharu Aizawa

Face Aging with Contextural Generative Adversarial NetsSi Liu ; Yao Sun ; Wei Wang ; Renda Bao ; Defa Zhu ; Shuicheng Yan

Attention Transfer from Web Images for Video RecognitionJunnan Li ; Yongkang Wong ; Qi Zhao ; Mohan Kankanhalli

A Unified Personalized Video Recommendation via Dynamic Recurrent Neural NetworksJunyu Gao ; Tianzhu Zhang ; Changsheng Xu

Is Foveated Rendering Perceivable to VR Users? A Study on the Efficiency and Consistency of Subjective Assessment MethodsChih-Fan Hsu ; Anthony Chen ; Cheng-Hsin Hsu ; Chun-Ying Huang ; Chin-Laung Lei ; Kuan-Ta Chen

Wheel: Accelerating CNNs with Distributed GPUs via Hybrid Parallelism and Alternate StrategyXiaoyu Du ; Jinhui Tang ; Zechao Li ; Zhiguang Qin

Multiview and Multimodal Pervasive Indoor LocalizationZhenguang Liu ; Li Cheng ; Anan Liu ; Luming Zhang ; Xiangnan He ; Roger Zimmermann

Future-Supervised Retrieval of Unseen Queries for Live VideoSpencer Cappallo ; Cees Snoek

Deep Attribute-preserving Metric Learning for Natural Language Object RetrievalJianan Li ; Yunchao Wei ; Xiaodan Liang ; Fang Zhao ; Jianshu Li ; Tingfa Xu ; Jiashi Feng

Understanding Fashion Trends from Street Photos via Neighbor-Constrained Embedding LearningXiaoling Gu ; Yongkang Wong ; Pai Peng ; Lidan Shou ; Gang Chen ; Mohan S. Kankanhalli

Multi-Modal Knowledge Representation Learning via Webly-Supervised Relationships MiningFudong Nian ; Bingkun Bao ; Teng Li ; Changsheng Xu

The Role of Visual Attention in Sentiment PredictionShaojing Fan ; Ming Jiang ; Zhiqi Shen ; Bryan Koenig ; Mohan Kankanhalli ; Qi Zhao

Searching Personal Photos on the Phone with Instant Visual Query Suggestion and Joint Text-Image HashingZhaoyang Zeng ; Jianlong Fu ; Hongyang Chao ; Tao Mei

Robust Visual Object Tracking with Top-down ReasoningMengdan Zhang ; Jiashi Feng ; Weiming Hu

Stylized Adversarial Autoencoder for Image GenerationYiru Zhao ; Bing Deng ; Jianqiang Huang ; Hongtao Lu ; Xian-Sheng Hua

An HTTP/2-Based Adaptive Streaming Framework for 360 Virtual Reality VideosStefano Petrangeli ; Viswanathan Swaminathan ; Mohammad Hosseini ; Filip De Turck

Multimodal Fusion with Recurrent Neural Networks for Rumor Detection on MicroblogsZhiwei Jin ; Han Guo ; Juan Cao ; Yongdong Zhang ; Jiebo Luo

A Dual-Network Progressive Approach to Weakly Supervised Object DetectionXuanyi Dong ; Deyu Meng ; Fan Ma ; Yi Yang

获取更多关于ACM Multimedia 2017的资料,请登录www.zhuanzhi.ai, 搜索 ACM Multimedia 查看!

欢迎转发分享到微信群和朋友圈!

原文发布于微信公众号 - 专知(Quan_Zhuanzhi)

原文发表时间:2017-10-17

本文参与腾讯云自媒体分享计划,欢迎正在阅读的你也加入,一起分享。

发表于

我来说两句

0 条评论
登录 后参与评论

相关文章

来自专栏专知

AAAI2019论文抢鲜看!48篇自然语言处理/计算机视觉/机器学习最新接受论文!

【导读】2019人工智能开年顶级会议AAAI的录取结果已出,投稿数量高达7745篇,录取率仅为16.2%。南京大学教授周志华和密歇根大学教授 Pascal Va...

2.5K30
来自专栏专知

机器学习领域顶会ICML 2018 接受论文列表

34730
来自专栏专知

【论文推荐】最新六篇视觉问答相关论文—鲁棒性分析、虚拟意象、双曲注意力网络、R-VQA、关系推理、双线性注意力网络

【导读】专知内容组为大家推出最新六篇视觉问答(Visual Question Answering, VQA)相关论文,欢迎查看!

16440
来自专栏专知

【ICLR 2018】Google 研究盘点,76篇论文抢先看

31220
来自专栏机器之心

ACL 2017接受了哪些论文?这份可视化分析让你轻松看懂(附论文列表)

选自ACL 2017 机器之心报道 参与:蒋思源 国际计算语言学协会 (ACL,The Association for Computational Lingui...

39390
来自专栏腾讯高校合作

【犀牛鸟学问】ACL2017论文报告会

近日,自然语言处理领域国际最权威的学术会议ACL 2017公布了录用论文。为了促进国内自然语言处理相关研究的发展以及研究者之间的交流,中国中文信息学会青年工作委...

31150
来自专栏专知

【最新】机器学习顶会 NIPS 2017 Pre-Proceedings 论文列表(附pdf下载链接)

【导读】机器学习领域顶尖学术会议——神经信息处理系统进展大会(Advances in NeuralInformation Processing Systems,...

59790
来自专栏数据派THU

自然语言处理领域重要研究及资源全索引!

来源:机器之心 作者:Kyubyong Park 本文长度为3071字,建议阅读6分钟 本文为你整理自然语言处理最新深度研究成果。 自然语言处理(NLP)是人工...

29790
来自专栏专知

SIGIR 2018 信息检索领域顶级学术会议接受论文列表

59030
来自专栏专知

【论文推荐】最新6篇推荐系统(Recommendation System)相关论文—深度、注意力、安全、可解释性、评论、自编码器

【导读】专知内容组整理了最近六篇推荐系统(Recommendation System)相关文章,为大家进行介绍,欢迎查看! 1. DKN: Deep Knowl...

1.6K60

扫码关注云+社区

领取腾讯云代金券