• Skip to main content
  • Skip to primary sidebar
AAAI

AAAI

Association for the Advancement of Artificial Intelligence

    • AAAI

      AAAI

      Association for the Advancement of Artificial Intelligence

  • About AAAIAbout AAAI
    • News
    • Officers and Committees
    • Staff
    • Bylaws
    • Awards
      • Fellows Program
      • Classic Paper Award
      • Dissertation Award
      • Distinguished Service Award
      • Allen Newell Award
      • Outstanding Paper Award
      • AI for Humanity Award
      • Feigenbaum Prize
      • Patrick Henry Winston Outstanding Educator Award
      • Engelmore Award
      • AAAI ISEF Awards
      • Senior Member Status
      • Conference Awards
    • Partnerships
    • Resources
    • Mailing Lists
    • Past Presidential Addresses
    • AAAI 2025 Presidential Panel on the Future of AI Research
    • Presidential Panel on Long-Term AI Futures
    • Past Policy Reports
      • The Role of Intelligent Systems in the National Information Infrastructure (1995)
      • A Report to ARPA on Twenty-First Century Intelligent Systems (1994)
    • Logos
  • aaai-icon_ethics-diversity-line-yellowEthics & Diversity
  • Conference talk bubbleConferences & Symposia
    • AAAI Conference
    • AIES AAAI/ACM
    • AIIDE
    • EAAI
    • HCOMP
    • IAAI
    • ICWSM
    • Spring Symposia
    • Summer Symposia
    • Fall Symposia
    • Code of Conduct for Conferences and Events
  • PublicationsPublications
    • AI Magazine
    • Conference Proceedings
    • AAAI Publication Policies & Guidelines
    • Request to Reproduce Copyrighted Materials
    • Contribute
    • Order Proceedings
  • aaai-icon_ai-magazine-line-yellowAI Magazine
  • MembershipMembership
    • Member Login
    • Chapters

  • Career CenterAI Jobs
  • aaai-icon_ai-topics-line-yellowAITopics
  • aaai-icon_contact-line-yellowContact

  • Twitter
  • Facebook
  • LinkedIn
Home / Proceedings / Proceedings of the AAAI Conference on Artificial Intelligence, 34 /

Vol. 34 No. 07: AAAI-20 Technical Tracks 7

AAAI Technical Track: Vision

  • Unified Vision-Language Pre-Training for Image Captioning and VQA

    Luowei Zhou, Hamid Palangi, Lei Zhang, Houdong Hu, Jason Corso, Jianfeng Gao

    13041-13049

    PDF
  • Ladder Loss for Coherent Visual-Semantic Embedding

    Mo Zhou, Zhenxing Niu, Le Wang, Zhanning Gao, Qilin Zhang, Gang Hua

    13050-13057

    PDF
  • Generate, Segment, and Refine: Towards Generic Manipulation Segmentation

    Peng Zhou, Bor-Chun Chen, Xintong Han, Mahyar Najibi, Abhinav Shrivastava, Ser-Nam Lim, Larry Davis

    13058-13065

    PDF
  • Motion-Attentive Transition for Zero-Shot Video Object Segmentation

    Tianfei Zhou, Shunzhou Wang, Yi Zhou, Yazhou Yao, Jianwu Li, Ling Shao

    13066-13073

    PDF
  • When AWGN-Based Denoiser Meets Real Noises

    Yuqian Zhou, Jianbo Jiao, Haibin Huang, Yang Wang, Jue Wang, Honghui Shi, Thomas Huang

    13074-13081

    PDF
  • Multi-Type Self-Attention Guided Degraded Saliency Detection

    Ziqi Zhou, Zheng Wang, Huchuan Lu, Song Wang, Meijun Sun

    13082-13089

    PDF
  • Towards Omni-Supervised Face Alignment for Large Scale Unlabeled Videos

    Congcong Zhu, Hao Liu*(corresponding author), Zhenhua Yu, Xuehong Sun

    13090-13097

    PDF
  • FASTER Recurrent Networks for Efficient Video Classification

    Linchao Zhu, Du Tran, Laura Sevilla-Lara, Yi Yang, Matt Feiszli, Heng Wang

    13098-13105

    PDF
  • EEMEFN: Low-Light Image Enhancement via Edge-Enhanced Multi-Exposure Fusion Network

    Minfeng Zhu, Pingbo Pan, Wei Chen, Yi Yang

    13106-13113

    PDF
  • Viewpoint-Aware Loss with Angular Regularization for Person Re-Identification

    Zhihui Zhu, Xinyang Jiang, Feng Zheng, Xiaowei Guo, Feiyue Huang, Xing Sun, Weishi Zheng

    13114-13121

    PDF
  • iFAN: Image-Instance Full Alignment Networks for Adaptive Object Detection

    Chenfan Zhuang, Xintong Han, Weilin Huang, Matthew Scott

    13122-13129

    PDF
  • Learning Attentive Pairwise Interaction for Fine-Grained Classification

    Peiqin Zhuang, Yali Wang, Yu Qiao

    13130-13137

    PDF
  • Single Camera Training for Person Re-Identification

    Tianyu Zhang, Lingxi Xie, Longhui Wei, Yongfei Zhang, Bo Li, Qi Tian

    12878-12885

    PDF
  • Multi-Instance Multi-Label Action Recognition and Localization Based on Spatio-Temporal Pre-Trimming for Untrimmed Videos

    Xiao-Yu Zhang, Haichao Shi, Changsheng Li, Peng Li

    12886-12893

    PDF
  • FACT: Fused Attention for Clothing Transfer with Generative Adversarial Networks

    Yicheng Zhang, Lei Li, Li Song, Rong Xie, Wenjun Zhang

    12894-12901

    PDF
  • Find Objects and Focus on Highlights: Mining Object Semantics for Video Highlight Detection via Graph Neural Networks

    Yingying Zhang, Junyu Gao, Xiaoshan Yang, Chang Liu, Yan Li, Changsheng Xu

    12902-12909

    PDF
  • When Radiology Report Generation Meets Knowledge Graph

    Yixiao Zhang, Xiaosong Wang, Ziyue Xu, Qihang Yu, Alan Yuille, Daguang Xu

    12910-12917

    PDF
  • Exploiting Motion Information from Unlabeled Videos for Static Image Action Recognition

    Yiyi Zhang, Li Niu, Ziqi Pan, Meichao Luo, Jianfu Zhang, Dawei Cheng, Liqing Zhang

    12918-12925

    PDF
  • Adaptive Unimodal Cost Volume Filtering for Deep Stereo Matching

    Youmin Zhang, Yimin Chen, Xiao Bai, Suihanjin Yu, Kun Yu, Zhiwei Li, Kuiyuan Yang

    12926-12934

    PDF
  • Fully Convolutional Network for Consistent Voxel-Wise Correspondence

    Yungeng Zhang, Yuru Pei, Yuke Guo, Gengyu Ma, Tianmin Xu, Hongbin Zha

    12935-12942

    PDF
  • Zero-Shot Sketch-Based Image Retrieval via Graph Convolution Network

    Zhaolong Zhang, Yuejie Zhang, Rui Feng, Tao Zhang, Weiguo Fan

    12943-12950

    PDF
  • JSNet: Joint Instance and Semantic Segmentation of 3D Point Clouds

    Lin Zhao, Wenbing Tao

    12951-12958

    PDF
  • Spherical Criteria for Fast and Accurate 360° Object Detection

    Pengyu Zhao, Ansheng You, Yuanxing Zhang, Jiaying Liu, Kaigui Bian, Yunhai Tong

    12959-12966

    PDF
  • GTNet: Generative Transfer Network for Zero-Shot Object Detection

    Shizhen Zhao, Changxin Gao, Yuanjie Shao, Lerenhan Li, Changqian Yu, Zhong Ji, Nong Sang

    12967-12974

    PDF
  • Multi-Source Distilling Domain Adaptation

    Sicheng Zhao, Guangzhi Wang, Shanghang Zhang, Yang Gu, Yaxian Li, Zhichao Song, Pengfei Xu, Runbo Hu, Hua Chai, Kurt Keutzer

    12975-12983

    PDF
  • MemCap: Memorizing Style Knowledge for Image Captioning

    Wentian Zhao, Xinxiao Wu, Xiaoxun Zhang

    12984-12992

    PDF
  • Distance-IoU Loss: Faster and Better Learning for Bounding Box Regression

    Zhaohui Zheng, Ping Wang, Wei Liu, Jinze Li, Rongguang Ye, Dongwei Ren

    12993-13000

    PDF
  • Random Erasing Data Augmentation

    Zhun Zhong, Liang Zheng, Guoliang Kang, Shaozi Li, Yi Yang

    13001-13008

    PDF
  • Spatial-Temporal Multi-Cue Network for Continuous Sign Language Recognition

    Hao Zhou, Wengang Zhou, Yun Zhou, Houqiang Li

    13009-13016

    PDF
  • Discriminative and Robust Online Learning for Siamese Visual Tracking

    Jinghao Zhou, Peng Wang, Haoyang Sun

    13017-13024

    PDF
  • Deep Domain-Adversarial Image Generation for Domain Generalisation

    Kaiyang Zhou, Yongxin Yang, Timothy Hospedales, Tao Xiang

    13025-13032

    PDF
  • Progressive Bi-C3D Pose Grammar for Human Pose Estimation

    Lu Zhou, Yingying Chen, Jinqiao Wang, Hanqing Lu

    13033-13040

    PDF
  • Pointwise Rotation-Invariant Network with Adaptive Sampling and 3D Spherical Voxel Convolution

    Yang You, Yujing Lou, Qi Liu, Yu-Wing Tai, Lizhuang Ma, Cewu Lu, Weiming Wang

    12717-12724

    PDF
  • Cascading Convolutional Color Constancy

    Huanglin Yu, Ke Chen, Kaiqi Wang, Yanlin Qian, Zhaoxiang Zhang, Kui Jia

    12725-12732

    PDF
  • Region Normalization for Image Inpainting

    Tao Yu, Zongyu Guo, Xin Jin, Shilin Wu, Zhibo Chen, Weiping Li, Zhizheng Zhang, Sen Liu

    12733-12740

    PDF
  • Patchy Image Structure Classification Using Multi-Orientation Region Transform

    Xiaohan Yu, Yang Zhao, Yongsheng Gao, Shengwu Xiong, Xiaohui Yuan

    12741-12748

    PDF
  • Human Synthesis and Scene Compositing

    Mihai Zanfir, Elisabeta Oneata, Alin-Ionut Popa, Andrei Zanfir, Cristian Sminchisescu

    12749-12756

    PDF
  • Realistic Face Reenactment via Self-Supervised Disentangling of Identity and Pose

    Xianfang Zeng, Yusu Pan, Mengmeng Wang, Jiangning Zhang, Yong Liu

    12757-12764

    PDF
  • Reliability Does Matter: An End-to-End Weakly Supervised Semantic Segmentation Approach

    Bingfeng Zhang, Jimin Xiao, Yunchao Wei, Mingjie Sun, Kaizhu Huang

    12765-12772

    PDF
  • Shape-Oriented Convolution Neural Network for Point Cloud Analysis

    Chaoyi Zhang, Yang Song, Lina Yao, Weidong Cai

    12773-12780

    PDF
  • Web-Supervised Network with Softly Update-Drop Training for Fine-Grained Visual Classification

    Chuanyi Zhang, Yazhou Yao, Huafeng Liu, Guo-Sen Xie, Xiangbo Shu, Tianfei Zhou, Zheng Zhang, Fumin Shen, Zhenmin Tang

    12781-12788

    PDF
  • FDN: Feature Decoupling Network for Head Pose Estimation

    Hao Zhang, Mengmeng Wang, Yong Liu, Yi Yuan

    12789-12796

    PDF
  • Rethinking the Image Fusion: A Fast Unified Image Fusion Network based on Proportional Maintenance of Gradient and Intensity

    Hao Zhang, Han Xu, Yang Xiao, Xiaojie Guo, Jiayi Ma

    12797-12804

    PDF
  • Model Watermarking for Image Processing Networks

    Jie Zhang, Dongdong Chen, Jing Liao, Han Fang, Weiming Zhang, Wenbo Zhou, Hao Cui, Nenghai Yu

    12805-12812

    PDF
  • Deep Object Co-Segmentation via Spatial-Semantic Network Modulation

    Kaihua Zhang, Jin Chen, Bo Liu, Qingshan Liu

    12813-12820

    PDF
  • Pixel-Aware Deep Function-Mixture Network for Spectral Super-Resolution

    Lei Zhang, Zhiqiang Lang, Peng Wang, Wei Wei, Shengcai Liao, Ling Shao, Yanning Zhang

    12821-12828

    PDF
  • RIS-GAN: Explore Residual and Illumination with Generative Adversarial Networks for Shadow Removal

    Ling Zhang, Chengjiang Long, Xiaolong Zhang, Chunxia Xiao

    12829-12836

    PDF
  • 3D Crowd Counting via Multi-View Fusion with 3D Gaussian Kernels

    Qi Zhang, Antoni B. Chan

    12837-12844

    PDF
  • Deep Camouflage Images

    Qing Zhang, Gelin Yin, Yongwei Nie, Wei-Shi Zheng

    12845-12852

    PDF
  • AutoRemover: Automatic Object Removal for Autonomous Driving Videos

    Rong Zhang, Wei Li, Peng Wang, Chenye Guan, Jin Fang, Yuhang Song, Jinhui Yu, Baoquan Chen, Weiwei Xu, Ruigang Yang

    12853-12861

    PDF
  • Knowledge Integration Networks for Action Recognition

    Shiwen Zhang, Sheng Guo, Limin Wang, Weilin Huang, Matthew Scott

    12862-12869

    PDF
  • Learning 2D Temporal Adjacent Networks for Moment Localization with Natural Language

    Songyang Zhang, Houwen Peng, Jianlong Fu, Jiebo Luo

    12870-12877

    PDF
  • ZoomNet: Part-Aware Adaptive Zooming Neural Network for 3D Object Detection

    Zhenbo Xu, Wei Zhang, Xiaoqing Ye, Xiao Tan, Wei Yang, Shilei Wen, Errui Ding, Ajin Meng, Liusheng Huang

    12557-12564

    PDF
  • Shape-Aware Organ Segmentation by Predicting Signed Distance Maps

    Yuan Xue, Hui Tang, Zhi Qiao, Guanzhong Gong, Yong Yin, Zhen Qian, Chao Huang, Wei Fan, Xiaolei Huang

    12565-12572

    PDF
  • FAS-Net: Construct Effective Features Adaptively for Multi-Scale Object Detection

    Jiangqiao Yan, Yue Zhang, Zhonghan Chang, Tengfei Zhang, Menglong Yan, Wenhui Diao, Hongqi Wang, Xian Sun

    12573-12580

    PDF
  • Gated Convolutional Networks with Hybrid Connectivity for Image Classification

    Chuanguang Yang, Zhulin An, Hui Zhu, Xiaolong Hu, Kun Zhang, Kaiqiang Xu, Chao Li, Yongjun Xu

    12581-12588

    PDF
  • Mining on Heterogeneous Manifolds for Zero-Shot Cross-Modal Image Retrieval

    Fan Yang, Zheng Wang, Jing Xiao, Shin'ichi Satoh

    12589-12596

    PDF
  • Asymmetric Co-Teaching for Unsupervised Cross-Domain Person Re-Identification

    Fengxiang Yang, Ke Li, Zhun Zhong, Zhiming Luo, Xing Sun, Hao Cheng, Xiaowei Guo, Feiyue Huang, Rongrong Ji, Shaozi Li

    12597-12604

    PDF
  • Learning to Incorporate Structure Knowledge for Image Inpainting

    Jie Yang, Zhiquan Qi, Yong Shi

    12605-12612

    PDF
  • An Adversarial Perturbation Oriented Domain Adaptation Approach for Semantic Segmentation

    Jihan Yang, Ruijia Xu, Ruiyu Li, Xiaojuan Qi, Xiaoyong Shen, Guanbin Li, Liang Lin

    12613-12620

    PDF
  • FAN-Face: a Simple Orthogonal Improvement to Deep Face Recognition

    Jing Yang, Adrian Bulat, Georgios Tzimiropoulos

    12621-12628

    PDF
  • Towards Scale-Free Rain Streak Removal via Self-Supervised Fractal Band Learning

    Wenhan Yang, Shiqi Wang, Dejia Xu, Xiaodong Wang, Jiaying Liu

    12629-12636

    PDF
  • SOGNet: Scene Overlap Graph Network for Panoptic Segmentation

    Yibo Yang, Hongyang Li, Xia Li, Qijie Zhao, Jianlong Wu, Zhouchen Lin

    12637-12644

    PDF
  • Release the Power of Online-Training for Robust Visual Tracking

    Yifan Yang, Guorong Li, Yuankai Qi, QIngming Huang

    12645-12652

    PDF
  • Context-Transformer: Tackling Object Confusion for Few-Shot Detection

    Ze Yang, Yali Wang, Xianyu Chen, Jianzhuang Liu, Yu Qiao

    12653-12660

    PDF
  • SM-NAS: Structural-to-Modular Neural Architecture Search for Object Detection

    Lewei Yao, Hang Xu, Wei Zhang, Xiaodan Liang, Zhenguo Li

    12661-12668

    PDF
  • Deep Discriminative CNN with Temporal Ensembling for Ambiguously-Labeled Image Classification

    Yao Yao, Jiehui Deng, Xiuhua Chen, Chen Gong, Jianxin Wu, Jian Yang

    12669-12676

    PDF
  • Object-Guided Instance Segmentation for Biological Images

    Jingru Yi, Hui Tang, Pengxiang Wu, Bo Liu, Daniel J. Hoeppner, Dimitris N. Metaxas, Lianyi Han, Wei Fan

    12677-12684

    PDF
  • Leveraging Multi-View Image Sets for Unsupervised Intrinsic Image Decomposition and Highlight Separation

    Renjiao Yi, Ping Tan, Stephen Lin

    12685-12692

    PDF
  • Joint Super-Resolution and Alignment of Tiny Faces

    Yu Yin, Joseph Robinson, Yulun Zhang, Yun Fu

    12693-12700

    PDF
  • Facial Action Unit Intensity Estimation via Semantic Correspondence Learning with Dynamic Graph Convolution

    Yingruo Fan, Jacqueline Lam, Victor Li

    12701-12708

    PDF
  • Cross-Modality Attention with Semantic Graph Embedding for Multi-Label Classification

    Renchun You, Zhiyao Guo, Lei Cui, Xiang Long, Yingze Bao, Shilei Wen

    12709-12716

    PDF
  • Distraction-Aware Feature Learning for Human Attribute Recognition via Coarse-to-Fine Attention Mechanism

    Mingda Wu, Di Huang, Yuanfang Guo, Yunhong Wang

    12394-12401

    PDF
  • Patch Proposal Network for Fast Semantic Segmentation of High-Resolution Images

    Tong Wu, Zhenzhen Lei, Bingqian Lin, Cuihua Li, Yanyun Qu, Yuan Xie

    12402-12409

    PDF
  • SalSAC: A Video Saliency Prediction Model with Shuffled Attentions and Correlation-Based ConvLSTM

    Xinyi Wu, Zhenyao Wu, Jinglin Zhang, Lili Ju, Song Wang

    12410-12417

    PDF
  • Recognizing Instagram Filtered Images with Feature De-Stylization

    Zhe Wu, Zuxuan Wu, Bharat Singh, Larry Davis

    12418-12425

    PDF
  • Convolutional Hierarchical Attention Network for Query-Focused Video Summarization

    Shuwen Xiao, Zhou Zhao, Zijian Zhang, Xiaohui Yan, Min Yang

    12426-12433

    PDF
  • Adversarial Learning of Privacy-Preserving and Task-Oriented Representations

    Taihong Xiao, Yi-Hsuan Tsai, Kihyuk Sohn, Manmohan Chandraker, Ming-Hsuan Yang

    12434-12441

    PDF
  • Motion-Based Generator Model: Unsupervised Disentanglement of Appearance, Trackable and Intrackable Motions in Dynamic Patterns

    Jianwen Xie, Ruiqi Gao, Zilong Zheng, Song-Chun Zhu, Ying Nian Wu

    12442-12451

    PDF
  • Segmenting Medical MRI via Recurrent Decoding Cell

    Ying Wen, Kai Xie, Lianghua He

    12452-12459

    PDF
  • PI-RCNN: An Efficient Multi-Sensor 3D Object Detector with Point-Based Attentive Cont-Conv Fusion Module

    Liang Xie, Chao Xiang, Zhengxu Yu, Guodong Xu, Zheng Yang, Deng Cai, Xiaofei He

    12460-12467

    PDF
  • Video Face Super-Resolution with Motion-Adaptive Feedback Cell

    Jingwei Xin, Nannan Wang, Jie Li, Xinbo Gao, Zhifeng Li

    12468-12475

    PDF
  • Facial Attribute Capsules for Noise Face Super Resolution

    Jingwei Xin, Nannan Wang, Xinrui Jiang, Jie Li, Xinbo Gao, Zhifeng Li

    12476-12483

    PDF
  • FusionDN: A Unified Densely Connected Network for Image Fusion

    Han Xu, Jiayi Ma, Zhuliang Le, Junjun Jiang, Xiaojie Guo

    12484-12491

    PDF
  • Universal-RCNN: Universal Object Detector via Transferable Graph R-CNN

    Hang Xu, Linpu Fang, Xiaodan Liang, Wenxiong Kang, Zhenguo Li

    12492-12499

    PDF
  • Geometry Sharing Network for 3D Point Cloud Classification and Segmentation

    Mingye Xu, Zhipeng Zhou, Yu Qiao

    12500-12507

    PDF
  • Learning Inverse Depth Regression for Multi-View Stereo with Correlation Cost Volume

    Qingshan Xu, Wenbing Tao

    12508-12515

    PDF
  • Planar Prior Assisted PatchMatch Multi-View Stereo

    Qingshan Xu, Wenbing Tao

    12516-12523

    PDF
  • A Proposal-Based Approach for Activity Image-to-Video Retrieval

    Ruicong Xu, Li Niu, Jianfu Zhang, Liqing Zhang

    12524-12531

    PDF
  • GDFace: Gated Deformation for Multi-View Face Image Synthesis

    Xuemiao Xu, Keke Li, Cheng Xu, Shengfeng He

    12532-12540

    PDF
  • CF-LSTM: Cascaded Feature-Based Long Short-Term Networks for Predicting Pedestrian Trajectory

    Yi Xu, Jing Yang, Shaoyi Du

    12541-12548

    PDF
  • SiamFC++: Towards Robust and Accurate Visual Tracking with Target Estimation Guidelines

    Yinda Xu, Zeyu Wang, Zuoxin Li, Ye Yuan, Gang Yu

    12549-12556

    PDF
  • Consistent Video Style Transfer via Compound Regularization

    Wenjing Wang, Jizheng Xu, Li Zhang, Yue Wang, Jiaying Liu

    12233-12240

    PDF
  • Mis-Classified Vector Guided Softmax Loss for Face Recognition

    Xiaobo Wang, Shifeng Zhang, Shuo Wang, Tianyu Fu, Hailin Shi, Tao Mei

    12241-12248

    PDF
  • Symbiotic Attention with Privileged Information for Egocentric Action Recognition

    Xiaohan Wang, Yu Wu, Linchao Zhu, Yi Yang

    12249-12256

    PDF
  • Task-Aware Monocular Depth Estimation for 3D Object Detection

    Xinlong Wang, Wei Yin, Tao Kong, Yuning Jiang, Lei Li, Chunhua Shen

    12257-12264

    PDF
  • Multi-Label Classification with Label Graph Superimposing

    Ya Wang, Dongliang He, Fu Li, Xiang Long, Zhichao Zhou, Jinwen Ma, Shilei Wen

    12265-12272

    PDF
  • Pruning from Scratch

    Yulong Wang, Xiaolu Zhang, Lingxi Xie, Jun Zhou, Hang Su, Bo Zhang, Xiaolin Hu

    12273-12280

    PDF
  • Learning Diverse Stochastic Human-Action Generators by Learning Smooth Latent Transitions

    Zhenyi Wang, Ping Yu, Yang Zhao, Ruiyi Zhang, Yufan Zhou, Junsong Yuan, Changyou Chen

    12281-12288

    PDF
  • Graph-Propagation Based Correlation Learning for Weakly Supervised Fine-Grained Image Classification

    Zhuhui Wang, Shijie Wang, Haojie Li, Zhi Dou, Jianjun Li

    12289-12296

    PDF
  • Localize, Assemble, and Predicate: Contextual Object Proposal Embedding for Visual Relation Detection

    Ruihai Wu, Kehan Xu, Chenchen Liu, Nan Zhuang, Yadong Mu

    12297-12304

    PDF
  • EFANet: Exchangeable Feature Alignment Network for Arbitrary Style Transfer

    Zhijie Wu, Chunjin Song, Yang Zhou, Minglun Gong, Hui Huang

    12305-12312

    PDF
  • Adaptive Cross-Modal Embeddings for Image-Text Alignment

    Jonatas Wehrmann, Camila Kolling, Rodrigo C Barros

    12313-12320

    PDF
  • FÂłNet: Fusion, Feedback and Focus for Salient Object Detection

    Jun Wei, Shuhui Wang, Qingming Huang

    12321-12328

    PDF
  • 3D Single-Person Concurrent Activity Detection Using Stacked Relation Network

    Yi Wei, Wenbo Li, Yanbo Fan, Linghan Xu, Ming-Ching Chang, Siwei Lyu

    12329-12337

    PDF
  • Heuristic Black-Box Adversarial Attacks on Video Recognition Models

    Zhipeng Wei, Jingjing Chen, Xingxing Wei, Linxi Jiang, Tat-Seng Chua, Fengfeng Zhou, Yu-Gang Jiang

    12338-12345

    PDF
  • Efficient Querying from Weighted Binary Codes

    Zhenyu Weng, Yuesheng Zhu

    12346-12353

    PDF
  • Online Hashing with Efficient Updating of Binary Codes

    Zhenyu Weng, Yuesheng Zhu

    12354-12361

    PDF
  • Tracklet Self-Supervised Learning for Unsupervised Person Re-Identification

    Guile Wu, Xiatian Zhu, Shaogang Gong

    12362-12369

    PDF
  • CircleNet for Hip Landmark Detection

    Hai Wu, Hongtao Xie, Chuanbin Liu, Zheng-Jun Zha, Jun Sun, Yongdong Zhang

    12370-12377

    PDF
  • 3D Human Pose Estimation via Explicit Compositional Depth Maps

    Haiping Wu, Bin Xiao

    12378-12385

    PDF
  • Tree-Structured Policy Based Progressive Reinforcement Learning for Temporally Language Grounding in Video

    Jie Wu, Guanbin Li, Si Liu, Liang Lin

    12386-12393

    PDF
  • V-PROM: A Benchmark for Visual Reasoning Using Visual Progressive Matrices

    Damien Teney, Peng Wang, Jiewei Cao, Lingqiao Liu, Chunhua Shen, Anton van den Hengel

    12071-12078

    PDF
  • End-to-End Thorough Body Perception for Person Search

    Kun Tian, Houjing Huang, Yun Ye, Shiyu Li, Jinbin Lin, Guan Huang

    12079-12086

    PDF
  • Differentiable Meta-Learning Model for Few-Shot Semantic Segmentation

    Pinzhuo Tian, Zhangkai Wu, Lei Qi, Lei Wang, Yinghuan Shi, Yang Gao

    12087-12094

    PDF
  • Attention-Based View Selection Networks for Light-Field Disparity Estimation

    Yu-Ju Tsai, Yu-Lun Liu, Ming Ouhyoung, Yung-Yu Chuang

    12095-12103

    PDF
  • Image Cropping with Composition and Saliency Aware Aesthetic Score Map

    Yi Tu, Li Niu, Weijie Zhao, Dawei Cheng, Liqing Zhang

    12104-12111

    PDF
  • Optical Flow in Deep Visual Tracking

    Mikko Vihlman, Arto Visala

    12112-12119

    PDF
  • TextScanner: Reading Characters in Order for Robust Scene Text Recognition

    Zhaoyi Wan, Minghang He, Haoran Chen, Xiang Bai, Cong Yao

    12120-12127

    PDF
  • Progressive Feature Polishing Network for Salient Object Detection

    Bo Wang, Quan Chen, Min Zhou, Zhiqiang Zhang, Xiaogang Jin, Kun Gai

    12128-12135

    PDF
  • Region-Based Global Reasoning Networks

    Chuanming Wang, Huiyuan Fu, Charles X. Ling, Peilun Du, Huadong Ma

    12136-12143

    PDF
  • Cross-Modality Paired-Images Generation for RGB-Infrared Person Re-Identification

    Guan-An Wang, Tianzhu Zhang, Yang Yang, Jian Cheng, Jianlong Chang, Xu Liang, Zeng-Guang Hou

    12144-12151

    PDF
  • Context Modulated Dynamic Networks for Actor and Action Video Segmentation with Language Queries

    Hao Wang, Cheng Deng, Fan Ma, Yi Yang

    12152-12159

    PDF
  • All You Need Is Boundary: Toward Arbitrary-Shaped Text Spotting

    Hao Wang, Pu Lu, Hui Zhang, Mingkun Yang, Xiang Bai, Yongchao Xu, Mengchao He, Yongpan Wang, Wenyu Liu

    12160-12167

    PDF
  • Temporally Grounding Language Queries in Videos by Contextual Boundary-Aware Prediction

    Jingwen Wang, Lin Ma, Wenhao Jiang

    12168-12175

    PDF
  • Show, Recall, and Tell: Image Captioning with Recall Mechanism

    Li Wang, Zechen Bai, Yonghua Zhang, Hongtao Lu

    12176-12183

    PDF
  • POST: POlicy-Based Switch Tracking

    Ning Wang, Wengang Zhou, Guojun Qi, Houqiang Li

    12184-12191

    PDF
  • Sparsity-Inducing Binarized Neural Networks

    Peisong Wang, Xiangyu He, Gang Li, Tianli Zhao, Jian Cheng

    12192-12199

    PDF
  • Multi-Speaker Video Dialog with Frame-Level Temporal Localization

    Qiang Wang, Pin Jiang, Zhiyi Guo, Yahong Han, Zhou Zhao

    12200-12207

    PDF
  • RDSNet: A New Deep Architecture forReciprocal Object Detection and Instance Segmentation

    Shaoru Wang, Yongchao Gong, Junliang Xing, Lichao Huang, Chang Huang, Weiming Hu

    12208-12215

    PDF
  • Decoupled Attention Network for Text Recognition

    Tianwei Wang, Yuanzhi Zhu, Lianwen Jin, Canjie Luo, Xiaoxue Chen, Yaqiang Wu, Qianying Wang, Mingxiang Cai

    12216-12224

    PDF
  • One-Shot Learning for Long-Tail Visual Relation Detection

    Weitao Wang, Meng Wang, Sen Wang, Guodong Long, Lina Yao, Guilin Qi, Yang Chen

    12225-12232

    PDF
  • FFA-Net: Feature Fusion Attention Network for Single Image Dehazing

    Xu Qin, Zhilin Wang, Yuanchao Bai, Xiaodong Xie, Huizhu Jia

    11908-11915

    PDF
  • Learning Meta Model for Zero- and Few-Shot Face Anti-Spoofing

    Yunxiao Qin, Chenxu Zhao, Xiangyu Zhu, Zezheng Wang, Zitong Yu, Tianyu Fu, Feng Zhou, Jingping Shi, Zhen Lei

    11916-11923

    PDF
  • DGCN: Dynamic Graph Convolutional Network for Efficient Multi-Person Pose Estimation

    Zhongwei Qiu, Kai Qiu, Jianlong Fu, Dongmei Fu

    11924-11931

    PDF
  • Improved Visual-Semantic Alignment for Zero-Shot Object Detection

    Shafin Rahman, Salman Khan, Nick Barnes

    11932-11939

    PDF
  • Dynamic Graph Representation for Occlusion Handling in Biometrics

    Min Ren, Yunlong Wang, Zhenan Sun, Tieniu Tan

    11940-11947

    PDF
  • Conquering the CNN Over-Parameterization Dilemma: A Volterra Filtering Approach for Action Recognition

    Siddharth Roheda, Hamid Krim

    11948-11956

    PDF
  • Hidden Trigger Backdoor Attacks

    Aniruddha Saha, Akshayvarun Subramanya, Hamed Pirsiavash

    11957-11965

    PDF
  • Temporal Interlacing Network

    Hao Shao, Shengju Qian, Yu Liu

    11966-11973

    PDF
  • Regularized Fine-Grained Meta Face Anti-Spoofing

    Rui Shao, Xiangyuan Lan, Pong C. Yuen

    11974-11981

    PDF
  • Multimodal Interaction-Aware Trajectory Prediction in Crowded Space

    Xiaodan Shi, Xiaowei Shao, Zipei Fan, Renhe Jiang, Haoran Zhang, Zhiling Guo, Guangming Wu, Wei Yuan, Ryosuke Shibasaki

    11982-11989

    PDF
  • Optimal Feature Transport for Cross-View Image Geo-Localization

    Yujiao Shi, Xin Yu, Liu Liu, Tong Zhang, Hongdong Li

    11990-11997

    PDF
  • Identifying Model Weakness with Adversarial Examiner

    Michelle Shu, Chenxi Liu, Weichao Qiu, Alan Yuille

    11998-12006

    PDF
  • Efficient Residual Dense Block Search for Image Super-Resolution

    Dehua Song, Chang Xu, Xu Jia, Yiyi Chen, Chunjing Xu, Yunhe Wang

    12007-12014

    PDF
  • KPNet: Towards Minimal Face Detector

    Guanglu Song, Yu Liu, Yuhang Zang, Xiaogang Wang, Biao Leng, Qingsheng Yuan

    12015-12022

    PDF
  • Multi-Spectral Salient Object Detection by Adversarial Domain Adaptation

    Shaoyue Song, Hongkai Yu, Zhenjiang Miao, Jianwu Fang, Kang Zheng, Cong Ma, Song Wang

    12023-12030

    PDF
  • Stereoscopic Image Super-Resolution with Stereo Consistent Feature

    Wonil Song, Sungil Choi, Somi Jeong, Kwanghoon Sohn

    12031-12038

    PDF
  • An Efficient Framework for Dense Video Captioning

    Maitreya Suin, A. N. Rajagopalan

    12039-12046

    PDF
  • Fine-Grained Recognition: Accounting for Subtle Differences between Similar Classes

    Guolei Sun, Hisham Cholakkal, Salman Khan, Fahad Khan, Ling Shao

    12047-12054

    PDF
  • Relation-Aware Pedestrian Attribute Recognition with Graph Convolutional Networks

    Zichang Tan, Yang Yang, Jun Wan, Guodong Guo, Stan Z. Li

    12055-12062

    PDF
  • R²MRF: Defocus Blur Detection via Recurrently Refining Multi-Scale Residual Features

    Chang Tang, Xinwang Liu, Xinzhong Zhu, En Zhu, Kun Sun, Pichao Wang, Lizhe Wang, Albert Zomaya

    12063-12070

    PDF
  • Fine-Grained Fashion Similarity Learning by Attribute-Specific Embedding Network

    Zhe Ma, Jianfeng Dong, Zhongzi Long, Yao Zhang, Yuan He, Hui Xue, Shouling Ji

    11741-11748

    PDF
  • Domain Generalization Using a Mixture of Multiple Latent Domains

    Toshihiko Matsuura, Tatsuya Harada

    11749-11756

    PDF
  • High-Order Residual Network for Light Field Super-Resolution

    Nan Meng, Xiaofei Wu, Jianzhuang Liu, Edmund Lam

    11757-11764

    PDF
  • Shallow Feature Based Dense Attention Network for Crowd Counting

    Yunqi Miao, Zijia Lin, Guiguang Ding, Jungong Han

    11765-11772

    PDF
  • Learning to Follow Directions in Street View

    Karl Moritz Hermann, Mateusz Malinowski, Piotr Mirowski, Andras Banki-Horvath, Keith Anderson, Raia Hadsell

    11773-11781

    PDF
  • Pyramid Attention Aggregation Network for Semantic Segmentation of Surgical Instruments

    Zhen-Liang Ni, Gui-Bin Bian, Guan-An Wang, Xiao-Hu Zhou, Zeng-Guang Hou, Hua-Bin Chen, Xiao-Liang Xie

    11782-11790

    PDF
  • Spatial-Temporal Gaussian Scale Mixture Modeling for Foreground Estimation

    Qian Ning, Weisheng Dong, Fangfang Wu, Jinjian Wu, Jie Lin, Guangming Shi

    11791-11798

    PDF
  • Crowd Counting with Decomposed Uncertainty

    Min-hwan Oh, Peder Olsen, Karthikeyan Natesan Ramamurthy

    11799-11806

    PDF
  • Image Formation Model Guided Deep Image Super-Resolution

    Jinshan Pan, Yang Liu, Deqing Sun, Jimmy Ren, Ming-Ming Cheng, Jian Yang, Jinhui Tang

    11807-11814

    PDF
  • Adversarial Cross-Domain Action Recognition with Co-Attention

    Boxiao Pan, Zhangjie Cao, Ehsan Adeli, Juan Carlos Niebles

    11815-11822

    PDF
  • Further Understanding Videos through Adverbs: A New Video Task

    Bo Pang, Kaiwen Zha, Yifan Zhang, Cewu Lu

    11823-11830

    PDF
  • Visual Dialogue State Tracking for Question Generation

    Wei Pang, Xiaojie Wang

    11831-11838

    PDF
  • Relation Network for Person Re-Identification

    Hyunjong Park, Bumsub Ham

    11839-11847

    PDF
  • Explanation vs Attention: A Two-Player Game to Obtain Attention for VQA

    Badri Patro, Anupriy, Vinay Namboodiri

    11848-11855

    PDF
  • LCD: Learned Cross-Domain Descriptors for 2D-3D Matching

    Quang-Hieu Pham, Mikaela Angelina Uy, Binh-Son Hua, Duc Thanh Nguyen, Gemma Roig, Sai-Kit Yeung

    11856-11864

    PDF
  • Exploit and Replace: An Asymmetrical Two-Stream Architecture for Versatile Light Field Saliency Detection

    Yongri Piao, Zhengkun Rong, Miao Zhang, Huchuan Lu

    11865-11873

    PDF
  • Differentiable Grammars for Videos

    AJ Piergiovanni, Anelia Angelova, Michael S. Ryoo

    11874-11881

    PDF
  • Region-Adaptive Dense Network for Efficient Motion Deblurring

    Kuldeep Purohit, A. N. Rajagopalan

    11882-11889

    PDF
  • Visualizing Deep Networks by Optimizing with Integrated Gradients

    Zhongang Qi, Saeed Khorram, Li Fuxin

    11890-11898

    PDF
  • Text Perceptron: Towards End-to-End Arbitrary-Shaped Text Spotting

    Liang Qiao, Sanli Tang, Zhanzhan Cheng, Yunlu Xu, Yi Niu, Shiliang Pu, Fei Wu

    11899-11907

    PDF
  • Learned Video Compression via Joint Spatial-Temporal Correlation Exploration

    Haojie Liu, Han Shen, Lichao Huang, Ming Lu, Tong Chen, Zhan Ma

    11580-11587

    PDF
  • Interactive Dual Generative Adversarial Networks for Image Captioning

    Junhao Liu, Kai Wang, Chunpu Xu, Zhou Zhao, Ruifeng Xu, Ying Shen, Min Yang

    11588-11595

    PDF
  • Morphing and Sampling Network for Dense Point Cloud Completion

    Minghua Liu, Lu Sheng, Sheng Yang, Jing Shao, Shi-Min Hu

    11596-11603

    PDF
  • Multi-Task Driven Feature Models for Thermal Infrared Tracking

    Qiao Liu, Xin Li, Zhenyu He, Nana Fan, Di Yuan, Wei Liu, Yongsheng Liang

    11604-11611

    PDF
  • Progressive Boundary Refinement Network for Temporal Action Detection

    Qinying Liu, Zilei Wang

    11612-11619

    PDF
  • A Generalized Framework for Edge-Preserving and Structure-Preserving Image Smoothing

    Wei Liu, Pingping Zhang, Yinjie Lei, Xiaolin Huang, Jie Yang, Ian Reid

    11620-11628

    PDF
  • Importance-Aware Semantic Segmentation in Self-Driving with Discrete Wasserstein Training

    Xiaofeng Liu, Yuzhuo Han, Song Bai, Yi Ge, Tianxing Wang, Xu Han, Site Li, Jane You, Jun Lu

    11629-11636

    PDF
  • A New Dataset and Boundary-Attention Semantic Segmentation for Face Parsing

    Yinglu Liu, Hailin Shi, Hao Shen, Yue Si, Xiaobo Wang, Tao Mei

    11637-11644

    PDF
  • Learning Cross-Modal Context Graph for Visual Grounding

    Yongfei Liu, Bo Wan, Xiaodan Zhu, Xuming He

    11645-11652

    PDF
  • CBNet: A Novel Composite Backbone Network Architecture for Object Detection

    Yudong Liu, Yongtao Wang, Siwei Wang, Tingting Liang, Qijie Zhao, Zhi Tang, Haibin Ling

    11653-11660

    PDF
  • Separate in Latent Space: Unsupervised Single Image Layer Separation

    Yunfei Liu, Feng Lu

    11661-11668

    PDF
  • TEINet: Towards an Efficient Architecture for Video Recognition

    Zhaoyang Liu, Donghao Luo, Yabiao Wang, Limin Wang, Ying Tai, Chengjie Wang, Jilin Li, Feiyue Huang, Tong Lu

    11669-11676

    PDF
  • TANet: Robust 3D Object Detection from Point Clouds with Triple Attention

    Zhe Liu, Xin Zhao, Tengteng Huang, Ruolan Hu, Yu Zhou, Xiang Bai

    11677-11684

    PDF
  • Training-Time-Friendly Network for Real-Time Object Detection

    Zili Liu, Tu Zheng, Guodong Xu, Zheng Yang, Haifeng Liu, Deng Cai

    11685-11692

    PDF
  • Hybrid Graph Neural Networks for Crowd Counting

    Ao Luo, Fan Yang, Xin Li, Dong Nie, Zhicheng Jiao, Shangchen Zhou, Hong Cheng

    11693-11700

    PDF
  • Video Cloze Procedure for Self-Supervised Spatio-Temporal Learning

    Dezhao Luo, Chang Liu, Yu Zhou, Dongbao Yang, Can Ma, Qixiang Ye, Weiping Wang

    11701-11708

    PDF
  • Context-Aware Zero-Shot Recognition

    Ruotian Luo, Ning Zhang, Bohyung Han, Linjie Yang

    11709-11716

    PDF
  • Learning Saliency-Free Model with Generic Features for Weakly-Supervised Semantic Segmentation

    Wenfeng Luo, Meng Yang

    11717-11724

    PDF
  • An Integrated Enhancement Solution for 24-Hour Colorful Imaging

    Feifan Lv, Yinqiang Zheng, Yicheng Li, Feng Lu

    11725-11732

    PDF
  • A Variational Autoencoder with Deep Embedding Model for Generalized Zero-Shot Learning

    Peirong Ma, Xiao Hu

    11733-11740

    PDF
  • Gated Fully Fusion for Semantic Segmentation

    Xiangtai Li, Houlong Zhao, Lei Han, Yunhai Tong, Shaohua Tan, Kuiyuan Yang

    11418-11425

    PDF
  • ScaleNet – Improve CNNs through Recursively Rescaling Objects

    Xingyi Li, Zhongang Qi, Xiaoli Fern, Fuxin Li

    11426-11433

    PDF
  • Relation-Guided Spatial Attention and Temporal Refinement for Video-Based Person Re-Identification

    Xingze Li, Wengang Zhou, Yun Zhou, Houqiang Li

    11434-11441

    PDF
  • Geometry-Driven Self-Supervised Method for 3D Human Pose Estimation

    Yang Li, Kan Li, Shuai Jiang, Ziyue Zhang, Congzhentao Huang, Richard Yi Da Xu

    11442-11449

    PDF
  • Natural Image Matting via Guided Contextual Attention

    Yaoyi Li, Hongtao Lu

    11450-11457

    PDF
  • Learning Transferable Adversarial Examples via Ghost Networks

    Yingwei Li, Song Bai, Yuyin Zhou, Cihang Xie, Zhishuai Zhang, Alan Yuille

    11458-11465

    PDF
  • Finding Action Tubes with a Sparse-to-Dense Framework

    Yuxi Li, Weiyao Lin, Tao Wang, John See, Rui Qian, Ning Xu, Limin Wang, Shugong Xu

    11466-11473

    PDF
  • Real-Time Scene Text Detection with Differentiable Binarization

    Minghui Liao, Zhaoyi Wan, Cong Yao, Kai Chen, Xiang Bai

    11474-11481

    PDF
  • Object Instance Mining for Weakly Supervised Object Detection

    Chenhao Lin, Siwen Wang, Dongqi Xu, Yu Lu, Wayne Zhang

    11482-11489

    PDF
  • Multimodal Structure-Consistent Image-to-Image Translation

    Che-Tsung Lin, Yen-Yi Wu, Po-Hao Hsu, Shang-Hong Lai

    11490-11498

    PDF
  • Fast Learning of Temporal Action Proposal via Dense Boundary Generator

    Chuming Lin, Jian Li, Yabiao Wang, Ying Tai, Donghao Luo, Zhipeng Cui, Chengjie Wang, Jilin Li, Feiyue Huang, Rongrong Ji

    11499-11506

    PDF
  • Learning to Transfer: Unsupervised Domain Translation via Meta-Learning

    Jianxin Lin, Yijun Wang, Zhibo Chen, Tianyu He

    11507-11514

    PDF
  • Learning Cross-Aligned Latent Embeddings for Zero-Shot Cross-Modal Retrieval

    Kaiyi Lin, Xing Xu, Lianli Gao, Zheng Wang, Heng Tao Shen

    11515-11522

    PDF
  • Learning to Deblur Face Images via Sketch Synthesis

    Songnan Lin, Jiawei Zhang, Jinshan Pan, Yicun Liu, Yongtian Wang, Jing Chen, Jimmy Ren

    11523-11530

    PDF
  • Self-Attention ConvLSTM for Spatiotemporal Prediction

    Zhihui Lin, Maomao Li, Zhuobin Zheng, Yangyang Cheng, Chun Yuan

    11531-11538

    PDF
  • Weakly-Supervised Video Moment Retrieval via Semantic Completion Network

    Zhijie Lin, Zhou Zhao, Zhu Zhang, Qi Wang, Huasheng Liu

    11539-11546

    PDF
  • Zero-Shot Learning from Adversarial Feature Residual to Compact Visual Feature

    Bo Liu, Qiulei Dong, Zhanyi Hu

    11547-11554

    PDF
  • Filtration and Distillation: Enhancing Region Attention for Fine-Grained Visual Categorization

    Chuanbin Liu, Hongtao Xie, Zheng-Jun Zha, Lingfeng Ma, Lingyun Yu, Yongdong Zhang

    11555-11562

    PDF
  • HAL: Improved Text-Image Matching by Mitigating Visual Semantic Hubs

    Fangyu Liu, Rongtian Ye, Xun Wang, Shuaipeng Li

    11563-11571

    PDF
  • Federated Learning for Vision-and-Language Grounding Problems

    Fenglin Liu, Xian Wu, Shen Ge, Wei Fan, Yuexian Zou

    11572-11579

    PDF
  • MULE: Multimodal Universal Language Embedding

    Donghyun Kim, Kuniaki Saito, Kate Saenko, Stan Sclaroff, Bryan Plummer

    11254-11261

    PDF
  • REST: Performance Improvement of a Black Box Model via RL-Based Spatial Transformation

    Jae Myung Kim, Hyungjin Kim, Chanwoo Park, Jungwoo Lee

    11262-11269

    PDF
  • Spiking-YOLO: Spiking Neural Network for Energy-Efficient Object Detection

    Seijoon Kim, Seongsik Park, Byunggook Na, Sungroh Yoon

    11270-11277

    PDF
  • FISR: Deep Joint Frame Interpolation and Super-Resolution with a Multi-Scale Temporal Loss

    Soo Ye Kim, Jihyong Oh, Munchurl Kim

    11278-11286

    PDF
  • JSI-GAN: GAN-Based Joint Super-Resolution and Inverse Tone-Mapping with Pixel-Wise Task-Specific Filters for UHD HDR Video

    Soo Ye Kim, Jihyong Oh, Munchurl Kim

    11287-11295

    PDF
  • Unpaired Image Enhancement Featuring Reinforcement-Learning-Controlled Image Editing Software

    Satoshi Kosugi, Toshihiko Yamasaki

    11296-11303

    PDF
  • Adversary for Social Good: Protecting Familial Privacy through Joint Adversarial Attacks

    Chetan Kumar, Riazat Ryan, Ming Shao

    11304-11311

    PDF
  • Kinematic-Structure-Preserved Representation for Unsupervised 3D Human Pose Estimation

    Jogendra Nath Kundu, Siddharth Seth, Rahul M V, Mugalodi Rakesh, Venkatesh Babu Radhakrishnan, Anirban Chakraborty

    11312-11319

    PDF
  • Background Suppression Network for Weakly-Supervised Temporal Action Localization

    Pilhyeon Lee, Youngjung Uh, Hyeran Byun

    11320-11327

    PDF
  • Multi-Question Learning for Visual Question Answering

    Chenyi Lei, Lei Wu, Dong Liu, Zhao Li, Guoxin Wang, Haihong Tang, Houqiang Li

    11328-11335

    PDF
  • Unicoder-VL: A Universal Encoder for Vision and Language by Cross-Modal Pre-Training

    Gen Li, Nan Duan, Yuejian Fang, Ming Gong, Daxin Jiang

    11336-11344

    PDF
  • Multi-Spectral Vehicle Re-Identification: A Challenge

    Hongchao Li, Chenglong Li, Xianpeng Zhu, Aihua Zheng, Bin Luo

    11345-11353

    PDF
  • Simple Pose: Rethinking and Improving a Bottom-up Approach for Multi-Person Pose Estimation

    Jia Li, Wen Su, Zengfu Wang

    11354-11361

    PDF
  • Learning Part Generation and Assembly for Structure-Aware Shape Synthesis

    Jun Li, Chengjie Niu, Kai Xu

    11362-11369

    PDF
  • Hierarchical Knowledge Squeezed Adversarial Network Compression

    Peng Li, Chang Shu, Yuan Xie, Yan Qu, Hui Kong

    11370-11377

    PDF
  • Age Progression and Regression with Spatial Attention Modules

    Qi Li, Yunfan Liu, Zhenan Sun

    11378-11385

    PDF
  • Domain Conditioned Adaptation Network

    Shuang Li, Chi Liu, Qiuxia Lin, Binhui Xie, Zhengming Ding, Gao Huang, Jian Tang

    11386-11393

    PDF
  • Appearance and Motion Enhancement for Video-Based Person Re-Identification

    Shuzhao Li, Huimin Yu, Haoji Hu

    11394-11401

    PDF
  • Attention-Based Multi-Modal Fusion Network for Semantic Scene Completion

    Siqi Li, Changqing Zou, Yipeng Li, Xibin Zhao, Yue Gao

    11402-11409

    PDF
  • OVL: One-View Learning for Human Retrieval

    Wenjing Li, Zhongcheng Wu

    11410-11417

    PDF
  • ElixirNet: Relation-Aware Network Architecture Adaptation for Medical Lesion Detection

    Chenhan Jiang, Shaoju Wang, Xiaodan Liang, Hang Xu, Nong Xiao

    11093-11100

    PDF
  • Divide and Conquer: Question-Guided Spatio-Temporal Contextual Attention for Video Question Answering

    Jianwen Jiang, Ziqiang Chen, Haojie Lin, Xibin Zhao, Yue Gao

    11101-11108

    PDF
  • Reasoning with Heterogeneous Graph Alignment for Video Question Answering

    Pin Jiang, Yahong Han

    11109-11116

    PDF
  • Recurrent Nested Model for Sequence Generation

    Wenhao Jiang, Lin Ma, Wei Lu

    11117-11124

    PDF
  • DualVD: An Adaptive Dual Encoding Model for Deep Visual Understanding in Visual Dialogue

    Xiaoze Jiang, Jing Yu, Zengchang Qin, Yingying Zhuang, Xingxing Zhang, Yue Hu, Qi Wu

    11125-11132

    PDF
  • Rethinking Temporal Fusion for Video-Based Person Re-Identification on Semantic and Time Aspect

    Xinyang Jiang, Yifei Gong, Xiaowei Guo, Qize Yang, Feiyue Huang, WEI-SHI ZHENG, Feng Zheng, Xing Sun

    11133-11140

    PDF
  • Learning Light Field Angular Super-Resolution via a Geometry-Aware Network

    Jing Jin, Junhui Hou, Hui Yuan, Sam Kwong

    11141-11148

    PDF
  • EAC-Net: Efficient and Accurate Convolutional Network for Video Recognition

    Bowei Jin, Zhuo Xu

    11149-11156

    PDF
  • SSAH: Semi-Supervised Adversarial Deep Hashing with Self-Paced Hard Sample Generation

    Sheng Jin, Shangchen Zhou, Yao Liu, Chao Chen, Xiaoshuai Sun, Hongxun Yao, Xian-Sheng Hua

    11157-11164

    PDF
  • Uncertainty-Aware Multi-Shot Knowledge Distillation for Image-Based Object Re-Identification

    Xin Jin, Cuiling Lan, Wenjun Zeng, Zhibo Chen

    11165-11172

    PDF
  • Semantics-Aligned Representation Learning for Person Re-Identification

    Xin Jin, Cuiling Lan, Wenjun Zeng, Guoqiang Wei, Zhibo Chen

    11173-11180

    PDF
  • Overcoming Language Priors in VQA via Decomposed Linguistic Representations

    Chenchen Jing, Yuwei Wu, Xiaoxun Zhang, Yunde Jia, Qi Wu

    11181-11188

    PDF
  • Pose-Guided Multi-Granularity Attention Network for Text-Based Person Search

    Ya Jing, Chenyang Si, Junbo Wang, Wei Wang, Liang Wang, Tieniu Tan

    11189-11196

    PDF
  • Associative Variational Auto-Encoder with Distributed Latent Spaces and Associators

    Dae Ung Jo, ByeongJu Lee, Jongwon Choi, Haanju Yoo, Jin Young Choi

    11197-11204

    PDF
  • Real-Time Object Tracking via Meta-Learning: Efficient Model Adaptation and One-Shot Channel Pruning

    Ilchae Jung, Kihyun You, Hyeonwoo Noh, Minsu Cho, Bohyung Han

    11205-11212

    PDF
  • Hide-and-Tell: Learning to Bridge Photo Streams for Visual Storytelling

    Yunjae Jung, Dahun Kim, Sanghyun Woo, Kyungsu Kim, Sungjin Kim, In So Kweon

    11213-11220

    PDF
  • Synthetic Depth Transfer for Monocular 3D Object Pose Estimation in the Wild

    Yueying Kao, Weiming Li, Qiang Wang, Zhouchen Lin, Wooshik Kim, Sunghoon Hong

    11221-11228

    PDF
  • Group-Wise Dynamic Dropout Based on Latent Semantic Variations

    Zhiwei Ke, Zhiwei Wen, Weicheng Xie, Yi Wang, Linlin Shen

    11229-11236

    PDF
  • Deep Generative Probabilistic Graph Neural Networks for Scene Graph Generation

    Mahmoud Khademi, Oliver Schulte

    11237-11245

    PDF
  • Tell Me What They’re Holding: Weakly-Supervised Object Detection with Transferable Knowledge from Human-Object Interaction

    Daesik Kim, Gyujeong Lee, Jisoo Jeong, Nojun Kwak

    11246-11253

    PDF
  • Tensor FISTA-Net for Real-Time Snapshot Compressive Imaging

    Xiaochen Han, Bo Wu, Zheng Shou, Xiao-Yang Liu, Yimeng Zhang, Linghe Kong

    10933-10940

    PDF
  • Temporal Context Enhanced Feature Aggregation for Video Object Detection

    Fei He, Naiyu Gao, Qiaozhe Li, Senyao Du, Xin Zhao, Kaiqi Huang

    10941-10948

    PDF
  • Grapy-ML: Graph Pyramid Mutual Learning for Cross-Dataset Human Parsing

    Haoyu He, Jing Zhang, Qiming Zhang, Dacheng Tao

    10949-10956

    PDF
  • Softmax Dissection: Towards Understanding Intra- and Inter-Class Objective for Embedding Learning

    Lanqing He, Zhongdao Wang, Yali Li, Shengjin Wang

    10957-10964

    PDF
  • RoadTagger: Robust Road Attribute Inference with Graph Neural Networks

    Songtao He, Favyen Bastani, Satvat Jagwani, Edward Park, Sofiane Abbar, Mohammad Alizadeh, Hari Balakrishnan, Sanjay Chawla, Samuel Madden, Mohammad Amin Sadeghi

    10965-10972

    PDF
  • Joint Commonsense and Relation Reasoning for Image and Video Captioning

    Jingyi Hou, Xinxiao Wu, Xiaoxun Zhang, Yayun Qi, Yunde Jia, Jiebo Luo

    10973-10980

    PDF
  • Hierarchical Modes Exploring in Generative Adversarial Networks

    Mengxiao Hu, Jinlong Li, Maolin Hu, Tao Hu

    10981-10988

    PDF
  • SPSTracker: Sub-Peak Suppression of Response Map for Robust Object Tracking

    Qintao Hu, Lijun Zhou, Xiaoxiao Wang, Yao Mao, Jianlin Zhang, Qixiang Ye

    10989-10996

    PDF
  • 3D Shape Completion with Multi-View Consistent Inference

    Tao Hu, Zhizhong Han, Matthias Zwicker

    10997-11004

    PDF
  • GTC: Guided Training of CTC towards Efficient and Accurate Scene Text Recognition

    Wenyang Hu, Xiaocong Cai, Jun Hou, Shuai Yi, Zhiping Lin

    11005-11012

    PDF
  • Coarse-to-Fine Hyper-Prior Modeling for Learned Image Compression

    Yueyu Hu, Wenhan Yang, Jiaying Liu

    11013-11020

    PDF
  • Location-Aware Graph Convolutional Networks for Video Question Answering

    Deng Huang, Peihao Chen, Runhao Zeng, Qing Du, Mingkui Tan, Chuang Gan

    11021-11028

    PDF
  • Unsupervised Deep Learning via Affinity Diffusion

    Jiabo Huang, Qi Dong, Shaogang Gong, Xiatian Zhu

    11029-11036

    PDF
  • GlobalTrack: A Simple and Strong Baseline for Long-Term Tracking

    Lianghua Huang, Xin Zhao, Kaiqi Huang

    11037-11044

    PDF
  • Part-Level Graph Convolutional Network for Skeleton-Based Action Recognition

    Linjiang Huang, Yan Huang, Wanli Ouyang, Liang Wang

    11045-11052

    PDF
  • Relational Prototypical Network for Weakly Supervised Temporal Action Localization

    Linjiang Huang, Yan Huang, Wanli Ouyang, Liang Wang

    11053-11060

    PDF
  • AWR: Adaptive Weighting Regression for 3D Hand Pose Estimation

    Weiting Huang, Pengfei Ren, Jingyu Wang, Qi Qi, Haifeng Sun

    11061-11068

    PDF
  • Domain Adaptive Attention Learning for Unsupervised Person Re-Identification

    Yangru Huang, Peixi Peng, Yi Jin, Yidong Li, Junliang Xing

    11069-11076

    PDF
  • Weakly-Supervised Video Re-Localization with Multiscale Attention Model

    Yung-Han Huang, Kuang-Jui Hsu, Shyh-Kang Jeng, Yen-Yu Lin

    11077-11084

    PDF
  • SGAP-Net: Semantic-Guided Attentive Prototypes Network for Few-Shot Human-Object Interaction Recognition

    Zhong Ji, Xiyao Liu, Yanwei Pang, Xuelong Li

    11085-11092

    PDF
  • Scale-Wise Convolution for Image Restoration

    Yuchen Fan, Jiahui Yu, Ding Liu, Thomas S. Huang

    10770-10777

    PDF
  • EHSOD: CAM-Guided End-to-End Hybrid-Supervised Object Detection with Cascade Refinement

    Linpu Fang, Hang Xu, Zhili Liu, Sarah Parisot, Zhenguo Li

    10778-10785

    PDF
  • Adversarial Attack on Deep Product Quantization Network for Image Retrieval

    Yan Feng, Bin Chen, Tao Dai, Shu-Tao Xia

    10786-10793

    PDF
  • Dynamic Sampling Network for Semantic Segmentation

    Bin Fu, Junjun He, Zhengfu Zhang, Yu Qiao

    10794-10801

    PDF
  • Ultrafast Video Attention Prediction with Coupled Knowledge Distillation

    Kui Fu, Peipei Shi, Yafei Song, Shiming Ge, Xiangju Lu, Jia Li

    10802-10809

    PDF
  • Accurate Temporal Action Proposal Generation with Relation-Aware Pyramid Network

    Jialin Gao, Zhixiang Shi, Guanshuo Wang, Jiani Li, Yufeng Yuan, Shiming Ge, Xi Zhou

    10810-10817

    PDF
  • Channel Interaction Networks for Fine-Grained Image Categorization

    Yu Gao, Xintong Han, Xun Wang, Weilin Huang, Matthew Scott

    10818-10825

    PDF
  • KnowIT VQA: Answering Knowledge-Based Questions about Videos

    Noa Garcia, Mayu Otani, Chenhui Chu, Yuta Nakashima

    10826-10834

    PDF
  • Deep Reinforcement Learning for Active Human Pose Estimation

    Erik Gärtner, Aleksis Pirinen, Cristian Sminchisescu

    10835-10844

    PDF
  • Look One and More: Distilling Hybrid Order Relational Knowledge for Cross-Resolution Image Recognition

    Shiming Ge, Kangkai Zhang, Haolin Liu, Yingying Hua, Shengwei Zhao, Xin Jin, Hao Wen

    10845-10852

    PDF
  • Symmetrical Synthesis for Deep Metric Learning

    Geonmo Gu, Byungsoo Ko

    10853-10860

    PDF
  • FLNet: Landmark Driven Fetching and Learning Network for Faithful Talking Facial Animation Synthesis

    Kuangxiao Gu, Yuqian Zhou, Thomas Huang

    10861-10868

    PDF
  • Pyramid Constrained Self-Attention Network for Fast Video Salient Object Detection

    Yuchao Gu, Lijuan Wang, Ziqin Wang, Yun Liu, Ming-Ming Cheng, Shao-Ping Lu

    10869-10876

    PDF
  • Constructing Multiple Tasks for Augmentation: Improving Neural Image Classification with K-Means Features

    Tao Gui, Lizhi Qing, Qi Zhang, Jiacheng Ye, Hang Yan, Zichu Fei, Xuanjing Huang

    10877-10884

    PDF
  • Channel Pruning Guided by Classification Loss and Feature Importance

    Jinyang Guo, Wanli Ouyang, Dong Xu

    10885-10892

    PDF
  • MarioNETte: Few-Shot Face Reenactment Preserving Identity of Unseen Targets

    Sungjoo Ha, Martin Kersner, Beomsu Kim, Seokjun Seo, Dongyoung Kim

    10893-10900

    PDF
  • SADA: Semantic Adversarial Diagnostic Attacks for Autonomous Applications

    Abdullah Hamdi, Matthias Mueller, Bernard Ghanem

    10901-10908

    PDF
  • Robust Conditional GAN from Uncertainty-Aware Pairwise Comparisons

    Ligong Han, Ruijiang Gao, Mun Kim, Xin Tao, Bo Liu, Dimitris Metaxas

    10909-10916

    PDF
  • Complementary-View Multiple Human Tracking

    Ruize Han, Wei Feng, Jiewen Zhao, Zicheng Niu, Yujun Zhang, Liang Wan, Song Wang

    10917-10924

    PDF
  • Point2Node: Correlation Learning of Dynamic-Node for Point Cloud Feature Modeling

    Wenkai Han, Chenglu Wen, Cheng Wang, Xin Li, Qing Li

    10925-10932

    PDF
  • Video Frame Interpolation via Deformable Separable Convolution

    Xianhang Cheng, Zhenzhong Chen

    10607-10614

    PDF
  • CSPN++: Learning Context and Resource Aware Convolutional Spatial Propagation Networks for Depth Completion

    Xinjing Cheng, Peng Wang, Chenye Guan, Ruigang Yang

    10615-10622

    PDF
  • A Coarse-to-Fine Adaptive Network for Appearance-Based Gaze Estimation

    Yihua Cheng, Shiyao Huang, Fei Wang, Chen Qian, Feng Lu

    10623-10630

    PDF
  • 3D Human Pose Estimation Using Spatio-Temporal Networks with Explicit Occlusion Training

    Yu Cheng, Bo Yang, Bo Wang, Robby T. Tan

    10631-10638

    PDF
  • PedHunter: Occlusion Robust Pedestrian Detector in Crowded Scenes

    Cheng Chi, Shifeng Zhang, Junliang Xing, Zhen Lei, Stan Z. Li, Xudong Zou

    10639-10646

    PDF
  • Relational Learning for Joint Head and Human Detection

    Cheng Chi, Shifeng Zhang, Junliang Xing, Zhen Lei, Stan Z. Li, Xudong Zou

    10647-10654

    PDF
  • Visual Domain Adaptation by Consensus-Based Transfer to Intermediate Domain

    Jongwon Choi, Youngjoon Choi, Jihoon Kim, Jinyeop Chang, Ilhwan Kwon, Youngjune Gwon, Seungjai Min

    10655-10662

    PDF
  • Channel Attention Is All You Need for Video Frame Interpolation

    Myungsub Choi, Heewon Kim, Bohyung Han, Ning Xu, Kyoung Mu Lee

    10663-10671

    PDF
  • DASOT: A Unified Framework Integrating Data Association and Single Object Tracking for Online Multi-Object Tracking

    Qi Chu, Wanli Ouyang, Bin Liu, Feng Zhu, Nenghai Yu

    10672-10679

    PDF
  • Towards Ghost-Free Shadow Removal via Dual Hierarchical Aggregation Network and Shadow Matting GAN

    Xiaodong Cun, Chi-Man Pun, Cheng Shi

    10680-10687

    PDF
  • The Missing Data Encoder: Cross-Channel Image Completion with Hide-and-Seek Adversarial Network

    Arnaud Dapogny, Matthieu Cord, Patrick Perez

    10688-10695

    PDF
  • Spatio-Temporal Deformable Convolution for Compressed Video Quality Enhancement

    Jianing Deng, Li Wang, Shiliang Pu, Cheng Zhuo

    10696-10703

    PDF
  • Zero Shot Learning with the Isoperimetric Loss

    Shay Deutsch, Andrea Bertozzi, Stefano Soatto

    10704-10712

    PDF
  • Every Frame Counts: Joint Learning of Video Segmentation and Optical Flow

    Mingyu Ding, Zhe Wang, Bolei Zhou, Jianping Shi, Zhiwu Lu, Ping Luo

    10713-10720

    PDF
  • Cycle-CNN for Colorization towards Real Monochrome-Color Camera Systems

    Xuan Dong, Weixin Li, Xiaojie Wang, Yunhong Wang

    10721-10728

    PDF
  • FD-GAN: Generative Adversarial Networks with Fusion-Discriminator for Single Image Dehazing

    Yu Dong, Yihao Liu, He Zhang, Shifeng Chen, Yu Qiao

    10729-10736

    PDF
  • Visual Relationship Detection with Low Rank Non-Negative Tensor Decomposition

    Mohammed Haroon Dupty, Zhen Zhang, Wee Sun Lee

    10737-10744

    PDF
  • SubSpace Capsule Network

    Marzieh Edraki, Nazanin Rahnavard, Mubarak Shah

    10745-10753

    PDF
  • Person Tube Retrieval via Language Description

    Hehe Fan, Yi Yang

    10754-10761

    PDF
  • CIAN: Cross-Image Affinity Net for Weakly Supervised Semantic Segmentation

    Junsong Fan, Zhaoxiang Zhang, Tieniu Tan, Chunfeng Song, Jun Xiao

    10762-10769

    PDF
  • Ultrafast Photorealistic Style Transfer via Neural Architecture Search

    Jie An, Haoyi Xiong, Jun Huan, Jiebo Luo

    10443-10450

    PDF
  • PsyNet: Self-Supervised Approach to Object Localization Using Point Symmetric Transformation

    Kyungjune Baek, Minhyun Lee, Hyunjung Shim

    10451-10459

    PDF
  • Detecting Human-Object Interactions via Functional Generalization

    Ankan Bansal, Sai Saketh Rambhatla, Abhinav Shrivastava, Rama Chellappa

    10460-10469

    PDF
  • Incremental Multi-Domain Learning with Network Latent Tensor Factorization

    Adrian Bulat, Jean Kossaifi, Georgios Tzimiropoulos, Maja Pantic

    10470-10477

    PDF
  • Monocular 3D Object Detection with Decoupled Structured Polygon Estimation and Height-Guided Depth Estimation

    Yingjie Cai, Buyu Li, Zeyu Jiao, Hongsheng Li, Xingyu Zeng, Xiaogang Wang

    10478-10485

    PDF
  • Auto-GAN: Self-Supervised Collaborative Learning for Medical Image Synthesis

    Bing Cao, Han Zhang, Nannan Wang, Xinbo Gao, Dinggang Shen

    10486-10493

    PDF
  • Feature Deformation Meta-Networks in Image Captioning of Novel Objects

    Tingjia Cao, Ke Han, Xiaomei Wang, Lin Ma, Yanwei Fu, Yu-Gang Jiang, Xiangyang Xue

    10494-10501

    PDF
  • General Partial Label Learning via Dual Bipartite Graph Autoencoder

    Brian Chen, Bo Wu, Alireza Zareian, Hanwang Zhang, Shih-Fu Chang

    10502-10509

    PDF
  • Learning Deep Relations to Promote Saliency Detection

    Changrui Chen, Xin Sun, Yang Hua, Junyu Dong, Hongwei Xv

    10510-10517

    PDF
  • Hierarchical Online Instance Matching for Person Search

    Di Chen, Shanshan Zhang, Wanli Ouyang, Jian Yang, Bernt Schiele

    10518-10525

    PDF
  • Binarized Neural Architecture Search

    Hanlin Chen, Li'an Zhuo, Baochang Zhang, Xiawu Zheng, Jianzhuang Liu, David Doermann, Rongrong Ji

    10526-10533

    PDF
  • End-to-End Learning of Object Motion Estimation from Retinal Events for Event-Based Object Tracking

    Haosheng Chen, David Suter, Qiangqiang Wu, Hanzi Wang

    10534-10541

    PDF
  • Zero-Shot Ingredient Recognition by Multi-Relational Graph Convolutional Network

    Jingjing Chen, Liangming Pan, Zhipeng Wei, Xiang Wang, Chong-Wah Ngo, Tat-Seng Chua

    10542-10550

    PDF
  • Rethinking the Bottom-Up Framework for Query-Based Video Localization

    Long Chen, Chujie Lu, Siliang Tang, Jun Xiao, Dong Zhang, Chilie Tan, Xiaolin Li

    10551-10558

    PDF
  • Diversity Transfer Network for Few-Shot Learning

    Mengting Chen, Yuxin Fang, Xinggang Wang, Heng Luo, Yifeng Geng, Xinyu Zhang, Chang Huang, Wenyu Liu, Bo Wang

    10559-10566

    PDF
  • Structure-Aware Feature Fusion for Unsupervised Domain Adaptation

    Qingchao Chen, Yang Liu

    10567-10574

    PDF
  • Knowledge Graph Transfer Network for Few-Shot Recognition

    Riquan Chen, Tianshui Chen, Xiaolu Hui, Hefeng Wu, Guanbin Li, Liang Lin

    10575-10582

    PDF
  • Expressing Objects Just Like Words: Recurrent Visual Embedding for Image-Text Matching

    Tianlang Chen, Jiebo Luo

    10583-10590

    PDF
  • Frame-Guided Region-Aligned Representation for Video Person Re-Identification

    Zengqun Chen, Zhiheng Zhou, Junchu Huang, Pengyu Zhang, Bo Li

    10591-10598

    PDF
  • Global Context-Aware Progressive Aggregation Network for Salient Object Detection

    Zuyao Chen, Qianqian Xu, Runmin Cong, Qingming Huang

    10599-10606

    PDF
  • Learning End-to-End Scene Flow by Distilling Single Tasks Knowledge

    Filippo Aleotti, Matteo Poggi, Fabio Tosi, Stefano Mattoccia

    10435-10442

    PDF

Primary Sidebar