0920-5691

International Journal of Computer Vision (IJCV) - April 2020, issue 4 论文列表

本期论文列表
Editorial: Special Issue on Machine Vision with Deep Learning

Learning on the Edge: Investigating Boundary Filters in CNNs

A Simple and Light-Weight Attention Module for Convolutional Neural Networks

Simultaneous Deep Stereo Matching and Dehazing with Feature Attention

Pixelated Semantic Colorization

Learning Single-Image 3D Reconstruction by Generative Modelling of Shape, Pose and Shading

Modeling Human Motion with Quaternion-Based Neural Networks

Learning Multi-human Optical Flow

Text2Sign: Towards Sign Language Production Using Neural Machine Translation and Generative Adversarial Networks

Guest Editorial: Special Issue on ACCV 2018

EdgeStereo: An Effective Multi-task Learning Network for Stereo Matching and Edge Detection

VOSTR: Video Object Segmentation via Transferable Representations

Minimal Solvers for Rectifying from Radially-Distorted Scales and Change of Scales

Editor’s Note

KS(conf): A Light-Weight Test if a Multiclass Classifier Operates Outside of Its Specifications

Correction to: KS(conf): A Light-Weight Test if a Multiclass Classifier Operates Outside of Its Specifications

End-to-End Learning of Decision Trees and Forests

3D Fluid Flow Estimation with Integrated Particle Reconstruction

Scaling up the Randomized Gradient-Free Adversarial Attack Reveals Overestimation of Robustness Using Established Attacks

Inference, Learning and Attention Mechanisms that Exploit and Preserve Sparsity in CNNs

CR-Net: A Deep Classification-Regression Network for Multimodal Apparent Personality Analysis

Necessary and Sufficient Polynomial Constraints on Compatible Triplets of Essential Matrices

Learning the Clustering of Longitudinal Shape Data Sets into a Mixture of Independent or Branching Trajectories

Transferrable Feature and Projection Learning with Class Hierarchy for Zero-Shot Learning

Gradient Shape Model

Mix and Match Networks: Cross-Modal Alignment for Zero-Pair Image-to-Image Translation

Learning the spatiotemporal variability in longitudinal shape data sets

Incorporating Side Information by Adaptive Convolution

Pix2Vox++: Multi-scale Context-aware 3D Object Reconstruction from Single and Multiple Images

Long-Short Temporal–Spatial Clues Excited Network for Robust Person Re-identification

Rooted Spanning Superpixels

Zero-Shot Object Detection: Joint Recognition and Localization of Novel Concepts

Video Based Face Recognition by Using Discriminatively Learned Convex Models

Deep Learning for Generic Object Detection: A Survey

Semantically Coherent 4D Scene Flow of Dynamic Scenes

Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization

Dual L1-Normalized Context Aware Tensor Power Iteration and Its Applications to Multi-object Tracking and Multi-graph Matching

Semantic Image Networks for Human Action Recognition

Deep Insights into Convolutional Networks for Video Recognition

Corner Detection Using Multi-directional Structure Tensor with Multiple Scales

Recognizing Profile Faces by Imagining Frontal View

Adaptive Importance Learning for Improving Lightweight Image Super-Resolution Network

Hallucinating Unaligned Face Images by Multiscale Transformative Discriminative Networks

SceneFlowFields++: Multi-frame Matching, Visibility Prediction, and Robust Interpolation for Scene Flow Estimation

Statistical Modeling of Craniofacial Shape and Texture

ARBEE: Towards Automated Recognition of Bodily Expression of Emotion in the Wild

Synchronization Problems in Computer Vision with Closed-Form Solutions

Robust Attentional Aggregation of Deep Feature Sets for Multi-view 3D Reconstruction

Temporal Action Detection with Structured Segment Networks

Tracking Persons-of-Interest via Unsupervised Representation Adaptation

Shape-From-Template with Curves

Classifier and Exemplar Synthesis for Zero-Shot Learning

Bi-Real Net: Binarizing Deep Network Towards Real-Network Performance

Predicting Intentions from Motion: The Subject-Adversarial Adaptation Approach

Single Image Dehazing via Multi-scale Convolutional Neural Networks with Holistic Edges

Exploiting Semantics for Face Image Deblurring

Is There Anything New to Say About SIFT Matching?

Deep Image Prior

Light Structure from Pin Motion: Geometric Point Light Source Calibration

MAP Inference Via \(\ell _2\)-Sphere Linear Program Reformulation

Semi-online Multi-people Tracking by Re-identification

The Open Images Dataset V4

Enhanced Balanced Min Cut

A Face Fairness Framework for 3D Meshes

GMS: Grid-Based Motion Statistics for Fast, Ultra-robust Feature Correspondence

Real-Time Multi-person Motion Capture from Multi-view Video and IMUs

Representation Learning on Unit Ball with 3D Roto-translational Equivariance

Scalable Person Re-Identification by Harmonious Attention

Fine-Grained Person Re-identification

Siamese Dense Network for Reflection Removal with Flash and No-Flash Image Pairs

Gated Fusion Network for Degraded Image Super Resolution

Discriminative Training of Conditional Random Fields with Probably Submodular Constraints

Weakly-Supervised Semantic Segmentation by Iterative Affinity Learning

Visual Social Relationship Recognition

RGB-IR Person Re-identification by Cross-Modality Similarity Preservation

Bottom-Up Scene Text Detection with Markov Clustering Networks

Multi-task Generative Adversarial Network for Detecting Small Objects in the Wild

Special Issue: Advances in Architectures and Theories for Computer Vision

Robust Fitting in Computer Vision: Easy or Hard?

Learning SO(3) Equivariant Representations with Spherical CNNs

EKLT: Asynchronous Photometric Feature Tracking Using Events and Frames

Correction to: EKLT: Asynchronous Photometric Feature Tracking Using Events and Frames

Jointly Discovering Visual Objects and Spoken Words from Raw Sensory Input

CornerNet: Detecting Objects as Paired Keypoints

DeepIM: Deep Iterative Matching for 6D Pose Estimation

Differential Scene Flow from Light Field Gradients

GANimation: One-Shot Anatomically Consistent Facial Animation

Augmented Autoencoders: Implicit 3D Orientation Learning for 6D Object Detection

Convolutional Networks with Adaptive Inference Graphs

Group Normalization

DeepTAM: Deep Tracking and Mapping with Convolutional Neural Networks

Efficient Object Annotation via Speaking and Pointing

Learning to Draw Sight Lines

Refractive Two-View Reconstruction for Underwater 3D Vision

Loss-Sensitive Generative Adversarial Networks on Lipschitz Densities

The Unmanned Aerial Vehicle Benchmark: Object Detection, Tracking and Baseline

Special Issue on Deep Learning for Robotic Vision

Learning 3D Shape Completion Under Weak Supervision

Curriculum Model Adaptation with Synthetic and Real Data for Semantic Foggy Scene Understanding

Image-Based Geo-Localization Using Satellite Imagery

Semi-supervised Semantic Mapping Through Label Propagation with Semantic Texture Meshes

Self-Supervised Model Adaptation for Multimodal Semantic Segmentation

SeDAR: Reading Floorplans Like a Human—Using Deep Learning to Enable Human-Inspired Localisation

Cognitive Mapping and Planning for Visual Navigation

Deep Multicameral Decoding for Localizing Unoccluded Object Instances from a Single RGB Image

Model-Based Robot Imitation with Future Image Similarity

Correction to: Model-Based Robot Imitation with Future Image Similarity

Special Issue on Generating Realistic Visual Data of Human Behavior

Adversarial Framework for Unsupervised Learning of Motion Dynamics in Videos

Realistic Speech-Driven Facial Animation with GANs

A Weakly Supervised Multi-task Ranking Framework for Actor–Action Semantic Segmentation

Masked Linear Regression for Learning Local Receptive Fields for Facial Expression Synthesis

Deep Neural Network Augmentation: Generating Faces for Affect Analysis

Towards High Fidelity Face Frontalization in the Wild

Generating Human Action Videos by Coupling 3D Game Engines and Probabilistic Graphical Models

DGPose: Deep Generative Models for Human Body Analysis

Guest Editorial: Generative Adversarial Networks for Computer Vision

Discriminative Region Proposal Adversarial Network for High-Quality Image-to-Image Translation

Handwritten Mathematical Expression Recognition via Paired Adversarial Learning

DRIT++: Diverse Image-to-Image Translation via Disentangled Representations

Layout2image: Image Generation from Layout

Discriminator Feature-Based Inference by Recycling the Discriminator of GANs

MimicGAN: Robust Projection onto Image Manifolds with Corruption Mimicking

Pix2Shape: Towards Unsupervised Learning of 3D Scenes from Images Using a View-Based Representation

Adversarial Confidence Learning for Medical Image Segmentation and Synthesis

Towards Image-to-Video Translation: A Structure-Aware Approach via Multi-stage Generative Adversarial Networks

3DFaceGAN: Adversarial Nets for 3D Face Representation, Generation, and Translation

High-Quality Video Generation from Static Structural Annotations

Compositional GAN: Learning Image-Conditional Binary Composition

Train Sparsely, Generate Densely: Memory-Efficient Unsupervised Training of High-Resolution Temporal GAN

Multimodal Image Synthesis with Conditional Implicit Maximum Likelihood Estimation

SliderGAN: Synthesizing Expressive Face Images by Sliding 3D Blendshape Parameters

Inferring 3D Shapes from Image Collections Using Adversarial Networks

RoCGAN: Robust Conditional GAN

Semantically Tied Paired Cycle Consistency for Any-Shot Sketch-Based Image Retrieval

Densifying Supervision for Fine-Grained Visual Comparisons

GADE: A Generative Adversarial Approach to Density Estimation and its Applications

Towards Photo-Realistic Facial Expression Manipulation

Efficient Visual Recognition

A Survey of Deep Facial Attribute Analysis

Hardware-Centric AutoML for Mixed-Precision Quantization

Spatially-Adaptive Filter Units for Compact and Efficient Deep Neural Networks

HetConv: Beyond Homogeneous Convolution Kernels for Deep CNNs

Learning an Evolutionary Embedding via Massive Knowledge Distillation

SSN: Learning Sparse Switchable Normalization via SparsestMax

Rectified Wing Loss for Efficient and Robust Facial Landmark Localisation with Convolutional Neural Networks

Multi-task Compositional Network for Visual Relationship Detection

Disentangled Representation Learning of Makeup Portraits in the Wild

Fine-Grained Multi-human Parsing

A General Framework for Deep Supervised Discrete Hashing

Learning Multifunctional Binary Codes for Personalized Image Retrieval

Unified Binary Generative Adversarial Network for Image Retrieval and Compression

Weakly-supervised Semantic Guided Hashing for Social Image Retrieval

Hadamard Matrix Guided Online Hashing

Anchor-Based Self-Ensembling for Semi-Supervised Deep Pairwise Hashing

Product Quantization Network for Fast Visual Search

Tensorized Multi-view Subspace Representation Learning