0920-5691

International Journal of Computer Vision (IJCV) - April 2022, issue 4 论文列表

本期论文列表
On Measuring and Controlling the Spectral Bias of the Deep Image Prior

Singularity Analysis for the Perspective-Four and Five-Line Problems

Globally Optimal Linear Model Fitting with Unit-Norm Constraint

Beyond Dents and Scratches: Logical Constraints in Unsupervised Anomaly Detection and Localization

Delving into the Effectiveness of Receptive Fields: Learning Scale-Transferrable Architectures for Practical Object Detection

Pre-Training Without Natural Images

SRT3D: A Sparse Region-Based 3D Object Tracking Approach for the Real World

Atmospheric Turbulence Removal in Long-Range Imaging Using a Data-Driven-Based Approach

GhostNets on Heterogeneous Devices via Cheap Operations

Open-Set Adversarial Defense with Clean-Adversarial Mutual Learning

Visual Attention Consistency for Human Attribute Recognition

Object-Based Visual Camera Pose Estimation From Ellipsoidal Model and 3D-Aware Ellipse Prediction

Weakly-Supervised Semantic Segmentation with Visual Words Learning and Hybrid Pooling

Learning JPEG Compression Artifacts for Image Manipulation Detection and Localization

H-SegMed: A Hybrid Method for Prostate Segmentation in TRUS Images via Improved Closed Principal Curve and Improved Enhanced Machine Learning

Matching Visual Features to Hierarchical Semantic Topics for Image Paragraph Captioning

Wide-Area Crowd Counting: Multi-view Fusion Networks for Counting in Large Scenes

I3CL: Intra- and Inter-Instance Collaborative Learning for Arbitrary-Shaped Scene Text Detection

3D Semantic Scene Completion: A Survey

Generative Sketch Healing

Occluded Video Instance Segmentation: A Benchmark

Learning Inverse Depth Regression for Pixelwise Visibility-Aware Multi-View Stereo Networks

Efficient Burst Raw Denoising with Variance Stabilization and Multi-frequency Denoising Network

Preface to the Special Issue on Human Pose, Motion, Activities and Shape in 3D

Towards Compact 1-bit CNNs via Bayesian Learning

AdaStereo: An Efficient Domain-Adaptive Stereo Matching Approach

Bridging Composite and Real: Towards End-to-End Deep Image Matting

CODON: On Orchestrating Cross-Domain Attentions for Depth Super-Resolution

Action2video: Generating Videos of Human 3D Actions

SensatUrban: Learning Semantics from Urban-Scale Photogrammetric Point Clouds

Personalized Convolution for Face Recognition

Estimating 3D Motion and Forces of Human–Object Interactions from Internet Videos

Real-Time Multi-Car Localization and See-Through System

AutoScale: Learning to Scale for Crowd Counting

Perspectives and Prospects on Transformer Architecture for Cross-Modal Tasks with Language and Vision

Adaptive Deep Disturbance-Disentangled Learning for Facial Expression Recognition

Robust Geodesic Regression

Instance-Aware Scene Layout Forecasting

Continuous and Diverse Image-to-Image Translation via Signed Attribute Vectors

Joint Classification and Regression for Visual Tracking with Fully Convolutional Siamese Networks

Procrustes Analysis with Deformations: A Closed-Form Solution by Eigenvalue Decomposition

Inextensible Surface Reconstruction Under Small Relative Deformations from Distributed Angle Measurements

Editorial for Special Issue on Computer Vision in the Wild

Physical Representation Learning and Parameter Identification from Video Using Differentiable Physics

Inferring Bias and Uncertainty in Camera Calibration

Rescaling Egocentric Vision: Collection, Pipeline and Challenges for EPIC-KITCHENS-100

Learning a Robust Part-Aware Monocular 3D Human Pose Estimator via Neural Architecture Search

Dual-Attention-Guided Network for Ghost-Free High Dynamic Range Imaging

Distribution-Aware Margin Calibration for Semantic Segmentation in Images

View-Invariant, Occlusion-Robust Probabilistic Embedding for Human Pose

Joint Bilateral-Resolution Identity Modeling for Cross-Resolution Person Re-Identification

Memory-Efficient Hierarchical Neural Architecture Search for Image Restoration

Semantic Edge Detection with Diverse Deep Supervision

Kyushu Decorative Tumuli Project: From e-Heritage to Cyber-Archaeology

Deep Bingham Networks: Dealing with Uncertainty and Ambiguity in Pose Estimation

Leveraging Blur Information for Plenoptic Camera Calibration

Countering Malicious DeepFakes: Survey, Battleground, and Horizon

Attribute Prototype Network for Any-Shot Learning

Beyond Monocular Deraining: Parallel Stereo Deraining Network Via Semantic Prior

Nonblind Image Deconvolution via Leveraging Model Uncertainty in An Untrained Deep Neural Network

REVISE: A Tool for Measuring and Mitigating Bias in Visual Datasets

Investigating the Role of Image Retrieval for Visual Localization

A Survey on Long-Tailed Visual Recognition

Correction to: On the Arbitrary-Oriented Object Detection: Classification Based Approaches Revisited

Shape and Albedo Recovery by Your Phone using Stereoscopic Flash and No-Flash Photography

Scaling Up Sign Spotting Through Sign Language Dictionaries

Dual Convolutional Neural Networks for Low-Level Vision

Sparse Black-Box Video Attack with Reinforcement Learning

3D Shape Analysis Through a Quantum Lens: the Average Mixing Kernel Signature

Facial Kinship Verification: A Comprehensive Review and Outlook

Curriculum Learning: A Survey

Pose Measurement at Small Scale by Spectral Analysis of Periodic Patterns

4D Temporally Coherent Multi-Person Semantic Reconstruction and Segmentation

Correction to: Learning Contrastive Representation for Semantic Correspondence

RePCD-Net: Feature-Aware Recurrent Point Cloud Denoising Network

Learning 3D Semantic Scene Graphs with Instance Embeddings

CMSNet: Deep Color and Monochrome Stereo

Learning Scene Dynamics from Point Cloud Sequences

Multi-frame Motion Segmentation by Combining Two-Frame Results

Learning to Detect Instance-Level Salient Objects Using Complementary Image Labels

Wide-Angle Image Rectification: A Survey

A Unified B-Spline Framework for Scale-Invariant Keypoint Detection

From Individual to Whole: Reducing Intra-class Variance by Feature Aggregation

Network Adjustment: Channel and Block Search Guided by Resource Utilization Ratio

A Survey on Intrinsic Images: Delving Deep into Lambert and Beyond

Consensus-Based Optimization for 3D Human Pose Estimation in Camera Coordinates

Correction to: Instance-Aware Scene Layout Forecasting

Correction to: AdaStereo: An Efficient Domain-Adaptive Stereo Matching Approach

SoftPool++: An Encoder–Decoder Network for Point Cloud Completion

iMoCap: Motion Capture from Internet Videos

Learning Self-supervised Low-Rank Network for Single-Stage Weakly and Semi-supervised Semantic Segmentation

Displacement-Invariant Cost Computation for Stereo Matching

Unsupervised Multi-View CNN for Salient View Selection and 3D Interest Point Detection

RIConv++: Effective Rotation Invariant Convolutions for 3D Point Clouds Deep Learning

Weakly Supervised Moment Localization with Decoupled Consistent Concept Prediction

Disentangled Inference for GANs With Latently Invertible Autoencoder

Interpreting Face Inference Models Using Hierarchical Network Dissection

Learning Contrastive Representation for Semantic Correspondence

Eliminating Temporal Illumination Variations in Whisk-broom Hyperspectral Imaging

Spatially-Consistent Feature Matching and Learning for Heritage Image Analysis

On the Arbitrary-Oriented Object Detection: Classification Based Approaches Revisited

Human Action Recognition and Prediction: A Survey

Information-Theoretic Odometry Learning

Improving Image Segmentation with Boundary Patch Refinement

A Deep Learning Approach to Clustering Visual Arts

Semantic Contrastive Embedding for Generalized Zero-Shot Learning

PageNet: Towards End-to-End Weakly Supervised Page-Level Handwritten Chinese Text Recognition

Artificial Intelligence for Dunhuang Cultural Heritage Protection: The Project and the Dataset

Efficient Joint-Dimensional Search with Solution Space Regularization for Real-Time Semantic Segmentation

Dynamical Deep Generative Latent Modeling of 3D Skeletal Motion

Snowvision: Segmenting, Identifying, and Discovering Stamped Curve Patterns from Fragments of Pottery

Cartoon Image Processing: A Survey

Learning Degradation-Invariant Representation for Robust Real-World Person Re-Identification

Edge-Aware Graph Matching Network for Part-Based Semantic Segmentation

Learnable Depth-Sensitive Attention for Deep RGB-D Saliency Detection with Multi-modal Fusion Architecture Search

Cross-Domain Gated Learning for Domain Generalization

Finite Aperture Stereo

Guided Hyperspectral Image Denoising with Realistic Data

Weakly-Supervised Action Localization, and Action Recognition Using Global–Local Attention of 3D CNN

Zero-Shot Learning on 3D Point Cloud Objects and Beyond

DnS: Distill-and-Select for Efficient and Accurate Video Indexing and Retrieval

Exploring the Semi-Supervised Video Object Segmentation Problem from a Cyclic Perspective

Explainability of Deep Vision-Based Autonomous Driving Systems: Review and Challenges

EAN: Event Adaptive Network for Enhanced Action Recognition

One-Shot Object Affordance Detection in the Wild

Understanding Synonymous Referring Expressions via Contrastive Features

Class-Difficulty Based Methods for Long-Tailed Visual Recognition

Learning Sequence Representations by Non-local Recurrent Neural Memory

Structured Binary Neural Networks for Image Recognition

Deep Image Deblurring: A Survey

Learning to Detect Semantic Boundaries with Image-Level Class Labels

Data-Driven Restoration of Digital Archaeological Pottery with Point Cloud Analysis

Multispectral Photometric Stereo for Spatially-Varying Spectral Reflectances

Pairwise Alignment of Archaeological Fragments Through Morphological Characterization of Fracture Surfaces

Twin Contrastive Learning for Online Clustering

Surgical Tool Datasets for Machine Learning Research: A Survey

Feature Matching via Motion-Consistency Driven Probabilistic Graphical Model

Self-Supervised Monocular Depth and Motion Learning in Dynamic Scenes: Semantic Prior to Rescue

ISHIGAKI Retrieval System Using 3D Shape Matching and Combinatorial Optimization

Learning Cooperative Neural Modules for Stylized Image Captioning

3DPointCaps++: Learning 3D Representations with Capsule Networks

Learning to Prompt for Vision-Language Models