On Measuring and Controlling the Spectral Bias of the Deep Image Prior
Singularity Analysis for the Perspective-Four and Five-Line Problems
Globally Optimal Linear Model Fitting with Unit-Norm Constraint
Beyond Dents and Scratches: Logical Constraints in Unsupervised Anomaly Detection and Localization
Delving into the Effectiveness of Receptive Fields: Learning Scale-Transferrable Architectures for Practical Object Detection
Pre-Training Without Natural Images
SRT3D: A Sparse Region-Based 3D Object Tracking Approach for the Real World
Atmospheric Turbulence Removal in Long-Range Imaging Using a Data-Driven-Based Approach
GhostNets on Heterogeneous Devices via Cheap Operations
Open-Set Adversarial Defense with Clean-Adversarial Mutual Learning
Visual Attention Consistency for Human Attribute Recognition
Object-Based Visual Camera Pose Estimation From Ellipsoidal Model and 3D-Aware Ellipse Prediction
Weakly-Supervised Semantic Segmentation with Visual Words Learning and Hybrid Pooling
Learning JPEG Compression Artifacts for Image Manipulation Detection and Localization
H-SegMed: A Hybrid Method for Prostate Segmentation in TRUS Images via Improved Closed Principal Curve and Improved Enhanced Machine Learning
Matching Visual Features to Hierarchical Semantic Topics for Image Paragraph Captioning
Wide-Area Crowd Counting: Multi-view Fusion Networks for Counting in Large Scenes
I3CL: Intra- and Inter-Instance Collaborative Learning for Arbitrary-Shaped Scene Text Detection
3D Semantic Scene Completion: A Survey
Generative Sketch Healing
Occluded Video Instance Segmentation: A Benchmark
Learning Inverse Depth Regression for Pixelwise Visibility-Aware Multi-View Stereo Networks
Efficient Burst Raw Denoising with Variance Stabilization and Multi-frequency Denoising Network
Preface to the Special Issue on Human Pose, Motion, Activities and Shape in 3D
Towards Compact 1-bit CNNs via Bayesian Learning
AdaStereo: An Efficient Domain-Adaptive Stereo Matching Approach
Bridging Composite and Real: Towards End-to-End Deep Image Matting
CODON: On Orchestrating Cross-Domain Attentions for Depth Super-Resolution
Action2video: Generating Videos of Human 3D Actions
SensatUrban: Learning Semantics from Urban-Scale Photogrammetric Point Clouds
Personalized Convolution for Face Recognition
Estimating 3D Motion and Forces of Human–Object Interactions from Internet Videos
Real-Time Multi-Car Localization and See-Through System
AutoScale: Learning to Scale for Crowd Counting
Perspectives and Prospects on Transformer Architecture for Cross-Modal Tasks with Language and Vision
Adaptive Deep Disturbance-Disentangled Learning for Facial Expression Recognition
Robust Geodesic Regression
Instance-Aware Scene Layout Forecasting
Continuous and Diverse Image-to-Image Translation via Signed Attribute Vectors
Joint Classification and Regression for Visual Tracking with Fully Convolutional Siamese Networks
Procrustes Analysis with Deformations: A Closed-Form Solution by Eigenvalue Decomposition
Inextensible Surface Reconstruction Under Small Relative Deformations from Distributed Angle Measurements
Editorial for Special Issue on Computer Vision in the Wild
Physical Representation Learning and Parameter Identification from Video Using Differentiable Physics
Inferring Bias and Uncertainty in Camera Calibration
Rescaling Egocentric Vision: Collection, Pipeline and Challenges for EPIC-KITCHENS-100
Learning a Robust Part-Aware Monocular 3D Human Pose Estimator via Neural Architecture Search
Dual-Attention-Guided Network for Ghost-Free High Dynamic Range Imaging
Distribution-Aware Margin Calibration for Semantic Segmentation in Images
View-Invariant, Occlusion-Robust Probabilistic Embedding for Human Pose
Joint Bilateral-Resolution Identity Modeling for Cross-Resolution Person Re-Identification
Memory-Efficient Hierarchical Neural Architecture Search for Image Restoration
Semantic Edge Detection with Diverse Deep Supervision
Kyushu Decorative Tumuli Project: From e-Heritage to Cyber-Archaeology
Deep Bingham Networks: Dealing with Uncertainty and Ambiguity in Pose Estimation
Leveraging Blur Information for Plenoptic Camera Calibration
Countering Malicious DeepFakes: Survey, Battleground, and Horizon
Attribute Prototype Network for Any-Shot Learning
Beyond Monocular Deraining: Parallel Stereo Deraining Network Via Semantic Prior
Nonblind Image Deconvolution via Leveraging Model Uncertainty in An Untrained Deep Neural Network
REVISE: A Tool for Measuring and Mitigating Bias in Visual Datasets
Investigating the Role of Image Retrieval for Visual Localization
A Survey on Long-Tailed Visual Recognition
Correction to: On the Arbitrary-Oriented Object Detection: Classification Based Approaches Revisited
Shape and Albedo Recovery by Your Phone using Stereoscopic Flash and No-Flash Photography
Scaling Up Sign Spotting Through Sign Language Dictionaries
Dual Convolutional Neural Networks for Low-Level Vision
Sparse Black-Box Video Attack with Reinforcement Learning
3D Shape Analysis Through a Quantum Lens: the Average Mixing Kernel Signature
Facial Kinship Verification: A Comprehensive Review and Outlook
Curriculum Learning: A Survey
Pose Measurement at Small Scale by Spectral Analysis of Periodic Patterns
4D Temporally Coherent Multi-Person Semantic Reconstruction and Segmentation
Correction to: Learning Contrastive Representation for Semantic Correspondence
RePCD-Net: Feature-Aware Recurrent Point Cloud Denoising Network
Learning 3D Semantic Scene Graphs with Instance Embeddings
CMSNet: Deep Color and Monochrome Stereo
Learning Scene Dynamics from Point Cloud Sequences
Multi-frame Motion Segmentation by Combining Two-Frame Results
Learning to Detect Instance-Level Salient Objects Using Complementary Image Labels
Wide-Angle Image Rectification: A Survey
A Unified B-Spline Framework for Scale-Invariant Keypoint Detection
From Individual to Whole: Reducing Intra-class Variance by Feature Aggregation
Network Adjustment: Channel and Block Search Guided by Resource Utilization Ratio
A Survey on Intrinsic Images: Delving Deep into Lambert and Beyond
Consensus-Based Optimization for 3D Human Pose Estimation in Camera Coordinates
Correction to: Instance-Aware Scene Layout Forecasting
Correction to: AdaStereo: An Efficient Domain-Adaptive Stereo Matching Approach
SoftPool++: An Encoder–Decoder Network for Point Cloud Completion
iMoCap: Motion Capture from Internet Videos
Learning Self-supervised Low-Rank Network for Single-Stage Weakly and Semi-supervised Semantic Segmentation
Displacement-Invariant Cost Computation for Stereo Matching
Unsupervised Multi-View CNN for Salient View Selection and 3D Interest Point Detection
RIConv++: Effective Rotation Invariant Convolutions for 3D Point Clouds Deep Learning
Weakly Supervised Moment Localization with Decoupled Consistent Concept Prediction
Disentangled Inference for GANs With Latently Invertible Autoencoder
Interpreting Face Inference Models Using Hierarchical Network Dissection
Learning Contrastive Representation for Semantic Correspondence
Eliminating Temporal Illumination Variations in Whisk-broom Hyperspectral Imaging
Spatially-Consistent Feature Matching and Learning for Heritage Image Analysis
On the Arbitrary-Oriented Object Detection: Classification Based Approaches Revisited
Human Action Recognition and Prediction: A Survey
Information-Theoretic Odometry Learning
Improving Image Segmentation with Boundary Patch Refinement
A Deep Learning Approach to Clustering Visual Arts
Semantic Contrastive Embedding for Generalized Zero-Shot Learning
PageNet: Towards End-to-End Weakly Supervised Page-Level Handwritten Chinese Text Recognition
Artificial Intelligence for Dunhuang Cultural Heritage Protection: The Project and the Dataset
Efficient Joint-Dimensional Search with Solution Space Regularization for Real-Time Semantic Segmentation
Dynamical Deep Generative Latent Modeling of 3D Skeletal Motion
Snowvision: Segmenting, Identifying, and Discovering Stamped Curve Patterns from Fragments of Pottery
Cartoon Image Processing: A Survey
Learning Degradation-Invariant Representation for Robust Real-World Person Re-Identification
Edge-Aware Graph Matching Network for Part-Based Semantic Segmentation
Learnable Depth-Sensitive Attention for Deep RGB-D Saliency Detection with Multi-modal Fusion Architecture Search
Cross-Domain Gated Learning for Domain Generalization
Finite Aperture Stereo
Guided Hyperspectral Image Denoising with Realistic Data
Weakly-Supervised Action Localization, and Action Recognition Using Global–Local Attention of 3D CNN
Zero-Shot Learning on 3D Point Cloud Objects and Beyond
DnS: Distill-and-Select for Efficient and Accurate Video Indexing and Retrieval
Exploring the Semi-Supervised Video Object Segmentation Problem from a Cyclic Perspective
Explainability of Deep Vision-Based Autonomous Driving Systems: Review and Challenges
EAN: Event Adaptive Network for Enhanced Action Recognition
One-Shot Object Affordance Detection in the Wild
Understanding Synonymous Referring Expressions via Contrastive Features
Class-Difficulty Based Methods for Long-Tailed Visual Recognition
Learning Sequence Representations by Non-local Recurrent Neural Memory
Structured Binary Neural Networks for Image Recognition
Deep Image Deblurring: A Survey
Learning to Detect Semantic Boundaries with Image-Level Class Labels
Data-Driven Restoration of Digital Archaeological Pottery with Point Cloud Analysis
Multispectral Photometric Stereo for Spatially-Varying Spectral Reflectances
Pairwise Alignment of Archaeological Fragments Through Morphological Characterization of Fracture Surfaces
Twin Contrastive Learning for Online Clustering
Surgical Tool Datasets for Machine Learning Research: A Survey
Feature Matching via Motion-Consistency Driven Probabilistic Graphical Model
Self-Supervised Monocular Depth and Motion Learning in Dynamic Scenes: Semantic Prior to Rescue
ISHIGAKI Retrieval System Using 3D Shape Matching and Combinatorial Optimization
Learning Cooperative Neural Modules for Stylized Image Captioning
3DPointCaps++: Learning 3D Representations with Capsule Networks
Learning to Prompt for Vision-Language Models