CVPR 2020 论文开源项目合集,同时欢迎各位大佬提交issue,分享CVPR 2020开源项目
- CNN
- 图像分类
- 目标检测
- 3D目标检测
- 视频目标检测
- 目标跟踪
- 语义分割
- 实例分割
- 全景分割
- 视频目标分割
- 超像素分割
- NAS
- GAN
- Re-ID
- 3D点云(含语义分割等)
- 人脸识别
- 人脸检测
- 人脸活体检测
- 人脸表情识别
- 人脸转正
- 人体姿态估计
- 场景文本检测
- 场景文本识别
- 超分辨率
- 模型压缩
- 模型剪枝
- 视频理解/行为识别
- 人群计数
- 深度估计
- 6D目标姿态估计
- 手势估计
- 去噪
- 去模糊
- 特征点检测与描述
- 视觉问答
- 视觉语言导航
- 视频压缩
- 视频插值
- 风格迁移
- "人-物"交互(HOI)检测
- 行为轨迹预测
- 运动预测
- HDR
- 数据集
- 其他
- 不确定中没中
Rethinking Depthwise Separable Convolutions: How Intra-Kernel Correlations Lead to Improved MobileNets
Spatially Attentive Output Layer for Image Classification
Instance-aware, Context-focused, and Memory-efficient Weakly Supervised Object Detection
Bridging the Gap Between Anchor-based and Anchor-free Detection via Adaptive Training Sample Selection
BiDet: An Efficient Binarized Object Detector
Harmonizing Transferability and Discriminability for Adapting Object Detectors
CentripetalNet: Pursuing High-quality Keypoint Pairs for Object Detection
Hit-Detector: Hierarchical Trinity Architecture Search for Object Detection
EfficientDet: Scalable and Efficient Object Detection
3DSSD: Point-based 3D Single Stage Object Detector
-
CVPR 2020 Oral
Disp R-CNN: Stereo 3D Object Detection via Shape Prior Guided Instance Disparity Estimation
End-to-End Pseudo-LiDAR for Image-Based 3D Object Detection
DSGN: Deep Stereo Geometry Network for 3D Object Detection
LiDAR-based Online 3D Video Object Detection with Graph-based Message Passing and Spatiotemporal Transformer Attention
PV-RCNN: Point-Voxel Feature Set Abstraction for 3D Object Detection
Point-GNN: Graph Neural Network for 3D Object Detection in a Point Cloud
Memory Enhanced Global-Local Aggregation for Video Object Detection
论文:https://arxiv.org/abs/2003.12063
代码:https://github.com/Scalsol/mega.pytorch
Siam R-CNN: Visual Tracking by Re-Detection
- 主页:https://www.vision.rwth-aachen.de/page/siamrcnn
- 论文:https://arxiv.org/abs/1911.12836
- 论文2:https://www.vision.rwth-aachen.de/media/papers/192/siamrcnn.pdf
- 代码:https://github.com/VisualComputingInstitute/SiamR-CNN
Cooling-Shrinking Attack: Blinding the Tracker with Imperceptible Noises
High-Performance Long-Term Tracking with Meta-Updater
AutoTrack: Towards High-Performance Visual Tracking for UAV with Automatic Spatio-Temporal Regularization
Probabilistic Regression for Visual Tracking
MAST: A Memory-Augmented Self-supervised Tracker
Siamese Box Adaptive Network for Visual Tracking
Self-supervised Equivariant Attention Mechanism for Weakly Supervised Semantic Segmentation
Temporally Distributed Networks for Fast Video Segmentation
Context Prior for Scene Segmentation
Strip Pooling: Rethinking Spatial Pooling for Scene Parsing
Cars Can't Fly up in the Sky: Improving Urban-Scene Segmentation via Height-driven Attention Networks
Learning Dynamic Routing for Semantic Segmentation
PolarMask: Single Shot Instance Segmentation with Polar Representation
- 论文:https://arxiv.org/abs/1909.13226
- 代码:https://github.com/xieenze/PolarMask
- 解读:https://zhuanlan.zhihu.com/p/84890413
CenterMask : Real-Time Anchor-Free Instance Segmentation
Deep Snake for Real-Time Instance Segmentation
Mask Encoding for Single Shot Instance Segmentation
Pixel Consensus Voting for Panoptic Segmentation
- 论文:https://arxiv.org/abs/2004.01849
- 代码:还未公布
BANet: Bidirectional Aggregation Network with Occlusion Handling for Panoptic Segmentation
论文:https://arxiv.org/abs/2003.14031
代码:https://github.com/Mooonside/BANet
State-Aware Tracker for Real-Time Video Object Segmentation
Learning Fast and Robust Target Models for Video Object Segmentation
Learning Video Object Segmentation from Unlabeled Videos
Superpixel Segmentation with Fully Convolutional Networks
Neural Architecture Search for Lightweight Non-Local Networks
Rethinking Performance Estimation in Neural Architecture Search
- 论文:准备中
- 代码:https://github.com/zhengxiawu/rethinking_performance_estimation_in_NAS
- 解读:https://www.zhihu.com/question/372070853/answer/1035234510
CARS: Continuous Evolution for Efficient Neural Architecture Search
Learning to Cartoonize Using White-box Cartoon Representations
-
论文:https://github.com/SystemErrorWang/White-box-Cartoonization/blob/master/paper/06791.pdf
-
主页:https://systemerrorwang.github.io/White-box-Cartoonization/
-
代码:https://github.com/SystemErrorWang/White-box-Cartoonization
GAN Compression: Efficient Architectures for Interactive Conditional GANs
Watch your Up-Convolution: CNN Based Generative Deep Neural Networks are Failing to Reproduce Spectral Distributions
Pose-guided Visible Part Matching for Occluded Person ReID
Weakly supervised discriminative feature learning with state information for person identification
Grid-GCN for Fast and Scalable Point Cloud Learning
FPConv: Learning Local Flattening for Point Convolution
Weakly Supervised Semantic Point Cloud Segmentation:Towards 10X Fewer Labels
PolarNet: An Improved Grid Representation for Online LiDAR Point Clouds Semantic Segmentation
Learning to Segment 3D Point Clouds in 2D Image Space
D3Feat: Joint Learning of Dense Detection and Description of 3D Local Features
RPM-Net: Robust Point Matching using Learned Features
Cascaded Refinement Network for Point Cloud Completion
CurricularFace: Adaptive Curriculum Learning Loss for Deep Face Recognition
Learning Meta Face Recognition in Unseen Domains
- 论文:https://arxiv.org/abs/2003.07733
- 代码:https://github.com/cleardusk/MFR
- 解读:https://mp.weixin.qq.com/s/YZoEnjpnlvb90qSI3xdJqQ
Searching Central Difference Convolutional Networks for Face Anti-Spoofing
Suppressing Uncertainties for Large-Scale Facial Expression Recognition
Rotate-and-Render: Unsupervised Photorealistic Face Rotation from Single-View Images
HigherHRNet: Scale-Aware Representation Learning for Bottom-Up Human Pose Estimation
The Devil is in the Details: Delving into Unbiased Data Processing for Human Pose Estimation
- 论文:https://arxiv.org/abs/1911.07524
- 代码:https://github.com/HuangJunJie2017/UDP-Pose
- 解读:https://zhuanlan.zhihu.com/p/92525039
Distribution-Aware Coordinate Representation for Human Pose Estimation
Self-Supervised 3D Human Pose Estimation via Part Guided Novel Image Synthesis
Compressed Volumetric Heatmaps for Multi-Person 3D Pose Estimation
VIBE: Video Inference for Human Body Pose and Shape Estimation
Back to the Future: Joint Aware Temporal Deep Learning 3D Human Pose Estimation
Cross-View Tracking for Multi-Human 3D Pose Estimation at over 100 FPS
- 论文:https://arxiv.org/abs/2003.03972
- 数据集:暂无
PointAugment: an Auto-Augmentation Framework for Point Cloud Classification
UnrealText: Synthesizing Realistic Scene Text Images from the Unreal World
ABCNet: Real-time Scene Text Spotting with Adaptive Bezier-Curve Network
- 论文:https://arxiv.org/abs/2002.10200
- 代码(即将开源):https://github.com/Yuliang-Liu/bezier_curve_text_spotting
- 代码(即将开源):https://github.com/aim-uofa/adet
Deep Relational Reasoning Graph Network for Arbitrary Shape Text Detection
UnrealText: Synthesizing Realistic Scene Text Images from the Unreal World
ABCNet: Real-time Scene Text Spotting with Adaptive Bezier-Curve Network
Learn to Augment: Joint Data Augmentation and Network Optimization for Text Recognition
Structure-Preserving Super Resolution with Gradient Guidance
Rethinking Data Augmentation for Image Super-resolution: A Comprehensive Analysis and a New Strategy
论文:https://arxiv.org/abs/2004.00448
代码:https://github.com/clovaai/cutblur
Zooming Slow-Mo: Fast and Accurate One-Stage Space-Time Video Super-Resolution
GAN Compression: Efficient Architectures for Interactive Conditional GANs
Group Sparsity: The Hinge Between Filter Pruning and Decomposition for Network Compression
HRank: Filter Pruning using High-Rank Feature Map
TEA: Temporal Excitation and Aggregation for Action Recognition
X3D: Expanding Architectures for Efficient Video Recognition
Temporal Pyramid Network for Action Recognition
Disentangling and Unifying Graph Convolutions for Skeleton-Based Action Recognition
Towards Better Generalization: Joint Depth-Pose Learning without PoseNet
3D Packing for Self-Supervised Monocular Depth Estimation
- 论文:https://arxiv.org/abs/1905.02693
- 代码:https://arxiv.org/abs/1905.02693
- Demo视频:https://www.bilibili.com/video/av70562892/
Domain Decluttering: Simplifying Images to Mitigate Synthetic-Real Domain Shift and Improve Depth Estimation
EPOS: Estimating 6D Pose of Objects with Symmetries
主页:http://cmp.felk.cvut.cz/epos
论文:https://arxiv.org/abs/2004.00605
G2L-Net: Global to Local Network for Real-time 6D Pose Estimation with Embedding Vector Features
HOPE-Net: A Graph-based Model for Hand-Object Pose Estimation
Monocular Real-time Hand Shape and Motion Capture using Multi-modal Data
A Physics-based Noise Formation Model for Extreme Low-light Raw Denoising
CycleISP: Real Image Restoration via Improved Data Synthesis
Multi-Scale Progressive Fusion Network for Single Image Deraining
Cascaded Deep Video Deblurring Using Temporal Sharpness Prior
- 主页:https://csbhr.github.io/projects/cdvd-tsp/index.html
- 论文:https://arxiv.org/abs/2004.02501
- 代码:https://github.com/csbhr/CDVD-TSP
ASLFeat: Learning Local Features of Accurate Shape and Localization
VC R-CNN:Visual Commonsense R-CNN
Towards Learning a Generic Agent for Vision-and-Language Navigation via Pre-training
Learning for Video Compression with Hierarchical Quality and Recurrent Enhancement
Scene-Adaptive Video Frame Interpolation via Meta-Learning
Softmax Splatting for Video Frame Interpolation
- 主页:http://sniklaus.com/papers/softsplat
- 论文:https://arxiv.org/abs/2003.05534
- 代码:https://github.com/sniklaus/softmax-splatting
Collaborative Distillation for Ultra-Resolution Universal Style Transfer
Cascaded Human-Object Interaction Recognition
VSGNet: Spatial Attention Network for Detecting Human Object Interactions Using Graph Convolutions
Social-STGCNN: A Social Spatio-Temporal Graph Convolutional Neural Network for Human Trajectory Prediction
Collaborative Motion Prediction via Neural Motion Message Passing
MotionNet: Joint Perception and Motion Prediction for Autonomous Driving Based on Bird's Eye View Maps
Single-Image HDR Reconstruction by Learning to Reverse the Camera Pipeline
A Local-to-Global Approach to Multi-modal Movie Scene Segmentation
Deep Homography Estimation for Dynamic Scenes
Assessing Image Quality Issues for Real-World Problems
UnrealText: Synthesizing Realistic Scene Text Images from the Unreal World
PANDA: A Gigapixel-level Human-centric Video Dataset
IntrA: 3D Intracranial Aneurysm Dataset for Deep Learning
Cross-View Tracking for Multi-Human 3D Pose Estimation at over 100 FPS
- 论文:https://arxiv.org/abs/2003.03972
- 数据集:暂无
Self-Supervised Monocular Scene Flow Estimation
Quasi-Newton Solver for Robust Non-Rigid Registration
A Local-to-Global Approach to Multi-modal Movie Scene Segmentation
DeepFLASH: An Efficient Network for Learning-based Medical Image Registration
Self-Supervised Scene De-occlusion
- 主页:https://xiaohangzhan.github.io/projects/deocclusion/
- 论文:https://arxiv.org/abs/2004.02788
- 代码:https://github.com/XiaohangZhan/deocclusion
Polarized Reflection Removal with Perfect Alignment in the Wild
- 主页:https://leichenyang.weebly.com/project-polarized.html
- 代码:https://github.com/ChenyangLEI/CVPR2020-Polarized-Reflection-Removal-with-Perfect-Alignment
Background Matting: The World is Your Green Screen
What Deep CNNs Benefit from Global Covariance Pooling: An Optimization Perspective
Look-into-Object: Self-supervised Structure Modeling for Object Recognition
- 论文:暂无
- 代码:https://github.com/JDAI-CV/LIO
Video Object Grounding using Semantic Roles in Language Description
Dynamic Hierarchical Mimicking Towards Consistent Optimization Objectives
SDFDiff: Differentiable Rendering of Signed Distance Fields for 3D Shape Optimization
- 论文:http://www.cs.umd.edu/~yuejiang/papers/SDFDiff.pdf
- 代码:https://github.com/YueJiang-nj/CVPR2020-SDFDiff
On Translation Invariance in CNNs: Convolutional Layers can Exploit Absolute Spatial Location
GhostNet: More Features from Cheap Operations
AdderNet: Do We Really Need Multiplications in Deep Learning?
Deep Image Harmonization via Domain Verification
Blurry Video Frame Interpolation
Extremely Dense Point Correspondences using a Learned Feature Descriptor
- 论文:https://arxiv.org/abs/2003.00619
- 代码:https://github.com/lppllppl920/DenseDescriptorLearning-Pytorch
Filter Grafting for Deep Neural Networks
- 论文:https://arxiv.org/abs/2001.05868
- 代码:https://github.com/fxmeng/filter-grafting
- 论文解读:https://www.zhihu.com/question/372070853/answer/1041569335
Action Segmentation with Joint Self-Supervised Temporal Domain Adaptation
Detecting Attended Visual Targets in Video
Deep Image Spatial Transformation for Person Image Generation
Rethinking Zero-shot Video Classification: End-to-end Training for Realistic Applications
https://github.com/charlesCXK/3D-SketchAware-SSC
https://github.com/Anonymous20192020/Anonymous_CVPR5767
https://github.com/avirambh/ScopeFlow
https://github.com/csbhr/CDVD-TSP
https://github.com/ymcidence/TBH
https://github.com/yaoyao-liu/mnemonics
https://github.com/meder411/Tangent-Images
https://github.com/KaihuaTang/Scene-Graph-Benchmark.pytorch
https://github.com/sjmoran/deep_local_parametric_filters
https://github.com/charlesCXK/3D-SketchAware-SSC
https://github.com/bermanmaxim/AOWS
https://github.com/dc3ea9f/look-into-object
FADNet: A Fast and Accurate Network for Disparity Estimation
- 论文:还没出来
- 代码:https://github.com/HKBU-HPML/FADNet
https://github.com/rFID-submit/RandomFID:不确定中没中
https://github.com/JackSyu/AE-MSR:不确定中没中
https://github.com/fastconvnets/cvpr2020:不确定中没中
https://github.com/aimagelab/meshed-memory-transformer:不确定中没中
https://github.com/TWSFar/CRGNet:不确定中没中
https://github.com/CVPR-2020/CDARTS:不确定中没中
https://github.com/anucvml/ddn-cvprw2020:不确定中没中
https://github.com/dl-model-recommend/model-trust:不确定中没中
https://github.com/apratimbhattacharyya18/CVPR-2020-Corr-Prior:不确定中没中
https://github.com/onetcvpr/O-Net:不确定中没中
https://github.com/502463708/Microcalcification_Detection:不确定中没中
https://github.com/anonymous-for-review/cvpr-2020-deep-smoke-machine:不确定中没中
https://github.com/anonymous-for-review/cvpr-2020-smoke-recognition-dataset:不确定中没中
https://github.com/cvpr-nonrigid/dataset:不确定中没中
https://github.com/theFool32/PPBA:不确定中没中
https://github.com/Realtime-Action-Recognition/Realtime-Action-Recognition