Visual Attention Network
VAN
NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis
CBAM: Convolutional Block Attention Module
Multimodal Deep Learning for Robust RGB-D Object Recognition
Swin Transformer: Hierarchical Vision Transformer using Shifted Windows 리뷰
Learning Transferable Visual Models From Natural Language Supervision 리뷰
An Image is worth 16x16 words: Transformers for image recognition at scale 리뷰
End-to-End Object Detection with Transformers 리뷰
EfficientDet: Scalable and Efficient Object Detection 리뷰
You Only Look Once: Unified, Real-Time Object Detection 리뷰
Mask R-CNN 리뷰
Video Retargeting 기술
Towards Real-Time Object Detection with Region Proposal Networks
Fast R-CNN
Rich feature hierarchies for accurate object detection and semantic segmentation
ImageNet Classification with Deep Convolutional Neural Networks
End-to-End Object Detection with Transformers