Recent posts

CLIP

Learning Transferable Visual Models From Natural Language Supervision 리뷰

ViT

An Image is worth 16x16 words: Transformers for image recognition at scale 리뷰

DETR

End-to-End Object Detection with Transformers 리뷰

EfficientDet

EfficientDet: Scalable and Efficient Object Detection 리뷰