Tony Shin
VeRA: Vector-based Random Matrix Adaptation
1:16
Tony Shin
PaLI-3 Vision Language Models: Smaller, Faster, Stronger
0:50
Tony Shin
HyperAttention: Long-context Attention in Near-Linear Time
1:25
Tony Shin
Fast Feedforward Networks
1:15
Tony Shin
Nougat: Neural Optical Understanding for Academic Documents
1:14
Tony Shin
Retentive Network: A Successor to Transformer for Large Language Models
1:05
Tony Shin
LLava: Visual Instruction Tuning
1:09
Tony Shin
DeepReader Live Stream
Tony Shin
BloombergGPT: A Large Language Model for Finance
1:56
Tony Shin
ImageBind: One Embedding Space To Bind Them All
3:02
Tony Shin
Segment Anything
2:00
Tony Shin
Are Emergent Abilities of Large Language Models a Mirage?
2:17
Tony Shin
Synthetic Data Boosts ImageNet Classification
1:12
Tony Shin
Unlimiformer: Long-Range Transformers with Unlimited Length Input
0:47
Tony Shin
[Tutorial] Image Super Resolution without Photoshop
23:34
Tony Shin
YOLO9000: Better, Faster, Stronger
10:32
Tony Shin
NÜWA: Visual Synthesis Pre-training for Neural visUal World creAtion
15:10
Tony Shin
Florence: A New Foundation Model for Computer Vision
10:27
Tony Shin
DSSD: Deconvolutional Single Shot Detector
8:03
Tony Shin
MAE: Masked Autoencoders Are Scalable Vision Learners
8:02
Tony Shin
PVANet: Deep but Lightweight Neural Networks forReal-time Object Detection
5:01
Tony Shin
SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers
5:36
Tony Shin
R-FCN: Object Detection via Region-based Fully Convolutional Networks
6:32
Tony Shin
Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
5:28
Tony Shin
Pix2Seq: A Language Modeling Framework for Object Detection
7:33
Tony Shin
Improved Regularization of Convolutional Neural Networks with Cutout
2:41
Tony Shin
VICReg: Variance-Invariance-Covariance Regularization for Self-Supervised Learning
7:13
Tony Shin
SSD: Single Shot MultiBox Detector
3:23
Tony Shin
Barlow Twins: Self-Supervised Learning via Redundancy Reduction
4:33
Tony Shin
MLP-Mixer: An all-MLP Architecture for Vision
5:22
Tony Shin
YOLO: Unified, Real-Time Object Detection
4:09
Tony Shin
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
4:26
Tony Shin
OHEM: Training Region-based Object Detectors with Online Hard Example Mining
3:41
Tony Shin
Swin Transformer Object Detection Demo
6:31
Tony Shin
Faster R CNN
6:30
Tony Shin
Fast R-CNN
5:27
Tony Shin
AttentionNet: Aggregating Weak Directions for Accurate Object Detection
6:33
Tony Shin
DeepBox: Learning Objectness with Convolutional Networks
8:59
Tony Shin
MR-CNN: Object detection via a multi-region & semantic segmentation-aware CNN model
8:09
Tony Shin
[DeepReader] SPP-Net: Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition
6:07
Tony Shin
[DeepReader] MultiBox: Scalable Object Detection using Deep Neural Networks
4:17
Tony Shin
[DeepReader] OverFeat: Integrated Recognition, Localization and Detection using Conv. Networks
5:15
Tony Shin
[DeepReader] R-CNN
4:50
Tony Shin
[Tutorial] Training End-to-end Object Detection with Transformer(DETR) model on custom dataset
25:16
Tony Shin
[DeepReader] Informative Dropout for Robust Representation Learning A Shape bias Perspective
6:36
Tony Shin
[DeepReader] DeLighT: Very Deep and Light weight Transformer
6:45
Tony Shin
[DeepReader] Contrastive Learning for Unpaired Image to Image Translation
4:40
Tony Shin
[DeepReader] Big Bird: Transformers for Longer Sequences
6:03
Tony Shin
[DeepReader] MiCo: Mixup Co Training for Semi Supervised Domain Adaptation
6:34
Tony Shin
[DeepReader] PP-YOLO: An Effective and Efficient Implementation of Object Detector
7:12
Tony Shin
[DeepReader] DeepSVG: A Hierarchical Generative Network for Vector Graphics Animation
7:10
Tony Shin
[DeepReader] Transformers are RNNs
5:30
Tony Shin
RepPoints: Point Set Representation for Object Detection
5:14
Tony Shin
RetrieveGAN: Image Synthesis via Differentiable Patch Retrieval
5:52
Tony Shin
PointRend: Image Segmentation as Rendering
6:04
Tony Shin
Neural Architecture Design for GPU Efficient Networks
9:11
Tony Shin
Locally Masked Convolution for Auto-regressive Models
4:43
Tony Shin
Rethinking the Truly Unsupervised Image-to-Image Translation
6:43
Tony Shin
Generative Pretraining from Pixels
6:13
Tony Shin
Disentangled Non local Neural Networks
8:13
Tony Shin
DETR: End-to-End Object Detection with Transformers
6:14
Tony Shin
CornerNet : Detecting Objects as Paired Keypoints
5:03
Tony Shin
EfficientDet: Scalable and Efficient Object Detection
7:24