Tony Shin
Pix2Seq: A Language Modeling Framework for Object Detection
3 years ago - 7:33
SDtCS
Abhinav Rao - Pix2Seq: A Language Modeling Framework for Object Detection (PRS 1.1)
3 years ago - 31:40
Machine Learning Group
2022/06/05 - (paper) Pix2Seq: A New Language Interface for Object Detection
2 years ago - 47:22
ThinkNotClearzh
【点论文】295 Pix2seq: A Language Modeling Framework for Object Detection
2 years ago - 16:51
만끽 MaanGeek
PR-348: Pix2seq: A Language Modeling Framework for Object Detection
3 years ago - 31:13
DataMListic
Object Detection Part 7: Detection Transformers (DETR), Object Queries
1 year ago - 4:28
딥러닝논문읽기모임
[2022 ICLR] Pix2Seq
2 years ago - 21:41
Machine Learning Group
In this channel we present all the virtual meetings recorded. Each video is about one session of the explanation of one topic of ...
@machinelearninggroup3450 subscribers
Harry C Blum
nanogpt for Speaker Diarization
1 year ago - 24:59
SDtCS
We are a bunch of undergrads united by one mission, to help push innovation toward a better future. And we hold a firm belief in ...
@sdtcs subscribers
만끽 MaanGeek
@maangeek subscribers
Microsoft Research
MDETR: Modulated Detection for End-to-End Multi-Modal Understanding
3 years ago - 1:13:28
Cerebras Systems
ICLR 2023 Workshop on Sparsity in Neural Networks - Introduction
2 years ago - 4:44
OpenDriveLab
ICLR23 SR4AD-01 Introduction and opening remarks(Li Chen)
2 years ago - 7:35
Alex Novak - AI Guide
❓ How to use AI Agents - 13 WAYS to GET STABLE PROFIT with AI | AI Agents Tutorial | AI Tutorial
-
Yi-Ting Chen
Multimodal Object Detection via Probabilistic Ensembling
3 years ago - 1:28
ML in PL
Lucas Beyer - Computer Vision in the Age of LLMs | ML in PL 2024
6 months ago - 49:53
ayanCV
[CVPR 2021] "Semi-Supervised Learning for Fine-Grained Sketch Based Image Retrieval", CVPR 2021.
4 years ago - 4:57
Edward Hu
ICLR 2023 Spotlight: Planning Goals for Exploration
2 years ago - 10:52
Yi-Ting Chen
Multimodal Object detection via Bayesian Fusion
4 years ago - 1:28
Artificial Images
RunwayML and Google Colab: Week 6 (Next Frame Prediction and Style Transfer)
4 years ago - 1:01:40
Nicolai Nielsen
Open-Source AI Camera System – Pose, Segmentation, Detection & Camera Management
1 day ago - 1:00
AI Engineer
Design like Karpathy is watching — Zeke Sikelianos, Replicate
6 days ago - 19:26
Prateek Keserwani
Quadbox: Quadrilateral Bounding Box Based Scene Text Detection Using Vector Regression
3 years ago - 1:28
Munachiso N.
Intuitions on Lifelong Machine Learning and Open World Object Detection
4 years ago - 23:28
Rose E Wang
ICLR 2022 Oral: Language modeling via stochastic processes
3 years ago - 12:43
Shunli Wang
ACM-MM'21 (Oral) TSA-Net: Tube Self-Attention Network for Action Quality Assessment
3 years ago - 11:57
AIRLab
[2D Perception]Demo - Pick and Place system with 6-DOF Pose Estimation using DOPE
2 years ago - 1:49
Edan Meyer
This Embodied LLM is...
2 years ago - 32:50
shil111
Medical Imaging - my approach how to tackle an object detection task
Intro ...
3 years ago - 15:46
Hello User
Ting Chen - Mesenchymal Niche Regulated RegionalEpithelial Regeneration
3 years ago - 28:06
JoonHo LEE
PR-347: Semi-Supervised Learning of Visual Features by Non-Parametrically Predicting... (PAWS)
3 years ago - 46:06
Microsoft Research
[VLP Tutorial @ CVPR 2022] Image-Text Pre-training Part II
3 years ago - 40:58
Scene the Ella
PR-350: Encoding in Style: a StyleGAN Encoder for Image-to-Image Translation (CVPR 2021, 한국어 리뷰)
3 years ago - 18:59
Vlad Golyanik (4DQV)
[ICLR 2023 Oral] QuAnt: Quantum Annealing with Learnt Couplings
2 years ago - 9:39
Pysource
Build a real-time multi camera tracking system | with Python
9 days ago - 17:42
Fahd Mirza
AI on EKS (AIoEKS) with vLLM and RayServe
5 days ago - 8:45
박성남
PR-335: Self-supervised Learning for Large-scale Item Recommendations
3 years ago - 32:32
Digital Spaceport
Local Ai Qwen3 Coder 480B 1MILLION CTX Runs Like ????
17 hours ago - 0:24
Wessel Bruinsma
Autoregressive Conditional Neural Processes (ICLR 2023)
2 years ago - 5:05
Scene the Ella
PR-342: Playable Video Generation (CVPR 2021 (Oral), 한국어 리뷰)
3 years ago - 20:31
Patrick Grady
[ECCV 2022 Oral] PressureVision: Estimating Hand Pressure from a Single RGB Image
2 years ago - 11:10
Scene the Ella
PR-354: Data-driven Interior Plan Generation for Residential Buildings & Graph2Plan
3 years ago - 29:58
DS Talks Siberia
21.11.2023 InstructDiffusion: A Generalist Modeling Interface for Vision Tasks
1 year ago - 59:39
Artificial Images
RunwayML and Google Colab: Week 6 (Next Frame Prediction and Style Transfer)
4 years ago - 1:01:40
Sunghoon Joo
PR-351: Adaptive Aggregation Networks for Class-Incremental Learning
3 years ago - 30:26
딥러닝논문읽기모임
[2022 ICLR] MobileViT : Light-weight, general-purpose, and Mobile-friendly Vision Transformer
3 years ago - 16:42
Zeta Alpha
ICLR 2023 Impressions from Kigali
2 years ago - 3:24
Tony Shin
[DeepReader] R-CNN
4 years ago - 4:50
Jaejun Yoo
PR-349: Adversarial Generation of Continuous Images
3 years ago - 42:23
SupportVectors
[Paper Reading] PaliGemma
Streamed 11 months ago - 1:06:10
딥러닝논문읽기모임
A Quantitative Analysis of Statistical and Graph-Based Term Weighting Schemes for Keyword Extraction
2 years ago - 16:00
Doyup Lee
PR-352: ImageBART: Bidirectional Context with Multinomial Diffusion for AR Image Synthesis
3 years ago - 45:43
Jeon Eddie
PR-346: Super Tickets in Pre-Trained Language Models
3 years ago - 32:38
yunssun
PR-345: LayerCAM: Exploring Hierarchical Class Activation Maps for Localization
3 years ago - 34:03
딥러닝논문읽기모임
Self-Supervised Learning based on Heat Equation
2 years ago - 8:51
Pyresearch
YOLOv12 for Meat Freshness Detection: A Step-by-Step Guide
6 days ago - 19:45
Sunghoon Joo
PR-339: Maintaining discrimination and fairness in class incremental learning
3 years ago - 29:47
Doyup Lee
PR-341: Involution: Inverting the Inherence of Convolution for Visual Recognition
3 years ago - 33:33
Hesham Asem
NLP 7.4.1 Tree Neural Network الجزء الأول
3 years ago - 10:17
딥러닝논문읽기모임
[Google Research] Minerva - Solving Quantitative Reasoning Problems with Language Models
2 years ago - 14:43
딥러닝논문읽기모임
[CVPR2023] ImageBind One Embedding Space To Bind Them All
1 year ago - 15:19
Weights & Biases
Unified stack of Kubernetes, Ray, PyTorch, and vLLM.
1 day ago - 1:22
何冠穎
Localizing piglets in pig farm with oriented bounding box
4 years ago - 0:14
Sungchul Kim
PR-343: Semi-Supervised Semantic Segmentation with Cross Pseudo Supervision
3 years ago - 28:06
JinWon Lee (DeepTube)
PR-344: A Battle of Network Structures: An Empirical Study of CNN, Transformer, and MLP
3 years ago - 29:34
DS Talks Siberia
19.12.23 Sequential Modeling Enables Scalable Learning for Large Vision Models
1 year ago - 46:05
조경진
[유투브 딥러닝 논문읽기 모임] NeurlIPS Hard Negative Mixing for Contrastive learning
2 years ago - 9:22
A Data Odyssey
Occlusion in Practice with Python and Captum | XAI for Computer Vision
11 days ago - 15:13
AISchool
멀티모달 LLM - PaliGemma 모델을 활용해서 물체 검출(Object Detection) 하기
9 months ago - 15:46