Computer Vision and Pattern Recognition

StreamBridge: Turning Your Offline Video Large Language Model into a
  Proactive Streaming Assistant

StreamBridge: Turning Your Offline Video Large...

Computer Vision and Pattern Recognition
Avatar
librarian
19 views
Flow-GRPO: Training Flow Matching Models via Online RL

Flow-GRPO: Training Flow Matching Models via O...

Computer Vision and Pattern Recognition
Avatar
Jie Liu
13 views
DEIM: DETR with Improved Matching for Fast Convergence

DEIM: DETR with Improved Matching for Fast Con...

Computer Vision and Pattern Recognition
Avatar
huang shihua
56 views
DEIM: DETR with Improved Matching for Fast Convergence

DEIM: DETR with Improved Matching for Fast Con...

Computer Vision and Pattern Recognition
Avatar
huang shihua
56 views
HelloMeme: Integrating Spatial Knitting Attentions to Embed High-Level
  and Fidelity-Rich Conditions in Diffusion Models

HelloMeme: Integrating Spatial Knitting Attent...

Computer Vision and Pattern Recognition
Avatar
Songkey Z
96 views
Chat-Edit-3D: Interactive 3D Scene Editing via Text Prompts

Chat-Edit-3D: Interactive 3D Scene Editing via...

Computer Vision and Pattern Recognition
Avatar
shuangkang fang
109 views
Kvasir-VQA: A Text-Image Pair GI Tract Dataset

Kvasir-VQA: A Text-Image Pair GI Tract Dataset

Computer Vision and Pattern Recognition
Avatar
Sushant Gautam
132 views
3D modelling of survey scene from images enhanced with a multi-exposure
  fusion

3D modelling of survey scene from images enhan...

Computer Vision and Pattern Recognition
Avatar
DIEGO FRANCISCO GARCIA MOLINA
203 views
High-level camera-LiDAR fusion for 3D object detection with machine
  learning

High-level camera-LiDAR fusion for 3D object d...

Computer Vision and Pattern Recognition
Avatar
DIEGO FRANCISCO GARCIA MOLINA
184 views
Complete End-To-End Low Cost Solution To a 3D Scanning System with
  Integrated Turntable

Complete End-To-End Low Cost Solution To a 3D ...

Computer Vision and Pattern Recognition
Avatar
DIEGO FRANCISCO GARCIA MOLINA
183 views
3D Reconstruction Using a Linear Laser Scanner and a Camera

3D Reconstruction Using a Linear Laser Scanner...

Computer Vision and Pattern Recognition
Avatar
DIEGO FRANCISCO GARCIA MOLINA
183 views
3D Scanning: A Comprehensive Survey

3D Scanning: A Comprehensive Survey

Computer Vision and Pattern Recognition
Avatar
DIEGO FRANCISCO GARCIA MOLINA
178 views
Survey on 3D face reconstruction from uncalibrated images

Survey on 3D face reconstruction from uncalibr...

Computer Vision and Pattern Recognition
Avatar
DIEGO FRANCISCO GARCIA MOLINA
168 views
Towards high-throughput 3D insect capture for species discovery and
  diagnostics

Towards high-throughput 3D insect capture for ...

Computer Vision and Pattern Recognition
Avatar
DIEGO FRANCISCO GARCIA MOLINA
163 views
Dual-Hybrid Attention Network for Specular Highlight Removal

Dual-Hybrid Attention Network for Specular Hig...

Computer Vision and Pattern Recognition
Avatar
绪行 陈
182 views
Pilgrims Face Recognition Dataset -- HUFRD

Pilgrims Face Recognition Dataset -- HUFRD

Computer Vision and Pattern Recognition
Avatar
muhammedheebboo
205 views
ARCH2S: Dataset, Benchmark and Challenges for Learning Exterior
  Architectural Structures from Point Clouds

ARCH2S: Dataset, Benchmark and Challenges for ...

Computer Vision and Pattern Recognition
Avatar
Daniel Cheung
191 views
ARCH2S: Dataset, Benchmark and Challenges for Learning Exterior
  Architectural Structures from Point Clouds

ARCH2S: Dataset, Benchmark and Challenges for ...

Computer Vision and Pattern Recognition
Avatar
Daniel Cheung
147 views
Benchmarking Detection Transfer Learning with Vision Transformers

Benchmarking Detection Transfer Learning with ...

Computer Vision and Pattern Recognition
Avatar
wa su
201 views
Rethinking Event-based Optical Flow: Iterative Deblurring as an
  Alternative to Correlation Volumes

Rethinking Event-based Optical Flow: Iterative...

Computer Vision and Pattern Recognition
Avatar
Yilun Wu
253 views
6-DOF GraspNet: Variational Grasp Generation for Object Manipulation

6-DOF GraspNet: Variational Grasp Generation f...

Computer Vision and Pattern Recognition
Avatar
Mark nth
209 views
Mapping industrial poultry operations at scale with deep learning and
  aerial imagery

Mapping industrial poultry operations at scale...

Computer Vision and Pattern Recognition
Avatar
Andrew Jiranek
212 views
Brain-inspired algorithms for processing of visual data

Brain-inspired algorithms for processing of vi...

Computer Vision and Pattern Recognition
Avatar
Isaak Bruno
204 views
High-Quality Facial Geometry and Appearance Capture at Home

High-Quality Facial Geometry and Appearance Ca...

Computer Vision and Pattern Recognition
Avatar
hello-2
205 views
Swin Transformer: Hierarchical Vision Transformer using Shifted Windows

Swin Transformer: Hierarchical Vision Transfor...

Computer Vision and Pattern Recognition
Avatar
sven-prevrhal
246 views
Always Clear Days: Degradation Type and Severity Aware All-In-One
  Adverse Weather Removal

Always Clear Days: Degradation Type and Severi...

Computer Vision and Pattern Recognition
Avatar
Yu-Wei Chen
218 views
Diversifying Spatial-Temporal Perception for Video Domain Generalization

Diversifying Spatial-Temporal Perception for V...

Computer Vision and Pattern Recognition
Avatar
Linky Unknown
181 views
Instance Segmentation under Occlusions via Location-aware Copy-Paste
  Data Augmentation

Instance Segmentation under Occlusions via Loc...

Computer Vision and Pattern Recognition
Avatar
Son Nguyen
223 views
Understanding Parameter Saliency via Extreme Value Theory

Understanding Parameter Saliency via Extreme V...

Computer Vision and Pattern Recognition
Avatar
Shuo Wang
205 views
Shape-centered Representation Learning for Visible-Infrared Person
  Re-identification

Shape-centered Representation Learning for Vis...

Computer Vision and Pattern Recognition
Avatar
Shuang Li
200 views
Qilin-Med-VL: Towards Chinese Large Vision-Language Model for General
  Healthcare

Qilin-Med-VL: Towards Chinese Large Vision-Lan...

Computer Vision and Pattern Recognition
Avatar
Junling Liu
228 views
FaultSeg Swin-UNETR: Transformer-Based Self-Supervised Pretraining Model
  for Fault Recognition

FaultSeg Swin-UNETR: Transformer-Based Self-Su...

Computer Vision and Pattern Recognition
Avatar
Zeren Zhang
253 views