Browse Subject Headings
Computer Vision - ECCV 2024 : 18th European Conference, Milan, Italy, September 29-October 4, 2024, Proceedings, Part LXXVI
Computer Vision - ECCV 2024 : 18th European Conference, Milan, Italy, September 29-October 4, 2024, Proceedings, Part LXXVI
Click to enlarge
ISBN No.: 9783031731150
Pages: lxxxv, 487
Year: 202410
Format: Trade Paper
Price: $ 110.39
Dispatch delay: Dispatched between 7 to 15 days
Status: Available

WTS: A Pedestrian-Centric Traffic Video Dataset for Fine-grained Spatial-Temporal Understanding.- Spiking Wavelet Transformer.- WAVE: Warping DDIM Inversion Features for Zero-shot Text-to-Video Editing.- PDT Uav Target Detection Dataset for Pests and Diseases Tree.- Hypernetworks for Generalizable BRDF Representation.- Photon Inhibition for Energy-Efficient Single-Photon Imaging.- COD: Learning Conditional Invariant Representation for Domain Adaptation Regression.- RANRAC: Robust Neural Scene Representations via Random Ray Consensus.


- LayerDiff: Exploring Text-guided Multi-layered Composable Image Synthesis via Layer-Collaborative Diffusion Model.- Characterizing Model Robustness via Natural Input Gradients.- UpFusion: Novel View Diffusion from Unposed Sparse View Observations.- Four Ways to Improve Verbo-visual Fusion for Dense 3D Visual Grounding.- SIMBA: Split Inference - Mechanisms, Benchmarks and Attacks.- Tuning-Free Image Customization with Image and Text Guidance.- FairDomain: Achieving Fairness in Cross-Domain Medical Image Segmentation and Classification.- Emerging Property of Masked Token for Effective Pre-training.


- DQ-DETR: DETR with Dynamic Query for Tiny Object Detection.- Track2Act: Predicting Point Tracks from Internet Videos enables Generalizable Robot Manipulation.- SWAG: Splatting in the Wild images with Appearance-conditioned Gaussians.- Gaussian in the wild: 3D Gaussian Splatting for Unconstrained Image Collections.- Few-shot Defect Image Generation based on Consistency Modeling.- Taming CLIP for Fine-grained and Structured Visual Understanding of Museum Exhibits.- CLIP-DPO: Vision-Language Models as a Source of Preference for Fixing Hallucinations in LVLMs.- Masked Motion Prediction with Semantic Contrast for Point Cloud Sequence Learning.


- Prompt-Based Test-Time Real Image Dehazing: A Novel Pipeline.- Video Editing via Factorized Diffusion Distillation.- Trackastra: Transformer-based cell tracking for live-cell microscopy.


To be able to view the table of contents for this publication then please subscribe by clicking the button below...
To be able to view the full description for this publication then please subscribe by clicking the button below...
Browse Subject Headings