Kernel Diffusion: An Alternate Approach to Blind Deconvolution.- MUSES: The Multi-Sensor Semantic Perception Dataset for Driving under Uncertainty.- Discovering Novel Actions from Open World Egocentric Videos with Object-Grounded Visual Commonsense Reasoning.- Bidirectional Progressive Transformer for Interaction Intention Anticipation.- Reinforcement Learning Meets Visual Odometry.- Bucketed Ranking-based Losses for Efficient Training of Object Detectors.- Robustness Tokens: Towards Adversarial Robustness of Transformers.- RSL-BA: Rolling Shutter Line Bundle Adjustment.
- DecentNeRFs: Decentralized Neural Radiance Fields from Crowdsourced Images.- DreamMesh: Jointly Manipulating and Texturing Triangle Meshes for Text-to-3D Generation.- Unveiling Typographic Deceptions: Insights of the Typographic Vulnerability in Large Vision-Language Models.- N2F2: Hierarchical Scene Understanding with Nested Neural Feature Fields.- ConceptExpress: Harnessing Diffusion Models for Single-image Unsupervised Concept Extraction.- PairingNet: A Learning-based Pair-searching and -matching Network for Image Fragments.- Skeleton-based Group Activity Recognition via Spatial-Temporal Panoramic Graph.- Towards Multimodal Open-Set Domain Generalization and Adaptation through Self-supervision.
- ReCON: Training-Free Acceleration for Text-to-Image Synthesis with Retrieval of Concept Prompt Trajectories.- AMES: Asymmetric and Memory-Efficient Similarity Estimation for Instance-level Retrieval.- TCAN: Animating Human Images with Temporally Consistent Pose Guidance using Diffusion Models.- 3D Hand Sequence Recovery from Real Blurry Images and Event Stream.- GlobalPointer: Large-Scale Plane Adjustment with Bi-Convex Relaxation.- Dissolving Is Amplifying: Towards Fine-Grained Anomaly Detection.- StyleCity: Large-Scale 3D Urban Scenes Stylization.- ViG-Bias: Visually Grounded Bias Discovery and Mitigation.
- DiffBIR: Toward Blind Image Restoration with Generative Diffusion Prior.- Assessing Sample Quality via the Latent Space of Generative Models.- Relightable Neural Actor with Intrinsic Decomposition and Pose Control.