Efficient Calisthenics Skills Classification through Foreground Instance Selection and Depth Estimation.- Hierarchical Vision-Language Retrieval of Educational Metaverse Content in Agriculture.- Enhancing Testicular Ultrasound Image Classification Through Synthetic Data and Pretraining Strategies.- Classification of Spatial Patterns of Lymphocyte Infiltration in Gliomas from Whole Slide Imaging.- CoLoR-GAN: Continual Few-Shot Learning with Low-Rank Adaptation in Generative Adversarial Networks.- MECAD: A Multi-Expert Architecture for Continual Anomaly Detection.- RAW-Mix: Region-Aware Mixing for Unsupervised Domain Adaptation in Object Detection.- Multiple Sclerosis Classification via Random Forest Distances Robust to Missing Data.
- Revisiting Dictionaries of Key Poses for Action Representation.- Segment Anything for Satellite Imagery: A Strong Baseline and a Regional Dataset for Automatic Field Delineation.- A Handful of Data: Evaluating Few-Shot Incremental Landmark Detection.- ExDD: Explicit Dual Distribution Learning for Surface Defect Detection via Diffusion Synthesis.- When Does Pruning Benefit Vision Representations?- WARD: Weather-Aware Road Surface Condition Monitoring Dataset.- Skeleton-Based Action Recognition for the Biomechanical Risk Condition Assessment.- Truth or Lie: An Audio-Visual Approach to Deception Detection.- Momaku: A Retinal Image Annotation Platform.
- ReMAR-DS: Recalibrated Feature Learning for Metal Artifact Reduction and CT Domain Transformation.- A Study on Multimodal Foundation Models for Affective Video Prediction.- Leveraging Multi View Weak Supervision for Occlusion-Aware Multi-Human Parsing.- T-EVO: Tracking in Egovision for Online Visual Episodic Memory.- Diffusion-Detect: Synthetic and Real Data Towards a Robust Fire and Smoke Detection.- Augmented Reality in Cultural Heritage: A Dual-Model Pipeline for 3D Artwork Reconstruction.- An Investigation of Ear-EEG Signals for a Novel Biometric Authentication System.- How to Train Your Metamorphic Deep Neural Network.
- Fault Tolerant Multi-Object Tracking via Temporal Consistency Filtering.- Ego-Exo Object Correspondence by SAM2 and Cross-View Prompting.- Deep Multi-Band EEG Learning for Motor Imagery Classification with Dry Electrodes.- A Semantically-Aware Relevance Measure for Content-Based Medical Image Retrieval Evaluation.- Multimodal Deepfake Detection with Large Vision-Language Models: The State of the Art.- Ego and Exo Views for an Object-Level Human Behavior Analysis and Understanding through Tracking in Retail Spaces.- UAV-Based Photovoltaic Farm Maintenance System.- Beyond Linear Bottlenecks: Spline-Based Knowledge Distillation for Culturally Diverse Art Style Classification.
- Pre-NeRF: Evaluating Preprocessing Approaches to Mitigate Real-World Corruptions in NeRF Reconstruction.- Single-Path Precision: Differentiable Bit-Width Numeric-Format Learning for FPGA-Efficient Neural Networks.- Diversified In-Domain Synthesis with Efficient Fine Tuning for Few-Shot Classification.- Inpainting of Ancient Chinese Character Rubbings via Generative Adversarial Network.- Estimation of Particle Growth Kinetics via Physics-Informed Neural Networks.- Graph-Based Evaluation of Visual Brain Decoding from fMRI Data.- A SAM-Based Automated Schistocyte Detection Pipeline in Peripheral Blood Smear Images.- MORE: A Framework for Stable White Blood Cell Morphological Classification and Report Generation.
- How Far Can Off-the-Shelf Multimodal Large Language Models Go in Online Episodic Memory Question Answering?- Deep Transductive Learning for Person Re-Identification.- Efficient Oblique Stripe Noise Detection and Removal Using Hough Transform and Guided Filter.- Multimodal Emotion Recognition via Multilevel Fusion of Visual, Audio, and Textual Data.- Business Activity Classification Extraction from Commercial Footage: A Multimodal LLM Approach Based on CNAE (Comparable to NACE and NAICS).- Evaluating Attribute Confusion in Fashion Text-to-Image Generation.- Multi-Swimmer Drowning Detection Using a Custom Annotated Underwater Dataset and Real-Time AI.- Unravelling Neurodivergent Gaze Behaviour through Visual Attention Causal Graphs.- NeSyLAD: A Neuro Symbolic Approach for Unsupervised Logical Anomaly Detection.
- Multimodal Distillation for Video-Based Sleep Behavior Analysis.- HoSVD-NSST Framework for Secure Dual Medical ImageWatermarking Using Deep Learning.