Computer Vision - ECCV 2024

Computer Vision - ECCV 2024

18th European Conference, Milan, Italy, September 29-October 4, 2024, Proceedings, Part XXIV

Roth, Stefan; Varol, Guel; Sattler, Torsten; Leonardis, Ales; Russakovsky, Olga; Ricci, Elisa

Springer International Publishing AG

12/2024

501

Mole

9783031726903

Pré-lançamento - envio 15 a 20 dias após a sua edição

Descrição não disponível.
Elevating All Zero-Shot Sketch-Based Image Retrieval Through Multimodal Prompt Learning.- Improving Knowledge Distillation via Regularizing Feature Direction and Norm.- 3DFG-PIFu: 3D Feature Grids for Human Digitization from Sparse Views.- Lazy Diffusion Transformer for Interactive Image Editing.- Non-parametric Sensor Noise Modeling and Synthesis.- Stripe Observation Guided Inference Cost-free Attention Mechanism.- The Nerfect Match: Exploring NeRF Features for Visual Localization.- ComboVerse: Compositional 3D Assets Creation Using Spatially-Aware Diffusion Guidance.- Robust Calibration of Large Vision-Language Adapters.- Leveraging Hierarchical Feature Sharing for Efficient Dataset Condensation.- Improving Domain Generalization in Self-Supervised Monocular Depth Estimation via Stabilized Adversarial Training.- milliFlow: Scene Flow Estimation on mmWave Radar Point Cloud for Human Motion Sensing.- denoiSplit: a method for joint microscopy image splitting and unsupervised denoising.- AugDETR: Improving Multi-scale Learning for Detection Transformer.- Spherical World-Locking for Audio-Visual Localization in Egocentric Videos.- SPIN: Hierarchical Segmentation with Subpart Granularity in Natural Images.- SIGMA: Sinkhorn-Guided Masked Video Modeling.- Generative Camera Dolly: Extreme Monocular Dynamic Novel View Synthesis.- Distribution Alignment for Fully Test-Time Adaptation with Dynamic Online Data Streams.- Divide and Fuse: Body Part Mesh Recovery from Partially Visible Human Images.- Understanding Physical Dynamics with Counterfactual World Modeling.- MIGS: Multi-Identity Gaussian Splatting via Tensor Decomposition.- 4Diff: 3D-Aware Diffusion Model for Third-to-First Viewpoint Translation.- Improving Point-based Crowd Counting and Localization Based on Auxiliary Point Guidance.- Nymeria: A Massive Collection of Egocentric Multi-modal Human Motion in the Wild.- DreamStruct: Understanding Slides and User Interfaces via Synthetic Data Generation.- SemTrack: A Large-scale Dataset for Semantic Tracking in the Wild.
artificial intelligence;computer networks;computer systems;computer vision;education;Human-Computer Interaction (HCI);image analysis;image coding;image processing;image reconstruction;image segmentation;learning;machine learning;object recognition;pattern recognition;reconstruction;signal processing;software engineering