Computer Vision - ECCV 2024

Computer Vision - ECCV 2024

18th European Conference, Milan, Italy, September 29-October 4, 2024, Proceedings, Part XIX

Leonardis, Ales; Ricci, Elisa; Varol, Guel; Roth, Stefan; Russakovsky, Olga; Sattler, Torsten

Springer International Publishing AG

01/2025

445

Mole

9783031726545

Pré-lançamento - envio 15 a 20 dias após a sua edição

Descrição não disponível.
NeuSDFusion: A Spatial-Aware Generative Model for 3D Shape Completion, Reconstruction, and Generation.- AID-AppEAL: Automatic Image Dataset and Algorithm for Content Appeal Enhancement and Assessment Labeling.- SEDiff: Structure Extraction for Domain Adaptive Depth Estimation via Denoising Diffusion Models.- Quantized Prompt for Efficient Generalization of Vision-Language Models.- Online Temporal Action Localization with Memory-Augmented Transformer.- Efficient Cascaded Multiscale Adaptive Network for Image Restoration.- MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model.- Occlusion-Aware Seamless Segmentation.- OpenKD: Opening Prompt Diversity for Zero- and Few-shot Keypoint Detection.- Referring Atomic Video Action Recognition.- Agent3D-Zero: An Agent for Zero-shot 3D Understanding.- Stream Query Denoising for Vectorized HD-Map Construction.- SAGS: Structure-Aware 3D Gaussian Splatting.- Spherical Linear Interpolation and Text-Anchoring for Zero-shot Composed Image Retrieval.- OneRestore: A Universal Restoration Framework for Composite Degradation.- Beat-It: Beat-Synchronized Multi-Condition 3D Dance Generation.- SkyMask: Attack-agnostic Robust Federated Learning with Fine-grained Learnable Masks.- Bag of Tricks to Boost Adversarial Transferability.- RePOSE: 3D Human Pose Estimation via Spatio-Temporal Depth Relational Consistency.- Pixel-GS Density Control with Pixel-aware Gradient for 3D Gaussian Splatting.- WorldPose: A World Cup Dataset for Global 3D Human Pose Estimation.- A Unified Framework for Gradient-based Saliency Map Generation of Black-box Models.- Language-Driven 6-DoF Grasp Detection Using Negative Prompt Guidance.- COIN-Matting: Confounder Intervention for Image Matting.- SHINE: Saliency-aware HIerarchical NEgative Ranking for Compositional Temporal Grounding.- Audio-driven Talking Face Generation with Stabilized Synchronization Loss.- Propose, Assess, Search: Harnessing LLMs for Goal-Oriented Planning in Instructional Videos.
artificial intelligence;computer networks;computer systems;computer vision;education;Human-Computer Interaction (HCI);image analysis;image coding;image processing;image reconstruction;image segmentation;learning;machine learning;object recognition;pattern recognition;reconstruction;signal processing;software engineering