Computer Vision - ECCV 2024

Computer Vision - ECCV 2024

18th European Conference, Milan, Italy, September 29-October 4, 2024, Proceedings, Part XXVII

Ricci, Elisa; Roth, Stefan; Varol, Guel; Russakovsky, Olga; Sattler, Torsten; Leonardis, Ales

Springer International Publishing AG

12/2024

482

Mole

9783031733826

Pré-lançamento - envio 15 a 20 dias após a sua edição

Descrição não disponível.
SLAck: Semantic, Location, and Appearance Aware Open-Vocabulary Tracking.- Tensorial template matching for fast cross-correlation with rotations and its application for tomography.- FreeAugment: Data Augmentation Search Across All Degrees of Freedom.- Learning Representations of Satellite Images From Metadata Supervision.- I2-SLAM: Inverting Imaging Process for Robust Photorealistic Dense SLAM.- FlashTex: Fast Relightable Mesh Texturing with LightControlNet.- GS-Pose: Category-Level Object Pose Estimation via Geometric and Semantic Correspondence.- ArtVLM: Attribute Recognition Through Vision-Based Prefix Language Modeling.- PanoFree: Tuning-Free Holistic Multi-view Image Generation with Cross-view Self-Guidance.- SOS: Segment Object System for Open-World Instance Segmentation With Object Priors.- Lagrangian Hashing for Compressed Neural Field Representations.- EDformer: Transformer-Based Event Denoising Across Varied Noise Levels.- Foster Adaptivity and Balance in Learning with Noisy Labels.- MetaAug: Meta-Data Augmentation for Post-Training Quantization.- Thermal3D-GS: Physics-induced 3D Gaussians for Thermal Infrared Novel-view Synthesis.- Cross-Platform Video Person ReID: A New Benchmark Dataset and Adaptation Approach.- Unleashing the Power of Prompt-driven Nucleus Instance Segmentation.- Gaze Target Detection Based on Head-Local-Global Coordination.- 3DSA:Multi-View 3D Human Pose Estimation With 3D Space Attention Mechanisms.- Toward Tiny and High-quality Facial Makeup with Data Amplify Learning.- An Economic Framework for 6-DoF Grasp Detection.- GaussianFormer: Scene as Gaussians for Vision-Based 3D Semantic Occupancy Prediction.- Powerful and Flexible: Personalized Text-to-Image Generation via Reinforcement Learning.- AdaLog: Post-Training Quantization for Vision Transformers with Adaptive Logarithm Quantizer.- Multi-Label Cluster Discrimination for Visual Representation Learning.- Plan, Posture and Go: Towards Open-vocabulary Text-to-Motion Generation.- DAMSDet: Dynamic Adaptive Multispectral Detection Transformer with Competitive Query Selection and Adaptive Feature Fusion.
artificial intelligence;computer networks;computer systems;computer vision;education;Human-Computer Interaction (HCI);image analysis;image coding;image processing;image reconstruction;image segmentation;learning;machine learning;object recognition;pattern recognition;reconstruction;signal processing;software engineering