Computer Vision - ECCV 2024

Computer Vision - ECCV 2024

18th European Conference, Milan, Italy, September 29-October 4, 2024, Proceedings, Part XXXVIII

Ricci, Elisa; Russakovsky, Olga; Leonardis, Ales; Sattler, Torsten; Roth, Stefan; Varol, Guel

Springer International Publishing AG

10/2024

499

Mole

9783031729195

15 a 20 dias

Descrição não disponível.
Tri^{2}-plane: Thinking Head Avatar via Feature Pyramid.- ControlCap: Controllable Region-level Captioning.- Free Lunch for Gait Recognition: A Novel Relation Descriptor.- SegVG: Transferring Object Bounding Box to Segmentation for Visual Grounding.- Adaptive Correspondence Scoring for Unsupervised Medical Image Registration.- MaxFusion: Plug&Play Multi-Modal Generation in Text-to-Image Diffusion Models.- Watch Your Steps: Local Image and Scene Editing by Text Instructions.- Forget More to Learn More: Domain-specific Feature Unlearning for Semi-supervised and Unsupervised Domain Adaptation.- 3x2: 3D Object Part Segmentation by 2D Semantic Correspondences.- Idea2Img: Iterative Self-Refinement with GPT-4V for Automatic Image Design and Generation.- Human-in-the-Loop Visual Re-ID for Population Size Estimation.- SEGIC: Unleashing the Emergent Correspondence for In-Context Segmentation.- PointNeRF++: A multi-scale, point-based Neural Radiance Field.- A Semantic Space is Worth 256 Language Descriptions: Make Stronger Segmentation Models with Descriptive Properties.- UMG-CLIP: A Unified Multi-Granularity Vision Generalist for Open-World Understanding.- Fast View Synthesis of Casual Videos with Soup-of-Planes.- Adaptive Human Trajectory Prediction via Latent Corridors.- Video Question Answering with Procedural Programs.- DGR-MIL: Exploring Diverse Global Representation in Multiple Instance Learning for Whole Slide Image Classification.- TexGen: Text-Guided 3D Texture Generation with Multi-view Sampling and Resampling.- C2C: Component-to-Composition Learning for Zero-Shot Compositional Action Recognition.- LLMGA: Multimodal Large Language Model based Generation Assistant.- Put Myself in Your Shoes: Lifting the Egocentric Perspective from Exocentric Videos.- Shape from Heat Conduction.- An Adaptive Screen-Space Meshing Approach for Normal Integration.- Parrot: Pareto-optimal Multi-Reward Reinforcement Learning Framework for Text-to-Image Generation.- HandDGP: Camera-Space Hand Mesh Prediction with Differentiable Global Positioning.
artificial intelligence;computer networks;computer systems;computer vision;education;Human-Computer Interaction (HCI);image analysis;image coding;image processing;image reconstruction;image segmentation;learning;machine learning;object recognition;pattern recognition;reconstruction;signal processing;software engineering