Research Intelligence

Live arXiv AI Data Stream

Synchronization Active

cs.CV

Apr 2, 2026

EventHub: Data Factory for Generalizable Event-Based Stereo Networks without Active Sensors

We propose EventHub, a novel framework for training deep-event stereo networks without ground truth annotations from costly active sensors, relying instead on standard color images. From these images, we derive either proxy annotations and proxy events through state-of-the-art novel view synthesis techniques, or simply proxy annotations when images are already paired with event data. Using the training set generated by our data factory, we repurpose state-of-the-art stereo models from RGB literature to process event data, obtaining new event stereo models with unprecedented generalization capabilities. Experiments on widely used event stereo datasets support the effectiveness of EventHub and show how the same data distillation mechanism can improve the accuracy of RGB stereo foundation models in challenging conditions such as nighttime scenes.

Luca BartolomeiFabio TosiMatteo Poggi+2 more

EventHub: Data Factory for Generalizable Event-Based Stereo Networks without Active Sensors

ActionParty: Multi-Subject Action Binding in Generative Video Games

Generative World Renderer

Modulate-and-Map: Crossmodal Feature Mapping with Cross-View Modulation for 3D Anomaly Detection

Steerable Visual Representations

Grounded Token Initialization for New Vocabulary in LMs for Generative Recommendation

Beyond Referring Expressions: Scenario Comprehension Visual Grounding

Batched Contextual Reinforcement: A Task-Scaling Law for Efficient Reasoning

Large-scale Codec Avatars: The Unreasonable Effectiveness of Large-scale Avatar Pretraining

Stop Wandering: Efficient Vision-Language Navigation via Metacognitive Reasoning

A Simple Baseline for Streaming Video Understanding

Beyond the Assistant Turn: User Turn Generation as a Probe of Interaction Awareness in Language Models

Topological Effects in Neural Network Field Theory

go-$m$HC: Direct Parameterization of Manifold-Constrained Hyper-Connections via Generalized Orthostochastic Matrices

VOID: Video Object and Interaction Deletion

Taming the Exponential: A Fast Softmax Surrogate for Integer-Native Edge Inference

AdamFlow: Adam-based Wasserstein Gradient Flows for Surface Registration in Medical Imaging

Omni123: Exploring 3D Native Foundation Models with Limited 3D Data by Unifying Text to 2D and 3D Generation

Unifying Group-Relative and Self-Distillation Policy Optimization via Sample Routing

Deep Neural Network Based Roadwork Detection for Autonomous Driving

Novel Memory Forgetting Techniques for Autonomous AI Agents: Balancing Relevance and Efficiency

The Self Driving Portfolio: Agentic Architecture for Institutional Asset Management

De Jure: Iterative LLM Self-Refinement for Structured Extraction of Regulatory Rules

Crystalite: A Lightweight Transformer for Efficient Crystal Modeling

SKILL0: In-Context Agentic Reinforcement Learning for Skill Internalization

Modular Energy Steering for Safe Text-to-Image Generation with Foundation Models

Model-Based Reinforcement Learning for Control under Time-Varying Dynamics

Retrieval-Augmented Question Answering over Scientific Literature for the Electron-Ion Collider

Best-Arm Identification with Noisy Actuation

SPAR: Single-Pass Any-Resolution ViT for Open-vocabulary Segmentation

Smoothing the Landscape: Causal Structure Learning via Diffusion Denoising Objectives

BVFLMSP : Bayesian Vertical Federated Learning for Multimodal Survival with Privacy

(PAC-)Learning state machines from data streams: A generic strategy and an improved heuristic (Extended version)

UAV-Track VLA: Embodied Aerial Tracking via Vision-Language-Action Models

Generative AI Spotlights the Human Core of Data Science: Implications for Education

Do Emotions in Prompts Matter? Effects of Emotional Framing on Large Language Models