Spherical Vision Transformers for Audio-Visual Saliency Prediction in 360 ^∘ Videos

Omnidirectional videos (ODVs) are redefining viewer experiences in virtual reality (VR) by offering an unprecedented full field-of-view (FOV). This study extends the domain of saliency prediction to 360^\circ ∘ environments, addressing the complexities of spherical distortion and the integration of...

Full description

Saved in:
Bibliographic Details
Published in:IEEE transactions on pattern analysis and machine intelligence 2026-01, Vol.48 (1), p.329-345
Main Authors: Cokelek, Mert, Ozsoy, Halit, Imamoglu, Nevrez, Ozcinar, Cagri, Ayhan, Inci, Erdem, Erkut, Erdem, Aykut
Format: Article
Language:English
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!