360 Scene Dataset
Overview
The QoEVAVE Scenes Database provides an initial audiovisual database consisting of 12 scenes capturing real-life nature and urban scenes. The maximum video resolution is 7680x3840 (8k) at 60 frames-per-second, with 4th-order Ambisonics spatial audio (4OA). All video sequences are recorded with a target duration of 60 seconds and designed to represent real-life settings for systematically evaluating various dimensions of uni-/multimodal perception, cognition, behavior, and quality of experience (QoE) in a controlled virtual environment. This database serves as high-quality reference material with an equal focus on auditory and visual sensory information within the QoE community. For more information, please see the publication below.
Recorded Scenes
You can download individual scenes on each of the available scene pages, or view the download list list below. On each scene page, you will find the following information:
- Location information of recording.
- Scene version notes describing variations in multiple 'takes'.
- Preview link to YouTube (uses 1st-order Ambisonics).
- Download links for 4th-order Ambisonics audio, 8k video file, or muxed audiovideo file with 1st-order Ambisonics.
- Spatial / Temporal indexing plots on video data.
Capture
| Media | Device | Description |
|---|---|---|
| Audio | mhacoustics Eigenmike | 32 channel spherical microphone array capable of 4th order higher-order ambisonics output. |
| Video | Insta360 Pro 2 | 360 video camera capable of 8K video output. Comprised of 6 F2.4 fisheye lenses each capturing 4K video resolution up to 120Mbps. |
Specifications
| Information | Description |
|---|---|
| Video Encoding | ffvhuff |
| Video Resolution | 7680x3840 |
| Projection Map | Equirectangular |
| Video FPS | 59.94 |
| Audio Sample-rate | 48,000; 24-bit PCM |
| 4th-Order Ambisonics | Individual .wav files of 4th-order Ambisonics in ACN channel ordering with SN3D normalization. Scene representation has a -90° rotational offset against video files for playback with Unity. Visit Help for more information. Labelled as ambiX4 in file names. |
| 1st-Order Ambisonics | 1st-Order Ambisonics in ACN channel ordering with SN3D normalization. Encoded using AAC and muxed with the video files into an .MP4 container. Pre-processed with Google's spatial media metadata injector for uploading to YouTube. Labelled as ambiX1 in file names. |
Audio post-production — all files are processed with IEM VST plugins:
- 500 ms fade-in and fade-out
- 60 Hz high-pass filter
- 10 kHz −3 dB notch filter
- Omnidirectional compression for make-up gain (max +5 dB)
- Ambisonic B-Format rotation for audiovisual alignment
Download
Expand the list below to download audio, video, or muxed files for all scenes at once. For scene-specific notes and version history, visit the individual scene pages.
Download list
- Audio.wav 🎧
- Video.mkv 🎬
- Audio + Video.mp4 🎧🎬
4th-order Ambisonics (4OA) in ambiX format (ACN, SN3D)
- Badminton01: Badminton01_A_ambiX4_24bit.wav
- Badminton03: Badminton03_A_ambiX4_24bit.wav
- Badminton04: Badminton04_A_ambiX4_24bit.wav
- Badminton05: Badminton05_A_ambiX4_24bit.wav
- BuskingCity01: BuskingCity01_A_ambiX4_24bit.wav
- BuskingCity02: BuskingCity02_A_ambiX4_24bit.wav
- BuskingUnderpass01: BuskingUnderpass01_A_ambiX4_24bit.wav
- BuskingUnderpass02: BuskingUnderpass02_A_ambiX4_24bit.wav
- BuskingUnderpass03: BuskingUnderpass03_A_ambiX4_24bit.wav
- BuskingUnderpass04: BuskingUnderpass04_A_ambiX4_24bit.wav
- BuskingUnderpass05: BuskingUnderpass05_A_ambiX4_24bit.wav
- Cheerleading01: Cheerleading01_A_ambiX4_24bit.wav
- ConferenceCenter01: Conference01_A_ambiX4_24bit.wav
- ConferenceParticipant02: Conference02_A_ambiX4_24bit.wav
- ConferenceParticipant03: Conference03_A_ambiX4_24bit.wav
- ForestWalk01: ForestWalk01_A_ambiX4_24bit.wav
- ForestWalk02: ForestWalk02_A_ambiX4_24bit.wav
- ForestWalk03: ForestWalk03_A_ambiX4_24bit.wav
- Lake01: Lake01_A_ambiX4_24bit.wav
- ParkFountains01: ParkFountains01_A_ambiX4_24bit.wav
- River01: River01_A_ambiX4_24bit.wav
- Skateboarding01: Skateboarding01_A_ambiX4_24bit.wav
- Skateboarding03: Skateboarding03_A_ambiX4_24bit.wav
- Skateboarding04: Skateboarding04_A_ambiX4_24bit.wav
- Skateboarding05: Skateboarding05_A_ambiX4_24bit.wav
- Train01: Train01_A_ambiX4_24bit.wav
Raw 8K resolution video files.
- Badminton01: Badminton01_V_7680x3840_60fps_60s.mkv
- Badminton03: Badminton03_V_7680x3840_60fps_60s.mkv
- Badminton04: Badminton04_V_7680x3840_60fps_60s.mkv
- Badminton05: Badminton05_V_7680x3840_60fps_64s.mkv
- BuskingCity01: BuskingCity01_V_7680x3840_60fps_51s.mkv
- BuskingCity02: BuskingCity02_V_7680x3840_60fps_63s.mkv
- BuskingUnderpass01: BuskingUnderPass01_V_7680x3840_60fps_59s.mkv
- BuskingUnderpass02: BuskingUnderPass02_V_7680x3840_60fps_63s.mkv
- BuskingUnderpass03: BuskingUnderPass03_V_7680x3840_60fps_63s.mkv
- BuskingUnderpass04: BuskingUnderPass04_V_7680x3840_60fps_61s.mkv
- BuskingUnderpass05: BuskingUnderPass05_V_7680x3840_60fps_61s.mkv
- Cheerleading01: CheerLeading01_V_7680x3840_60fps_68s.mkv
- ConferenceCenter01: Conference01_V_7680x3840_60fps_60s.mkv
- ConferenceParticipant02: Conference02_V_7680x3840_60fps_80s.mkv
- ConferenceParticipant03: Conference03_V_7680x3840_60fps_83s.mkv
- ForestWalk01: ForestWalk01_V_7680x3840_60fps_60s.mkv
- ForestWalk02: ForestWalk02_V_7680x3840_60fps_60s.mkv
- ForestWalk03: ForestWalk03_V_7680x3840_60fps_60s.mkv
- Lake01: Lake01_V_7680x3840_60fps_60s.mkv
- ParkFountains01: ParkFountains01_V_7680x3840_60fps_60s.mkv
- River01: River01_V_7680x3840_60fps_60s.mkv
- Skateboarding01: Skateboarding01_V_7680x3840_60fps_60s.mkv
- Skateboarding03: Skateboarding03_V_7680x3840_60fps_60s.mkv
- Skateboarding04: Skateboarding04_V_7680x3840_60fps_60s.mkv
- Skateboarding05: Skateboarding05_V_7680x3840_60fps_60s.mkv
- Train01: Train01_V_7680x3840_60fps_60s.mkv
Encoded audiovideo files uploaded to YouTube. Uses 1st-order Ambisonics (1OA) with AAC encoding and 8K resolution video.
- Badminton01: Badminton01_AV_7680x3840_60fps_60s_25Mbps_ambiX1_24bit.mp4
- Badminton03: Badminton03_AV_7680x3840_60fps_60s_25Mbps_ambiX1_24bit.mp4
- Badminton04: Badminton04_AV_7680x3840_60fps_60s_25Mbps_ambiX1_24bit.mp4
- Badminton05: Badminton05_AV_7680x3840_60fps_64s_25Mbps_ambiX1_24bit.mp4
- BuskingCity01: BuskingCity01_AV_7680x3840_60fps_51s_25Mbps_ambiX1_24bit.mp4
- BuskingCity02: BuskingCity02_AV_7680x3840_60fps_63s_25Mbps_ambiX1_24bit.mp4
- BuskingUnderpass01: BuskingUnderpass01_AV_7680x3840_60fps_59s_25Mbps_ambiX1_24bit.mp4
- BuskingUnderpass02: BuskingUnderpass02_AV_7680x3840_60fps_63s_25Mbps_ambiX1_24bit.mp4
- BuskingUnderpass03: BuskingUnderpass03_AV_7680x3840_60fps_63s_25Mbps_ambiX1_24bit.mp4
- BuskingUnderpass04: BuskingUnderpass04_AV_7680x3840_60fps_61s_25Mbps_ambiX1_24bit.mp4
- BuskingUnderpass05: BuskingUnderpass05_AV_7680x3840_60fps_61s_25Mbps_ambiX1_24bit.mp4
- Cheerleading01: Cheerleading01_AV_7680x3840_60fps_68s_25Mbps_ambiX1_24bit.mp4
- ConferenceCenter01: Conference01_AV_7680x3840_60fps_60s_25Mbps_ambiX1_24bit.mp4
- ConferenceParticipant02: Conference02_AV_7680x3840_60fps_80s_25Mbps_ambiX1_24bit.mp4
- ConferenceParticipant03: Conference03_AV_7680x3840_60fps_83s_25Mbps_ambiX1_24bit.mp4
- ForestWalk01: ForestWalk01_AV_7680x3840_60fps_60s_25Mbps_ambiX1_24bit.mp4
- ForestWalk02: ForestWalk02_AV_7680x3840_60fps_60s_25Mbps_ambiX1_24bit.mp4
- ForestWalk03: ForestWalk03_AV_7680x3840_60fps_60s_25Mbps_ambiX1_24bit.mp4
- Lake01: Lake01_AV_7680x3840_60fps_60s_25Mbps_ambiX1_24bit.mp4
- ParkFountains01: ParkFountains01_AV_7680x3840_60fps_60s_25Mbps_ambiX1_24bit.mp4
- River01: River01_AV_7680x3840_60fps_60s_25Mbps_ambiX1_24bit.mp4
- Skateboarding01: Skateboarding01_AV_7680x3840_60fps_60s_25Mbps_ambiX1_24bit.mp4
- Skateboarding03: Skateboarding03_AV_7680x3840_60fps_60s_25Mbps_ambiX1_24bit.mp4
- Skateboarding04: Skateboarding04_AV_7680x3840_60fps_60s_25Mbps_ambiX1_24bit.mp4
- Skateboarding05: Skateboarding05_AV_7680x3840_60fps_63s_25Mbps_ambiX1_24bit.mp4
- Train01: Train01_AV_7680x3840_60fps_60s_25Mbps_ambiX1_24bit.mp4
File Naming Conventions
Download filenames follow a consistent pattern encoding the key properties of each file.
| Format | Pattern |
|---|---|
4OA Audio (.wav) | SceneName+Version · A · AmbisonicsFormat+Order · Bit-depth |
8K Video (.mkv) | SceneName+Version · V · Resolution · FPS · Duration |
Muxed AV (.mp4) | SceneName+Version · AV · Resolution · FPS · Duration · Bitrate · AmbisonicsFormat+Order · Bit-depth |
Example: Badminton01_A_ambiX4_24bit.wav — Scene Badminton, version 01, audio-only (A), 4th-order AmbiX format (ambiX4), 24-bit depth.
Publication
When using the QoEVAVE Scenes Database, please cite the following works
@inproceedings{robotham2022,
title = {Audiovisual Database with 360° Video and Higher-Order Ambisonics Audio for Perception, Cognition, Behavior, and QoE Evaluation Research},
author = {Robotham, Thomas and Singla, Ashutosh and Rummukainen, Olli S. and Raake, Alexander and Habets, Emanuël A. P.},
year = {2022},
booktitle = {14th International Conference on Quality of Multimedia Experience},
address={Lippstadt, Germany},
pages={1--6},
doi={10.1109/QoMEX55416.2022.9900893}
}