Skip to main content

360 Scene Database

Overview

The QoEVAVE Scenes Database provides an initial audiovisual database consiting of 12 scenes capturing real-life nature and urban scenes. The maximum video resolution is 7680x3840 (8k) at 60 frames-per-second, with 4th-order Ambisonics spatial audio (4OA). All video sequences are recorded with a target duration of 60 seconds and designed to represent real-life settings for systematically evaluating various dimensions of uni-/multimodal perception, cognition, behavior, and quality of experience (QoE) in a controlled virtual environment. This database serves as high-quality reference material with an equal focus on auditory and visual sensory information within the QoE community. For more information, please see the publication below.

Recorded Scenes

You can download individual scenes on each of the available scene pages, or view the download list list below. On each scene page, you will find the following information:

  • Location information of recording.
  • Scene version notes describing variations in multiple 'takes'.
  • Preview link to YouTube (uses 1st-order Ambisonics).
  • Download links for 4th-audio Ambisonics audio, 8k video file, or muxed audiovideo file with 1st-order Ambisonics.
  • Spatial / Temporal indexing plots on video data.

Capture

MediaDeviceDescription
Audiomhacoustics Eigenmike32 channel spherical microphone array capable of 4th order higher-order ambisonics output.
VideoInsta360 Pro 2360 video camera capable of 8K video output. Comprised of 6 F2.4 fisheye lenses each capturing 4K video resolution up to 120Mbps.

Specifications

                                                                Information                                                                Description
Video Encodingffvhuff
Video Resolution7680x3840
Projection MapEquirectangular
Video FPS59.94
Audio Sample-rate48,000; 24-bit PCM
4th-Order AmbisonicsIndividual .wav files of 4th-order Ambisonics in ACN channel ordering with SN3D normalization. Scene representation has a -90° rotational offset against video files for playback with Unity. Visit Help for more information. Labelled as ambiX4 in file names.
1st-Order Ambisonics1st-Order Ambisonics in ACN channel ordering with SN3D normalization. Encoded using AAC and muxed with the video files into an .MP4 container. Pre-processed with Google's spatial media metadata injector for uploading to YouTube. Labelled as ambiX1 in file names.
Audio Post-productionAll audio files are provided with a 500ms fade-in/out. Further post-processing includes EQ (60 Hz high-pass and 10 kHz -3dB notch filter), Omnidirection-compression for make-up gain (max +5 dB), and Ambisonic B-Format scene roation for audiovisual alignment. All post-processing was applied with IEM VSTs

Download

Download links for all individual scenes can be found on the each scene page. There you can download audio (4OA) video (8K) and the muxed audiovideo files (YouTube 1OA) for any available versions of the scene. Comments are provided to note slight changes in scene composition between any available versions.

Alternatively, you can expand the drop down menu below to see all download links for audio, video, and audiovideo files.

Download list

Publication

When using the QoEVAVE Scenes Database, please cite the following works.

Robotham, T., Singla, A., Rummukainen, O. S., Raake, A., Habets, E. A. P. Sept, 2022. Audiovisual Database with 360° Video and Higher-Order Ambisonics Audio for Perception, Cognition, Behavior, and QoE Evaluation Research. In Proc, 14th International Conference on Quality of Multimedia Experience (QoMEX). Lippstadt, Germany. DOI: 10.1109/QoMEX55416.2022.9900893

Open access versison available at arXiv:2212.13442v1

@inproceedings{robotham2022,
title = {Audiovisual Database with 360° Video and Higher-Order Ambisonics Audio for Perception, Cognition, Behavior, and QoE Evaluation Research},
author = {Robotham, Thomas and Singla, Ashutosh and Rummukainen, Olli S. and Raake, Alexander and Habets, Emanuël A. P.},
year = {2022},
booktitle = {14th International Conference on Quality of Multimedia Experience},
address={Lippstadt, Germany},
pages={1--6},
doi={10.1109/QoMEX55416.2022.9900893}
}