Audiovisual quality fusion based on relative multimodal complexity

Junyong You; Jari Korhonen; Ulrich Reiter

doi:10.1109/ICIP.2011.6116386

Audiovisual quality fusion based on relative multimodal complexity

Junyong You^*, Jari Korhonen, Ulrich Reiter

^*Corresponding author for this work

Computing Science

Research output: Chapter in Book/Report/Conference proceeding › Published conference contribution

1 Citation (Scopus)

Abstract

In multimodal presentations the perceived audiovisual quality assessment is significantly influenced by the content of both the audio and visual tracks. Based on our earlier subjective quality test for finding the optimal trade-off between audio and video quality, this paper proposes a novel method for relative multimodal complexity analysis to derive the fusion parameter in objective audiovisual quality metrics. Audio and video qualities are first estimated separately using advanced quality models, and then they are combined into the overall audiovisual quality using a linear fusion. Based on carefully designed auditory and visual features, the relative complexity analysis model across sensory modalities is proposed for deriving the fusion parameter. Experimental results have demonstrated that the content adaptive fusion parameter can improve the prediction accuracy of objective audiovisual quality metrics, compared to the fusion parameters obtained from the subjective quality tests using other known optimization methods.

Original language	English
Title of host publication	ICIP 2011
Subtitle of host publication	2011 18th IEEE International Conference on Image Processing
Pages	3337-3340
Number of pages	4
DOIs	https://doi.org/10.1109/ICIP.2011.6116386
Publication status	Published - 2011
Event	2011 18th IEEE International Conference on Image Processing, ICIP 2011 - Brussels, Belgium Duration: 11 Sept 2011 → 14 Sept 2011

Conference

Conference	2011 18th IEEE International Conference on Image Processing, ICIP 2011
Country/Territory	Belgium
City	Brussels
Period	11/09/11 → 14/09/11

Keywords

Audiovisual quality assessment
content analysis
multimodal complexity
quality fusion

Access to Document

10.1109/ICIP.2011.6116386

Cite this

@inproceedings{ce77cc3901164438aff704d9db6a0b18,

title = "Audiovisual quality fusion based on relative multimodal complexity",

abstract = "In multimodal presentations the perceived audiovisual quality assessment is significantly influenced by the content of both the audio and visual tracks. Based on our earlier subjective quality test for finding the optimal trade-off between audio and video quality, this paper proposes a novel method for relative multimodal complexity analysis to derive the fusion parameter in objective audiovisual quality metrics. Audio and video qualities are first estimated separately using advanced quality models, and then they are combined into the overall audiovisual quality using a linear fusion. Based on carefully designed auditory and visual features, the relative complexity analysis model across sensory modalities is proposed for deriving the fusion parameter. Experimental results have demonstrated that the content adaptive fusion parameter can improve the prediction accuracy of objective audiovisual quality metrics, compared to the fusion parameters obtained from the subjective quality tests using other known optimization methods.",

keywords = "Audiovisual quality assessment, content analysis, multimodal complexity, quality fusion",

author = "Junyong You and Jari Korhonen and Ulrich Reiter",

year = "2011",

doi = "10.1109/ICIP.2011.6116386",

language = "English",

isbn = "9781457713033",

pages = "3337--3340",

booktitle = "ICIP 2011",

note = "2011 18th IEEE International Conference on Image Processing, ICIP 2011 ; Conference date: 11-09-2011 Through 14-09-2011",

}

TY - GEN

T1 - Audiovisual quality fusion based on relative multimodal complexity

AU - You, Junyong

AU - Korhonen, Jari

AU - Reiter, Ulrich

PY - 2011

Y1 - 2011

N2 - In multimodal presentations the perceived audiovisual quality assessment is significantly influenced by the content of both the audio and visual tracks. Based on our earlier subjective quality test for finding the optimal trade-off between audio and video quality, this paper proposes a novel method for relative multimodal complexity analysis to derive the fusion parameter in objective audiovisual quality metrics. Audio and video qualities are first estimated separately using advanced quality models, and then they are combined into the overall audiovisual quality using a linear fusion. Based on carefully designed auditory and visual features, the relative complexity analysis model across sensory modalities is proposed for deriving the fusion parameter. Experimental results have demonstrated that the content adaptive fusion parameter can improve the prediction accuracy of objective audiovisual quality metrics, compared to the fusion parameters obtained from the subjective quality tests using other known optimization methods.

AB - In multimodal presentations the perceived audiovisual quality assessment is significantly influenced by the content of both the audio and visual tracks. Based on our earlier subjective quality test for finding the optimal trade-off between audio and video quality, this paper proposes a novel method for relative multimodal complexity analysis to derive the fusion parameter in objective audiovisual quality metrics. Audio and video qualities are first estimated separately using advanced quality models, and then they are combined into the overall audiovisual quality using a linear fusion. Based on carefully designed auditory and visual features, the relative complexity analysis model across sensory modalities is proposed for deriving the fusion parameter. Experimental results have demonstrated that the content adaptive fusion parameter can improve the prediction accuracy of objective audiovisual quality metrics, compared to the fusion parameters obtained from the subjective quality tests using other known optimization methods.

KW - Audiovisual quality assessment

KW - content analysis

KW - multimodal complexity

KW - quality fusion

UR - http://www.scopus.com/inward/record.url?scp=84856242090&partnerID=8YFLogxK

U2 - 10.1109/ICIP.2011.6116386

DO - 10.1109/ICIP.2011.6116386

M3 - Published conference contribution

AN - SCOPUS:84856242090

SN - 9781457713033

SP - 3337

EP - 3340

BT - ICIP 2011

T2 - 2011 18th IEEE International Conference on Image Processing, ICIP 2011

Y2 - 11 September 2011 through 14 September 2011

ER -

Audiovisual quality fusion based on relative multimodal complexity

Abstract

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this