Abstract
In multimodal presentations the perceived audiovisual quality assessment is significantly influenced by the content of both the audio and visual tracks. Based on our earlier subjective quality test for finding the optimal trade-off between audio and video quality, this paper proposes a novel method for relative multimodal complexity analysis to derive the fusion parameter in objective audiovisual quality metrics. Audio and video qualities are first estimated separately using advanced quality models, and then they are combined into the overall audiovisual quality using a linear fusion. Based on carefully designed auditory and visual features, the relative complexity analysis model across sensory modalities is proposed for deriving the fusion parameter. Experimental results have demonstrated that the content adaptive fusion parameter can improve the prediction accuracy of objective audiovisual quality metrics, compared to the fusion parameters obtained from the subjective quality tests using other known optimization methods.
Original language | English |
---|---|
Title of host publication | ICIP 2011 |
Subtitle of host publication | 2011 18th IEEE International Conference on Image Processing |
Pages | 3337-3340 |
Number of pages | 4 |
DOIs | |
Publication status | Published - 2011 |
Event | 2011 18th IEEE International Conference on Image Processing, ICIP 2011 - Brussels, Belgium Duration: 11 Sep 2011 → 14 Sep 2011 |
Conference
Conference | 2011 18th IEEE International Conference on Image Processing, ICIP 2011 |
---|---|
Country/Territory | Belgium |
City | Brussels |
Period | 11/09/11 → 14/09/11 |
Keywords
- Audiovisual quality assessment
- content analysis
- multimodal complexity
- quality fusion