This paper provides an overview of the JointContest on Multimedia Challenges Beyond Visual Analysis.We organized an academic competition that focused on fourproblems that require e‚ective processing of multimodalinformation in order to be solved. Two tracks were devoted togesture
...
This paper provides an overview of the JointContest on Multimedia Challenges Beyond Visual Analysis.We organized an academic competition that focused on fourproblems that require e‚ective processing of multimodalinformation in order to be solved. Two tracks were devoted togesture spotting and recognition from RGB-D video, two fundamentalproblems for human computer interaction. Anothertrack was devoted to a second round of the €rst impressionschallenge of which the goal was to develop methods torecognize personality traits from short video clips. For thissecond round we adopted a novel collaborative-competitive(i.e., coopetition) setting. ‡e fourth track was dedicated tothe problem of video recommendation for improving userexperience. ‡e challenge was open for about 45 days, andreceived outstanding participation: almost 200 participantsregistered to the contest, and 20 teams sent predictions inthe €nal stage. ‡e main goals of the challenge were ful€lled:the state of the art was advanced considerably in the fourtracks, with novel solutions to the proposed problems (mostlyrelying on deep learning). However, further research is stillrequired. ‡e data of the four tracks will be available to allowresearchers to keep making progress in the four tracks@en