These expressions are produced at two levels of emotional intensities (regular and strong) except for the neutral emotion that only contains regular intensity.
Dataset containing open-ended questions about images. These questions require an understanding of vision, language and commonsense knowledge to answer.