Inspec keywords: text analysis; image sequences; neural net architecture; image coding; question answering (information retrieval)

Other keywords: language information; Oracle task; object sequences; GuessWhat dataset; spatial object information encoding; VQA task; visual information encoding; text-based question; yes-no visual question answering task; visual features; language-based features; categorical object information encoding; neural network architecture

Subjects: Natural language interfaces; Image and video coding; Document processing and analysis techniques; Neural computing techniques; Computer vision and image processing techniques; Information retrieval techniques