I work at DeepMind as a research scientist with a broad interest in building responsive machines that understand natural language, surrounding environment, as well as human intentions, all necessary for human-like communication. Prior to my work at DeepMind, I worked at Max Planck Institute for Informatics, where I pursued PhD in Computer Vision. During my PhD studies, I pioneered the task of Visual Turing Test (also known as Visual Question Answering) that has been widely followed up by a research community. In this task, I study a problem of question answering about real-world images, and have proposed various architectures such as LSTM+CNN termed 'Ask Your Neurons', and logic-based that relies on a semantic parser. Besides of the Visual Turing Test, I also studied a retrieval problem and learnable spatial representations.