How do we decode the images that we see? For example, when we look at an image, how do we distinguish an object from the space surrounding it? How do we know whether the image depicts a part or a whole? How do we infer its temporality? By what means do we interpret its scale?