Real-world scene perception

Undoubtedly the context in which we perform a task plays a crucial role in the way we process our environment. But how does the actual interaction with our surroundings play into that? Is our perception contingent on the action we perform and how is this relationship modulated by the availability of semantic information?