Approaches to Cognitive Science

Lecture 3: Approaches to Vision

References

Basic Reading

Green & others (course textbook), Chapter 4

Taking it further

Sharples, M. et al., Computers and Thought (MIT Press, 1989), Chapter 9

Bruce, V., Green, P. & Georgeson, M., Visual Perception: physiology, psychology and ecology (Psychology Press, 1996)

Seminal classics

Gibson, J.J., The Senses Considered as Perceptual Systems (Allen & Unwin, 1968; Waveland Press, 1983)

Marr, D., Vision: A computational investigation into the human representation and processing of visual information (Freeman, 1982)

Vision and the study of mind

Vision mediates most of our interactions with

Visual representations are

Visual processing

Questions about vision

What do we see?

How is visual processing organised?

Are our visual processes modular?

Are our visual processes specialised or general?

How can we investigate vision?

Can we make machines that see?

How do you cross the road?

You are at the side of a busy, two-way road, with no pedestrian crossing. You need to get across, and you're in a hurry. What does your visual system do?

Maybe some of the following ...

The static environment

Where is the kerb? Where are the parked cars, the road markings, junctions? Which direction takes me straight across the road?

To help answer these questions, your visual system might have to

The dynamic environment

Should I cross now, or wait? Your visual system needs to predict whether you have time to safely get to the other side of the road before a car comes.

Your visual system might

but it might just

Looming and time to collision - a study in visual information pickup

The image of an approaching object expands. A description of the dynamic properties of the image is called the optic flow field.

Approaching vehicle and expanding optic flow

You can show that for an object on a collision course, the rate of image expansion specifies directly the time to collision. There is no need to know the size, distance, or speed of the approaching object!

There is thus a computational theory for at least part of the visual control of road crossing. (More information is here.)

There is also some experimental evidence that the visual system makes use of this kind of information:

But what of the intentions of the drivers? Is this part of visual perception?

J.J. Gibson and "nouvelle AI"

Gibson's work on human perception, especially visual perception, had many strands. Some important aspects:

In recent years, Artificial Intelligence's approach to vision has converged to some extent with Gibson's outlook, moving away from the construction of elaborate 3-D representations underpinning complex reasoning, towards a more action-centred approach, in which vision is part of a perceptuo-motor cycle, in which feedback from actions plays a major role.

However, practical image understanding technologies exploit many of the tools developed in computer vision over the past 4 decades.

Conclusion

Understanding vision remains one of central challenges in cognitive science.

Some processes are understood; but the overall architecture and functioning of the human visual system remains largely uncertain.

The challenge can be met only with a wide variety of approaches: computational, psychological and philosophical; experimental and theoretical.

Maintained by: David Young