Back Return to Beginning Next

Invariance to Rotation, Translation, & Scale

The issue of invariance is even more challenging in visual perception, where simple shapes are recognized independent of their rotation, translation, and scale, as well as of elastic deformations. If spatial patterns are detected by any kind of spatial template, then that templates would have to be tested at every point across the visual field. In computer simulations this is usually done by scanning the template at every location cross the image. In neural network models the template is generally replicated at every location across the image, to perform the search in parallel.

If recognition is to be invariant to rotation, then the template must be scanned (or replicated) at every orientation at every spatial location.

And if recognition is to be invariant to scale, and elastic deformation, the template must be replicated at every location, every orientation, every scale, and with every possible deformation. And to extend the recognition into full volumetric 3-D would require volumetric spatial templates at every volumetric location in 3-D, at every orientation in 3-D at every location, through a range of spatial scales, and for every possible deformation. Clearly this is not promising as a general principle of perceptual representation!

Back Return to Beginning Next