Brightness, Lightness, & Edge Detection

 

Learning Outcomes

1. What are the units for quantifying light, and the ways of describing our experience of it?

2. How does the visual system arrive at lightness constancy?

3. How do natural constraints in brightness perception relate to the detection of edges?

4. What are convolution and reconstitution?

5. What low-, mid-, and high-level factors contribute to contrast illusions?

6. What is Marr’s two-stage approach?

7. How is the raw primal sketch obtained, and what is the result of this processing?

8. How does this approach make up for the shortcomings in feature-template “theory”?

 


 

Photometry

 

- the measurement of visible radiation from light sources

 

Photometric units:

________: radiant power from a light source

- unit: lumen = light produced by a standard candle (“candela”, a.k.a. “___________”)

- candela = 1 lumen per steradian (“_______ _______,” or unit of solid angle that is conical in shape)

steradians

e.g., 1 lm = 1.46 mW

 

___________: amount of light falling on a surface

- unit: lux = 1 lumen per square metre of area (lm/m2)

e.g., daylight = 10,000 lx, full moon = 0.1 lx, office = 250 lx

 

_________: amount of light reflected from a surface

- unit: nit = 1 candela per square metre of area (cd/m2)

e.g., typical active-matrix LCD panel = 200-300 nits; typical CRT monitor = 50-125 nits

 

___________: proportion of light reflected from a matte (non-glossy) surface

- unit: percent (%) or “albedo” = (luminance/illuminance) × 100

e.g., typically, white paper = 90%,  black paper = 10%

 


 

Lightness Constancy

 

__________: perceptual impression of intensity of light source

 

_________: perceptual impression of surface “greyness”; psychological counterpart to reflectance of a surface

 

illumination

×

reflectance

=

_______ _________

(light falling on

a surface)

 

(proportion of light

reflected from

a surface)

 

(amount of light

on retina)

 

Burzlaff (1931):

- presented two sets of 48 grey squares: one by a window (organized by lightness) another in a ___ corner (randomly arranged)

- selected one square from the window set as the standard

- asked observers which comparison square from the dim set matched it

- performance was _________

e.g., standard had 78% reflectance, comparison had 72%

- but standard was under 20× more illumination!

 

intensity

reflectance

amount of light

standard

200

78%

= 156

comparison

 10

72%

=    7

 

- _____ principle: the percentage of light reflected determines perception of lightness

- lightness _________: ability to perceive the true reflectance properties of an object--no matter what the illumination is

 

Rock (1983):

- consider white and black paper on a partially lit desktop

- the black paper in the light has greater luminance than the white paper in the dark--yet we still maintain lightness constancy!

- is this due to “______ ______”?

- nope: same result obtains for unfamiliar stimuli

 

Lightness and Edge Detection

 

The visual system must be able to determine reflectance, despite “contamination” by differences in _________.

- how can we magically filter out all but the correct interpretation?

- by using properties that are generally true of the world

- Land & McCann (1971): _______ theory (retina + cortex), based on two natural constraints:

• variations in ____________ are gradual within an object

• changes in ___________ are abrupt between objects--this defines an edge

 

How is lightness detected?

___________: matching a certain receptive field (or feature detector) with an image

receptor mosaic/optimal stimulus

- this does two things: 1. smooths out any noise or small differences in illumination, and 2. detects edges

 

• feature detectors act like _______:

e.g., move a spot of light across a receptive field:

centre-surround output

- detector has “_________” the point of light across several positions in the visual field

 

- if there is a _______ change in luminance, it is not detected as an edge:

stimulus:

intensity step

luminance profile:

luminance profile

edge detection:

output

 

- a gradual change in luminance (likely due to differences in illumination, rather than in reflectance or edges) is not encoded as an ____

 

- as a boundary is approached, luminance changes are greatest:

stimulus:

intensity step

luminance profile:

luminance profile

edge detection:

detector cell output

 

- abrupt change in luminance is detected as an edge

 

How are the edges filled in?

reconstitution (“to construct anew”) via _____________: activity is spread from an edge, smoothing out all small changes in illumination (or reflectance)

 

stimulus:

intensity step

luminance profile:

luminance profile

edge detection (via convolution):

edge detection

reconstitution (via deconvolution)

→ perception of lightness:

reconstitution

 


 

Contrast Illusions

 

- result in _________ lightness perception (breakdown of lightness constancy)

 

Simultaneous Contrast:

 

stimuli:

simultaneous contrast

luminance profile:

luminance profiles

edge detection:

convolution

reconstitution:

deconvolution

 

- how does the convolution and deconvolution take place?

 

Chevreul illusion (1861):

 

stimulus:

Chevreul illusion

luminance profile:

luminance profile

 

stimulus:

Chevreul illusion

luminance profile:

luminance profile

processing (_______ __________):

lateral inhibition

output:

output

 

- the above mechanism can explain the effect

- knowing the explanation does not eliminate the effect (not cognitively penetrable)

- ergo, this mechanism is potentially the __________ ____________ underlying the effect

 

Craik-O’Brien-Cornsweet Effect (COCE):

 

stimulus:

COCE

luminance profile:

luminance profile

edge detection:

edge detection

reconstitution:

reconstitution

percept:

percept

 

- this illusion is caused by a “_________________”--the edge is detected, then reconstituted into a percept that doesn’t match the stimulus

- how does the reconstitution/filling-in process work?

- propagation of lateral __________ connections in V1 or V2 is possible (Davey, Maddess, & Srinivasan, 1998)

 

Problem: exceptions to _______ __________ explanations:

 

Benary cross (1924):

 

Benary cross

- both triangles should receive equal lateral inhibition, but seem different shades

 

White’s illusion (1979):

 

White’s illusion

- rectangle A should receive ______ lateral inhibition (and seem lighter), but it seems darker

- rectangle B should receive a ___ of lateral inhibition (and seem darker), but it seems lighter

 

Solutions:

- explained in terms of “_____________”: appearance of an area is influenced by the surroundings to which it seems to belong (Gilchrist et al., 1999)

- triangle B _______ to the dark cross, which makes it seem lighter in contrast

- rectangle B _______ to the black bars, which makes it seem lighter in contrast

- suggests ______-_____ processing instead of retinal mechanism

 

Adelson (2000):

- an explanation of lightness perception requires consideration of ________ levels of processing:

• ___-_____ vision: at the retina; includes light adaptation, and centre-surround receptive fields

• ____-_____ vision: cognitive processes; includes knowledge about objects, materials, and scenes

• ___-_____ vision: ill-defined; involves surfaces, contours, grouping, etc.

 

- low-level models can explain lightness perception for (2-D) “_________” (named for Piet Mondrian, Dutch Neoplasticist painter, 1872-1944)

- but cannot account for a world more complex than Mondrians

e.g., Knill & Kersten’s illusion (1991): both figures contain _________ COCE ramps, but are interpreted very differently

Knill & Kersten illusion

 

- mid-level explanations: the _______ approach:

e.g., Koffka’s ring (1935): appears uniform

Koffka ring

e.g., when split, the rings appear different shades of grey:

split Koffka ring

e.g., when shifted, different __________ ____________ occurs:

shifted Koffka ring

(spatial configuration affects simultaneous contrast)

 

- the problem of lightness constancy: luminance image is the product of the ___________ image (due to paint) and the ___________ image (due to illumination/shading)

reflectance/illuminance images

 

- different _________ (place where two or more contours come together) provide information on reflectance and shading

junctions

- junction configuration and grey levels forming the junction provides cues about surface shading and reflectance

• Ψ-junctions: vertical spine appears to be a ________ with different illuminances on each side; angled arms appear to represent a reflectance edge that crosses the dihedral

e.g., the __________ _____ depend on Ψ-junctions:

impossible steps

• X-junctions account for shifted Koffka rings

• T-junctions account for White’s illusion

- these illusions reveal the __________ made by the visual system about illumination and surfaces

 


 

The Raw Primal Sketch

 

According to Retinex theory, in general:

• gradations in luminance: attributed to variations in ____________ across the surface of an object

• abrupt luminance differences: attributed to differences in ___________ between different objects

How can these _______ ___________ be applied to edge detection?

 

Marr’s two-stage approach (1976):

1) raw primal sketch: detects edge segments

- representation of an image, such that intensity differences are made ________ (→ global structures)

- also, representations of __________ (features) and _____ ______ (their locations) are determined (→ contours that may belong together)

- ignores ______: doesn’t matter; likely is a separate system anyway (i.e., magno vs. parvo)

 

2) full primal sketch: represents higher-level boundaries

- boundaries and regions found by ________ primitive elements together

 

Problem: luminance at any point is not constant--it fluctuates imperceptibly (“______ _____”)

 

Solution: _______ intensity between a given spot and its neighbours (i.e., over a region of space)

 

What size region is chosen to average over?

• large region:

☑ get general shape

☒ but lose ____ details

• small region:

☑ pick up lots of detail

☒ but lose overall form

 

Averaging process is like passing the image through a spatial ______:

• ___-____ filter (preserves gross features)

• band-pass filter (preserves intermediate details)

• ____-____ filter (preserves fine details)

 

To create the best raw primal sketch:

- obtain ________ representations in parallel

- each is a product of a different spatial filter

- result: get fine details, gross features, and everything in between

 

Marr & Hildreth’s (1980) algorithm:

1) smooth out small, unimportant variations that are due to differences in illumination

- take average around a given spot

spot

- but: the farther from the spot, the less informative the data

- so, apply a weighting function emphasizing the importance of the area at the centre while downplaying the contribution of regions farther away

- How? Use ________ (G ) function (“normal curve”):

G (x, y ) = exp (

_  x 2 + y 2

   2πσ2

)

Gaussian functions

• points closer to the centre have a greater weighting in the average

• at the periphery, points contribute the least

• to get different spatial filters, use different size Gaussian functions (measured by ________ _________)

 

2) extract ____ information from the representations

- an edge is defined as an abrupt change between a dark area and a light area--but how to detect this mathematically?

- _______________: rate of change of one variable with another

e.g. falling stone: differentiate distance vs. time

first derivative = ________

second derivative = ____________

- apply this differentiation transformation to filtered representations of the image

- e.g., differentiate intensity vs. space: an edge will produce a peak (or trough) in the first derivative, or a zero-crossing in the second derivative

- ____-________: function that crosses the baseline, produced by an edge

 

second order derivative: zero-crossing

- obtain second differential over a two-dimensional area

- use a _________ operator (∇2), for all of the multiple representations:

2

=

  δ2  

δx 2

 +

  δ2  

δy 2

 

- ∇2G is a band-pass filter

- repeat with different-sized filters

 

Result:

- ________ representations of the same input image--each one picks out variations of different spatial detail (or different intensities)

- raw primal sketch is comprised of the ___________ of all these representations--you don’t pick out the "best" one

- rationale: important boundaries should appear at multiple levels of detail

 

Combined how?

- match zero-crossings (borders) from representations filtered by similar ____ Gaussians

- this constructs several elements in the raw primal sketch, primitives:

features

 

What is the output of the “raw primal sketch”?

• a representation of significant _________ of light intensity in a given image

 

How does the visual system do Laplacian transforms of Gaussian-filtered images (∇2G  or LoG)?

- Gaussian filters: ______-________ receptive fields

centre-surroundoutput

 

- this __________-of-Gaussian (DoG) approximates a Laplacian transform of a Gaussian filtered image:

DoGs

 

According to feature template theory, a bar is detected by one simple cell, which receives connections from a number of similar-sized centre-surround receptive fields:

simple cell

- this leads to ambiguity; it’s _____

 

Marr-Hildreth algorithm:

Marr/Hildreth algorithm

Marr/Hildreth algorithm

(circles represent centres of centre-surround receptive fields)

- feature detection depends on ____ detection

- an edge may be detected by taking the activity of _______ simple cells of differing sizes

- they must agree amongst themselves

- there must also be agreement between “light-side” (on-centre/off-surround) and “dark-side” (off-centre/on-surround) detector groups

 

Pros & Cons:

☑ supported by simulations/image processing

☑ visual system has multiple spatial frequency ________

☒ Schiller (1982): turned off light-side detectors in monkeys, but still got sensitivity to ___________

☒ algorithm may generate “false edge”

☒ has problems with curved edges

 

Other algorithms: Pearson & Robinson (1985):

- picked out “_______” in luminance separately from “_____”

valley vs. step

- explains why ________ images are harder to identify

- maybe visual system operates on “_______” (e.g., black lines on white background), not simply zero-crossings/edges