scholarly journals A Model of V4 Shape Selectivity and Invariance

2007 ◽  
Vol 98 (3) ◽  
pp. 1733-1750 ◽  
Author(s):  
Charles Cadieu ◽  
Minjoon Kouh ◽  
Anitha Pasupathy ◽  
Charles E. Connor ◽  
Maximilian Riesenhuber ◽  
...  

Object recognition in primates is mediated by the ventral visual pathway and is classically described as a feedforward hierarchy of increasingly sophisticated representations. Neurons in macaque monkey area V4, an intermediate stage along the ventral pathway, have been shown to exhibit selectivity to complex boundary conformation and invariance to spatial translation. How could such a representation be derived from the signals in lower visual areas such as V1? We show that a quantitative model of hierarchical processing, which is part of a larger model of object recognition in the ventral pathway, provides a plausible mechanism for the translation-invariant shape representation observed in area V4. Simulated model neurons successfully reproduce V4 selectivity and invariance through a nonlinear, translation-invariant combination of locally selective subunits, suggesting that a similar transformation may occur or culminate in area V4. Specifically, this mechanism models the selectivity of individual V4 neurons to boundary conformation stimuli, exhibits the same degree of translation invariance observed in V4, and produces observed V4 population responses to bars and non-Cartesian gratings. This work provides a quantitative model of the widely described shape selectivity and invariance properties of area V4 and points toward a possible canonical mechanism operating throughout the ventral pathway.

2021 ◽  
Vol 15 ◽  
Author(s):  
Jarrod Hollis ◽  
Glyn W. Humphreys ◽  
Peter M. Allen

Evidence is presented for intermediate, wholistic visual representations of objects and non-objects that are computed online and independent of visual attention. Short-term visual priming was examined between visually similar shapes, with targets either falling at the (valid) location cued by primes or at another (invalid) location. Object decision latencies were facilitated when the overall shapes of the stimuli were similar irrespective of whether the location of the prime was valid or invalid, with the effects being equally large for object and non-object targets. In addition, the effects were based on the overall outlines of the stimuli and low spatial frequency components, not on local parts. In conclusion, wholistic shape representations based on outline form, are rapidly computed online during object recognition. Moreover, activation of common wholistic shape representations prime the processing of subsequent objects and non-objects irrespective of whether they appear at attended or unattended locations. Rapid derivation of wholistic form provides a key intermediate stage of object recognition.


1999 ◽  
Vol 82 (5) ◽  
pp. 2490-2502 ◽  
Author(s):  
Anitha Pasupathy ◽  
Charles E. Connor

The ventral pathway in visual cortex is responsible for the perception of shape. Area V4 is an important intermediate stage in this pathway, and provides the major input to the final stages in inferotemporal cortex. The role of V4 in processing shape information is not yet clear. We studied V4 responses to contour features (angles and curves), which many theorists have proposed as intermediate shape primitives. We used a large parametric set of contour features to test the responses of 152 V4 cells in two awake macaque monkeys. Most cells responded better to contour features than to edges or bars, and about one-third exhibited systematic tuning for contour features. In particular, many cells were selective for contour feature orientation, responding to angles and curves pointing in a particular direction. There was a strong bias toward convex (as opposed to concave) features, implying a neural basis for the well-known perceptual dominance of convexity. Our results suggest that V4 processes information about contour features as a step toward complex shape recognition.


eLife ◽  
2021 ◽  
Vol 10 ◽  
Author(s):  
Rundong Jiang ◽  
Ian Max Andolina ◽  
Ming Li ◽  
Shiming Tang

The ventral visual pathway is crucially involved in integrating low-level visual features into complex representations for objects and scenes. At an intermediate stage of the ventral visual pathway, V4 plays a crucial role in supporting this transformation. Many V4 neurons are selective for shape segments like curves and corners, however it remains unclear whether these neurons are organized into clustered functional domains, a structural motif common across other visual cortices. Using two-photon calcium imaging in awake macaques, we confirmed and localized cortical domains selective for curves or corners in V4. Single-cell resolution imaging confirmed that curve or corner selective neurons were spatially clustered into such domains. When tested with hexagonal-segment stimuli, we find that stimulus smoothness is the cardinal difference between curve and corner selectivity in V4. Combining cortical population responses with single neuron analysis, our results reveal that curves and corners are encoded by neurons clustered into functional domains in V4. This functionally-specific population architecture bridges the gap between the early and late cortices of the ventral pathway and may serve to facilitate complex object recognition.


2019 ◽  
Author(s):  
Rundong Jiang ◽  
Ian M. Andolina ◽  
Ming Li ◽  
Shiming Tang

AbstractThe ventral visual pathway is crucially involved in integrating low-level visual features into complex representations for objects and scenes. At an intermediate stage of the ventral visual pathway, V4 plays a crucial role in supporting this transformation. Many V4 neurons are selective for shape segments like curves and corners, however it remains unclear whether these neurons are organized into clustered functional domains, a structural motif common across other visual cortices. Using two-photon calcium imaging in awake macaques, we confirmed and localized cortical domains selective for curves or corners in V4. Single-cell resolution imaging confirmed that curve or corner selective neurons were spatially clustered into such domains. When tested with hexagonal-segment stimuli, we find that stimulus smoothness is the cardinal difference between curve and corner selectivity in V4. Combining cortical population responses with single neuron analysis, our results reveal that curves and corners are encoded by neurons clustered into functional domains in V4. This functionally-specific population architecture bridges the gap between the early and late cortices of the ventral pathway and may serve to facilitate complex object recognition.


2014 ◽  
Vol 112 (9) ◽  
pp. 2114-2122 ◽  
Author(s):  
Timothy D. Oleskiw ◽  
Anitha Pasupathy ◽  
Wyeth Bair

The midlevel visual cortical area V4 in the primate is thought to be critical for the neural representation of visual shape. Several studies agree that V4 neurons respond to contour features, e.g., convexities and concavities along a shape boundary, that are more complex than the oriented segments encoded by neurons in the primary visual cortex. Here we compare two distinct approaches to modeling V4 shape selectivity: one based on a spectral receptive field (SRF) map in the orientation and spatial frequency domain and the other based on a map in an object-centered angular position and contour curvature space. We test the ability of these two characterizations to account for the responses of V4 neurons to a set of parametrically designed two-dimensional shapes recorded previously in the awake macaque. We report two lines of evidence suggesting that the SRF model does not capture the contour sensitivity of V4 neurons. First, the SRF model discards spatial phase information, which is inconsistent with the neuronal data. Second, the amount of variance explained by the SRF model was significantly less than that explained by the contour curvature model. Notably, cells best fit by the curvature model were poorly fit by the SRF model, the latter being appropriate for a subset of V4 neurons that appear to be orientation tuned. These limitations of the SRF model suggest that a full understanding of midlevel shape representation requires more complicated models that preserve phase information and perhaps deal with object segmentation.


2004 ◽  
Vol 1269 ◽  
pp. 23-25
Author(s):  
Charles E. Connor ◽  
Anitha Pasupathy

Author(s):  
Özgür Evren ◽  
Farhad Hüsseinov

Consider a dominance relation (a preorder) ≿ on a topological space X, such as the greater than or equal to relation on a function space or a stochastic dominance relation on a space of probability measures. Given a compact set K ⊆ X, we study when a continuous real function on K that is strictly monotonic with respect to ≿ can be extended to X without violating the continuity and monotonicity conditions. We show that such extensions exist for translation invariant dominance relations on a large class of topological vector spaces. Translation invariance or a vector structure are no longer needed when X is locally compact and second countable. In decision theoretic exercises, our extension theorems help construct monotonic utility functions on the universal space X starting from compact subsets. To illustrate, we prove several representation theorems for revealed or exogenously given preferences that are monotonic with respect to a dominance relation.


2005 ◽  
Vol 94 (4) ◽  
pp. 2726-2737 ◽  
Author(s):  
David A. Hinkle ◽  
Charles E. Connor

We performed a quantitative characterization of binocular disparity-tuning functions in the ventral (object-processing) pathway of the macaque visual cortex. We measured responses of 452 area V4 neurons to stimuli with disparities ranging from −1.0 to +1.0°. Asymmetric Gaussian functions fit the raw data best (median R = 0.90), capturing both the modal components (local peaks in the −1.0 to +1.0° range) and the monotonic components (linear or sigmoidal dependency on disparity) of the tuning patterns. Values derived from the asymmetric Gaussian fits were used to characterize neurons on a modal × monotonic tuning domain. Points along the modal tuning axis correspond to classic tuned excitatory and inhibitory patterns; points along the monotonic axis correspond to classic near and far patterns. The distribution on this domain was continuous, with the majority of neurons exhibiting a mixed modal/monotonic tuning pattern. The distribution in the modal dimension was shifted toward excitatory patterns, consistent with previous results in other areas. The distribution in the monotonic dimension was shifted toward tuning for crossed disparities (corresponding to stimuli nearer than the fixation plane). This could reflect a perceptual emphasis on objects or object parts closer to the observer. We also found that disparity-tuning strength was positively correlated with orientation-tuning strength and color-tuning strength, and negatively correlated with receptive field eccentricity.


1998 ◽  
Vol 111 (1) ◽  
pp. 7-37 ◽  
Author(s):  
S. Nikonov ◽  
N. Engheta ◽  
E.N. Pugh

The kinetics of the dark-adapted salamander rod photocurrent response to flashes producing from 10 to 105 photoisomerizations (Φ) were investigated in normal Ringer's solution, and in a choline solution that clamps calcium near its resting level. For saturating intensities ranging from ∼102 to 104 Φ, the recovery phases of the responses in choline were nearly invariant in form. Responses in Ringer's were similarly invariant for saturating intensities from ∼103 to 104 Φ. In both solutions, recoveries to flashes in these intensity ranges translated on the time axis a constant amount (τc) per e-fold increment in flash intensity, and exhibited exponentially decaying “tail phases” with time constant τc. The difference in recovery half-times for responses in choline and Ringer's to the same saturating flash was 5–7 s. Above ∼104 Φ, recoveries in both solutions were systematically slower, and translation invariance broke down. Theoretical analysis of the translation-invariant responses established that τc must represent the time constant of inactivation of the disc-associated cascade intermediate (R*, G*, or PDE*) having the longest lifetime, and that the cGMP hydrolysis and cGMP-channel activation reactions are such as to conserve this time constant. Theoretical analysis also demonstrated that the 5–7-s shift in recovery half-times between responses in Ringer's and in choline is largely (4–6 s) accounted for by the calcium-dependent activation of guanylyl cyclase, with the residual (1–2 s) likely caused by an effect of calcium on an intermediate with a nondominant time constant. Analytical expressions for the dim-flash response in calcium clamp and Ringer's are derived, and it is shown that the difference in the responses under the two conditions can be accounted for quantitatively by cyclase activation. Application of these expressions yields an estimate of the calcium buffering capacity of the rod at rest of ∼20, much lower than previous estimates.


Sign in / Sign up

Export Citation Format

Share Document