Return to Kechen Zhang homepage

Kechen Zhang

Papers available online

  • Quadratic analysis

    Christopher DiMattina, Michael J. Anderson, Eric D. Young and Kechen Zhang: Optimal stimuli and invariant stimuli in hierarchical sensory networks: A quadratic analysis. (submitted)

  • Equivalent networks

    Christopher DiMattina and Kechen Zhang: How to modify a neural network gradually without changing its input-output functionality. Neural Computation (accepted for publication).

  • Optimal stimulus

    Christopher DiMattina and Kechen Zhang (2008): How optimal stimuli for sensory neurons are constrained by network architecture. Neural Computation 20: 668-708.

    Abstract: Identifying the optimal stimuli for a sensory neuron is often a difficult process involving trial and error. By analyzing the relationship between stimuli and responses in feedforward and stable recurrent neural network models, we find that the stimulus yielding themaximum firing rate response always lies on the topological boundary of the collection of all allowable stimuli, provided that individual neurons have increasing input-output relations or gain functions and that the synaptic connections are convergent between layers with nondegenerate weight matrices. This result suggests that in neurophysiological experiments under these conditions, only stimuli on the boundary need to be tested in order to maximize the response, thereby potentially reducing the number of trials needed for finding the most effective stimuli. Even when the gain functions allow firing rate cutoff or saturation, a peak still cannot exist in the stimulus-response relation in the sense that moving away from the optimum stimulus always reduces the response.We further demonstrate that the condition for nondegenerate synaptic connections also implies that proper stimuli can independently perturb the activities of all neurons in the same layer. One example of this type of manipulation is changing the activity of a single neuron in a given processing layer while keeping that of all others constant. Such stimulus perturbations might help experimentally isolate the interactions of selected neurons within a network.

    Download reprint PDF file (nc-boundary.pdf).

  • From grid cells to place cells

    Huge T. Blair, Kishan Gupta, and Kechen Zhang (2008): Conversion of a phase-coded to a rate-coded position signal by a three-stage model of theta cells, grid cells, and place cells. Hippocampus, 18:1239-1255.

    Abstract: As a rat navigates through a familiar environment, its position in space is encoded by firing rates of place cells and grid cells. Oscillatory interference models propose that this positional firing rate code is derived from a phase code, which stores the rat's position as a pattern of phase angles between velocity-modulated theta oscillations. Here we describe a three-stage network model, which formalizes the computational steps that are necessary for converting phase-coded position signals (represented by theta oscillations) into rate-coded position signals (represented by grid cells and place cells). The first stage of the model proposes that the phase-coded position signal is stored and updated by a bank of ring attractors, like those that have previously been hypothesized to perform angular path integration in the head-direction cell system. We show analytically how ring attractors can serve as central pattern generators for producing velocity-modulated theta oscillations, and we propose that such ring attractors may reside in subcortical areas where hippocampal theta rhythm is known to originate. In the second stage of the model, grid fields are formed by oscillatory interference between theta cells residing in different (but not the same) ring attractors. The model's third stage assumes that hippocampal neurons generate Gaussian place fields by computing weighted sums of inputs from a basis set of many grid fields. Here we show that under this assumption, the spatial frequency spectrum of the Gaussian place field defines the vertex spacings of grid cells that must provide input to the place cell. This analysis generates a testable prediction that grid cells with large vertex spacings should send projections to the entire hippocampus, whereas grid cells with smaller vertex spacings may project more selectively to the dorsal hippocampus, where place fields are smallest.

    Download reprint PDF file (hipp-3stage.pdf).

  • Moire grid

    Hugh T. Blair, Adam C. Welday, and Kechen Zhang (2007): Moire interference between grid fields that produce theta oscillations: A computational model. Jounral of Neuroscience 27: 3211-3229.

    Abstract: The dorsomedial entorhinal cortex (dMEC) of the rat brain contains a remarkable population of spatially tuned neurons called grid cells (Hafting et al., 2005). Each grid cell fires selectively at multiple spatial locations, which are geometrically arranged to form a hexagonal lattice that tiles the surface of the rat's environment. Here, we show that grid fields can combine with one another to form moire interference patterns, referred to as "moire grids", that replicate the hexagonal lattice over an infinite range of spatial scales.Wepropose that dMEC grids are actually moire grids formed by interference between much smaller "theta grids," which are hypothesized to be the primary source of movement-related theta rhythm in the rat brain. The formation of moire grids from theta grids obeys two scaling laws, referred to as the length and rotational scaling rules. The length scaling rule appears to account for firing properties of grid cells in layer II of dMEC, whereas the rotational scaling rule can better explain properties of layer III grid cells. Moire grids built from theta grids can be combined to form yet larger grids and can also be used as basis functions to construct memory representations of spatial locations (place cells) or visual images. Memory representations built from moire grids are automatically endowed with size invariance by the scaling properties of the moire grids. We therefore propose that moire interference between grid fields may constitute an important principle of neural computation underlying the construction of scale-invariant memory representations.

    Download reprint PDF file (jns-moire.pdf).

  • Cortical scaling

    Kechen Zhang and Terrence J. Sejnowski (2000): A universal scaling law between gray matter and white matter of cerebral cortex. Proceedings of the National Academy of Sciences USA 97: 5621-5626.

    Abstract: Neocortex, a new and rapidly evolving brain structure in mammals, has a similar layered architecture in species over a wide range of brain sizes. Larger brains require longer fibers to communicate between distant cortical areas; the volume of the white matter that contains long axons increases disproportionally faster than the volume of the gray matter that contains cell bodies, dendrites, and axons for local information processing, according to a power law. The theoretical analysis presented here shows how this remarkable anatomical regularity might arise naturally as a consequence of the local uniformity of the cortex and the requirement for compact arrangement of long axonal fibers. The predicted power law with an exponent of 4/3 minus a small correction for the thickness of the cortex accurately accounts for empirical data spanning several orders of magnitude in brain sizes for various mammalian species including human and non-human primates.

    Download reprint PDF file (pnas-brains.pdf).



  • Cosine tuning for 3-D object

    Kechen Zhang and Terrence J. Sejnowski (1999): A theory of geometric constraints on neural activity for natural three-dimensional movement. Journal of Neuroscience 19: 3122-2145.

    Abstract: Although the orientation of an arm in space or the static view of an object may be represented by a population of neurons in complex ways, how these variables change with movement often follows simple linear rules, reflecting the underlying geometric constraints in the physical world. A theoretical analysis is presented for how such constraints affect the average firing rates of sensory and motor neurons during natural movements with low degrees of freedom, such as a limb movement and rigid object motion. When applied to non-rigid reaching arm movements, the linear theory accounts for cosine directional tuning with linear speed modulation, predicts a curl-free spatial distribution of preferred directions, and also explains why the instantaneous motion of the hand can be recovered from the neural population activity. For three-dimensional motion of a rigid object, the theory predicts that, to a first approximation, the response of a sensory neuron should have a preferred translational direction and a preferred rotation axis in space, both with cosine tuning functions modulated multiplicatively by speed and angular speed, respectively. Some known tuning properties of motion-sensitive neurons follow as special cases. Acceleration tuning and nonlinear speed modulation are considered in an extension of the linear theory. This general approach provides a principled method to derive mechanism-insensitive neuronal properties by exploiting the inherently low dimensionality of natural movements.

    Download reprint PDF file (jns-object.pdf).



  • Tuning width

    Kechen Zhang and Terrence J. Sejnowski (1999): Neuronal tuning: To sharpen or broaden? Neural Computation 11: 75-84.

    Abstract: Sensory and motor variables are typically represented by a population of broadly tuned neurons. A coarser representation with broader tuning can often improve coding accuracy, but sometimes the accuracy may also improve with sharper tuning. The theoretical analysis here shows that the relationship between tuning width and accuracy depends crucially on the dimension of the encoded variable. A general rule is derived for how the Fisher information scales with the tuning width, regardless of the exact shape of the tuning function, the probability distribution of spikes, and allowing some correlated noise between neurons. These results demonstrate a universal dimensionality effect in neural population coding.

    Download reprint PDF file (nc-tuning.pdf).



  • Maximum likelihood by recurrent network

    Alexandre Pouget, Kechen Zhang, Sophie Deneve and Peter E. Latham (1998): Statistically efficient estimation using population code. Neural Computation 10: 373-401.

    Abstract: Coarse codes are widely used throughout the brain to encode sensory and motor variables. Methods designed to interpret these codes, such as population vector analysis, are either inefficient (the variance of the estimate is much larger than the smallest possible variance) or biologically implausible, like maximum likelihood. Moreover, these methods attempt to compute a scalar or vector estimate of the encoded variable. Neurons are faced with a similar estimation problem. They must read out the responses of the presynaptic neurons, but, by contrast, they typically encode the variable with a further population code rather than as a scalar. We show how a nonlinear recurrent network can be used to perform estimation in a near-optimal way while keeping the estimate in a coarse code format. This work suggests that lateral connections in the cortex may be involved in cleaning up uncorrelated noise among neurons representing similar variables.

    Download reprint PDF file (nc-ml.pdf).

    Link to Alex Pouget's homepage.


  • Place cell reconstruction

    Kechen Zhang, Iris Ginzburg, Bruce L. McNaughton, and Terrence J. Sejnowski (1998): Interpreting neuronal population activity by reconstruction: Unified framework with application to hippocampal place cells. Journal of Neurophysiology 79: 1017-1044.

    Abstract: Physical variables such as the orientation of a line in the visual field or the location of the body in space are coded as activity levels in populations of neurons. Reconstruction or decoding is an inverse problem in which the physical variables are estimated from observed neural activity. Reconstruction is useful first in quantifying how much information about the physical variables is present in the population, and second, in providing insight into how the brain might use distributed representations in solving related computational problems such as visual object recognition and spatial navigation. Two classes of reconstruction methods, namely, probabilistic or Bayesian methods and basis function methods, are discussed. They include important existing methods as special cases, such as population vector coding, optimal linear estimation and template matching. As a representative example for the reconstruction problem, different methods were applied to multi-electrode spike train data from hippocampal place cells in freely moving rats. The reconstruction accuracy of the trajectories of the rats was compared for the different methods. Bayesian methods were especially accurate when a continuity constraint was enforced, and the best errors were within a factor of two of the the information-theoretic limit on how accurate any reconstruction can be, which were comparable with the intrinsic experimental errors in position tracking. In addition, the reconstruction analysis uncovered some interesting aspects of place cell activity, such as the tendency for erratic jumps of the reconstructed trajectory when the animal stopped running. In general, the theoretical values of the minimal achievable reconstruction errors quantify how accurately a physical variable is encoded in the neuronal population in the sense of mean square error, regardless of the method used for reading out the information. One related result is that the theoretical accuracy is independent of the width of the Gaussian tuning function only in two dimensions. Finally, all the reconstruction methods considered in this paper can be implemented by a unified neural network architecture, which the brain could feasibly use to solve related problems.

    Download reprint PDF file (jnp-reconst.pdf).


  • Contineous attractor dynamics: head-direction cell model

    Kechen Zhang (1996): Representation of spatial orientation by the intrinsic dynamics of the head-direction cell ensemble: A theory. Journal of Neuroscience 16: 2112-2126.

    Abstract: The head-direction (HD) cells found in the limbic system in freely moving rats represent the instantaneous head direction of the animal in the horizontal plane regardless of the location of the animal. The internal direction represented by these cells uses both self-motion information for inertially based updating and familiar visual landmarks for calibration. Here, a model of the dynamics of the HD cell ensemble is presented. The stability of a localized static activity profile in the network and a dynamic shift mechanism are explained naturally by synaptic weight distribution components with even and odd symmetry, respectively. Under symmetric weights or symmetric reciprocal connections, a stable activity profile close to the known directional tuning curves will emerge. By adding a slight asymmetry to the weights, the activity profile will shift continuously without disturbances to its shape, and the shift speed can be accurately controlled by the strength of the odd-weight component. The generic formulation of the shift mechanism is determined uniquely within the current theoretical framework. The attractor dynamics of the system ensures modality-independence of the internal representation and facilitates the correction for cumulative error by the putative local-view detectors. The model offers a specific one-dimensional example of a computational mechanism in which a truly world-centered representation can be derived from observer-centered sensory inputs by integrating self-motion information.

    Download reprint PDF file (jns-hd.pdf).


  • Optic flow: position-invariance in area MST

    Kechen Zhang, Martin I. Sereno and Margaret E. Sereno (1993): Emergence of position-independent detectors of sense of rotation and dilation with Hebbian learning: An analysis. Neural Computation 5: 597-612.

    Abstract: We previously demonstrated that it is possible to learn position-independent responses to rotation and dilation by filtering rotations and dilations with different centers through an input layer with MT-like speed and direction tuning curves and connecting them to an MSTlike layer with simple Hebbian synapses (Sereno and Sereno 1991). By analyzing an idealized version of the network with broader, sinusoidal direction-tuning and linear speed-tuning, we show analytically that a Hebb rule trained with arbitrary rotation, dilation/contraction, and translation velocity fields yields units with weight fields that are a rotation plus a dilation or contraction field, and whose responses to a rotating or dilating/contracting disk are exactly position independent. Differences between the performance of this idealized model and our original model (and real MST neurons) are discussed.

    Clip here to see a key figure (in Marty Sereno's homepage).

    Download reprint PDF file (nc-mst.pdf).



  • Maxwell's demon: explicit models

    Kechen Zhang and Kezhao Zhang (1992): Mechanical models of Maxwell's demon with noninvariant phase volume. Physical Review A 46: 4598-4605.

    General background: Maxwell's demon is a hypothetical intelligent being originally proposed by Maxwell to demonstrate that the Second Law of Thermodynamics---which, roughly speaking, describes a universal tendency for an isolated system to degenerate into a state of maximum disorder---is statistical in nature and can be breached by intelligence. Historically, the studies (or exorcisement) of the demon have led to a number of interesting results, including the close relation between information and entropy (Szilard), the limitations on the perceptual abilities of the demon (Brillouin), and more recently, the cost of energy dissipation in the computational processes due to the disregard of unwanted information to refresh the memory of the demon (Landauer and Bennett). Following a dynamical approach, we have arrived at a new conclusion: An essential requirement for the demon, regardless of its level of intelligence, is a special dynamics with contracting phase-space volume, and it is possible to construct explicit models of the demon without violating any known physical laws except the Second Law itself.

    Abstract: This paper is concerned with the dynamical basis of Maxwell's demon within the framework of classical mechanics. The authors show that the operation of the demon, whose effect is equivalent to exerting a velocity-dependent force on the gas molecules, can be modeled as a suitable force field without disobeying any laws in classical mechanics. An essential requirement for the models is that the phase-space volume should be noninvariant during time evolution. The necessity of the requirement can be established under general conditions by showing that (1) a mechanical device is able to violate the second law of thermodynamics if and only if it can be used to generate and sustain a robust momentum flow inside an isolated system, and (2) no systems with invariant phase volume are able to support such a flow. The invariance of phase volume appears as an independent factor responsible for the validity of the second law of thermodynamics. When this requirement is removed, explicit mechanical models of Maxwell's demon can exist.

    Download reprint PDF file (pra-demon.pdf).



  • Intrinsic probability in deterministic systems

    Kechen Zhang (1990): Uniform distribution of initial states: The physical basis of probability. Physical Review A 41: 1893-1900.

    Abstract: For repetitive experiments performed on a deterministic system with initial states restricted to a certain region in phase space, the relative frequency of an event has a definite value insensitive to the preparation of the experiments only if the initial states leading to that event are distributed uniformly in the prescribed region. Mechanical models of coin tossing and roulette spinning and the equal a priori probability hypothesis in statistical mechanics are considered in the light of this principle. Probabilities that have arisen from uniform distributions of initial states do not necessarily submit to Kolmogorov's (1956) axioms of probability. In the finite-dimensional case, a uniform distribution in phase space either in the coarse-grained sense or in the limit sense can be formulated in a unified way.

    Download reprint PDF file (pra-prob.pdf).




  • Return to Kechen Zhang homepage