Source code for brainpy_state._nest_neuron.rate_transformer_node

# Copyright 2026 BrainX Ecosystem Limited. All Rights Reserved.
#
# Licensed under the Apache License, Version 2.0 (the "License");
# you may not use this file except in compliance with the License.
# You may obtain a copy of the License at
#
#     http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
# ==============================================================================

# -*- coding: utf-8 -*-


from typing import Callable

import brainstate
import braintools
import brainunit as u
import jax.numpy as jnp
import numpy as np
from brainstate.typing import Size

from brainpy_state._nest_base.base import NESTNeuron

__all__ = [
    'rate_transformer_node',
]


class rate_transformer_node(NESTNeuron):
    r"""NEST-compatible ``rate_transformer_node`` template model.

    A stateless rate-based processing node that aggregates weighted incoming rate signals
    and applies a configurable input nonlinearity. Serves as an intermediary transformation
    stage in rate-based neural networks, mimicking NEST's ``rate_transformer_node<TNonlinearities>``
    template.

    **1. Mathematical Model**

    The model implements the transformation:

    .. math::

       X_i(t) = \phi\!\left(\sum_j w_{ij}\,\psi\!\left(X_j(t-d_{ij})\right)\right)

    where:

    - :math:`X_i(t)` is the output rate of node :math:`i` at time :math:`t`
    - :math:`X_j(t-d_{ij})` are incoming rates from presynaptic nodes :math:`j` with delay :math:`d_{ij}`
    - :math:`w_{ij}` are connection weights
    - :math:`\phi` is the input nonlinearity (applied to summed input)
    - :math:`\psi` is the output nonlinearity (applied per-event before summation)

    The model has **no intrinsic dynamics**: no differential equations, no noise, no membrane
    time constant. It acts purely as a feedforward transformation stage.

    **2. Nonlinearity Application Modes**

    The ``linear_summation`` parameter controls where the nonlinearity is applied, matching
    NEST's event handler semantics:

    **Mode A: ``linear_summation=True`` (default, recommended)**
      - Event handlers store weighted rates **without** applying nonlinearity
      - Nonlinearity :math:`\phi` is applied **once** to the summed input during ``update()``
      - Efficient when many inputs converge to one node
      - Math: :math:`X_i = \phi\!\left(\sum_j w_{ij} X_j(t-d_{ij})\right)`

    **Mode B: ``linear_summation=False``**
      - Nonlinearity :math:`\psi` is applied **per-event** before summation
      - Event handlers pre-transform each incoming rate
      - Useful for models where nonlinearity operates on individual inputs
      - Math: :math:`X_i = \sum_j w_{ij}\,\psi\!\left(X_j(t-d_{ij})\right)`

    **3. Default Nonlinearity**

    If no custom ``input_nonlinearity`` is provided, the model uses a simple gain function:

    .. math::

       \phi(h) = g \cdot h

    where :math:`g` is the ``g`` parameter (default 1.0).

    **4. Update Algorithm**

    dftype = brainstate.environ.dftype()
    ditype = brainstate.environ.ditype()
    The update sequence (matching NEST's ``rate_transformer_node_impl.h``) is:

    1. **Store delayed output**: Copy current ``rate`` → ``delayed_rate`` for outgoing delayed connections
    2. **Drain delayed queue**: Retrieve events scheduled for current timestep
    3. **Process delayed events**: Handle ``delayed_rate_events`` with explicit delay specifications
    4. **Process instant events**: Handle ``instant_rate_events`` (zero-delay)
    5. **Sum contributions**: Aggregate all weighted inputs into a single value
    6. **Apply nonlinearity** (if ``linear_summation=True``): Transform summed input
    7. **Update state**: Store new ``rate`` and ``instant_rate``
    8. **Increment step counter**: Advance internal timestep

    **5. Event Handling**

    The model accepts two types of runtime events in ``update()``:

    **Instant events (``instant_rate_events``)**:
      - Applied in the **current** timestep (zero delay)
      - Must not specify non-zero ``delay_steps`` (raises ``ValueError``)
      - Typical use: direct feedforward connections

    **Delayed events (``delayed_rate_events``)**:
      - Stored in internal queue and applied after ``delay_steps`` timesteps
      - Default delay is 1 step if not specified
      - Negative delays raise ``ValueError``

    **Event format** (flexible tuple/dict):
      - 2-tuple: ``(rate, weight)`` → uses default delay
      - 3-tuple: ``(rate, weight, delay_steps)``
      - 4-tuple: ``(rate, weight, delay_steps, multiplicity)``
      - dict: ``{'rate': ..., 'weight': ..., 'delay_steps': ..., 'multiplicity': ...}``
      - Scalar: interpreted as ``rate`` with ``weight=1.0, delay=default, multiplicity=1.0``

    **6. Latency Considerations**

    As in NEST, inserting a transformer node introduces **one simulation step of latency**
    compared to direct instantaneous connections. This is because:

    - Output ``instant_rate`` is updated at the **end** of the current timestep
    - Downstream nodes read this value in the **next** timestep
    - Even "instantaneous" connections have this inherent discrete-time delay

    **7. Computational Complexity**

    - **Time complexity**: :math:`O(E)` per timestep, where :math:`E` is the number of events
    - **Space complexity**: :math:`O(D \times N)` for delayed queue, where :math:`D` is max delay and :math:`N` is population size
    - **Nonlinearity cost**: :math:`O(N)` per call (applied once if ``linear_summation=True``, or :math:`E` times otherwise)

    Parameters
    ----------
    in_size : int, tuple of int, or Size
        Shape of the node population. Can be a scalar (1D population), tuple (multi-dimensional),
        or ``brainstate.Size`` object. Determines the shape of ``rate`` state variable.
    linear_summation : bool, optional
        Controls where the input nonlinearity is applied:

        - ``True`` (default): Apply nonlinearity **once** to summed input (efficient, recommended)
        - ``False``: Apply nonlinearity **per-event** before summation (mathematically different)

        See "Nonlinearity Application Modes" above for detailed semantics.
    g : float, array_like, optional
        Gain parameter for the default linear nonlinearity :math:`\phi(h) = g \cdot h`.
        Can be a scalar (shared across population) or array with shape matching ``in_size``.
        Ignored if custom ``input_nonlinearity`` is provided. Default: ``1.0`` (identity transform).
    input_nonlinearity : callable, optional
        Custom input nonlinearity function replacing the default :math:`g \cdot h`. Must accept:

        - Signature 1: ``f(h)`` where ``h`` is ndarray of summed inputs (shape ``in_size``)
        - Signature 2: ``f(model, h)`` where ``model`` is ``self`` (for accessing parameters)

        The function is automatically inspected to determine which signature to use.
        If ``None`` (default), uses ``g * h``. Return value must broadcast to ``in_size``.
    rate_initializer : callable, optional
        Initializer for the ``rate`` state variable. Must be a callable accepting ``(shape, batch_size)``
        and returning an array. Common choices:

        - ``braintools.init.Constant(0.0)`` (default): Initialize to zero
        - ``braintools.init.Normal(0.0, 0.1)``: Gaussian initialization
        - ``braintools.init.Uniform(0.0, 1.0)``: Uniform random

        Default: ``Constant(0.0)`` (all rates start at zero).
    name : str, optional
        Unique identifier for this module instance. Used for logging, debugging, and visualization.
        If ``None``, an auto-generated name is assigned. Default: ``None``.

    Parameter Mapping
    -----------------

    This table maps brainpy.state parameters to NEST's template instantiation:

    ================================  ================================  ================================
    brainpy.state parameter           NEST equivalent                   Notes
    ================================  ================================  ================================
    ``in_size``                       (implicit in node creation)       Population size
    ``linear_summation=True``         ``linear_summation=true``         Default mode in NEST
    ``linear_summation=False``        ``linear_summation=false``        Per-event nonlinearity
    ``g``                             (template parameter)              Gain in default nonlinearity
    ``input_nonlinearity``            ``TNonlinearities::input``        Custom :math:`\phi` or :math:`\psi`
    ``rate_initializer``              (no direct equiv)                 NEST defaults to 0.0
    ================================  ================================  ================================

    State Variables
    ---------------
    rate : ndarray, shape (in_size,)
        **Primary output rate** of the node. Updated at the end of each timestep after applying
        nonlinearity. This is the value read by downstream connections in the **next** timestep.
        Type: ``ShortTermState`` (persists between timesteps but not saved in checkpoints).
    instant_rate : ndarray, shape (in_size,)
        **Instantaneous output rate**, identical to ``rate`` after update. Provided for clarity
        in models that distinguish between instant and delayed outputs. Equals ``rate`` at the
        end of each timestep.
    delayed_rate : ndarray, shape (in_size,)
        **Previous timestep's output rate**, used for delayed outgoing connections. Equals
        ``rate`` from the **previous** timestep. Allows modeling connection delays explicitly.
    _step_count : int64 scalar
        Internal timestep counter used for delayed event scheduling. Increments by 1 each update.
        Not intended for external access.

    Receptor Types
    --------------
    The model exposes a single receptor type:

    - ``'RATE': 0`` — accepts rate-valued inputs (both instant and delayed)

    Use this in connection specifications to route rate signals to the transformer node.

    Recordables
    -----------
    The following state variables can be monitored during simulation:

    - ``'rate'`` — primary output rate (most commonly recorded)

    Notes
    -----
    **Comparison with NEST:**
      - Fully replicates NEST's ``rate_transformer_node_impl.h`` event handling logic
      - Supports all NEST template instantiations via custom ``input_nonlinearity``
      - Uses NumPy-based event queue instead of NEST's C++ ring buffer (functionally identical)
      - Batch processing is supported via ``init_state(batch_size=...)``

    **When to use this model:**
      - **Layered rate networks**: Inserting nonlinear transformations between rate-neuron populations
      - **Gain modulation**: Implementing attention or gating via spatially varying ``g``
      - **Modular architectures**: Separating rate dynamics (in rate neurons) from transformations (in transformer nodes)

    **Failure modes:**
      - Passing non-zero ``delay_steps`` in ``instant_rate_events`` raises ``ValueError``
      - Negative ``delay_steps`` in ``delayed_rate_events`` raises ``ValueError``
      - Custom ``input_nonlinearity`` returning wrong shape causes broadcasting errors

    **Performance tips:**
      - Use ``linear_summation=True`` (default) for better efficiency when many inputs converge
      - Avoid unnecessary delayed events if connections are instantaneous
      - For very long delays, consider pruning old queue entries manually if memory is constrained

    References
    ----------
    .. [1] Hahne, J., Dahmen, D., Schuecker, J., Frommer, A., Bolten, M., Helias, M.,
           & Diesmann, M. (2017). Integration of continuous-time dynamics in a spiking
           neural network simulator. *Frontiers in Neuroinformatics*, 11, 34.
           https://doi.org/10.3389/fninf.2017.00034

    .. [2] NEST Simulator. ``rate_transformer_node`` documentation.
           https://nest-simulator.readthedocs.io/en/stable/models/rate_transformer_node.html

    .. [3] NEST source code: ``rate_transformer_node_impl.h``
           https://github.com/nest/nest-simulator

    Examples
    --------
    **Example 1: Basic usage with default linear gain**

    .. code-block:: python

        >>> from brainpy import state as bst
        >>> import brainunit as u
        >>> import brainstate
        >>> # Create a transformer node with 10 units
        >>> transformer = bst.rate_transformer_node(in_size=10, g=2.0)
        >>> # Initialize state
        >>> transformer.init_state()
        >>> # Send instant rate events (rate=0.5, weight=1.0)
        >>> with brainstate.environ.context(dt=0.1 * u.ms):
        ...     output = transformer.update(instant_rate_events=(0.5, 1.0))
        >>> print(output)  # doctest: +SKIP
        [1.0, 1.0, ..., 1.0]  # g * rate * weight = 2.0 * 0.5 * 1.0

    **Example 2: Custom sigmoid nonlinearity**

    .. code-block:: python

        >>> import numpy as np
        >>> from brainpy import state as bst
        >>> # Define sigmoid activation
        >>> def sigmoid(h):
        ...     return 1.0 / (1.0 + np.exp(-h))
        >>> transformer = bst.rate_transformer_node(
        ...     in_size=5,
        ...     linear_summation=True,
        ...     input_nonlinearity=sigmoid
        ... )
        >>> transformer.init_state()
        >>> # High input drives to saturation
        >>> with brainstate.environ.context(dt=0.1 * u.ms):
        ...     output = transformer.update(instant_rate_events=(10.0, 1.0))
        >>> print(output)  # doctest: +SKIP
        [0.9999..., 0.9999..., ...]  # sigmoid(10) ≈ 1.0

    **Example 3: Delayed event scheduling**

    .. code-block:: python

        >>> from brainpy import state as bst
        >>> import brainunit as u
        >>> import brainstate
        >>> transformer = bst.rate_transformer_node(in_size=3)
        >>> transformer.init_state()
        >>> # Send delayed event (rate=1.0, weight=0.5, delay=2 steps)
        >>> with brainstate.environ.context(dt=1.0 * u.ms):
        ...     out_t0 = transformer.update(delayed_rate_events=(1.0, 0.5, 2))
        ...     out_t1 = transformer.update()  # Delayed event still in queue
        ...     out_t2 = transformer.update()  # Delayed event arrives now
        >>> print(out_t0, out_t1, out_t2)  # doctest: +SKIP
        [0., 0., 0.] [0., 0., 0.] [0.5, 0.5, 0.5]  # Event arrives at t+2

    **Example 4: Per-event nonlinearity (linear_summation=False)**

    .. code-block:: python

        >>> from brainpy import state as bst
        >>> import numpy as np
        >>> # ReLU applied per-event before summation
        >>> relu = lambda h: np.maximum(0, h)
        >>> transformer = bst.rate_transformer_node(
        ...     in_size=1,
        ...     linear_summation=False,
        ...     input_nonlinearity=relu
        ... )
        >>> transformer.init_state()
        >>> # Two events: one positive, one negative
        >>> events = [
        ...     (2.0, 1.0),   # ReLU(2.0) * 1.0 = 2.0
        ...     (-1.0, 1.0)   # ReLU(-1.0) * 1.0 = 0.0
        ... ]
        >>> with brainstate.environ.context(dt=0.1 * u.ms):
        ...     output = transformer.update(instant_rate_events=events)
        >>> print(output)  # doctest: +SKIP
        [2.0]  # Sum of per-event transformed values

    See Also
    --------
    rate_neuron_ipn : Rate neuron with intrinsic dynamics and input nonlinearity
    rate_neuron_opn : Rate neuron with output nonlinearity
    lin_rate : Linear rate neuron with dynamics
    sigmoid_rate : Sigmoid rate neuron
    """

    __module__ = 'brainpy.state'

    #: This node emits a continuous graded value every step (seam-(H)); the
    #: substrate routes ``weight * <emission>`` into downstream rate targets.
    _emission_continuous = True

    def __init__(
        self,
        in_size: Size,
        linear_summation: bool = True,
        g: float = 1.0,
        input_nonlinearity: Callable | None = None,
        rate_initializer: Callable = braintools.init.Constant(0.0),
        name: str = None,
    ):
        super().__init__(in_size=in_size, name=name)

        self.linear_summation = bool(linear_summation)
        self.g = braintools.init.param(g, self.varshape)
        self.input_nonlinearity = input_nonlinearity
        self.rate_initializer = rate_initializer

        # Seam-(H) continuous graded emission: the substrate captures this node's
        # output each step and deposits ``weight * <emission>`` into downstream
        # rate targets. ``linear_summation`` selects which value rides the seam --
        # the raw rate (post applies phi to the summed input) or the pre-applied
        # ``phi_rate`` (post sums phi-of-rate directly).
        self._emission_attr = 'rate' if self.linear_summation else 'phi_rate'

    @property
    def recordables(self):
        return ['rate']

    @property
    def receptor_types(self):
        return {'RATE': 0}

    @staticmethod
    def _to_numpy(x):
        dftype = brainstate.environ.dftype()
        return np.asarray(u.math.asarray(x), dtype=dftype)

    def _call_nl(self, fn: Callable, x):
        r"""Invoke a user nonlinearity as ``fn(self, x)`` then ``fn(x)`` (JAX pass-through).

        The argument ``x`` is a JAX value (a tracer under ``for_loop`` / ``jit``),
        so the supplied callable must be JAX-expressible.
        """
        try:
            return fn(self, x)
        except TypeError as first_error:
            try:
                return fn(x)
            except TypeError:
                raise first_error

    def _activation(self, h):
        r"""Input nonlinearity :math:`\phi(h)` (JAX; reads ``self``).

        Uses the user-supplied ``input_nonlinearity`` when provided (invoked as
        ``fn(self, h)`` then ``fn(h)``), otherwise the default linear gain
        :math:`\phi(h)=g\,h`. Must be JAX-expressible so the step lowers under
        ``brainstate.transform.for_loop`` / ``jit``.
        """
        if self.input_nonlinearity is None:
            return u.get_mantissa(self.g) * h
        return self._call_nl(self.input_nonlinearity, h)

    def _alloc_phi_rate(self, rate_np):
        r"""Allocate the ``phi_rate`` emission holder (``linear_summation`` False only).

        When ``linear_summation`` is False the seam emits :math:`\phi(\text{rate})`
        so downstream targets sum the already-transformed value; this allocates the
        holder so the substrate can capture it.
        """
        if not self.linear_summation:
            dftype = brainstate.environ.dftype()
            phi0 = np.asarray(u.get_mantissa(self._activation(jnp.asarray(rate_np))), dtype=dftype)
            self.phi_rate = brainstate.ShortTermState(phi0)

    def _store_phi_rate(self, rate_new):
        r"""Refresh the ``phi_rate`` emission holder (``linear_summation`` False only)."""
        if not self.linear_summation:
            self.phi_rate.value = self._activation(rate_new)


[docs]
    def init_state(self, **kwargs):
        r"""Initialize all state variables and reset the delayed event queue.

        Allocates ``rate``, ``instant_rate``, ``delayed_rate``, and internal timestep counter.
        Clears any pre-existing delayed events from the internal queue. Must be called before
        the first ``update()`` invocation.

        Parameters
        ----------
        **kwargs
            Unused compatibility parameters accepted by the base-state API.

        Notes
        -----
        All state variables are initialized using the ``rate_initializer`` provided during
        construction. The delayed event queue (``_delayed_queue``) is reset to an empty dict,
        discarding any events scheduled in previous simulations.

        This method is typically called automatically by ``brainstate`` infrastructure, but can
        be invoked manually to reset the model to initial conditions.
        """
        dftype = brainstate.environ.dftype()
        rate = braintools.init.param(self.rate_initializer, self.varshape)
        rate_np = self._to_numpy(rate)

        self.rate = brainstate.ShortTermState(rate_np)
        self.instant_rate = brainstate.ShortTermState(np.array(rate_np, dtype=dftype, copy=True))
        self.delayed_rate = brainstate.ShortTermState(np.array(rate_np, dtype=dftype, copy=True))
        self._alloc_phi_rate(rate_np)



[docs]
    def update(self, x=0.0):
        r"""Apply the input nonlinearity to the aggregated network input.

        Network coupling arrives continuously through the substrate's delta
        channel (seam-(H)): :math:`h=\sum_j w_{ij}\,\psi(X_j)` is read from
        ``sum_delta_inputs(0.0)``. With ``linear_summation=True`` the gain is
        applied to that sum, :math:`X_i=\phi(h)`; with ``linear_summation=False``
        the presynaptic gain has already been applied per connection (each source
        emits ``phi_rate``) and the node forwards the sum directly,
        :math:`X_i=h`. The whole step is JAX-expressible so it lowers under
        ``brainstate.transform.for_loop`` / ``jit``.

        Parameters
        ----------
        x : ArrayLike, optional
            Ignored; accepted for ``Dynamics`` API compatibility (the NEST rate
            transformer has no intrinsic current-driven dynamics).

        Returns
        -------
        rate_new : ArrayLike
            Updated output rate :math:`X_i` (shape ``self.rate.value.shape``).
        """
        del x  # NEST rate transformer has no intrinsic current input.
        dftype = brainstate.environ.dftype()
        state_shape = self.rate.value.shape

        h = u.get_mantissa(self.sum_delta_inputs(0.0))
        h = jnp.broadcast_to(jnp.asarray(h, dtype=dftype), state_shape)

        rate_prev = jnp.broadcast_to(jnp.asarray(self.rate.value, dtype=dftype), state_shape)
        rate_new = self._activation(h) if self.linear_summation else h

        self.rate.value = rate_new
        self.delayed_rate.value = rate_prev
        self.instant_rate.value = rate_new
        self._store_phi_rate(rate_new)
        return rate_new
Source code for brainpy_state._nest_neuron.rate_transformer_node

Modeling

Infrastructure

Compilation