ETraceGraphExecutor

ETraceGraphExecutor#

class braintrace.ETraceGraphExecutor(model, include_recurrent_mixing=False, control_flow=None)#

The eligibility trace graph executor.

This class is used for computing the weight spatial gradients and the hidden state residuals. It is the most foundational class for the ETrace algorithms.

It is important to note that the graph is built no matter whether the model is batched or not. This means that this graph can be applied to any kind of models. However, the compilation is sensitive to the shape of hidden states.

Parameters:

model (Module) – The model to build the eligibility trace graph. The models should only define the one-step behavior.
include_recurrent_mixing (bool) – Hidden-group grouping mode for the hidden-to-hidden transition; see compile_etrace_graph(..., include_recurrent_mixing=...).
control_flow (ControlFlowPolicy | None) – Policy governing control-flow canonicalization (cond if-conversion, scan unrolling, structured scan descent, …) during graph compilation. None (default) uses DEFAULT_CONTROL_FLOW_POLICY.

compile_graph(*args)[source]#

Build the eligibility trace graph for the model based on the provided inputs.

This method is crucial for constructing the graph used in the eligibility trace algorithm, which is essential for calculating weight spatial gradients and the hidden state Jacobian.

Parameters:: *args (Any) – Positional arguments for the model, which may include inputs, parameters, or other necessary data required for graph compilation.
Returns:: This method does not return any value. It initializes the compiled graph attribute of the instance.
Return type:: None

property graph: ETraceGraph#

Retrieve the compiled eligibility trace graph for the model.

This property provides access to the compiled graph, which is a crucial data structure for the eligibility trace algorithm. It contains various attributes that describe the relationships between the model’s variables, states, and operations.

Returns:: The compiled graph for the model. This graph includes detailed information about the model’s structure, such as output variables, state variables, hidden-to-hidden variable relationships, and more.
Return type:: ETraceGraph
Raises:: ValueError – If the graph has not been compiled yet. Ensure to call the compile_graph() method before accessing this property.

property path_to_states: FlattedDict#

The path to the states.

Returns:: The path to the states.
Return type:: brainstate.util.FlattedDict[Path, brainstate.State]

show_graph(verbose=True, return_msg=False)[source]#

Display the graph illustrating weights, operators, and hidden states.

Renders via braintrace.CompilationReport, the single source of truth for the structural summary.

Parameters:

verbose (bool) – If True (default), print the summary to stdout.
return_msg (bool) – If True, also return the summary string. Default False.

Returns:

The summary string if return_msg is True, else None.

Return type:

None | str

solve_h2w_h2h_jacobian(*args)[source]#

Compute the hidden-to-weight and hidden-to-hidden Jacobian matrices.

This function is designed to calculate the forward propagation of the hidden-to-weight Jacobian and the hidden-to-hidden Jacobian based on the provided inputs and parameters. It is a crucial part of the eligibility trace algorithm, which helps in understanding the influence of weights and previous hidden states on the current hidden state.

Parameters:

*args (Any) – Positional arguments for the model, which may include inputs, parameters, or other necessary data required for the computation of the Jacobians.

Returns:

A tuple containing the following elements:

The function output (e.g., model predictions).
The updated hidden states after the current computation step.
Other states that may be relevant to the model’s operation.
The spatial gradients of the weights, represented by the hidden-to-weight Jacobian.

Return type:

Any

Raises:

NotImplementedError – This method must be implemented by subclasses.

Notes

For the state transition function \(y, h^t = f(h^{t-1}, \theta, x)\), this function aims to solve:

The function output \(y\).
The updated hidden states \(h^t\).
The Jacobian matrix of hidden-to-weight, i.e., \(\partial h^t / \partial \theta^t\).
The Jacobian matrix of hidden-to-hidden, i.e., \(\partial h^t / \partial h^{t-1}\).

solve_h2w_h2h_l2h_jacobian(*args)[source]#

Compute the hidden-to-weight and hidden-to-hidden Jacobian matrices, along with the VJP transformed loss-to-hidden gradients based on the provided inputs.

This function is designed to calculate both the forward propagation of the hidden-to-weight Jacobian and the loss-to-hidden gradients at the current time-step. It is essential for understanding the influence of weights and previous hidden states on the current hidden state, as well as the impact of the loss on the hidden states.

Parameters:

*args (Any) – Positional arguments for the model, which may include inputs, parameters, or other necessary data required for the computation of the Jacobians and gradients.

Returns:

A tuple containing the following elements:

The function output (e.g., model predictions).
The updated hidden states after the current computation step.
Other states that may be relevant to the model’s operation.
The spatial gradients of the weights, represented by the hidden-to-weight Jacobian.
The residuals, which are the partial gradients of the loss with respect to the hidden states.

Return type:

Any

Raises:

NotImplementedError – This method must be implemented by subclasses.

Notes

Particularly, this function aims to solve:

The Jacobian matrix of hidden-to-weight. That is, \(\partial h / \partial w\), where \(h\) is the hidden state and \(w\) is the weight.
The Jacobian matrix of hidden-to-hidden. That is, \(\partial h / \partial h\), where \(h\) is the hidden state.
The partial gradients of the loss with respect to the hidden states. That is, \(\partial L / \partial h\), where \(L\) is the loss and \(h\) is the hidden state.

property state_id_to_path: Dict[int, Tuple[str, ...]]#

The state id to the path.

Returns:: The mapping from state id to the path.
Return type:: Dict[int, Path]

property states: FlattedDict#

The states for the model.

Returns:: The states for the model.
Return type:: brainstate.util.FlattedDict[Path, brainstate.State]

ETraceGraphExecutor

Contents

ETraceGraphExecutor#

Modeling

Infrastructure

Compilation