Frequently Asked Questions

Frequently Asked Questions#

Do I need a GPU to use `brainevent`?#

No. Event-driven array and sparse-matrix operations run on CPU, GPU, and TPU. A GPU (and a host C++ compiler) is only required when you compile custom C++/CUDA kernels. See Installation.

Do I need to install the CUDA Toolkit separately?#

No. Installing jax[cuda12] or jax[cuda13] pulls in the nvidia-* pip packages, which already bundle nvcc, ptxas, and the CUDA runtime and headers. You still need the NVIDIA driver and a host C++ compiler (g++/clang++). Details in Installation.

How does the event-driven optimization actually work?#

When you multiply a BinaryArray by a connectivity structure, brainevent dispatches a kernel that iterates only over the active spike indices and accumulates their contributions. Work scales with the number of spikes, not the size of the matrix. See What is event-driven computation?.

Which connectivity format should I use?#

Explicit, reusable sparsity → CSR / CSC.
Large random connectivity → JITC (memory independent of synapse count).
Fixed number of connections per neuron → FixedPreNumConn / FixedPostNumConn.

See Choose a connectivity format and Sparse format trade-offs.

Numba (CPU) / Numba-CUDA, Warp (GPU) — convenient, decorator-based, no separate compiler step.
Raw C++/CUDA — maximum control, or to reuse existing native code.

For raw CUDA kernels, brainevent compiles your source via nvcc, registers XLA FFI targets, and caches compiled artifacts on disk for fast reloads. See The custom-kernel architecture and Compile a raw CUDA/C++ kernel.

How do I ship a custom CUDA kernel with my project?#

Place the kernel in a co-located .cu file and load it at import time:

# my_module/my_kernels.py
from pathlib import Path
from brainevent import load_cuda_file

_module = load_cuda_file(
    Path(__file__).parent / "my_kernels.cu",
    target_prefix="my_module.my_kernels",
)

Annotate each entry point in the .cu file with // @BE:

// @BE my_kernel arg arg ret stream
void my_kernel(const BE::Tensor& input,
               const BE::Tensor& weights,
               BE::Tensor& output,
               int64_t stream) {
    // kernel launch code
}

load_cuda_file compiles the kernel on first use, caches the .so to disk, and registers it as a JAX FFI target. Subsequent imports skip recompilation (see Caching).

Frequently Asked Questions

Contents

Frequently Asked Questions#

Do I need a GPU to use `brainevent`?#

Do I need to install the CUDA Toolkit separately?#

How does the event-driven optimization actually work?#

Which connectivity format should I use?#

Can I learn or inspect individual JITC weights?#

Are computations reproducible?#

Can I attach physical units to weights?#

Which custom-kernel backend should I use?#

How do I ship a custom CUDA kernel with my project?#