Standard Regularizations#
Classical regularization methods for parameter penalization and constraint enforcement.
These regularizations add penalty terms to the loss function to encourage desired
properties like sparsity (L1), smoothness (L2), or structural constraints (orthogonality,
spectral norms). Use with Param to automatically include regularization losses in
training objectives.
Abstract base class for parameter regularization. |
|
L1 (Lasso) regularization. |
|
L2 (Ridge) regularization. |
|
Elastic Net regularization (combination of L1 and L2). |
|
Huber regularization (robust regularization). |
|
Group Lasso regularization. |
|
Total Variation regularization. |
|
Max Norm regularization (soft constraint). |
|
Entropy regularization. |
|
Orthogonal regularization. |
|
Spectral Norm regularization. |
|
Composite regularization that chains multiple regularizations together. |
Prior Distribution-Based Regularizations#
Probabilistic regularizations based on prior distributions for Bayesian-inspired
parameter estimation. These regularizations encode domain knowledge or assumptions
about parameter distributions (Gaussian, heavy-tailed, bounded, etc.). Particularly
useful for variational inference, maximum a posteriori (MAP) estimation, and
uncertainty quantification. Each regularization implements loss(), sample_init(),
and reset_value() for prior-based parameter initialization.
Gaussian prior regularization. |
|
Student's t-distribution prior regularization. |
|
Cauchy prior regularization. |
|
Uniform prior regularization (soft bounded constraint). |
|
Beta prior regularization (for parameters in [0, 1]). |
|
Log-normal prior regularization (for positive parameters). |
|
Exponential prior regularization (for positive parameters). |
|
Gamma prior regularization (for positive parameters). |
|
Inverse-Gamma prior regularization (for variance parameters). |
|
Log-uniform (Jeffreys) prior regularization (scale-invariant). |
|
Horseshoe prior regularization (strong sparsity with heavy tails). |
|
Spike-and-slab prior regularization (variable selection). |
|
Dirichlet prior regularization (for probability simplexes). |