Regularizers
Below we enumerate the regularizers implemented by ERM, and provide their mathematical definition.
Mathematical definitions
name | ERM Regularizer | mathematical definition | notes |
---|---|---|---|
L2 ($\ell_2$) | L2Reg() | $r(\theta) = \|\theta\|_2 = \left(\sum_{i=1}^n (\theta_i)^{2}\right)^{\frac{1}{2}}$ | convex |
L1 ($\ell_1$) | L1Reg() | $r(\theta) = \|\theta\|_1 = \sum_{i=1}^n|\theta_i|$ | convex, sparsifying |
Square root ($\ell_{0.5}$) | SqrtReg() | $r(\theta) = \left(\sum_{i=1}^n |\theta_i|^{1/2} \right)^{2}$ | non-convex, sparsifying |
Nonnegative | NonnegReg() | $r(\theta) = \begin{cases} 0 & \theta_i \geq 0 \text{ for all i} \\\\ +\infty & \text{else} \end{cases}$ | convex |
A good reference for regularizers are the EE104 lecture slides. In particular, the lecture on non-quadratic regularizers is helpful.