Should the minimum value of a cost (loss) function be equal to zero?

Question

We know optimization techniques search in the space of all the possible parameters for a parameter set that minimizes the cost function of the model. The most well-known loss functions, like MSE or Categorical Cross Entropy, has a global minimum value equal to zero, in the ideal case.

For example, the Gradient Descent, $\theta_j \leftarrow \theta_j - \alpha \frac{\partial}{\partial \theta_j}J(\theta)$, updates parameters based on the derivation of the calculated cost function value, $J(\theta)$.

I was wondering what will happen if we design a cost function that has a non-zero global minimum in its ideal case. Does it make a difference, e.g. in the convergence rate or other aspects of the optimization process, or not?

Gyan Ranjan · Accepted Answer · 2018-11-07 15:28:31Z

Saying that the well-known loss functions, like MSE or Categorical Cross Entropy, has a global minimum value equal to zero is flawed . The idea behind loss function is to measure how near the model predictions are to the actuals(in case of a regression). Now ideally , you would want your model to predict exactly equal to the actuals . In that case only , we get loss equal to zero. Otherwise , loss is non zero almost all the time . If you remember the loss function for a linear regression setting ,

We need to minimise $J(\theta)$ so that the predictions can be as close to the actuals as possible . For that the derivative of $J(\theta)$ should be zero . It doesn't matter if $J(\theta)$ is zero or non zero . Graphically , for a cost function like this , you want to reach the point $J_{min}(w)$ where the derivative is zero.

Stack Exchange Network

Should the minimum value of a cost (loss) function be equal to zero?

1 Answer 1

Hot Network Questions

Should the minimum value of a cost (loss) function be equal to zero?

1 Answer 1

Related

Hot Network Questions