Skip to main content

Questions tagged [loss]

3 votes
0 answers
44 views

I am trying to fine-tune a transformer/encoder based pose estimation model available here at: https://huggingface.co/docs/transformers/en/model_doc/vitpose When passing "labels" attribute to ...
Soham Bhaumik's user avatar
1 vote
1 answer
47 views

My dataset consists of 5625 Arabic examples and 5625 synsets, and my model is CNN followed by a sigmoid classification layer. I constructed this 5625 synsets to 5625 classes, and my predicted output ...
rahma touzi's user avatar
1 vote
1 answer
41 views

If I would do loss = loss/10 before calculating the gradient would that change the amount of change applied to the model parameters during back propagation? Or is ...
GreedyGroot's user avatar
0 votes
1 answer
86 views

The below function is applied as a filtering procedure for a set of clients that are represented by accuracy values. where accuracy is used to measure the model’s performance. So, my question is: If ...
aam's user avatar
  • 1
1 vote
0 answers
35 views

Hi Community and thanks in advance for the help. I am working on transfer learning - specifically GoogLeNet model with the Food101 Dataset. Code is below. I think everything is in order from data ...
James's user avatar
  • 21
0 votes
0 answers
25 views

In my Machine Learning intro course I made several scatter plots of predictions for the california housing data set. Here is the most complete of them (created by a pipe using a sklearn StandardScaler ...
S3k's user avatar
  • 1
0 votes
0 answers
48 views

if we have a loss like this plot, is it kind of underfitting or goodfitting? Error results: Training Process:
stack offer's user avatar
4 votes
0 answers
343 views

Reading the InstructGPT paper(which seems to be what ChatGPT was built off of), I found this equation for the reward function. However, I'm struggling to understand how this equation is used to ...
itisyeetimetoday's user avatar
1 vote
0 answers
161 views

I am trying to understand Perplexity within Natural Language Processing as a metric more fully. And I am doing so by creating manual examples to understand all the component parts. Is the following ...
Piskator's user avatar
  • 135
1 vote
1 answer
500 views

I have a problem. I have trained a model. And as you can see, there is a zigzag in the loss. In addition, the validation loss is increasing. What does this mean if you only look at the training curve? ...
Test's user avatar
  • 89
0 votes
1 answer
68 views

As the title I asked. For example: a model that predicts the probability of a stock price rising/falling. Let's say this is a triple-classification problem. If it predicts "RISING", while ...
EvilRoach's user avatar
  • 163
0 votes
2 answers
112 views

I've created an LSTM model to predict 1 output value from 8 features. My loss constantly decreases and my val loss also decreases from the start, however it begins to increase after so many epochs. ...
ahy's user avatar
  • 1
0 votes
0 answers
51 views

I'm reading this paper and I don't understand why the squared L2-norm is also multiplied by 0.5 in the loss. They want a loss that measures the distance between two feature maps. Why don't they use ...
Alessandro Polidori's user avatar
0 votes
0 answers
260 views

I am working on the classification problem where by I am having a hinge loss function + other loss terms to optimize for which the input is the output from tanh layer at the end. But I can't reveal ...
POOJA GUPTA's user avatar

15 30 50 per page