Questions tagged [methodology]
The methodology tag has no summary.
48 questions
2 votes
2 answers
63 views
How to combine scores from two skill areas to reward individuals with high scores in both?
I want to evaluate individuals on a scale of 1 to 10 based on their skills in two areas, Skill A and Skill B. Our goal is to reward those who possess skills in both areas by giving them a higher ...
-1 votes
1 answer
35 views
machine learning - data science - data analysis
I have a research project in the machine learning area. In this study, dataset contains more than 4000 numbers categorized in four columns. I am going to find or predict a possible relation between ...
0 votes
0 answers
50 views
Can one have good understanding on a method without having direct experience with it?
This question is in line of these previous questions on other sites: Is it possible to conduct scientific research without actually getting close to the sample/specimen? in Biology SE Is it possible ...
2 votes
1 answer
73 views
Is it methodologically correct to use the data to be used for finetuning in the pretrain phase of the BERT model?
Let us assume the training of a BERT model. An initial pre-train is performed with a large data set A. Subsequently a finetuning is performed with a dataset B which is part of A, but now with labels ...
0 votes
2 answers
302 views
Why shouldn't we try to balance the test set?
Most advice I have found online is that we must not balance the test set. The test set should remain to be unseen. However, I failed to see how balancing the test set will cause us to leak knowledge ...
0 votes
1 answer
39 views
How to represent varying reliability of ratios calculations in a dataset?
I want to predict whether the client will renew his/her subscription based on groceries consumption patterns. Suppose an order contain only one type of grocery. I have a DataFrame containing ratios of ...
0 votes
1 answer
47 views
Classification for choice data
It is essentially a choice modelling problem, but hopefully can be addressed by classification. Suppose one needs to choose a route to drive to work among many candidates in his mind. These candidates ...
1 vote
0 answers
32 views
Supervised recommender system design feedback
I am facing a challenge that I am not quite sure how to solve and would like to hear feedback. Basically, I have to implement a recomendation system for certain courses to be recommended to users of ...
2 votes
2 answers
172 views
Are all problems solvable using machine learning?
I am confronted with a relatively original problem which consists in predicting on which floor of a building audio recordings have been made. I have tried many machine learning approaches but none of ...
8 votes
1 answer
447 views
How can I learn and apply the scientific method in machine learning?
Rigor Theory. I wish to learn the scientific method and how to apply it in machine learning. Specifically, how to verify that a model captured the pattern in data; how to rigorously reach conclusions ...
1 vote
1 answer
33 views
Organization method on sharing research within a company
Currently we are trying to organize a methodology how different teams can share theirs projects with other team. These projects can be papers, code, pptx, views on everything. Is there a known scheme ;...
1 vote
1 answer
125 views
Handling gaps in regression model
I'm facing a regression problem where I'm supposed to predict the delay of some trains. There's some peculiar particularity, however: a train is not considered delayed until it has more than 10 mins ...
0 votes
0 answers
155 views
Splitting sentiment analysis training data into x-train and y-train for a RNN?
Suppose I have a dataset of comments from users, around multiple websites, such that in each row, there are two comments, and one is considered more 'negative' and one more 'positive' indicated by the ...
0 votes
1 answer
264 views
Explaining the logic behind the pipe_line method for cross-validation of imbalance datasets
Reading the following article: https://kiwidamien.github.io/how-to-do-cross-validation-when-upsampling-data.html There is an explanation of how to use ...
0 votes
0 answers
24 views
Regressing over tiny floats with Neural Networks
I am trying to regress over very small floats - of the magnitude [1e-2, 9e-3]. They're mostly in this range. Using simple ...