Questions tagged [unsupervised-learning]
Finding hidden (statistical) structure in unlabelled data, including clustering and feature extraction for dimensionality reduction.
440 questions
1 vote
0 answers
6 views
Feature selection for unsupervised learning with a One-Class SVM
I am trying to build a solution to detect a particular sound against all possible other sounds occuring in nature. My approach is to train a One-Class SVM only on my class of interest, hoping it will ...
1 vote
0 answers
36 views
How to identify and quantify main tendencies across participants from cluster membership heatmaps?
I'd appreciate your thoughts on the following problem. I've created a heatmap plot (attached) showing the cluster membership ratio for each participant (in separate subplots) and condition (η). Now, I'...
2 votes
1 answer
32 views
Reinforcement Learning vs Unsupervised Learning
I am trying to understand what are the main differences between reinforcement learning and unsupervised learning when training ML models. I know the main difference is that reinforcement learning aims ...
2 votes
1 answer
54 views
Unsupervised anomaly vibration detection for time series
I'm working with a dataset consisting of multiple CSV files, each representing time series data of accelerations (x, y, z) captured during vibration events. For each event, a sensor records data for ...
0 votes
0 answers
18 views
GNN Based unsupervised Anomaly Detection for Heterogeneous Graphs
I am working on a project where I am doing Unsupervised Anomaly Detection on employee expenses on HCP transfer Of Value. I am trying to use Graph Neural Network to detect anomalies with proper ...
1 vote
0 answers
51 views
Using a differentiable Self-Organizing Map loss in a CNN
I've been trying to aggregate a normal CNN loss with a loss that quantifies how well we can cluster the second-to-last layer embeddings (i.e. feed the embeddings to a 2D Self Organizing Map (SOM) and ...
2 votes
0 answers
63 views
Determine best hyperprameteres in GridSearch - Isolation Forest
I have implemented an Isolation Forest algorithm for anomaly detection (unsupervised learning), where I divided my dataset into 1000 subsets, and for each subset, there is one isolation tree. This ...
1 vote
0 answers
65 views
What are the Strategies for Anomaly Detection in Sparse Datasets?
I’m working on a large dataset (300+ columns, 500k+ rows) and have been asked to build an anomaly detection algorithm, but I’m unsure how to define or approach these anomalies in a meaningful way. ...
0 votes
0 answers
43 views
Finding dependencies between arbitrary features automatically
Given a 3-rank tensor with dimensions $x,y,z$. Where: $x$: number of graphs (number of samples) $y$: number of nodes/vectors/features (let's say $5$: $a, b, c, d,$ and $e$) $z$: embedding dimension (...
2 votes
1 answer
55 views
Calculating LOF for big data
I have big dataset (hundreds of millions of records, counted in dozens of GBs) and I would like to perform LOF for the problem of anomaly detection (testing different methods for academic purposes) ...
1 vote
0 answers
53 views
How to Interpret Laplacian Scores for Feature Importance Ranking in Unsupervised Feature Clustering?
I am currently working on unsupervised feature importance ranking using graph clustering methods, specifically focusing on the Laplacian score as a metric. However, I am struggling to clarify the ...
0 votes
0 answers
142 views
Machine learning approach for bot detection
I am working on a project that tries to determine if users are bots or not. Currently, the labels that the dataset contains are not reliable, but I have found some trends/features that are solid for ...
3 votes
1 answer
252 views
Does Including Contamination Turn Isolation Forest into Supervised?
In unsupervised anomaly detection, does including the contamination percentage turn isolation forest into supervised instead of unsupervised when I fit the data after?
0 votes
2 answers
108 views
Doing unsupervised anomaly detection on a dataset without any labels and without variable descriptions
I am trying to do unsupervised anomaly detection on a dataset with a dozen of variables. None of them have descriptions, and the dataset doesn't have any labels or class variable. I have tried using a ...
1 vote
1 answer
223 views
Topic modeling evaluation
I'm working on topic modeling and I have generated clusters with two different methods. How can I evaluate which method performs better than the other?