1
$\begingroup$

I am doing my first cluster analysis with Orange (which I recently discovered and looks promising for this iterative and interactive process).

Apparently, there are several methods of creating clusters based on distance algorithm:

  • Single linkage (which computes the distance between the closest elements of the two clusters)
  • Average linkage (which computes the average distance between elements of the two clusters)
  • Complete linkage (which computes the distance between the clusters’ most distant elements)
  • Weighted linkage
  • Ward

Since I have several columns, and some of them are more important than others in terms of defining clusters, it seems to me that using weighted linkage method may be what I am looking for. Unfortunately, I don't know how to do that, since I didn't find a way to assign a weight to each column.

To make things worse, I have only found an explanation about the first three on this Orange's blog post but nothing about Weighted linkage (nor Ward, which may be a recent addition, since it is not even mentioned on widget's help).

Am I on the right path to achieve what I am looking for? Is there any way to make some columns more or less important/definitory when calculating the distances?

$\endgroup$

1 Answer 1

2
$\begingroup$

Weighted linkage probably does not mean you get to specify weights of features (build the distance matrix yourself!)

Instead this most likely refers to the well-known weighted group average strategy you will find in most textbooks often called WPGMA. There are two different definitions of "average", so this is likely simply the "other" average linkage.

$\endgroup$
2
  • $\begingroup$ Oh, I see I misunderstood. If I am not wrong, from your answer I should build the distance matrix in order to express if a variable is more important than another, but honestly I don't know how should I do that in orange. Do you have any clue about that? $\endgroup$ Commented Mar 14, 2019 at 17:48
  • $\begingroup$ I don't use orange. Isn't it little more than a GUI frontend for sklearn by now? $\endgroup$ Commented Mar 14, 2019 at 22:49

Start asking to get answers

Find the answer to your question by asking.

Ask question

Explore related questions

See similar questions with these tags.