Skip to main content

Questions tagged [linear-algebra]

A field of mathematics concerned with the study of finite dimensional vector spaces, including matrices and their manipulation, which are important in statistics.

2 votes
1 answer
57 views

I'm watching Lecture 4 ("Curse of Dimensionality / Perceptron") from Cornell’s CS4780 (Spring 2017) by Prof. Kilian Weinberger on YouTube. In this lecture, he applies the bias trick to ...
xF6's user avatar
  • 23
1 vote
0 answers
53 views

Suppose we are given two classes class-1 and class-2 and the mean of the two classes are $μ_1$ and $μ_2$ respectively before projection. Both of their variance-covariance matrix is also provided ...
Suman Debnath's user avatar
1 vote
1 answer
67 views

I got this slide from the class lecture. My questions are: Q1. Why "there are enough vectors" are required for the linear span of vectors to satisfy 1st condition of basis? And why "...
David's user avatar
  • 28
6 votes
2 answers
199 views

Apology for the ambiguous title, I do not know the term. I have data of some products which a few variables: origin, weight, brand. For example: Product A = "China, 100g, Brand X" Product B ...
lpounng's user avatar
  • 1,345
1 vote
0 answers
48 views

I am trying to detect outliers with sklearn.covariance.EllipticEnvelope for a single variable, but it throws an unexpected error. Here is an example the reproduces ...
Maya's user avatar
  • 11
2 votes
2 answers
167 views

I have two sets of 2D data $A$ and $B$ (representing 2D positions on a 2D plane $x,y$) which are related (the first pair of $x,y$ of $A$ is related to the first pair of $x,y$ of $B$ for instance). I ...
patjol's user avatar
  • 21
1 vote
2 answers
264 views

So, in the decoder layer of transfomer, suppose I have predicted 3 words till now, including the start token then the last decoder layer will produce 3 vectors of size d-model, and only the last ...
Nishan Poudel's user avatar
1 vote
1 answer
3k views

Context: I am a pure mathematician trying to understand machine learning. I am studying it from various sources, now focusing on NLP and word embeddings. My question: What is the weight matrix for a ...
Tereza Tizkova's user avatar
0 votes
1 answer
57 views

Scikit learn has a make_regression data generator. Can someone explain it to me like I'm 5 what is meant in the help docs by "The input set can either be well ...
Snehal Patel's user avatar
0 votes
1 answer
132 views

I was reading about Maximal Margin Classifiers in "Introduction to Statistical Learning" and could not understand how is the perpendicular distance of an observation (which is a vector) from ...
Circuit_Breaker0.7's user avatar
1 vote
1 answer
86 views

I have a dataset of drugs represented as a graph, each of which is described by three non-square matrices: edge index (A), an 2xe matrix, where e are the bonds of the molecule, the first line ...
Gianmarco Luchetti 's user avatar
0 votes
1 answer
109 views

I have been trying to understand the math behind Linear classifier for images and I'm hitting a roadblock to understanding this image below: I can to some extent agree that we stretch the pixels into ...
joesan's user avatar
  • 219
3 votes
0 answers
309 views

I have a high-dimensional space, say $\mathbb{R}^{1000}$, and I have samples $y_1, \ldots , y_n \in \mathbb{R}^{1000}$ and $\hat{y}_1, \ldots , \hat{y}_n \in \mathbb{R}^{1000}$. Would $$ R^2 = 1 - \...
AspiringToAspire's user avatar
0 votes
1 answer
47 views

Setting the initial weights as all zeros will have the output dependent on the bias and setting the weights of all the neurons of a layer as same, will update the gradients in same way thus removing ...
Deshwal's user avatar
  • 323
1 vote
0 answers
32 views

I am trying to perform Face recognition using PCA (eigenfaces). I have a set of N training images (of dimensions M=wxh), which I have pre-processed into a vertical ...
zr0gravity7's user avatar

15 30 50 per page
1
2 3 4 5 6