Skip to main content

Questions tagged [embeddings]

0 votes
0 answers
9 views

I am working on some natural language stuff for fun, basically a rhyming dictionary, trying to figure it out. Trying next to figure out how to properly/decently capture the basic consonants + vowels ...
Lance Pollard's user avatar
0 votes
0 answers
9 views

I am currently studying GloVe paper about word embeddings. link In Section 3 The GloVe Model this model is derived from several desiderata, one of which confuses me. It is around Equation 3 which ...
Nourless's user avatar
  • 203
0 votes
1 answer
18 views

I have a large set of document embeddings, and I would like to sample a subset where the median or average pairwise distance is maximized. The idea here is to get a more balanced sample set where long ...
Layman's user avatar
  • 173
1 vote
0 answers
36 views

How can I visualise a hierarchical ontology of items in embedding space, combining text embeddings with the graphical structure? (Something similar to the example below) I have a hierarchical ...
baked goods's user avatar
3 votes
1 answer
50 views

I am trying to determine if using multiple instances of nn.Embedding() has any value over using a single instance in training a model. As an example, let's say I ...
CCranney's user avatar
6 votes
1 answer
147 views

I've been working in data science for a long time, but very rarely have I been called upon to implement an ML algorithm; I've just ran other people's libraries. I'm trying to pick up the skill. I'm ...
Zorgoth's user avatar
  • 305
1 vote
0 answers
29 views

I am working on a project of integrating the preferences of the workers into a schedule, I mean we won’t only satisfy the systematic constraints but also users preferences as constraints, so we are ...
Tchikss's user avatar
  • 11
2 votes
1 answer
37 views

I'm trying to locally replicate the pair classification task of MMTEB/MTEB. However, I didn't find train/dev sets for all datasets in this task. Table 2 in the original MTEB paper (Mueninghoff et al, ...
Jonathan's user avatar
  • 5,625
5 votes
2 answers
436 views

We have ~30 audio snippets, of which around 50% are from the same speaker, who is our target speaker, and the rest are from various different speakers. We want to extract all audio snippets from our ...
Yes's user avatar
  • 181
1 vote
2 answers
132 views

If I have multiple categorical features, each which has its own unique cardinality, and I want to use an embedding layer to reduce the dimensions fed to an MLP. Should I have one big embedding matrix ...
dule arnaux's user avatar
0 votes
0 answers
23 views

I have two integer encoded categorical variables, one indexed from 0 and another indexed from 1. What is the best way of embedding unique tuples (Category A, Category B), taking into account that ...
Yandle's user avatar
  • 231
1 vote
1 answer
74 views

I’m building a recommendation system where each user interacts with sessions (topics with a title and description). I want to represent each user using their last 5 session interactions by creating a ...
Swastik's user avatar
  • 111
1 vote
0 answers
31 views

Background I manage a property portfolio on platforms like Airbnb, handling customer support through the entire guest journey (pre-booking to post-stay). I'm building a RAG system to help automate ...
dowjones123's user avatar
0 votes
1 answer
85 views

Let's say that I have some lists of texts such as : ...
Leon's user avatar
  • 1
1 vote
0 answers
149 views

The model I am talking about is hosted here: From the documentation: We simplify usage of prompts, providing two prompts for most general tasks, one is for s2p, another one is for s2s.Prompt of s2p ...
figs_and_nuts's user avatar

15 30 50 per page
1
2 3 4 5
13