I’m building a recommendation system where each user interacts with sessions (topics with a title and description). I want to represent each user using their last 5 session interactions by creating a user embedding.
My idea is to use a pretrained model to generate embeddings for the session titles and descriptions, then combine the embeddings of the last 5 sessions to create a user representation.
I’d like advice on:
- How to combine multiple session embeddings effectively. Which pretrained models are best suited for this kind of text encoding.
I'm using titan-embedding-v2 and open for any other embedding models as well
Looking forward to your suggestions. Thank you!