Questions tagged [language-model]

Question 1

I am working on evaluating an explainability method for a text classification model that predicts whether a given text sequence contains hate speech or not. The method outputs token-level importance ...

Question 2

https://openai.com/index/learning-to-reason-with-llms/ OpenAI o1 also add more data than the last version of LLM.

Question 3

This might be an odd question, but why is there two codes for the class BaseCallbackHandler? https://api.python.langchain.com/en/latest/_modules/langchain_core/callbacks/base.html#BaseCallbackHandler ...

Question 4

Which languages llama2 supports? I looked at the docs and huggingface but I couldn't find a list. Just it says usage in other languages than English as out-of-scope.

Question 5

Is there any place I can get the list of pre-trained large language models in a neat way? Despite the most common ones like gpt, BARD, llama2, which llm do you suggest that can be used for RAG and ...

Question 6

How to check if a large language model has a license allowing to fine tune the model and then publish it publicly? How can I be sure that I can use and fine-tune a large language model without ...

Question 7

I started to work with LLMs lately and want to know how people choose their pre-trained models in their fine-tuning tasks? What is the criteria to choose the base model and which factors affect?

Question 8

I recently went through some litterature about knowledge-enhanced language models and found connections with the Machine Reading Comprehension (MRC) task. However, I couldn't find papers more recent ...

Question 9

I am new to data science and trying to find possibilities of using datascience in tasks. I have a set of logs which I want to convert to json. The logs are more or less of same format and I can write ...

Question 10

Given an email thread, I am trying to extract the body of the most recent email. I used to do that with rules. Now I am testing Large Language Models (LLM) to see if I they provide a less ad hoc ...

Question 11

I’m looking for an open-source LLM for a new project. I want to use it for instructions and to fine-tune the model to a specific domain like legal and rights. Some LLMs are open-source, but they didn’...

Question 12

I'm trying to understand how encoder-decoder architectures are used, or if they are used at all, for generative tasks that do not require an explicit prompt (ie. machine translation, summarization, ...

Question 13

I have recently read through a lot of documentation and articles about Large Language Models (LLMs), and I have come to the conclusion that 0.7 is, most of the time, the default value for the ...

Question 14

I have dataset with two column: one with faulty addresses, and other with correct addresses. I want to train a model such that, I can use it later for correcting all the incoming faulty addresses. I ...

Question 15

I have a CSV file, and I am using langchain to read it into the vector store FAISS. My question is, since I have a CSV file, is RecursiveTextSplitter required? Put differently, consider the following ...

Stack Exchange Network

Questions tagged [language-model]

Evaluation of token importance attribution based on human rationales

How much improvement does OpenAI o1 achieve from the chain of thought?

Callback handlers in Langchain

What languages llama2 supports?

How can I get the list of pretrained large language models?

How to check the license of a LLM for specific use?

How to choose ideal pretrained model for fine-tuning?

Is Machine Reading Comprehension (MRC) outdated?

How can I leverage machine learning for log analysis?

Purely extractive Language Model

Open-Source Large Language Models (LLM): Your experience and recommendation

What is the input to an encoder-decoder transformer in next word prediction task?

Why is 0.7, in general, the default value of temperature for LLMs?

TFRobertaSequenceClassification for Address Normalization task

How to read CSV File into Vector Store

Hot Network Questions