Questions tagged [text-classification]
For questions about text classification, the task of assigning predefined categories (or classes) to free-text documents.
273 questions
0 votes
0 answers
18 views
Document Classification Task for Review Paper References
everyone! I'm doing for the first time research on how well LLMs and DL models can structuralize scattered data, through NER and RE. We are using a review paper on a domain that has no ontologies or ...
2 votes
0 answers
51 views
Do you need paired data to train multimodal?
I have video, audio, and text data. The intent is to use the multimodal for binary classification. However, the data is not paired (i.e The audio and text are not from the same video recording). I've ...
5 votes
1 answer
119 views
Variable Length Input: How to handle longer input sequences during model testing
As the part of my college project on RNN, I'm working on a text classification task using tensorflow module. During training, I used pad_sequences with a max_length of 100, so all training examples ...
1 vote
0 answers
21 views
LinkedIn post copies - text analysis tools
I want to use data from my work - 1000+ different copies with full meta data from LI. We've got a SaaS platform for brand's ambassadors, and give them a tool to create these copies and plan them. So ...
0 votes
1 answer
58 views
Handle text column with PyTorch
I'm new in ML so question may be stupid. I have a data set with multiple numeric columns and one text column. Text is just one sentense. So i want to use all data avaible for classification. But i don'...
1 vote
0 answers
51 views
NER with custom tags and no training data, zero shot approach help
I am building a "field tagger" for documents. Basically, a document, in my case something like a proposal or sales quote, would have a bunch of entities scattered throughout it, and we want ...
4 votes
1 answer
89 views
How to Incorporate Label Titles and Parent Folder Information in a Multiclass Classification Model Using Keras?
I'm building a multiclass classification system using Keras. I am working with a dataset that includes text data and its metadata. Both the text and the metadata are sequences of words. The output of ...
0 votes
1 answer
198 views
SMOTE Oversampling for Text Classification with Multiple Input Features
SMOTE Oversampling for Text Classification with Multiple Input Features I have a text classification problem where the input has 2 features: a text and a language: the text is a string variable. the ...
3 votes
0 answers
94 views
Weird behaviour when using RobERTA for text classification
I have a dataset with around 70 classes and the dataset is largely balanced ~150 samples per class. I am finetuning RoBERTA-base for 4 epochs with a ...
0 votes
0 answers
50 views
Getting nearly 100% accuracy using Binary Classification in Tensorflow but incredibly wrong prediction levels for email messages
I'm creating a Chrome Extension to read user emails via Gmail's API, and then passing in user emails to a trained Keras model in Flask to determine whether the email was written by an AI or a Human, ...
2 votes
1 answer
64 views
Text Classification with unlimited labels, Text Extraction?
I'm looking to use ML to read in a blob of text, and extract a name from that text blob. (The blob is from an OCR result from an iPhone) The text blob varies in size, but the name is always present in ...
2 votes
1 answer
73 views
Is it methodologically correct to use the data to be used for finetuning in the pretrain phase of the BERT model?
Let us assume the training of a BERT model. An initial pre-train is performed with a large data set A. Subsequently a finetuning is performed with a dataset B which is part of A, but now with labels ...
1 vote
1 answer
60 views
What specific problems in what domains and fields have the need to use rule-based text classification?
I wrote a rule-based keyword detection and classification program specialized in my language (Vietnamese) and would like to know where this app is useful. Here how the program work: First you input ...
2 votes
2 answers
220 views
Improve text classification accuracy
Task: I am building a text classification for salary prediction for data science jobs. I want to achieve at least 70 percent accuracy. Data: Features: Consists of job descriptions of data science, ...
0 votes
1 answer
55 views
Recommendation system NLP ideas
The problem: If we have a clustering problem with lets say x groups. And each group has a document describing it, lets say 3 pages. Then we have n observations each with a smaller piece of text ...