sentiment-analysis

How to predict the sentiment of the entities form the tweet?

coding_ninza

2022年6月4日 17:05

I have a JSON file (tweets.json) that contains tweets (sentences) along with the name of the author. Objective 1: Get the most frequent entities from the tweets. Objective 2: Find out the sentiment/polarity of each author towards each of the entities. Sample Input: Assume we have only 3 tweets: Tweet1 by Author1: Pink Pearl Apples are tasty but Empire Apples are not. Tweet2 by Author2: Empire Apples are very tasty. Tweet3 by Author3: Pink Pearl Apples are not tasty. Sample …

Topic: spacy stanford-nlp sentiment-analysis language-model nlp

Category: Data Science

What is the pooled output when using tensorflows implementation of BERT for text classification (multiple sentences)

Blackforest95

2022年6月4日 15:50

I stumbled upon different sources that state that each sentence starts with a CLS token when passed to BERT. I'm passing text documents with multiple sentences to BERT. This would mean that for each sentence, I would have one CLS token. Pooled output is however only returning a vector of size hidden state. Does this mean that all CLS tokens are somehow compressed to one (averaging?)? Or does my text document only contain one single CLS token for the whole …

Topic: bert sentiment-analysis

Category: Data Science

Sentiment Analysis Label Distribution

shrish123 kumar

2022年6月4日 03:02

I am working on Sentiment Analysis model. The dataset I have has three labels: positive, negative and neutral. But the problem is the data is not equal for labels. Say out of 100K : 75 K are neutral, 15K positive and 10K negative. I wanted to know whether it is necessary to choose equal distribution of labels while training or I can go ahead with unequal data and if so till what extent? Are there any ways to deal with …

Topic: sentiment-analysis

Category: Data Science

Algorithms for Sentiment Analysis on Entity

Vladimir Shebuniayeu

2022年6月3日 09:02

I want to make sentiment analysis for an entity which was found, like Google NLP. Entity should have magnitude and score. Please share with me the possible research papers. p/s please not propose to make sentiment for sentence where the entity is located and them assign to entity from such sentence.

Topic: named-entity-recognition sentiment-analysis

Category: Data Science

Get the keywords from positive and negative reviews

spectre

2022年5月28日 04:03

I have trained a classifier algorithm on a sentiment analysis model which classifies the reviews scraped off Amazon as Positive or Negative. Now for each class, I want to get the keywords from the review i.e. reason for the positive or negative review. For example if I have a review "the quality of the shirt is the worst!". I want to get the keyword as "quality". Similarly "Really liked the fitting of the shirt" should return "fitting" as the keyword. …

Topic: sentiment-analysis nlp python

Category: Data Science

Deciding Initial Weights In A Linear Classifier For Sentiment Analysis

Suhail Gupta

2022年5月26日 16:00

I would like to build a simple sentiment analysis classifier using logistic regression. I downloaded a list of positive and negative words from cs.uic.edu. There are more than 6000 words both positive and negative. Linear Classifier has the form: (Wikipedia Reference) $$\sum wj*xj$$ where $w$ is the weight of the vector $x$. So for example, if the weight of vector awesome is 3, then in the following sentence: Food is awesome and music is awesome. according to the formula, it …

Topic: machine-learning-model logistic-regression sentiment-analysis classification machine-learning

Category: Data Science

Kmeans cluster validation when I have labeled test data

Aniket Bote

2022年5月24日 02:01

I'm trying to implement the unsupervised k-means algorithm for sentiment analysis of imdb movie dataset created by stanford. The steps that I followed is : 1) Load the comments 2) Apply tokenization and stemmetion ,use tf-idf algo to create tfidf matrix. 3) Use k-means algo to divide the data into 2 clusters. My problem is how do I validate the the clusters I have labeled test data. I want to check if all the negative examples go in one cluster …

Topic: unsupervised-learning sentiment-analysis python k-means

Category: Data Science

Is there anyway to classify the category on give amazon reviews using python

Sanjay Chintha

2022年5月23日 21:49

I am trying to find a model or way to classify text which falls into a category and its a positive or negative feedback. For ex. we have three columns Review : Camera's not good battery backup is not very good. Ok ok product camera's not very good and battery backup is not very good. Rating : 2 Topic :['Camera (Neutral)', 'Battery (Neutral)'] My Whole Dataset is like above and Topic is not standard one , Topic value is based …

Topic: text-classification sentiment-analysis python

Category: Data Science

Using a fine-tuned model for a different dataset

Alex

2022年5月20日 20:37

I have a dataset of different sentences from news articles which I need to classify by their sentiment. For that goal I'm planning to use a fine-tuned model which was fine-tuned on different datasets, for example various comments from forums, reviews, tweets. However, news articles are supposedly quite different from that dataset as they are usually more neutral. I understand that a correct way to approach this issue would be by training a model on my own labeled dataset, however …

Topic: text-classification machine-learning-model deep-learning sentiment-analysis classification

Category: Data Science

How to get sentiment score for a word in a given dataset

Dipto_Das

2022年5月15日 05:08

I have a sentiment analysis dataset that is labeled in three categories: positive, negative, and neutral. I also have a list of words (mostly nouns), for which I want to calculate the sentiment value, to understand "how" (positively or negatively) these entities were talked about in the dataset. I have read some online resources like blogs and thought about a couple of approaches for calculating the sentiment score for a particular word X. Calculate how many data instances (sentences) which …

Topic: bert sentiment-analysis dataset nlp

Category: Data Science

Labeling a dataset for sentiment analysis

user28324

2022年5月14日 23:30

I was reading articles on sentiment analysis and NLP and there is something I cant quite understand. One of the methods to label a dataset is to use something like textblob with a polarity dictionary that would count words in a positive and negative dictionary and give a score based on it. Then the dataset is used to train a classification algorithm. My question is, why do we bother with ML at all while we have a rule-based labeling method …

Topic: sentiment-analysis nlp machine-learning

Category: Data Science

How to perform tokenization for tweets in xlnet?

Mathew

2022年5月10日 20:03

X_train has only one column that contains all tweets. xlnet_model = 'xlnet-large-cased' xlnet_tokenizer = XLNetTokenizer.from_pretrained(xlnet_model) def get_inputs(tweets, tokenizer, max_len=120): """ Gets tensors from text using the tokenizer provided""" inps = [tokenizer.encode_plus(t, max_length=max_len, pad_to_max_length=True, add_special_tokens=True) for t in tweets] inp_tok = np.array([a['input_ids'] for a in inps]) ids = np.array([a['attention_mask'] for a in inps]) segments = np.array([a['token_type_ids'] for a in inps]) return inp_tok, ids, segments inp_tok, ids, segments = get_inputs(X_train, xlnet_tokenizer) AttributeError: 'NoneType' object has no attribute 'encode_plus'

Topic: tokenization tensorflow sentiment-analysis nlp python

Category: Data Science

Training data in sentiment analysis

Dan Jírovec

2022年5月7日 16:29

I'm doing sentiment analysis of tweets related to recent acquisition of Twitter by Elon Musk. I have a corpus of 10 000 tweets and I'd like to use machine learning methods using models like SVM and Linear Regression. My question is, when I want to train the models, do I have to manually tag big portion of those 10 000 collected tweets with either positive or negative class to train the model correctly or can I use some other dataset …

Topic: linear-regression sentiment-analysis svm

Category: Data Science

How to deal with one output for multiple inputs?

nilosch

2022年5月7日 16:17

Hei! I want to train a model, that predicts the sentiment of news headlines. I've got multiple unordered news headlines per day, but one sentiment score. What is a convenient solution to overcome the not 1:1 issue? I could: Concatenate all headlines to one string, but that feels a bit wrong, as an LSTM or CNN will use cross-sentence word relations, that don't exist. Predict one score per headline (1:1), and take the average in the application. But that might …

Topic: deep-learning sentiment-analysis text-mining neural-network

Category: Data Science

getting actual concepts value instead of its URI in ontology

Abdulmoty

2022年5月1日 01:24

I am using owl ontology for semantic analysis in emotional sentiment analysis project , I am trying to navigate the ontology to check a concepts and its relation , my ontology has classes like this :  <owl:Class rdf:about="http://purl.obolibrary.org/obo/MFOEM_000011"> <rdfs:subClassOf rdf:resource="http://purl.obolibrary.org/obo/MFOEM_000001" /> <rdfs:subClassOf> <owl:Restriction> <owl:onProperty rdf:resource="http://purl.obolibrary.org/obo/BFO_0000117" /> <owl:someValuesFrom rdf:resource="http://purl.obolibrary.org/obo/MFOEM_000208" /> </owl:Restriction> </rdfs:subClassOf> <obo:IAO_0000115>An unpleasant emotion closely related to anger but lower in intensity and without the moral dimension of blame and seriousness that is implicated in anger. [Source: …

Topic: ai sentiment-analysis nlp

Category: Data Science

Are the word of women and men different when expressing their views on the same subject?

nem0

2022年4月30日 20:29

My data includes women's comments on X and Y and men's comments on X and Y. Each comment is of equal length. I will calculate how much different the word choice between men and women when commenting on X. How can it do this?

Topic: gensim sentiment-analysis lda topic-model nlp

Category: Data Science

Why (or how) does a Keras model skip Stemming or Lemmatization steps?

Rajdeep Biswas

2022年4月30日 12:00

This Keras article / tutorial here does perform text standardization i.e removing HTML elements, punctuation, etc. from the text dataset, however, there is a distinct lack of any stemming or lemmatization before the vectorization step. I have a bit of experience in deep learning but I am very new to NLP, and I just got to know (from a different tutorial on Udemy, which BTW was using Bag of Words) that using either a Stemmer or a Lemmatizer helps in …

Topic: keras sentiment-analysis nlp

Category: Data Science

predictive effect in the classification made according to the comments in different fields

nem0

2022年4月27日 10:37

I want to do a classification through comments categorized in 4 areas(X,Y,Z,M). Categorizing the product as good or bad based on the comments in the fields X, Y, Z, M. How can I follow a path to see the effects of these 4 areas on the result. For example; Id X Y Z M Result 1 The prod.. I fell.. Very bad.. lost of time.. 0(bad) Using this data, the model will be given comments in the x, y, z, …

Topic: text-classification lstm deep-learning sentiment-analysis nlp

Category: Data Science

How to make use of POS tags as useful features for a NaiveBayesClassifier for sentiment analysis?

emily

2022年4月27日 00:04

I'm doing sentiment analysis on a twitter dataset (problem link). I have extracted the POS tags from the tweets and created tfidf vectors from the POS tags and used them as a feature (got accuracy of 65%). But I think, we can achieve a lot more with POS tags since they help to distinguish how a word is being used within the scope of a phrase. The model I'm training is MultnomialNB(). The problem I'm trying to solve is to …

Topic: naive-bayes-classifier sentiment-analysis nlp machine-learning

Category: Data Science

ValueError: y should be a 1d array, got an array of shape () instead

Maddy

2022年4月26日 14:58

I'm using a reviews data and trying to apply classifier model and get prediction. Here is the code i'm trying. dataset = pd.read_csv('Scraping reviews.csv') import numpy as np X = np.linspace(0, 2*np.pi, 8) y = np.sin(X) + np.random.normal(0, 0.4, 8) X = X.reshape(-1, 1) from sklearn.feature_extraction.text import CountVectorizer count_vect = CountVectorizer() X_train_counts = count_vect.fit_transform(dataset) #X_train_counts=X_train_counts.reshape(4,1) X_train_counts.shape [out] (2,2) from sklearn.feature_extraction.text import TfidfTransformer tf_transformer = TfidfTransformer(use_idf=False).fit(X_train_counts) X_train_tf = tf_transformer.transform(X_train_counts) #X_train_tf=X_train_tf.reshape(4,1) X_train_tf.shape [out] (2,2) tfidf_transformer = TfidfTransformer() X_train_tfidf = tfidf_transformer.fit_transform(X_train_counts) #X_train_tfidf=X_train_tfidf.reshape(4,1) X_train_tfidf.shape …

Topic: sentiment-analysis classification nlp machine-learning

Category: Data Science

About