Docsity
Docsity

Prepare for your exams
Prepare for your exams

Study with the several resources on Docsity


Earn points to download
Earn points to download

Earn points by helping other students or get them with a premium plan


Guidelines and tips
Guidelines and tips

Sentiment Analysis of Twitter Data: Nike vs. Adidas, Study Guides, Projects, Research of Social Management

Twitter Sentiment Analysis: A Case Study for Apparel Brands.

Typology: Study Guides, Projects, Research

2021/2022

Uploaded on 01/21/2022

loveu
loveu 🇺🇸

4.5

(20)

297 documents

1 / 6

Toggle sidebar

This page cannot be seen from the preview

Don't miss anything!

bg1
1
Twitter Sentiment Analysis: A Case Study for Apparel Brands
Abdur Rasool1,*, Ran Tao1, Kamyab Marjan1, Tayyab Naveed2
1School of Computer Science & Technology, Donghua University, Shanghai, China.
2College of Textile Engineering, Donghua University, Shanghai 201620, China.
*Corresponding author e-mail: 317089@mail.dhu.edu.cn
Abstract. Social media especially Twitter is providing a space for expression and
opinions, where users discuss various events, services, and brands. Entrepreneurs are
in continuous need of the feedback about their services to improve the quality and
quantity. However, due to the bulk amount of data, it’s difficult to detect the
consumer’s opinions. This article deliberates the problems about the Twitter data for
the sentiment analysis. Furthermore, it implements the text mining and document-
based sentiment on the preprocessed Twitter data through the machine learning
techniques, Naïve Bayes and lexicon dictionary. Our case study is to find the public
opinion about the top two apparel international brands and compare the positive and
negative attitude of common users about each brand. We found that positive reviews
of Adidas are more than the Nike while there is the slight difference in negative
reviews. It founds that people want to discuss the other brands as comparisons when
they are talking about just one brand.
1. Introduction
In the current century, the world is a global village due to the internet availability. According to [1]
54.4% of the world population was using the internet until Dec 2017. Now a day’s social media
(Facebook, YouTube, Twitter, and, content sharing sites, etc.) has efficiently and effectively shared
useful information. In the learning resources, it is statistically [2] verified that 71% of the internet has
been used through social media by the consumers. Studies [3] show that more than half customers
prefer to read the other’s comments about that products before purchasing. Thus public opinions are
the best source of feedback for business stakeholders about their products and services which enables
them to redesign the quality factor and disclose the opportunity of a new business [4]. The social
network like Twitter and Facebook provided the important marketing, selling, branding, and
promotional chances to the brands [5].
In the last decades, a hot research gets popular among the researchers to get the useful information
from all these social media especially from Twitter. It’s a mathematically procedural study of people’
thoughts and opinions which can be positive or negative about any product or event through the
natural language processing namely as ‘sentiment analysis’. Sentiment analysis is correlated with text
mining or data mining. The basic purpose of sentiment analysis is to assure the polarity of natural
language by performing supervised and/or unsupervised classification. Recently available sentiment
analysis techniques are useful for [6] political predictions, marketing strategy, e-commerce, and brand
reputation management.
In one study, Sharma et al. have performed an apparel brand study, in which he found the trust
level using regression tool for the Facebook data [5] but this study implements the sentiment for
pf3
pf4
pf5

Partial preview of the text

Download Sentiment Analysis of Twitter Data: Nike vs. Adidas and more Study Guides, Projects, Research Social Management in PDF only on Docsity!

Twitter Sentiment Analysis: A Case Study for Apparel Brands

Abdur Rasool1,, Ran Tao*^1 , Kamyab Marjan^1 , Tayyab Naveed^2 (^1) School of Computer Science & Technology, Donghua University, Shanghai, China. (^2) College of Textile Engineering, Donghua University, Shanghai 201620, China.

*Corresponding author e-mail: 317089@mail.dhu.edu.cn

Abstract. Social media especially Twitter is providing a space for expression and opinions, where users discuss various events, services, and brands. Entrepreneurs are in continuous need of the feedback about their services to improve the quality and quantity. However, due to the bulk amount of data, it’s difficult to detect the consumer’s opinions. This article deliberates the problems about the Twitter data for the sentiment analysis. Furthermore, it implements the text mining and document- based sentiment on the preprocessed Twitter data through the machine learning techniques, Naïve Bayes and lexicon dictionary. Our case study is to find the public opinion about the top two apparel international brands and compare the positive and negative attitude of common users about each brand. We found that positive reviews of Adidas are more than the Nike while there is the slight difference in negative reviews. It founds that people want to discuss the other brands as comparisons when they are talking about just one brand.

1. Introduction In the current century, the world is a global village due to the internet availability. According to [1] 54.4% of the world population was using the internet until Dec 2017. Now a day’s social media (Facebook, YouTube, Twitter, and, content sharing sites, etc.) has efficiently and effectively shared useful information. In the learning resources, it is statistically [2] verified that 71% of the internet has been used through social media by the consumers. Studies [3] show that more than half customers prefer to read the other’s comments about that products before purchasing. Thus public opinions are the best source of feedback for business stakeholders about their products and services which enables them to redesign the quality factor and disclose the opportunity of a new business [4]. The social network like Twitter and Facebook provided the important marketing, selling, branding, and promotional chances to the brands [5]. In the last decades, a hot research gets popular among the researchers to get the useful information from all these social media especially from Twitter. It’s a mathematically procedural study of people’ thoughts and opinions which can be positive or negative about any product or event through the natural language processing namely as ‘sentiment analysis’. Sentiment analysis is correlated with text mining or data mining. The basic purpose of sentiment analysis is to assure the polarity of natural language by performing supervised and/or unsupervised classification. Recently available sentiment analysis techniques are useful for [6] political predictions, marketing strategy, e-commerce, and brand reputation management. In one study, Sharma et al. have performed an apparel brand study, in which he found the trust level using regression tool for the Facebook data [5] but this study implements the sentiment for

IOP Conf. Series: Journal of Physics: Conf. Series 1176 (2019) 022015 doi:10.1088/1742-6596/1176/2/

Twitter apparel data. Another study was acquainted with apparel brands based on web text data in order to extract the customer emotion by implementing the sentiment techniques [1]. The purpose of our research is to apply the sentiment classification methodology to the Twitter apparel datasets. It will illustrate the relationship between the consumers and the apparel enterprise. In our work study, preprocessing step has been taken to achieve the better analysis results. The Bernoulli Naïve Bayes algorithm has been used with the lexicon dictionaries, e.g. VEDER dictionary. Furthermore, this study is a document-based approach and extracts the polarity from the tweets. This comparative study will assist the new researcher to analyze the social media for the sentiment detection. The main contribution of this paper:

  • To conduct a comparative analysis of two online international apparel brands.
  • To find the answer, which apparel brand is most popular among the online viewers at the base of sentiment analysis? The rest of the paper is: section-2 describe the related work, section-3 explain the case study which will discuss the preprocessing of data and will implement the sentiment techniques at our proposed case study and disclose the experimental results. Finally, section-4 clarify conclusion and future work. 2. SECTION 2: Related Work The power of social media marketing is influencing the consumer and companies as well by spreading useful information and exchange of positive or negative values. Companies are learning the customer views and discussion to support their own mission and performance goals. Now, a direct link among consumers and companies about satisfaction and loyalty level has been found [7]. Twitter is an important source of information and data analysis for numerous sentiment analysis application by using machine learning techniques recently (Asghar et al. 2017) [8]. Shukri and Yaghi et al (2015) implement the sentiment and text mining techniques to analyze the unstructured data of Twitter to find the polarity and emotion classification about the automotive industry (such as Mercedes, Audi and BMB) and concluded that positive polarity of BMB (72%) is more than other brands [9]. However, this work will implement for the apparel brands comparison. Sharma and Alavi et al. (2017) did a comparative study of 5 local apparel brands by proposing 3 hypothesis, based on the volume of posts, number of reaction on that posts, and number of comments. It decided [5] that 3 brands Craftsvilla, indiaRush, and Stalkbuylove maintained the above-mentioned hypothesis by positive coefficient while other 2 brands have negative beta coefficients for comments and number of posts. In [1] the author proposes a study of Chinese based apparel brands by testing the web text, NPL, domain dictionary, sentiment analyses and visual techniques to extract the subjective customer emotions. It observed that T-shirts and coats have the highest emotions than the pants and, sales and emotions are directly proportions to each other as well as price and sales are directly influencing. Gautam et al. (2014) engaged the probabilistic classifier, Naïve Bayes (NB) approaches which is supervised learning. Supervised learning is based on the labeled dataset which provided the model during the process. These labeled datasets are trained to achieve rational outcomes when encountered decision- making [10]. Jitendra et al (2018) [11] used both machine learning method, supervised and unsupervised approaches on different datasets obtained from Twitter. It achieved 80.68% accuracy on tweets for an unsupervised approach. For supervised, they combined unigram, bigram, and part-of- speech as features and achieved an accuracy of 67%. According to Go et al. (2009) used emoticons as positive and negative in the test of tweet dataset [12]. It compared the accuracy of three machine learning algorithms including Naïve Bayes (NB), maximum entropy (MaxE) and support vector machines (SVM). And found that Naive Bayes’s accuracy is better than other algorithms. Parikh and Movassate (2009) test the classification of the sentiment of two algorithms, Naïve Bayes Bigram and at Maximum Entropy model. It provides that the Naïve Bayes classifiers worked much better than the Maximum Entropy model [13].

IOP Conf. Series: Journal of Physics: Conf. Series 1176 (2019) 022015 doi:10.1088/1742-6596/1176/2/

Figure 1. Sentiment Process Diagram on Twitter Data

4. Results After the implementation of sentiment classification, we got the values of sentiment distribution of Nike and Adidas’ Figure 2 display the Nike sentiment distribution which shows the positive, negative and neutral views 24.5, 11.9%, and 63.6% respectively. Similarly, figure 3 is the distribution of Adidas sentiment with positive (27.2%), negative (11.7%) and neutral (61.1%). It was found that positive reviews of Adidas are more than the Nike. While the neutral values record the satisfaction level among the online Twitter’ users for both brands which is more than 60% of total reviews.

Figure 2. Sentiment distribution of Nike

Figure 3. Sentiment distribution of Adidas

IOP Conf. Series: Journal of Physics: Conf. Series 1176 (2019) 022015 doi:10.1088/1742-6596/1176/2/

However, the negative opinions are more valuable than the positive for the shareholders and investors of both brands. These stakeholders could evaluate the negative reviews to detect the common users’ requirements from the Twitter. For example in the word cloud of Adidas keywords (as shown in figure 4) has some negative words e.g. low. Similarly, in the word cloud of Nike (as shown in figure 5), containing the word e.g. los, indicate the bad comment. It was observed that people discuss the other brands as well when they want to talk about any single brand product. It’s concluded, online consumers always try to compare the same product of different brands before making a purchase decision. For instance, in both word clouds of Adidas and Nike, people are discussing about third apparel brand, GUCCI.

Figure 4. Word Cloud of Adidas Figure 5. Word cloud of Nike Figure 6 has shown as a comparison of both brands for sentiment distributions about the online Twitter users. It is indicated the sentiment of each brand, Adidas and Nike, and total distributions of sentiments for our apparel dataset. Positive reviews were found greater than the negative and the peak of neutral reviews in both brands is a representation of most popular brands on the online users. As compared to the positive amount of comments, the neutral comments of Nike are better than the Adidas. While the more negative views of Nike is alarming for its market value as compare to Adidas.

Figure 6. Sentiments distribution of each brands and total dataset.

5. Discussion Nowadays, online shopping is replacing the regular or custom shopping. An apparel shopping is the popular one among the online users on social media. With the technologies advancement, new methods for customer satisfaction has been introduced by using the big data, text mining and sentiment analysis. Before implementing the sentiment, we apply the preprocess step at our Twitter