Professional Documents
Culture Documents
ISSN No:-2456-2165
Abstract:- Customer service centers are most crucial part and dislikes of customers about the company. It helps
of any company. They represent the company and companies evaluate the social sentiment of their
communicate with the customers on its behalf. These brand.Sentiment Analysis is crucial and should be
centers also impart valuable information through treatedseriously as customer feedback contains a plethora of
customer feedback. As they form the bridge between the useful information which if properly harnesses can help
company and its customers, it is important they convey companies achieve next to impossible goals. But, just
information timely and effectively. Emails are the most knowing what customers are talking about it is not enough.
popular means of business communication. In order to Companies must also understand how they feel. Sentiment
efficiently and effectively utilize time, the customer analysis is one way to discover these feelings. The customer
service centers need to extract the relevant information service center employees can thus determine the sentiment of
from these emails. Then they need organizing it and the email and prioritize or respond appropriately.
promptly respond to the customers accordingly. In this
paper, we propose categorizing emails based on the Massive amounts of data are collected on daily basis in
customer reviews. The polarity (positive or negative) of so many companies. The huge amount of information makes
the customer emails is determined along with its it difficult to understand the data or to find what we are
probability and also determine the topic of the email. searching. We have to employ different methods to organize
After preliminary data pre-processing, we use the Naive- the data systematically and extract relevant information from
Bayes algorithm based approach to classify the emails it.Topic modeling is one such powerful technique which
and then the topic modeling is performed. This way, the allows us to not only organize but also summarize large
project helps to classify emails and determine their amount of textual data. It is helpful for determining different
subjects to save valuable time for the employees. topical patterns present in the dataset which otherwise remain
invisible.Topic modelling is a method for finding subjects
Keywords:- Emails, Customer Care, Categorization, Topic (i.e. topics) from a data that best describe the content present
Modeling. in the document. There are many methods which can be used
to obtain topic models. For our project, we will be using Bag
I. INTRODUCTION of Words technique to find the topics of the emails at the
customer service centers. This will help the employees to
Emails are convenient and the most popular means of summarize the content of the emails.
communication for customers. They help in avoiding long
waiting hours for telephonic conversation as well as in Today, the algorithm-based sentiment analysis tools can
preserving the record of the communication that is happening handle huge volumes of customer feedback consistently and
between the customer service center and the customer. It is accurately. Paired with topic modeling, sentiment analysis
crucial that these emails be properly organized at the reveals the customers opinion about topics regarding
customer service so as to spontaneously and appropriately different products and services. In our research, we provide
reply. Also, if properly utilized, these emails can yield with various features for extraction for Naive Bayes.
invaluable data for the companies to access the needs and
demands of the customers.As technology advances, the Algorithm based classifier for the project and the
customers have more interactive and dynamic relation with subject of the email through topic modeling. This
the company. The amount of the emails received by classification helps the customer service employees to
customers at the customer service center is increasing prioritize emails. It also assists them to assess the customer
rapidly. Handling this enormous amount of email manually responses to particular products and formulate market
can be a very time-consuming and complex task. The strategies in accordance. Furthermore, the customers inputs
problem can be solved if emails are automatically classified are useful as they inadvertently provide the businesses with
on the basis of their content. insightful information regarding the current trends and
requires of customers.
Sentiment Analysis, also known as opinion mining
sometimes, is a method to assess written or spoken language
to determine whether the expression is a positive, negative,or
neutral, and to what extent. It is also the most common
classification tool for textual analysis.Sentiment analysis is
important because it helps businesses understand the likes
VI. EXPERIMENT
A. Sentiment Module
Fig 1:- Overview of the process This module helps to determine the polarity of the
incoming customer emails with the help of Naive Bayes
IV. ALGORITHM classifier.Firstly, we need to find a method to train our
Naive-Bayes classifier to classify our mails as positive or
A. Naive Bayes negative. For this purpose, we use the words in the positive
Naive Bayes algorithm is one of the Bayesian theorems and negative reviews as their features. The classifier, after
and a supervised machine learning technique. It can be used training, has thus learned to associate the words from positive
for binary as well as multiclass classification problems. It is reviews as features for positive sentiment and similarly those
named Naive because it simplifies the probabilities for all from negative reviews as features for negative sentiment.The
hypothesis to make their corresponding calculation easy. The feature set, which contains the features extracted after
algorithm works by finding out the probability of the training the classifier, is then loaded onto the module.The
different attributes of data being associated with the certain Naive Bayes algorithm based classifier is then used to
class. Naive Bayes classifier works under the assumption that determine the sentiment of the emails based on the content
the presence of one feature in a class is independent of the from the customers.The function sentiment() is then defined
presence of other features. The classier selects the most likely where the new text is finally classified.
classification for a given set of the attribute values.
B. Topic Module
This module contains two distinct functions; clean()
and topic().In the clean function, the data is pre-processed so
that only relevant words are retained for topic determination.
Pre-processing involves processes such as tokenization stop-
P(c—x) is the posterior probability of class(target) given words removal, stemming and POS tagging. After the
predictor(attribute). cleaning of the dataset, it is now ready for determination of
• P(c) is the prior probability of class. topics and is thus sent to the topic function The topic
• P(x—c) is the likelihood which is the probability of function, which uses the Bag of Words approach, is then used
predictor given class. to choose the most appropriate topic for the corresponding
• P(x) is the prior probability of predictor. emails.
REFERENCES