TITEL
Response modeling in direct marketing: a data mining based approach for target selection
FöRFATTARE
Hossein Javaheri, Sadaf
INSTITUTION
Industriell ekonomi och samhällsvetenskap / Industriell marknadsföring och e-handel
SAMMANFATTNING
Identifying customers who are more likely to respond to a product offering
is an important issue in direct marketing. In direct marketing, data mining
has been used extensively to identify potential customers for a new product
(target selection). Using historical purchase data, a predictive response
model with data mining techniques is developed to predict a probability
that a customer is going to respond to a promotion or an offer. The purpose
of this thesis is to identify the Parsian bank customers who are more
likely to respond positively to a new product offering. To reach this
purpose a predictive response model using customer historical purchase data
is build with data mining techniques. Response modeling procedure consists
of several steps. In building a response model one has to deal with some
issues, such as: constructing all purchase behavior variables (RFM
variables), determining the inputs to the model (feature selection) and
class imbalance problem. The purpose of this study is to deal with all
these issues and steps of modeling. Thus various data mining techniques and
algorithms are used to implement each step of modeling and alleviate
related difficulties.
For modeling purpose customers' data (30,000 customers) were gathered from
Parsian bank. Based on literature and domain knowledge 85 RFM features and
their two-way interactions were constructed from collected data. Since
irrelevant or redundant features result in bad model performance thus
feature selection was performed in order to determine the inputs to the
model. Feature selection was done in three steps using F-score and backward
elimination on Random Forest. The data was highly unbalanced. We used under-
sampling for solving class imbalance problem. Finally SVM was used as a
classifier for classification purpose.
The result indicates that Parsian bank can reach three times as many
respondents as if they use no model (random sampling) for target selection.
By using this model Parsian bank not only can significantly reduce the
overall marketing cost but also can maximize customers' response to a
product offering, prevent customer annoyance and improve customer
relationship management.
ISSN 1653-0187 / ISRN LTU-PB-EX--08/014--SE / NR 2008:014
|