Förstasida | Sök | In English

Luleå tekniska universitet

Examensarbete, påbyggnadsutbildningar Marknadsföring och e-handel / 2008:014

Hämta PDF ( PDF 1497 kb )
TITEL
Response modeling in direct marketing: a data mining based approach for target selection

FöRFATTARE
Hossein Javaheri, Sadaf

INSTITUTION
Industriell ekonomi och samhällsvetenskap / Industriell marknadsföring och e-handel

SAMMANFATTNING
Identifying customers who are more likely to respond to a product offering is an important issue in direct marketing. In direct marketing, data mining has been used extensively to identify potential customers for a new product (target selection). Using historical purchase data, a predictive response model with data mining techniques is developed to predict a probability that a customer is going to respond to a promotion or an offer. The purpose of this thesis is to identify the Parsian bank customers who are more likely to respond positively to a new product offering. To reach this purpose a predictive response model using customer historical purchase data is build with data mining techniques. Response modeling procedure consists of several steps. In building a response model one has to deal with some issues, such as: constructing all purchase behavior variables (RFM variables), determining the inputs to the model (feature selection) and class imbalance problem. The purpose of this study is to deal with all these issues and steps of modeling. Thus various data mining techniques and algorithms are used to implement each step of modeling and alleviate related difficulties. For modeling purpose customers' data (30,000 customers) were gathered from Parsian bank. Based on literature and domain knowledge 85 RFM features and their two-way interactions were constructed from collected data. Since irrelevant or redundant features result in bad model performance thus feature selection was performed in order to determine the inputs to the model. Feature selection was done in three steps using F-score and backward elimination on Random Forest. The data was highly unbalanced. We used under- sampling for solving class imbalance problem. Finally SVM was used as a classifier for classification purpose. The result indicates that Parsian bank can reach three times as many respondents as if they use no model (random sampling) for target selection. By using this model Parsian bank not only can significantly reduce the overall marketing cost but also can maximize customers' response to a product offering, prevent customer annoyance and improve customer relationship management.

ISSN 1653-0187 / ISRN LTU-PB-EX--08/014--SE / NR 2008:014

Förstasida | Sök | Universitetet | Biblioteket


Till biblioteket
LULEÅ UNIVERSITETSBIBLIOTEK