Two steps optimization of a credit card limit

I have a problem similar to what is on the title but not the same. The problem on the title allows me to explain the dynamics of my need.

I have to determine what the optimal value is for a variable called QUOTA or LIMIT for a credit card.

The goal of the model is to allow me to minimize the probability of default, given this variable and others that characterize my costumer. What is the best way to determine this?

I have thought of two possibilities:

  • Train a model with all the exogenous and the LIMIT as variables for any classification model (with Y=Default/No Default)
  • Train a model with all the exogenous variables but not a LIMIT (with Y=Default/No Default), then train another model with the resulting probability and LIMIT as the only two variables (with Y=Default/No Default).

Do any of these possibilities sound right?

With both models, the application dataset should repeat costumers with two or more possible values for the LIMIT variable and obtain the default probability of every customer repetition. Does this sound correct?

Topic data-science-model logistic-regression optimization

Category Data Science


You can model it as a binary logistic regression. The target is the probability of default. One of the features is credit card limit.

Logistic regression allows an odds ratio to be calculated. The odd ratio will give the effect of increasing credit card limit on wether a default is predicted to happen.

Logistic regression allows a threshold to be set to "minimize the probability of default".

About

Geeks Mental is a community that publishes articles and tutorials about Web, Android, Data Science, new techniques and Linux security.