How can I use a class variable with many possible values in logistic regression?
I am attempting to build a logistic regression model that determines the probability of an outcome based on a set of independent variables.
For context, the data is based on a project in which sales representatives and branch managers for a builders merchant were given price recommendations for their customers' deals, and were given the option of saying 'Yes' or 'No' to these price recommendations. The Yes or No answer is my dependent variable; I need to determine which variables can predict with the highest probability whether the respondent will say 'no' to a price recommendation.
Most of the independent variables work fine in this model, except for one which I am currently unable to test; individual differences between the respondents.
My hypothesis is that the propensity to say 'no' will be stronger in some individual respondents than others, be it for psychological or geographical reasons, and that these individual differences will be a stronger determiner for the dependent variable than any other class variable.
There are about 800 respondents, so simply shoving them in as an independent variable does not produce the desired results.
Is there a method of doing this in logistic regression? Should I use another analysis technique for this?
I am using Statistical Analysis Software (SAS) to carry out the logistic regression.
Topic sas logistic-regression statistics
Category Data Science