What kind of regression model should I do?
my research question is the examine the effect of "receiving attention" from other members in an online community on "sustained participation" on the website.
I decided to measure "sustained participation" of each user by calculating average time difference between the submissions of the user. I calculated it in the following way:
and I measured "attention" by calculating total number of the comments each user received for all the submissions he/she has posted.I also want to consider total number of votes and total number of views as well. I am not sure if it is a good idea to add those as the independent variables into model too or not?
My problem is about Dependent variable:
since some people just participated two times in two successive days and the average between submission days is 1. and some people participated 100 times and their average between submission days of them is also 1. but it is obvious that the second group that have participated 100 times had sustained participation not the first group.
So I need to consider the number of the submissions in the model too! I do not know if there is a way to consider it? how I can handle this problem?
should I group the users and do data analysis separately for them. for example users who have participated less than 10 times in one group! users with 10-20 participation in another group and etc.
I would appreciate if anyone can help me! my paper's due is so close and I need some preliminary results.
Topic regression research
Category Data Science