What kind of model/type is this

Essentially I want to pass a program some variables, all gathered from a user on my site, and have the program give a score of how authentic the user is meant to be.

I already have a large set of data with already set scores and want to start creating the scores myself ( currently done through a third party)

After reading about machine learning and asking some friends I've chosen (still open to ideas) python, but I'm unsure which type of algorithm I'm going for, Supervised, unsupervised, reinforcement, semi-supervised etc.

My question is, which type am I looking for. Any example program ideas, or help is much appreciated.

Info I'll be passing on: Browser name, version Os name, version ISP Referrer Estimate of Longitude and Latitude OS System time ( and system time of area)

Topic python machine-learning

Category Data Science


This is supervised learning, more specifically a regression task (as opposed to classification, because the target is a numerical value).

The principle is this:

  • The model is trained with a training set made of instances with both the features (input variables used as indicators) and the target (the score which will be the output of the model). This way the model learns the patterns in the features which contribute to determining the target, assuming the features contain enough information to estimate the target.
  • Once the model is trained, it can be applied to any new set of instances containing only the features, and it predicts the target.
    • It is strongly recommended to first evaluate the model with a test set for which the target is known. This way we know how reliably the model can estimate the target. It's possible that the model doesn't work. The test set must be made of different instances than the training set, usually this is done by randomly splitting the full dataset with target between training and test set.
    • If the model performs decently, it can be applied "in production" to any new set of instances.

This is very common, there are many tutorials and many libraries available, the most standard with Python is scikit-learn.

You could try various methods, for example SVR or decision trees.

About

Geeks Mental is a community that publishes articles and tutorials about Web, Android, Data Science, new techniques and Linux security.