Should I concat multiple stock timeseries datasets into one?

I have several timeseries datasets of stock data, with fundamental indicators. I would like to build a model that selects stocks for buy and hold.

I understand that to perform this task I have two options:

  1. Train a model for each stock: This way, I understand that it is the most practical, however, the amount of data for each model will be very reduced (Each dataset has less than 1000 lines).

  2. Putting all the data together in a single dataset: I didn't find anything on the internet to support this idea, however, I understand that the model would be more robust and would have a much larger amount of data to be trained.

So, what would be the correct way to perform this type of analysis? Any of you would suggest another way?

Thannk you in advance!

Topic multivariate-distribution finance time-series machine-learning

Category Data Science


I would suggest option two. Because this way your model would have the chance to learn something for one stock, which it can apply for other stocks as well. If you provide the type of stock as an input feature, it should be able to distinguish between the specialities which only occure within one stock and the common things. So it is kind of able to transfer knowledge from one stock to the other.

But at the same time, I would suggest to try out both and choose whatever performs the best. This way you can also justify your choice in the end.

About

Geeks Mental is a community that publishes articles and tutorials about Web, Android, Data Science, new techniques and Linux security.