Division of data into training and validation sets
I have a multi-sensory dateset for the activities of daily living. It contains data from 10 volunteers each performing 9 activities. Each volunteer wears 6 sensors on their body with the recorded data type quaternions, acceleration, and angular velocity. For each volunteer, I have total of 7 CSV files i-e 6 for each sensor and one for annotation.
Now, I would like to divide the data of 7 volunteers into training and validation and the remaining 3 for testing. For 7 volunteers I have a total of almost 49 CSV files.
What should be the required approach to divide these into training and validation sets? I can find a lot of information regarding a single CSV files, but not about bunch of these.
I am looking forward to some advice.
Topic activity-recognition machine-learning-model deep-learning
Category Data Science