Clustering without information about identifier
I have a data-set with different products and binary value if it was sold in a store or not. I looks like:
product_id store_1 store_2 store_3 store_4 store_5 store_6
0 A 1 0 0 1 0 1
1 B 1 1 0 0 1 0
Is there any way to cluster these products with any information about the products itself?
One thought I had was to generate distance between products and then cluster the product X product matrix. Is this problem sort of similar to market basket analysis?
Any guidance will help.
Topic distance clustering
Category Data Science