Clustering with sets as values
I have gathered a large amount of qualitative data and am now looking to cluster it so as to make sense of it. For this, I am using Biolab's Orange.
In my data, specific values may co-occur in a given feature, or they may not. I am wondering how I could cluster the data (either in Orange or other software), where values that co-occur would be seen as two values, rather than one string.
To make matters clearer, imagine I have a feature X, with the possible values A and B. The values can occur in the following way: A, B, A and B.
Question: how can I cluster my data without "A and B" being treated as a separate string, but rather "A" and "B" co-occurring?
Topic orange3 orange clustering
Category Data Science