Identifing this dataset for sanitising
I am beginner here starting with data science for analytics. I am trying to figure out what data set this is and how to read it from python. I have an idea of the steps but not sure how to code it in python.
- Open read the file
- Search for keywords based on another file
- If keyword found, search for Term from that line up and copy value of id: which is below it. If more than one keyword is found copy their id value with a comma.
- Close the file If anyone can direct me to any tutorial that would help me here. I would appreciate it. I came across a lot of them but those data sets are tables and columns.
Category Data Science