How to segregate resume layouts into different types?
I'm looking for any suggestions on how to segregate resume layout into different types.
How do one proceed with such a task? I mean resumes are usually available as pdf or docx format and when we parse text from documents we lose a lof of information regarding layout or metadata.
So how one could build a system to segregate resumes based on layouts.
It'll be really helpful if you have any suggestions.
Topic ocr deep-learning text-mining parsing nlp
Category Data Science