How to extract contents by topic from a document?
I am trying to extract information from resumes. I tried the pdfminer for the text extraction. But I need to extract the contents from a resume with respect to its title.
For example: I will be giving my educational details under a title EDUCATIONAL BACKGROUND, so I have to extract the content topic wise.
Is it possible to extract like that?
What will be the process behind that?
Is it possible to approach the problem in a segmentation manner.