How to generate abbrevations from shortend words in medical records
I have text files which contains medical history of a patients and would want to extract information out of it.
Basically what want is generate english text of abbrevations,semantic category region,location and relation also correct spelling mistakes if any by parsing medical history records.
I have google around for open source tools and found cTAKES Metamap.However could not find python api for the same.
Can someone suggest how to make use of these tools or interact via python?and also is there a better way to extract information using regular epression..etc.
Sample sentence:
55 yr M comes to the ED with c/o of a chest pain that started 1 hr ago,
The cP are a/w SOB.
Expected o/p
55 yr(year) (M)male comes to the (ED)emergency department with (c/o)complaint of a chest pain that started 1 hr(hour) ago,The (cP)chest pain is associated with (SOB)short of breathing.
Apart from generating abbrevations,also need to cature temparatures,levels such as 5/4,semantic category region,location and relation also correct spelling mistakes in the clinical notes.
While am waiting for UMLS license,just wanted to check can above functionality be achieved through UMLS/cTaked/Metamap? or cutsom script needs to be written in conjunction with python/NLP/RE and open source clinical abbrevations dictionary(please suggest)?
Category Data Science