Extract segment from document scan

Question

feeeper

2022年6月3日 01:00

I need to extract some "valuable" information from document scan. For example, document's number, incoming date, organizations, persons, etc.

Example document:

I'm trying to extract highlighted segment of the document. Original scan doesn't have that highlighting. And value can be handwritten or typewritten.

I tried U-Net and Mask RCNN for my dataset (~100 examples). Without any success.

Any ideas?

fuwiak · Accepted Answer · 2019年12月9日 17:53

fuwiak answered at 2019年12月9日 17:53

Priviet, feeper!

I created some simple program to extract data from documents. Works pretty well.

Best