Extracting structure and content from invoices
Lately, I have been largely inspired by this https://rossum.ai/, which is able to extract text from invoice documents.
Do you have any ideas on how this could be implemented? It's clear that they did a lot of research to reach this performance level, but in my case I am interested in the overall approach to such problems.
If I understand correctly, the first part of the pipeline is to extract different blocks from the document. In that case, is object detection the right approach to get bounding boxes around the blocks? I guess it might not be really good at extracting tabular data.
If not object detection, what is the correct way to tackle the problem?
Thanks.
Topic object-detection ocr text
Category Data Science