BIO tagging software

I would like to label character data with BIO tags as part of an active learning process on unlabelled data. I am assuming there are open source GUI tools available which I can use to make this easier - i.e. present the string to be labeled and some way of tagging characters from a predefined set of tags (and probably allow new tags to be added).

I have not been able to find anything though - ideally cross-platform (Linux and macOS at a minimum). It could be a cloud service or AWS product but I'd prefer something which is locally installed.

Does anyone know of such a thing? Even something as simple as search term would help.

Topic active-learning machine-learning

Category Data Science


Another solution, without however the active learning support that prodigy offers, is doccano. It's open-source (free), easy to install and supports various applications, apart from NER (BIO tagging), also classification and translation.


Prodigy is a tool for data annotation, I think it shines in NLP tasks.

I've also heard about Amazon mechanical turk, I know it can be used for annotation, but it's more general purpose.

About

Geeks Mental is a community that publishes articles and tutorials about Web, Android, Data Science, new techniques and Linux security.