Open Source Annotation Tools for spaCy's Entity Linker?

According to the Tutorial "Training a custom ENTITY LINKING model with spaCy" (20:33) this is the training data format for spaCy's Entity Linker: TRAIN_DATA = ("Emerson was born on a farm in Blackbutt, Queensland.", {"links": {(0, 7): { "Q312545": 1.0 }}}) My search for open source annotation tool is not successful. Do you know an open source/free annotation tool for this format?
Category: Data Science

Entity linking vs aliasing

The process of finding entity in a knowledge base (KB) that a given keyphrase in a text refers to is called entity linking. I have the opposite problem. I have an entity in my knowledge base (KB) and I want to find all the ways people might refer to this entity. For instance, I have "Madonna" (singer) and I am looking for aliases like "Louise Ciccone", "Madonna Ritchie", "Queen of Pop", "Mo", etc. Is it called aliasing? Or there is …
Category: Data Science

complete entity extraction from unstructured data

I understand there are many techniques/libraries/packages to extract named entities like people, places etc. from data. Personally, for me an entity is something like: first name: john surname: smith dob: 1/1/2000 shoesize: 6 address: ... etc. So an entity is a class having fields, to use object orientated terminology. One would expect that these fields/attributes would occur close in a unstructured data (closeness could be defined by word distance). Are there techniques to extract, what I would call, complete entities? …
Category: Data Science

how to find the best candidate

I am doing some work about entity disambiguation. Please suppose, there are some candidates for each entity, e.g. e1 has three candidates c1, c2, c3. Each candidate has two values: v1 and v2 We know that higher values are better but I do not know about their weights. I am looking for an approach to find the best candidates (there exist some training data, the suitable candidate of the entity is available). Could you please advise me which one should …
Category: Data Science

About

Geeks Mental is a community that publishes articles and tutorials about Web, Android, Data Science, new techniques and Linux security.