How can i determine the company role of each employee based on a paragraph of text?

yishairasowsky · March 18, 2020, 7:42pm

I want to do this using machine learning.

I do not know necessarily what role titles will be given.

Example input:

"The CEO is John Smith; Jane Jones is the COO, and recently Mark Wright was made the CTO."

Desired output:

{"CEO": "John Smith", "COO":"Jane Jones", "CTO":"Mark Wright"}

yishairasowsky · March 19, 2020, 6:36pm

if i have thousands of paragraphs which indicate the titles in the staff of a team, e.g. "CEO is Smith, the CTO will be Jones, for ten years Higgins has been the VP", then how can I se machine to extract this information and classify it neatly?

ines · March 20, 2020, 8:22pm

What works best ultimately depends on your data, but you could try and solve this with a combination of named entity recognition to first detect all names in the text, and then use rules based on the syntax and keywords ("CEO", "CTO" etc.) to extract the relationships.

This example shows a pretty similar use case:

Also see the documentation on NER with Prodigy here:

Topic		Replies	Views
Extracting current and prior company affiliations from bios usage , ner , best-practices	4	1545	February 1, 2019
Company name matching usage , ner	1	1326	March 16, 2020
Parsing/Identifying sections in job descriptions usage , ner , custom	3	3259	June 16, 2022
Using the NER_manual interface to annotate text classification usage , textcat , front-end	4	414	September 14, 2022
what is best way to to extract paragraph or long sentences in a text document? usage	18	3685	August 9, 2020

How can i determine the company role of each employee based on a paragraph of text?

Related topics