John Doe is a web developer at Google.
NER stands for Named Entity Recognition. It is a subtask that involves identifying and classifying named entities in text into predefined categories such as the names of persons, organizations, locations, expressions of times, quantities, monetary values, percentages, etc.
Generative models like ChatGPT, GPT-3.5, GPT-4, LLaMA 3, Yi 34B, or Mixtral 8x7B, are very good at performing entity extraction.
NER is crucial for many NLP applications like question answering, text summarization, and machine translation, as it provides detailed information about the key elements of a text, enabling deeper understanding and processing. For instance, knowing that "Paris" refers to a location in a given text can significantly influence the interpretation of that text and the response generated by an NLP system.
Let's say you have the following sentence:
John Doe is a web developer at Google.
You would like to automatically detect that "John Doe" is a name, "web developer" is a job title, and "Google" is a company. And this is exactly what NER is going to do.
The world is full of unstructured data, especially the web. Being able to extract structured information from it can give access to a lot of valuable information. Here are a couple of examples.
When dealing with lots of customer requests (support, sales, ...) it definitely helps to apply NER in order to automatically sort these incoming requests. For example you could automatically extract the type of product mentioned in the request and route this to the right service accordingly.
Extracting and consolidating financial data can be long and tedious. NER can definitely boost your productivity here by helping you extract the right data in a second.
HR services are sometimes having a hard time reading all these applications. It can be interesting for them to automatically highlight interesting entities like company names, skills,... in order to save time.
Many B2B leads can be found on public websites or company brochures, but extracting them manually can sometimes be a pain. Thanks to NER you can automatically extract a person, with her jobtitle, and company, if they exist.
NLP Cloud proposes an entity extraction API that allows to perform Named Entity Recognition out of the box, based on spaCy, Ginza, or more advanced generative AI models equivalent to GPT-4, GPT-3.5, or ChatGPT, like LLaMA 3, Dolphin, Yi 34B, Mixtral 8x7B, and more. For advanced entity extraction on specific documents we recommend that fine-tune your own generative models for NER on NLP Cloud.
For more details, see our documentation about entity extraction here. For advanced usage, see the text generation API endpoint here. And easily test entity extraction on our playground.
Testing NER locally is one thing, but using it reliably in production is another thing. With NLP Cloud you can just do both!