Keyword and Keyphrase Extraction
For large inputs (above 128,000 tokens) you will need to use the asynchronous mode: see more in the documentation.
Keyword and Keyphrase Extraction
What is Keyword/Keyphrase Extraction and Why Use GPT?
Keyword extraction is about extracting one or several important words from a piece of text. These words must be core ideas from the text.
For example, imagine you have the following content:
Information Retrieval (IR) is the process of obtaining resources relevant to the information need. For instance, a search query on a web search engine can be an information need. The search engine can return web pages that represent relevant resources.
The important keywords in this example could be information, resources, search.
If keywords are too simple, you might want to extract keyphrases: a combination of several words. For example, in the above content, important keyphrases could be information retrieval, relevant resources, search query, search engine.
Performing keyword and keyphrase extraction is harder than it sounds. It takes an advanced AI model to understand the core ideas from a piece of text.
For advanced use cases, it is possible to fine-tune Dolphin, Yi 34B, and Mixtral 8x7B (train them with your own data), which is a great way perform paraphrasing that is perfectly tailored to your use case or industry.
Why Use Keyword/Keyphrase Extraction?
Keyword and keyphrase extractions are a great way to instantly have a good grasp on a piece of text, and potentially categorize the text for later use. Here are a couple of examples:
Social Media Analysis
Tons of ideas are written in social media and you might want to understand the main ideas behind this chaos. With keyword/keyphrase extraction you can instantly do this.
Customers' Feedbacks
Asking for customers' feedbacks is great practice, but it takes a lot of time to properly analyze the results. You can easily perform qualitative analysis thanks to keyword/keyphrase extraction.
Competition Monitoring
Do you want to monitor the brand of your competitors? You can easily do it by retrieving their content and get the most important ideas.
SEO
Finding the right keywords for your positioning can be tricky. A strategy could be to analyze your competitor's websites, and understand which keywords they are positioning on.
Use GPU
Control whether you want to use the model on a GPU. Machine learning models run much faster on GPUs.
Language
AI models don't always work well with non-English languages.
We do our best to add non-English models when it's possible. See for example Fine-tuned LLaMA 3.1 405B, LLaMA 3 70B, Dolphin, ChatDolphin, XLM Roberta Large XNLI, Paraphrase Multilingual Mpnet Base V2, or spaCy. Unfortunately not all the models are good at handling non-English languages.
In order to solve this challenge, we developed a multilingual module that automatically translates your input into English, performs the actual NLP operation, and then translates the result back to your original language. It makes your requests a bit slower but often returns very good results.
Even for models that natively understand non-English languages, they actually sometimes work even better with the multilingual addon.
Simply select your language in the list, and from now on you can write the input text in your own language!
This multilingual add-on is a free feature.