This is a named entity extraction & linking API that performs very well even on short texts, on which many other similar services do not. It currently works on texts in English, French, German, Italian, Portuguese, Russian, Spanish. With this API you will be able to automatically tag your texts, extracting Wikipedia entities and enriching your data.
We support both GET and POST methods to query the API.
Remember to authenticate yourself specifying the token parameter (or the legacy $app_id and $app_key pair). See the API doc about authentication for any questions.
Type | string |
Type | string |
Default value | auto |
Accepted values | de | en | es | fr | it | pt | ru | auto |
Type | integer |
Default value | 0 |
Accepted values | 0 .. +inf |
Type | float |
Default value | 0.6 |
Accepted values | 0.0 .. 1.0 |
Type | integer |
Default value | 2 |
Accepted values | 2 .. +inf |
Type | comma-separated list |
Default value | <empty string> |
Accepted values | types, categories, abstract, image, lod, alternate_labels |
Example | include=types,lod |
Type | comma-separated list |
Default value | <empty string> |
Accepted values | phone, vat |
Example | extra_types=phone,vat |
Type | string |
Default value | <empty string> |
Accepted values | AD, AE, AM, AO, AQ, AR, AU, BB, BR, BS, BY, CA, CH, CL, CN, CX, DE, FR, GB, HU, IT, JP, KR, MX, NZ, PG, PL, RE, SE, SG, US, YT, ZW |
Looking for some advanced parameter? Show me more
Type | float |
Default value | 0.3 |
Accepted values | 0.0 .. 0.5 |
The response is structured in JSON as follow:
{
"timestamp": "Date and time of the response generation process",
"time": "Time elapsed for generating the response (milliseconds)",
"lang": "The language used to tag the input text",
"langConfidence": "Accuracy of the language detection, from 0.0 to 1.0. Present only if auto-detection is on",
"text": "The annotated text. Present only if the 'html' parameter has been used",
"annotations": [
{
"id": "ID of the linked Wikipedia resource",
"title": "Title of the linked Wikipedia resource",
"uri": "URL of the entity on Wikipedia",
"label": "Most common name used to represent the resource",
"confidence": "Value of confidence for this annotation",
"spot": "Annotated string, as it is in the input text",
"start": "Character position in the input text where the annotation begins",
"end": "Character position in the input text where the annotation ends",
"types": ["List of types of the linked DBpedia resource","Only if 'include' parameter contains 'types'"],
"categories": [
"List of the category of the linked DBpedia resource",
"Only if 'include' parameter contains 'categories'"
],
"abstract": "Abstract of the linked Wikipedia resource. Only if 'include' parameter contains 'abstract'",
"lod": {
"wikipedia": "URL of the Wikipedia article that represents the resource",
"dbpedia": "URI of the resource on DBpedia"
},
"alternateLabels": [
"List of other names used when referring to the entity",
"Only if 'include' parameter contains 'alternate_labels'"
],
"image": {
"full": "URL of a depiction of the resource on Wikipedia. Only if 'include' parameter contains 'image'",
"thumbnail": "URL of the thumbnail of the depiction. Only if 'include' parameter contains 'image'",
}
}
],
"topEntities": [ # Only if 'top_entities' parameter is greater than 0
{
"id": "ID of the linked Wikipedia resource",
"uri": "URL of the entity on Wikipedia",
"score": "The result of the ranking algorithm"
}
]
}
For more information about status codes and error handling please refer to the dandelion generic API documentations. The cost of each request can be found in the response headers as described here.
Connection: keep-alive
Content-Length: 2748
Content-Type: application/json;charset=UTF-8
Date: Wed, 21 Oct 2015 16:29:37 GMT
Server: Apache-Coyote/1.1
X-DL-units: 1
X-DL-units-left: 999
X-DL-units-reset: 2015-10-22 00:00:00 +0000
{
"timestamp": "2015-10-21T16:29:37",
"time": 2,
"lang": "en",
"annotations": [
{
"abstract": "A physician is a professional who practices medicine, which is concerned with promoting, maintaining or restoring human health through the study, diagnosis, and treatment of disease, injury, and other physical and mental impairments. They may focus their practice on certain disease categories, types of patients, or methods of treatment \u2013 known as specialist medical practitioners \u2013 or assume responsibility for the provision of continuing and comprehensive medical care to individuals, families, and communities \u2013 known as general practitioners. Medical practice properly requires both a detailed knowledge of the academic disciplines (such as anatomy and physiology) underlying diseases and their treatment \u2013 the science of medicine \u2013 and also a decent competence in its applied practice \u2013 the art or craft of medicine.",
"id": 23315,
"title": "Physician",
"start": 4,
"categories": [
"Physicians",
"Healthcare occupations",
"Occupations"
],
"lod": {
"wikipedia": "http://en.wikipedia.org/wiki/Physician",
"dbpedia": "http://dbpedia.org/resource/Physician"
},
"label": "Physician",
"types": [],
"confidence": 0.438,
"uri": "http://en.wikipedia.org/wiki/Physician",
"end": 10,
"spot": "doctor"
},
{
"abstract": "The apple is the pomaceous fruit of the apple tree, species Malus domestica in the rose family (Rosaceae). It is one of the most widely cultivated tree fruits, and the most widely known of the many members of genus Malus that are used by humans. Apples grow on small, deciduous trees. The tree originated in Central Asia, where its wild ancestor, Malus sieversii, is still found today. Apples have been grown for thousands of years in Asia and Europe, and were brought to North America by European colonists. Apples have been present in the mythology and religions of many cultures, including Norse, Greek and Christian traditions. In 2010, the fruit's genome was decoded, leading to new understandings of disease control and selective breeding in apple production.",
"id": 18978754,
"title": "Apple",
"start": 19,
"categories": [
"Apples",
"Malus",
"Plants described in 1803",
"Sequenced genomes"
],
"lod": {
"wikipedia": "http://en.wikipedia.org/wiki/Apple",
"dbpedia": "http://dbpedia.org/resource/Apple"
},
"label": "Apple",
"types": [
"http://dbpedia.org/ontology/Eukaryote",
"http://dbpedia.org/ontology/Plant",
"http://dbpedia.org/ontology/Species"
],
"confidence": 0.7869,
"uri": "http://en.wikipedia.org/wiki/Apple",
"end": 24,
"spot": "apple"
},
{
"abstract": "The orange (specifically, the sweet orange) is the fruit of the citrus species Citrus × sinensis in the family Rutaceae. The fruit of the Citrus sinensis is called sweet orange to distinguish it from that of the Citrus aurantium, the bitter orange. The orange is a hybrid, possibly between pomelo (Citrus maxima) and mandarin (Citrus reticulata), cultivated since ancient times.",
"id": 4984440,
"title": "Orange (fruit)",
"start": 43,
"categories": [
"Oranges",
"Citrus hybrids",
"Tropical agriculture",
"Symbols of Florida",
"Symbols of California",
"United States state plants",
"World Digital Library related"
],
"lod": {
"wikipedia": "http://en.wikipedia.org/wiki/Orange_(fruit)",
"dbpedia": "http://dbpedia.org/resource/Orange_(fruit)"
},
"label": "Orange",
"types": [
"http://dbpedia.org/ontology/Eukaryote",
"http://dbpedia.org/ontology/FloweringPlant",
"http://dbpedia.org/ontology/Plant",
"http://dbpedia.org/ontology/Species"
],
"confidence": 0.7515,
"uri": "http://en.wikipedia.org/wiki/Orange_(fruit)",
"end": 49,
"spot": "orange"
}
]
}
If you're new to the Entity Extraction API you may want to:
Dandelion API
built with ❤ by SpazioDati S.r.l.
Company subject to management and coordination of Cerved Group S.p.A.
site privacy | api privacy | tos | cookies | consent preferences
We're a startup based in Italy, specialized in Semantics & Big Data.
Find out more about us at spaziodati.eu