description |
---|
Topics assigned to works |
Concepts are abstract ideas that works are about. OpenAlex indexes about 65k concepts.
- Get all the concepts used by OpenAlex:
https://api.openalex.org/concepts
The Canonical External ID for OpenAlex concepts is the Wikidata ID, and each of our concepts has one, because all OpenAlex concepts are also Wikidata concepts.
Concepts are hierarchical, like a tree. There are 19 root-level concepts, and six layers of descendants branching out from them, containing about 65 thousand concepts all told. This concept tree is a modified version of the one created by MAG.
You can view all the concepts and their position in the tree as a spreadsheet here. About 85% of works are tagged with at least one concept (here's the breakdown of concept counts per work).
Each work is tagged with multiple concepts, based on the title, abstract, and the title of its host venue. The tagging is done using an automated classifier that was trained on MAG’s corpus; you can read more about the development and operation of this classifier in Automated concept tagging for OpenAlex, an open index of scholarly articles. You can implement the classifier yourself using our models and code.
A score is available for each concept in a work, showing the classifier's confidence in choosing that concept. However, when assigning a lower-level child concept, we also assign all of its parent concepts all the way up to the root. This means that some concept assignment scores will be 0.0. The tagger adds concepts to works written in different languages, but it is optimized for English.
Concepts are linked to works via the concepts
property. They’re also linked to authors, sources, and institutions via the x_concepts
property, and to other concepts via the ancestors
and related_concepts
properties.
Learn more about what you can do with concepts: