Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: Update to Categories

...

An Apache Atlas repository may contain many glossaries, particularly when it is part of a bigger enterprise cluster of repositories.  Each glossary may come from a specific team or external organization.  Or it may be focused on a particular topic or set of use cases.  Figure 2 shows how a glossary is defined.

Image Modified

Figure 2: The glossary providing object provides the anchor point for the glossary content

 

The anchor for each glossary is the Glossary object. 

The classifications used associated with the glossary object are used to document the type of vocabulary it contains and its purpose:

  • Taxonomy - This is a glossary where the terms have been organized into a category hierarchy that reflects their meaning or use.
  • Canonical Vocabulary - this glossary provides the standard vocabulary definitions for an organization.  Typically terms from other glossaries are linked to terms from the canonical glossary.
  • ExternallySourcedGlossary - The content from this glossary comes from an external source.  It may be, for example an industry-specific glossary, or from a standards body, or from an open data site, or from a commercial organization.

These classifications are independent of one another so a  Glossary object may have none, one or all of these classifications attached.

 

Category Hierarchies

The vocabulary for the glossary is organized into a hierarchy of categories.  The These categories are effectively provide a folder structure for the glossary.  Some of these categories represent the terms for a subject area and this is identified using the SubjectArea classification.  The categories may follow a formally published category hierarchy.  The LibraryCategoryReference classification is used to document the links to the external category's reference Id.Figure 3 shows the definition for a glossary category.

Image Added

Figure 3: The glossary category and its hierarchy

GlossaryCategory represents a category in a glossary. 

CategoryAnchor links each category to exactly one Glossary object.  This means that this is its home glossary.  If the Glossary object is deleted then so are all of the categories linked to it.

CategoryHierarchyLink is a relationship used to organize categories into a hierarchy to, for example, create a structure for a taxonomy.  A category may have none or one super-categories.  However this super-category may be in a different glossary.

SubjectArea is a classification for a category that indicates that the category represents a subject area.

LibraryCategoryReference provides reference information for how this category corresponds to a category in an external glossary.

 

Image Removed

Figure 3: The category hierarchy

 

 

Terms

The vocabulary for the glossary is documented using terms.  Each term represents a concept of short phrase in the vocabulary.  A Just like a category, a term is owned by a glossary but can be linked into a category from any glossary.  Figure 4 shoes shows the glossary term.

Figure 4: Terms

 

...