Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: Updates to remove complext structs and classifications

...

Details of the new models are shown below.  The original Apache models are linked to on page: Atlas Model.

Existing Base Model

The existing base model appears at this point defining key concepts such as Referenceable, Asset, Infrastructure, Process and DataSet.

Image Added

Figure 2: Existing base model

 

 

Linked Media Types

Linked media types describes the simple structures that are used repeatedly in open metadata to connect it to documents and entities in other types of repositories.

Image RemovedImage Added

Figure

2

3: Linked

Media Types

media for any referenceable entity

 

 

Existing Base Model

The existing base model appears at this point defining key concepts such as Referenceable, Asset, Infrastructure, Process and DataSet.

Property Facets

Property facets allow any entity to be extended with additional properties.  This is particularly useful for storing metadata that originated in another type of metadata repository, or tool since it allows vendor/tool specific values to be stored.

Image Added

Figure 4: Adding custom properties to any referenceable entity

Image Removed

 

 

User Identities

Most metadata repositories are run in a secure mode requiring incoming requests to include the requestor’s security credentials.  Therefore we have an identifier for each unique logged on security identity (aka userId).  The UserIdentity can capture this identity.

 

Image Added

Figure 5: Understanding the actors working on the metadata and data assets

Image Removed

 

 

Locations

It is important to understand where assets are located to ensure they are properly protected and comply with data sovereignty laws.  The open metadata model allows location information to be captured at many levels of granularity.

Image Added

Figure 6: Understanding where data assets and services are located

Image Removed

 

 

Hosts and Platforms

The host and platform metadata entities provide a simple model for the system infrastructure (nodes, computers, etc) that data resources are hosted on.

Image Modified

Figure 7: Defining the platform that the data assets and services run on

 

 

Complex Hosts

The Complex complex hosts handle environments where many nodes are acting together as a cluster, and where virtualized containers (such as Docker) are being used.

Image Modified

Figure 8: Supporting server clusters and server virtualization (server containers)

 

 

Servers

Servers describe the middleware servers (such as application servers, data movement engines and database servers) that run on the Hosts.

Image Modified

 

Figure 9: Servers and their connectivity and capabilities

 

 

 

Cloud Platforms and Services

The cloud platforms and services model show that cloud computing is not so different from what we have been doing before.  Cloud infrastructure and services are classified as such to show that the organization is not completely in control of the technology supporting their data and processes.

 

Image Removed 

 

Security Infrastructure

Security infrastructure needs to be embedded in the servers and hosts that support an organization's data and processing.  These classifications are used to identify components that are supporting different security roles.

Data Stores and Data Sets

The base model introduced the concept of a data set.  The data store definition shows how the data set relates to the server that it is hosted on.  In addition, some data sets are virtual - that is they are build up from calling other data sets.  Figure 10 shows the data stores and virtual data sets linking to the data set.

 

Image Added

Figure 10: Data stores hosting data sets

 

 

Applications and Processes

Applications provide business or management logic.  They are often custom built but may also be brought as a package.  They are deployed onto a server.  Some applications are written to support specific processes.  Figure 11 shows how applications relate to processes and the servers that host them

 

Image Added

Figure 11: Applications and the servers they run on

Image Removed

 

 

Networks and Gateways

The network model for open metadata is very simple, to allow hosts to be grouped into the networks they are connected to.  This can show details such as where hosts are isolated in private networks, where the gateways onto the Internet. 

 

Image Modified

 

Figure 12: The networks that specific hosts connect to

 

 

Authoritative Assets

Assets often describe objects in the real world.  They may have multiple names depending on the types of technology used to access them. As such an asset may have multiple identifiers.  The Authoritative Asset supports the de-duplication and consolidation of asset descriptions.  This is shown in figure 4.

Cloud Platforms and Services

The cloud platforms and services model show that cloud computing is not so different from what we have been doing before.  Cloud infrastructure and services are classified as such to show that the organization is not completely in control of the technology supporting their data and processes.

 

Image Added

Figure 13: Cloud platforms and services

Image Removed

Figure 4: The Authoritative Asset Model

 

 

 

 

...