...
Details of the new models are shown below. The original Apache models are linked to on page: Atlas Model.
Existing Base Model
The existing base model appears at this point defining key concepts such as Referenceable, Asset, Infrastructure, Process and DataSet.
Figure 2: Existing base model |
Linked Media Types
Linked media types describes the simple structures that are used repeatedly in open metadata to connect it to documents and entities in other types of repositories.
Figure 23: Linked Media Typesmedia for any referenceable entity |
Existing Base Model
The existing base model appears at this point defining key concepts such as Referenceable, Asset, Infrastructure, Process and DataSet.
Property Facets
Property facets allow any entity to be extended with additional properties. This is particularly useful for storing metadata that originated in another type of metadata repository, or tool since it allows vendor/tool specific values to be stored.
Figure 4: Adding custom properties to any referenceable entity |
User Identities
Most metadata repositories are run in a secure mode requiring incoming requests to include the requestor’s security credentials. Therefore we have an identifier for each unique logged on security identity (aka userId). The UserIdentity can capture this identity.
Figure 5: Understanding the actors working on the metadata and data assets |
Locations
It is important to understand where assets are located to ensure they are properly protected and comply with data sovereignty laws. The open metadata model allows location information to be captured at many levels of granularity.
Figure 6: Understanding where data assets and services are located |
Hosts and Platforms
The host and platform metadata entities provide a simple model for the system infrastructure (nodes, computers, etc) that data resources are hosted on.
Figure 7: Defining the platform that the data assets and services run on |
Complex Hosts
The Complex complex hosts handle environments where many nodes are acting together as a cluster, and where virtualized containers (such as Docker) are being used.
Figure 8: Supporting server clusters and server virtualization (server containers) |
Servers
Servers describe the middleware servers (such as application servers, data movement engines and database servers) that run on the Hosts.
Figure 9: Servers and their connectivity and capabilities |
Cloud Platforms and Services
The cloud platforms and services model show that cloud computing is not so different from what we have been doing before. Cloud infrastructure and services are classified as such to show that the organization is not completely in control of the technology supporting their data and processes.
Security Infrastructure
Security infrastructure needs to be embedded in the servers and hosts that support an organization's data and processing. These classifications are used to identify components that are supporting different security roles.
Data Stores and Data Sets
The base model introduced the concept of a data set. The data store definition shows how the data set relates to the server that it is hosted on. In addition, some data sets are virtual - that is they are build up from calling other data sets. Figure 10 shows the data stores and virtual data sets linking to the data set.
Figure 10: Data stores hosting data sets |
Applications and Processes
Applications provide business or management logic. They are often custom built but may also be brought as a package. They are deployed onto a server. Some applications are written to support specific processes. Figure 11 shows how applications relate to processes and the servers that host them
Figure 11: Applications and the servers they run on |
Networks and Gateways
The network model for open metadata is very simple, to allow hosts to be grouped into the networks they are connected to. This can show details such as where hosts are isolated in private networks, where the gateways onto the Internet.
Figure 12: The networks that specific hosts connect to |
Authoritative Assets
Assets often describe objects in the real world. They may have multiple names depending on the types of technology used to access them. As such an asset may have multiple identifiers. The Authoritative Asset supports the de-duplication and consolidation of asset descriptions. This is shown in figure 4.
Cloud Platforms and Services
The cloud platforms and services model show that cloud computing is not so different from what we have been doing before. Cloud infrastructure and services are classified as such to show that the organization is not completely in control of the technology supporting their data and processes.
Figure 13: Cloud platforms and services Figure 4: The Authoritative Asset Model |
...