Security Features of Apache Atlas

Overview

The following features are available for enhancing the security of the platform:

  • SSL
  • Service Authentication
  • SPNEGO-based HTTP Authentication

SSL

Both SSL one-way (server authentication) and two-way (server and client authentication) are supported. The following application properties (properties configured in the application.properties file) are available for configuring SSL:

  • atlas.enableTLS (false|true) [default: false] - enable/disable the SSL listener
  • keystore.file - the path to the keystore file leveraged by the server. This file contains the server certificate.
  • truststore.file - the path to the truststore file. This file contains the certificates of other trusted entities (e.g. the certificates for client processes if two-way SSL is enabled). In most instances this can be set to the same value as the keystore.file property (especially if one-way SSL is enabled).
  • client.auth.enabled (false|true) [default: false] - enable/disable client authentication. If enabled, the client will have to authenticate to the server during the transport session key creation process (i.e. two-way SSL is in effect).
  • cert.stores.credential.provider.path - the path to the Credential Provider store file. The passwords for the keystore, truststore, and server certificate are maintained in this secure file. Utilize the cputil script in the 'bin' directoy (see below) to populate this file with the passwords required.
Credential Provider Utility Script

In order to prevent the use of clear-text passwords, the Atlas platofrm makes use of the Credential Provider facility for secure password storage (see Hadoop Credential Command Reference for more information about this facility). The cputil script in the 'bin' directory can be leveraged to create the password store required.

To create the credential provdier for Atlas:

The credential provider will be generated and saved to the path provided.

Service Authentication

The Atlas platform, upon startup, is associated to an authenticated identity. By default, in an insecure environment, that identity is the same as the OS authenticated user launching the server. However, in a secure cluster leveraging kerberos, it is considered a best practice to configure a keytab and principal in order for the platform to authenticate to the KDC. This allows the service to subsequently interact with other secure cluster services (e.g. HDFS).

The properties for configuring service authentication are:

  • atlas.authentication.method (simple|kerberos) [default: simple] - the authentication method to utilize. Simple will leverage the OS authenticated identity and is the default mechanism. 'kerberos' indicates that the service is required to authenticate to the KDC leveraging the configured keytab and principal.
  • atlas.authentication.keytab - the path to the keytab file.
  • atlas.authentication.principal - the principal to use for authenticating to the KDC. The principal is generally of the form "user/host@realm". You may use the '_HOST' token for the hostname and the local hostname will be substituted in by the runtime (e.g. "Atlas/_HOST@EXAMPLE.COM").

SPNEGO-based HTTP Authentication

HTTP access to the Atlas platform can be secured by enabling the platform's SPNEGO support. There are currently two supported authentication mechanisms:

  • simple - authentication is performed via a provided user name
  • kerberos - the KDC authenticated identity of the client is leveraged to authenticate to the server

The kerberos support requires the client accessing the server to first authenticate to the KDC (usually this is done via the 'kinit' command). Once authenticated, the user may access the server (the authenticated identity will be related to the server via the SPNEGO negotiation mechanism).

The properties for configuring the SPNEGO support are:

  • atlas.http.authentication.enabled (true|false) [default: false] - a property indicating whether to enable HTTP authentication
  • atlas.http.authentication.type (simple|kerberos) [default: simple] - the authentication type
  • atlas.http.authentication.kerberos.principal - the web-application Kerberos principal name. The Kerberos principal name must start with "HTTP/...". For example: "HTTP/localhost@LOCALHOST". There is no default value.
  • atlas.http.authentication.kerberos.keytab - the path to the keytab file containing the credentials for the kerberos principal.

For a more detailed discussion of the HTTP authentication mechanism refer to Hadoop Auth, Java HTTP SPNEGO 2.6.0 - Server Side Configuration. The prefix that document references is "atlas.http.authentication" in the case of the Atlas authentication implementation.

Client security configuration

When leveraging Atlas client code to communicate with an Atlas server configured for SSL transport and/or Kerberos authentication, there is a requirement to provide a client configuration file that provides the security properties that allow for communication with, or authenticating to, the server. Create a client.properties file with the appropriate settings (see below) and place it on the client's classpath or in the directory specified by the "atlas.conf" system property.

The client properties for SSL communication are:

  • atlas.enableTLS (false|true) [default: false] - enable/disable the SSL client communication infrastructure.
  • keystore.file - the path to the keystore file leveraged by the client. This file is only required if 2-Way SSL is enabled at the server and contains the client certificate.
  • truststore.file - the path to the truststore file. This file contains the certificates of trusted entities (e.g. the certificates for the server or a shared certification authority). This file is required for both one-way or two-way SSL.
  • cert.stores.credential.provider.path - the path to the Credential Provider store file. The passwords for the keystore, truststore, and client certificate are maintained in this secure file.

The property required for authenticating to the server (if authentication is enabled):

  • atlas.http.authentication.type (simple|kerberos) [default: simple] - the authentication type

If the authentication type specified is 'kerberos', then the kerberos ticket cache will be accessed for authenticating to the server (Therefore the client is required to authenticate to the KDC prior to communication with the server using 'kinit' or a similar mechanism).