Pre-installation

Before installation, there are several considerations to make when configuring your Kubernetes cluster for the Skylar Automated RCA application. For the Skylar Automated RCA software to be fully functional, the following software requirements must be met, and additional details and examples of these requirements are in the following topics:

Kubernetes Version 1.19 or higher.
Helm version 3 is required for installation of Skylar Automated RCA.
Kubernetes cluster availability meeting or exceeding the Skylar Automated RCA sizing specifications. For more information, see Sizing Considerations.
Ability to provision block storage. For more information, see Storage Considerations.
Ingress Controller with https support for a Fully Qualified Host Name (FQHN). For more information, see Ingress Considerations.
Access to the Skylar Automated RCA registry. For more information, see Helm Chart and Image Repository Access.
A Helm override file with your respective configurations.

After you have met these requirements, be sure to review the Additional Configurations topic. Otherwise, you are ready to move onto the Installation topic.

Storage Considerations

The Skylar Automated RCA deployment is made up of several statefulsets, all of which will require persistent volumes. Due to the constraints of our current database, Skylar Automated RCA does not support NFS mounts, and all volumes provisioned will need to be physical or block storage. While the Skylar Automated RCA deployment provides several ways to define the respective Kubernetes storage classes, it does not configure any external dependencies or permissions that may be needed to ensure provisioning happens correctly. It is the responsibility of the cluster operator to ensure that any and all external dependencies are met for your provisioner of choice.

The Skylar Automated RCA application separates volumes into two different flavors: core and Vertica. This allows operators the flexibility of defining different retention and drive configurations for volumes that are mounted onto our central databases (Vertica), versus those used for our workers as buffers and working dir. Skylar Automated RCA generally recommends that you configure the reclaimPolicy for your Vertica storage class to Retain to prevent any unintentional data loss if the corresponding statefulset or pvc is lost or deleted. This is not required on the core storage classes, as as loss of data stored in these claims will not be detrimental to the system.

Our application has two different options for how cluster operators can provide storage to satisfy the needs of the Skylar Automated RCA Storage. Operators can bring their own storage classes, or use our Helm chart to define Skylar Automated RCA specific classes. Both of these options are explored in more detail below.

Bring Your Own Storage Classes (BYOSC)

Cluster operators might choose to use existing storage classes instead of redefining these inside of the Skylar Automated RCA Helm charts. To bring your own storage class, you will need add the following section to your override file. In the example below, we are using the StorageClass called gp2 instead of the Skylar Automated RCA-created ones.

zebrium-core:
  storageProvisioners:
    vertica:
      enabled: false
      customStorageClass: "gp2"
    core:
      enabled: false
      customStorageClass: "gp2"

This configuration disables the creation of the Skylar Automated RCA storage classes, and instead instructs the pvcs to use the class gp2.

Using Skylar Automated RCA Storage Classes

By default, Skylar Automated RCA will provision two storage classes to be used for the Skylar Automated RCA application. Configurations for these two storage classes are managed through the storageProvisioners section within the Helm chart. The default options for this settings are represented below:

storageProvisioners:
    vertica:
      enabled: true
      provisioner: kubernetes.io/no-provisioner
      reclaimPolicy: Retain
      parameters: {}
    core:
      enabled: true
      provisioner: kubernetes.io/no-provisioner
      reclaimPolicy: Retain
      parameters: {}

As we can see in the example above, we have two separate declarations of a storage provisioner, vertica and core. They both function the exact same way, so we will only focus on walking through core currently. As we can see, we have 4 configuration options available for the core storage provisioner and will dive into what each one does.

enabled. Enables or disables the creation of the storage class within Helm. See BYOSC for an example of disabling this.
provisioner. This configures which underlying storage class provisioner. We support any provisioner that is available to Kubernetes and that has been configured for your system.
reclaimPolicy. Since these will be dynamically created volumes, we need to define a reclaim policy for when the resource is deleted. The available options are Retain or Delete. Read more here.
parameters. Every provisioner provides a series of additional parameters that help to describe the volumes. These are unique to your selected provisioner. Read more here.

Dynamic vs Manual Volume Provisioning

While the Skylar Automated RCA application will dynamically create persistent volume claims as the necessary pods are scheduled, cluster operators may choose to use a provisioner that does not support dynamic provisioning. An example of a provisioner that does not support dynamic provisioning would be the local. When using such provisioners, it is the responsibility of the cluster operator to ensure that any needed persistent volumes are created and available to the requesting persistent volume claims. A walkthrough of this can be found here.

Ingress Considerations

The Skylar Automated RCA application leverages Kubernetes ingress as its preferred method for exposing its internal services and user interfaces to external consumers. Ingress resources are automatically created for each necessary route, and can be customized through the Helm chart parameters.

Helm Parameter Overrides

The Helm chart provides several level of configuration for modifying the ingress resources provisioned to tailor it to your desired ingress controller’s requirements. Since ingress frequently uses annotations to configure some options depending on the controller, our Helm chart provides two ways to customize the ingress controller: through a global configuration or through application-level configurations. When the chart is templated, global values will override the resource-level configurations when both are set.

Global Overrides

Below are the available global configurations for all ingress resources. For annotations and TLS, the Helm chart will combine values defined in both the global and individual level. For the most update list of all options, please see the values.yaml file of the current Helm chart.

global:
  ingress:
    # -- Ingress Class to use for all objects.
    className:
    # -- Hostname to expose all ingress objects on.
    hostname: 'zebrium.example.com'
    # -- Global Annotations to add to all ingress objects
    annotations: {}
    tls: []

Resource Overrides

Below are the locations of the individual ingress resources, allowing you to modify only that particular ingress resource, instead of all resources. For the most update list of all options, please see the values.yaml file of the current Helm chart.

zebrium-core:
  zapi:
    ingress:
      path: '/api/v2'
      annotations: {}
      tls: []
  report:
    ingress:
      path: '/report/v1'
      annotations: {}
      tls: []
  mwsd:
    ingress:
      path: '/mwsd/v1'
      annotations: {}
      tls: []
zebrium-ui: 
 ingress:
    path: '/'
    annotations: {}
    tls: []
zebrium-auth:
 ingress:
    path: '/auth'
    annotations: {}
    tls: []

Ingress Controllers

In order to expose the ingress resources defined by the Skylar Automated RCA deployment, an ingress controller must be defined and configured by your cluster operator.

Packaged Ingress Controller

We do provide the option to install ingress-nginx as part of the Skylar Automated RCA chart. If you wish to use the provided ingress-nginx, you can use the following configuration to get started:

ingress-nginx:
  enabled: true
global: 
  ingress: 
    className: nginx
    annotations: 
      nginx.ingress.kubernetes.io/proxy-body-size: '0'

Hostname and DNS Resolution

Currently, the Skylar Automated RCA deployment requires a dns hostname that allows access to the ingress endpoints. This endpoint needs to a be a fully qualified domain name (FQDN). Cluster operators should also ensure that this FQDN is added as a record to their DNS server and is resolvable from all systems intending to access the Skylar Automated RCA installation. Network access from desired systems to the ingress endpoint should also be verified. To set the FQDN in the Skylar Automated RCA Helm chart, use the following override:

global:
  ingress:
    hostname: ""

TLS

Due to browser security configurations, the Skylar Automated RCA user interface must be served over HTTPS with a backing TLS certificate. Failure to do so will create a sign-in loop within the user interface, blocking the user from being able to access the internal system.

There are several ways to secure the ingress endpoint with a TLS certificate, including through the ingress resources themselves, through configuration of your ingress controller, using tools like cert-manager, a service mesh, or attaching certificates directly to provisioned resources, like cloud load balancers. It is at the discretion of the cluster operator to determine the best solution for their environment.

Helm Chart and Image Repository Access

Skylar Automated RCA hosts its Helm charts and associated Docker images within its own registry. Skylar Automated RCA will provide credentials (username/password) to access these resources as part of the on-prem onboarding process. As part of the installation process, you will create a Kubernetes image pull secret.