Skip to content

Operations Overview

Once the 3-node bare-metal cluster is deployed and running, Day 2 operations are managed through the Palette console. This section covers cluster lifecycle management, monitoring, and maintenance procedures.

Operations Scope

Area Description
Cluster Management Profiles, upgrades, A/B partitions, node lifecycle
Monitoring Prometheus and Grafana monitoring stack

Palette Management Console

All Day 2 operations are performed through the Palette tenant console:

Console URL Purpose
Tenant Console https://10.25.232.155/ Cluster management, profiles, edge hosts
System Console https://10.25.232.155/system System administration, registry, tenants
Local UI https://10.25.232.252:5080 Node-level management (PMA only)

Key Operations

Cluster Profile Updates

Cluster profiles define the full stack deployed on the cluster. Updates to profiles are automatically reconciled to running clusters:

  1. Edit a cluster profile in Palette (e.g., update a pack version)
  2. Palette detects the drift and creates a pending update
  3. Review the diff in the Palette UI
  4. Apply the update -- Palette orchestrates the rollout across nodes

Node Management

  • Add nodes -- Register additional edge hosts and add them to the machine pool
  • Remove nodes -- Cordon, drain, and remove nodes from the cluster
  • Replace nodes -- Swap failed hardware by registering a new node with the same profile

Backup and Recovery

  • Cluster profiles are stored in Palette and can be re-applied to new clusters
  • etcd snapshots are managed by PXKe for cluster state recovery
  • Portworx snapshots provide application-level data protection

Operational Contacts

For escalation during the POC:

Role Contact Availability
Spectro Cloud SE Craig Smith (craig.smith@spectrocloud.com) Primary technical contact
Spectro Cloud SE Kyle Jepson (kyle.jepson@spectrocloud.com) Backup SE
Portworx CNA David Castellani (dcastellani@purestorage.com) Storage architecture
Toyota Infra Ramana Buka On-site infrastructure
Toyota L1/L2 Duane Gardner, Cameron Shawcross ACM on-site support
Toyota Ubuntu Remedios Cardena IVS Ubuntu expert