Operations Overview¶
Once the 3-node bare-metal cluster is deployed and running, Day 2 operations are managed through the Palette console. This section covers cluster lifecycle management, monitoring, and maintenance procedures.
Operations Scope¶
| Area | Description |
|---|---|
| Cluster Management | Profiles, upgrades, A/B partitions, node lifecycle |
| Monitoring | Prometheus and Grafana monitoring stack |
Palette Management Console¶
All Day 2 operations are performed through the Palette tenant console:
| Console | URL | Purpose |
|---|---|---|
| Tenant Console | https://10.25.232.155/ |
Cluster management, profiles, edge hosts |
| System Console | https://10.25.232.155/system |
System administration, registry, tenants |
| Local UI | https://10.25.232.252:5080 |
Node-level management (PMA only) |
Key Operations¶
Cluster Profile Updates¶
Cluster profiles define the full stack deployed on the cluster. Updates to profiles are automatically reconciled to running clusters:
- Edit a cluster profile in Palette (e.g., update a pack version)
- Palette detects the drift and creates a pending update
- Review the diff in the Palette UI
- Apply the update -- Palette orchestrates the rollout across nodes
Node Management¶
- Add nodes -- Register additional edge hosts and add them to the machine pool
- Remove nodes -- Cordon, drain, and remove nodes from the cluster
- Replace nodes -- Swap failed hardware by registering a new node with the same profile
Backup and Recovery¶
- Cluster profiles are stored in Palette and can be re-applied to new clusters
- etcd snapshots are managed by PXKe for cluster state recovery
- Portworx snapshots provide application-level data protection
Operational Contacts¶
For escalation during the POC:
| Role | Contact | Availability |
|---|---|---|
| Spectro Cloud SE | Craig Smith (craig.smith@spectrocloud.com) | Primary technical contact |
| Spectro Cloud SE | Kyle Jepson (kyle.jepson@spectrocloud.com) | Backup SE |
| Portworx CNA | David Castellani (dcastellani@purestorage.com) | Storage architecture |
| Toyota Infra | Ramana Buka | On-site infrastructure |
| Toyota L1/L2 | Duane Gardner, Cameron Shawcross | ACM on-site support |
| Toyota Ubuntu | Remedios Cardena | IVS Ubuntu expert |