← / → to move · Esc to exit
1 / 1
No observations match your filters
Try clearing the search or widening the topic / scope selection.
An open reference for where we begin investigating HPC environments
This is an open, living reference describing how the Concertim team begins investigating HPC health observations. It's shared so customers and peers can compare our practices against their own — not as a prescription of what anyone must do. Each entry captures an observation, what it can mean for the service, and where we'd start looking into it.
Observations are grouped into topics: Vitals (essential health indicators),
Security, and Performance. Use the topic pills to focus on an area;
the scope pills and search narrow further. You can browse the grid or switch to a focused
walkthrough with the ≡ button (arrow keys navigate).
Some entries are still being written and are badged Draft — they're shown so the gaps are visible, not hidden.
Prefer a printable copy for offline use? Download the companion PDF ↓
Try clearing the search or widening the topic / scope selection.