I built slmtop in Rust: an htop-like terminal dashboard for monitoring Slurm clusters in real time
Posted by Alone-Acanthisitta-2@reddit | sysadmin | View on Reddit | 1 comments
I built slmtop**: an htop-like terminal dashboard for Slurm clusters**
If you use Slurm on an HPC cluster, you probably spend a lot of time with squeue, sinfo, scontrol, sacct, and watch.
I wanted a faster, more visual way to monitor jobs and cluster resources, so I built slmtop:
https://github.com/dawnmy/slmtop
slmtop is a Rust-based interactive TUI for real-time Slurm monitoring. It shows jobs, nodes, GPUs/resources, disks, and accounting summaries in one terminal dashboard.
Key features:
- Real-time Slurm job and node monitoring
- htop-like interactive terminal UI
- GPU/resource overview
- Search and filters, e.g.
owner=me state=running gpu=a100 - Sortable tables with keyboard or mouse
- Job detail popup and guarded actions: cancel, hold, release, requeue
- Per-user resource summaries
- Multiple color themes
Example:
```
slmtop
slmtop --user bob
slmtop -T nightowl --refresh-interval 2
```
VA_Network_Nerd@reddit
Sorry, it seems this comment or thread has violated a sub-reddit rule and has been removed by a moderator.
Do Not Conduct Marketing Operations Within This Community.
Your content may be better suited for our companion sub-reddit: /r/SysAdminBlogs
If you wish to appeal this action please don't hesitate to message the moderation team.