apptainer by apptainer

Apptainer: Application containers for Linux

updated at May 19, 2024, 3:12 p.m.

Go

30 +1

945 +19

118 +1

GitHub
warewulf by warewulf

Warewulf is a stateless and diskless container operating system provisioning system for large clusters of bare metal and/or virtual systems.

updated at May 19, 2024, 1:15 p.m.

Go

21 +0

205 +1

72 +0

GitHub
dcgm-exporter by NVIDIA

NVIDIA GPU metrics exporter for Prometheus leveraging DCGM

updated at May 19, 2024, 7:59 a.m.

Go

14 +0

675 +7

125 +2

GitHub
cgroup_exporter by treydock

None

updated at May 18, 2024, 4:51 p.m.

Go

4 +0

15 +0

4 +0

GitHub
goslmailer by CLIP-HPC

GoSlurmMailer - drop in replacement for default slurm MailProg. Delivers slurm job messages to various destinations.

updated at May 17, 2024, 2:43 p.m.

Go

4 +0

38 +1

6 +0

GitHub
infiniband_exporter by treydock

None

updated at May 17, 2024, 9:28 a.m.

Go

4 +0

32 +0

4 +0

GitHub
kube-batch by kubernetes-retired

A batch scheduler of kubernetes for high performance workload, e.g. AI/ML, BigData, HPC

updated at May 9, 2024, 3:01 a.m.

Go

51 +0

1,069 +0

265 +0

GitHub
lustre_exporter by GSI-HPC

Prometheus exporter for use with the Lustre parallel filesystem

updated at April 29, 2024, 6:23 a.m.

Go

3 +0

15 +0

18 +0

GitHub
gpfs_exporter by treydock

None

updated at April 2, 2024, 7:29 a.m.

Go

5 +0

32 +0

12 +0

GitHub
slurm-exporter by ubccr

Slurm Exporter for Prometheus

updated at March 27, 2024, 12:03 a.m.

Go

5 +0

11 +0

2 +0

GitHub
cgroups_exporter by phpHavok

A Prometheus exporter for cgroup-level metrics.

updated at Oct. 12, 2023, 2:56 p.m.

Go

2 +0

4 +0

4 +0

GitHub
prometheus-slurm-exporter by treydock

Prometheus exporter for performance metrics from Slurm.

updated at June 16, 2021, 3:21 p.m.

Go

2 +0

0 +0

0 +0

GitHub