kube-batch by kubernetes-retired

A batch scheduler of kubernetes for high performance workload, e.g. AI/ML, BigData, HPC

created at June 30, 2017, 2:08 a.m.

Go

51 +0

1,068 -1

270 +0

GitHub
apptainer by apptainer

Apptainer: Application containers for Linux

created at Nov. 30, 2021, 1:45 p.m.

Go

29 +0

917 +6

117 +1

GitHub
dcgm-exporter by NVIDIA

NVIDIA GPU metrics exporter for Prometheus leveraging DCGM

created at Aug. 11, 2021, 3:40 p.m.

Go

15 +0

658 +7

124 +1

GitHub
warewulf by warewulf

Warewulf is a stateless and diskless container operating system provisioning system for large clusters of bare metal and/or virtual systems.

created at Oct. 28, 2020, 3:31 p.m.

Go

21 +1

203 +2

71 +0

GitHub
goslmailer by CLIP-HPC

GoSlurmMailer - drop in replacement for default slurm MailProg. Delivers slurm job messages to various destinations.

created at May 16, 2022, 11:55 a.m.

Go

4 +0

36 +0

6 +0

GitHub
gpfs_exporter by treydock

None

created at Jan. 18, 2020, 11:11 p.m.

Go

5 +0

32 +0

12 +0

GitHub
infiniband_exporter by treydock

None

created at April 24, 2021, 4:34 p.m.

Go

4 +0

31 +0

4 +0

GitHub
lustre_exporter by GSI-HPC

Prometheus exporter for use with the Lustre parallel filesystem

created at Feb. 9, 2021, 10:45 a.m.

Go

3 +0

15 +0

18 +1

GitHub
cgroup_exporter by treydock

None

created at Feb. 12, 2020, 4 p.m.

Go

4 +0

15 +0

3 +0

GitHub
slurm-exporter by ubccr

Slurm Exporter for Prometheus

created at June 4, 2020, 12:50 p.m.

Go

5 +0

11 +0

2 +0

GitHub
cgroups_exporter by phpHavok

A Prometheus exporter for cgroup-level metrics.

created at April 23, 2021, 3:16 p.m.

Go

2 +0

4 +0

4 +0

GitHub
prometheus-slurm-exporter by treydock

Prometheus exporter for performance metrics from Slurm.

created at Aug. 4, 2020, 8:41 p.m.

Go

2 +0

0 +0

0 +0

GitHub