Skip to content
GitLab
Explore
Sign in
Primary navigation
Search or go to…
Project
H
HPCasCode
Manage
Activity
Members
Labels
Plan
Issues
Issue boards
Milestones
Wiki
Code
Merge requests
Repository
Branches
Commits
Tags
Repository graph
Compare revisions
Deploy
Releases
Model registry
Operate
Environments
Monitor
Incidents
Analyze
Value stream analytics
Contributor analytics
Repository analytics
Model experiments
Help
Help
Support
GitLab documentation
Compare GitLab plans
Community forum
Contribute to GitLab
Provide feedback
Keyboard shortcuts
?
Snippets
Groups
Projects
Show more breadcrumbs
hpc-team
HPCasCode
Graph
46a46294866a3a5fa486b8e70583221ab324c933
Select Git revision
Branches
20
20180923
20181019
20181022
201902
24june
34-nvme-disk-not-mounted
AddKaraageCommitVersion
CRAMSdev
FixCvlSssdConfig
FixSlurmConfigBackup
Oct2018
Sep2019
a16s
add-pmiX-functionality-to-slurm01
add_slurmstats
add_strigger_clusterbuild
admin_packages
allow_private_network_input
ansible_include_errors
autoupdate_update
20 results
You can move around the graph by using the arrow keys.
Begin with the selected commit
Created with Raphaël 2.2.0
11
Aug
8
7
1
31
Jul
13
11
10
5
19
Jun
26
May
18
15
11
21
Apr
7
6
30
Mar
27
24
21
20
10
23
Feb
7
1
31
Jan
6
5
4
3
2
21
Dec
20
15
13
12
8
5
28
Nov
25
24
23
22
16
11
10
3
25
Oct
24
21
7
20
Sep
15
13
7
6
16
Aug
11
12
11
5
4
3
2
1
29
Jul
25
22
21
20
13
6
4
24
Jun
22
21
20
19
16
14
13
7
2
31
May
30
27
26
25
18
12
4
3
2
28
Apr
27
26
22
20
18
15
14
5
4
1
30
Mar
24
23
22
21
17
14
10
9
3
1
2
1
29
Feb
23
19
15
11
10
9
4
22
Jan
15
12
17
Dec
15
8
7
4
2
1
26
Nov
25
24
20
19
18
17
11
10
5
4
3
2
30
Oct
28
27
26
20
16
15
14
8
30
Sep
29
25
24
23
22
21
18
17
16
14
13
11
10
9
8
7
4
1
31
Aug
29
28
27
26
25
21
20
19
14
13
12
10
7
5
4
3
31
Jul
30
24
23
finally corrected the cuda collectd script
Update cuda_collectd.py.j2
Merge branch 'gpu_util' into 'master'
Remove unused fan_speed monitoring, add monitoring of memory and gpu utilization
forgot to include the buddyinfo script for collectd
Merge branch 'cuda_collectd' into 'master'
more meaningful comments in name: task,
ansible_include…
ansible_include_errors
Placed comment after include, as newer ansible version will include a file but no execute it without such a line
templated the outgoing and ingoing ports on the NAT which runs on the DTN node
nat_role
nat_role
template out device names for iptables. Use ip route <IP> to determine device names.
change the way a script is called to determine number of GPUs
slurm_gres_probe
slurm_gres_probe
Merge branch 'gluster' into 'master'
Added "set quorum ratio" command
changed a role to run on "+ delegate_to: "{{ gluster_servers[0] }}"" so it is consistent within playbook
collectd monitoring changes ... monitor additional things in proc, don't monitor slabinfo as its noisy, monitor gpus
Merge branch 'master' of gitlab.erc.monash.edu.au:hpc-team/ansible_cluster_in_a_box into improve_nat_role
changed a role to run on "+ delegate_to: "{{ gluster_servers[0] }}"" so it is consistent within playbook
Added "set quorum ratio" command
Placed comment after include, as newer ansible version will include a file but no execute it without such a line
more meaningful comments in name: task,
template out device names for iptables. Use ip route <IP> to determine device names.
templated the outgoing and ingoing ports on the NAT which runs on the DTN node
change the way a script is called to determine number of GPUs
we found that kernel packages reinstalled default centos repos. We need to remove them at this point (again)
Merge branch 'nagios_slurmctld' into 'master'
nagios script for slurmctld
Merge branch 'prolog' into 'master'
Update main.yml
Remove adding cluster on slurm
Removed the /tmp clean-up until the script is sorted
Removed the /tmp clean-up until the script is sorted
Dropping cache on epilog & prolog
Merge branch 'update_kernel' into 'master'
Update the mofed version
Merge branch 'collectd' into 'master'
add a collectd role that can be deploed to compute and login nodes
Merge branch 'fix_hpcsystems' into 'master'
Merge branch 'fix_gluster' into 'master'
update the hpcsystems role to include the naggy_quota cronjob and update the tools via pip
fix the volcreate to be more sane
Loading