Skip to content
GitLab
Explore
Sign in
Primary navigation
Search or go to…
Project
H
HPCasCode
Manage
Activity
Members
Labels
Plan
Issues
Issue boards
Milestones
Wiki
Code
Merge requests
Repository
Branches
Commits
Tags
Repository graph
Compare revisions
Deploy
Releases
Model registry
Operate
Environments
Monitor
Incidents
Analyze
Value stream analytics
Contributor analytics
Repository analytics
Model experiments
Help
Help
Support
GitLab documentation
Compare GitLab plans
Community forum
Contribute to GitLab
Provide feedback
Keyboard shortcuts
?
Snippets
Groups
Projects
Show more breadcrumbs
hpc-team
HPCasCode
Graph
007a2a0a341a600fb1a68946f326c7fc9e4a48f3
Select Git revision
Branches
20
20180923
20181019
20181022
201902
24june
34-nvme-disk-not-mounted
AddKaraageCommitVersion
CRAMSdev
FixCvlSssdConfig
FixSlurmConfigBackup
Oct2018
Sep2019
a16s
add-pmiX-functionality-to-slurm01
add_slurmstats
add_strigger_clusterbuild
admin_packages
allow_private_network_input
ansible_include_errors
autoupdate_update
20 results
You can move around the graph by using the arrow keys.
Begin with the selected commit
Created with Raphaël 2.2.0
7
Aug
1
31
Jul
13
11
10
5
19
Jun
26
May
18
15
11
21
Apr
7
6
30
Mar
27
24
21
20
10
23
Feb
7
1
31
Jan
6
5
4
3
2
21
Dec
20
15
13
12
8
5
28
Nov
25
24
23
22
16
11
10
3
25
Oct
24
21
7
20
Sep
15
13
7
6
16
Aug
11
12
11
5
4
3
2
1
29
Jul
25
22
21
20
13
6
4
24
Jun
22
21
20
19
16
14
13
7
2
31
May
30
27
26
25
18
12
4
3
2
28
Apr
27
26
22
20
18
15
14
5
4
1
30
Mar
24
23
22
21
17
14
10
9
3
1
2
1
29
Feb
23
19
15
11
10
9
4
22
Jan
15
12
17
Dec
15
8
7
4
2
1
26
Nov
25
24
20
19
18
17
11
10
5
4
3
2
30
Oct
28
27
26
20
16
15
14
8
30
Sep
29
25
24
23
22
21
18
17
16
14
13
11
10
9
8
7
4
1
31
Aug
29
28
27
26
25
21
20
19
14
13
12
10
7
5
4
3
31
Jul
30
24
23
22
20
17
Merge branch 'cuda_collectd' into 'master'
more meaningful comments in name: task,
ansible_include…
ansible_include_errors
Placed comment after include, as newer ansible version will include a file but no execute it without such a line
templated the outgoing and ingoing ports on the NAT which runs on the DTN node
nat_role
nat_role
template out device names for iptables. Use ip route <IP> to determine device names.
change the way a script is called to determine number of GPUs
slurm_gres_probe
slurm_gres_probe
Merge branch 'gluster' into 'master'
Added "set quorum ratio" command
changed a role to run on "+ delegate_to: "{{ gluster_servers[0] }}"" so it is consistent within playbook
collectd monitoring changes ... monitor additional things in proc, don't monitor slabinfo as its noisy, monitor gpus
Merge branch 'master' of gitlab.erc.monash.edu.au:hpc-team/ansible_cluster_in_a_box into improve_nat_role
changed a role to run on "+ delegate_to: "{{ gluster_servers[0] }}"" so it is consistent within playbook
Added "set quorum ratio" command
Placed comment after include, as newer ansible version will include a file but no execute it without such a line
more meaningful comments in name: task,
template out device names for iptables. Use ip route <IP> to determine device names.
templated the outgoing and ingoing ports on the NAT which runs on the DTN node
change the way a script is called to determine number of GPUs
we found that kernel packages reinstalled default centos repos. We need to remove them at this point (again)
Merge branch 'nagios_slurmctld' into 'master'
nagios script for slurmctld
Merge branch 'prolog' into 'master'
Update main.yml
Remove adding cluster on slurm
Removed the /tmp clean-up until the script is sorted
Removed the /tmp clean-up until the script is sorted
Dropping cache on epilog & prolog
Merge branch 'update_kernel' into 'master'
Update the mofed version
Merge branch 'collectd' into 'master'
add a collectd role that can be deploed to compute and login nodes
Merge branch 'fix_hpcsystems' into 'master'
Merge branch 'fix_gluster' into 'master'
update the hpcsystems role to include the naggy_quota cronjob and update the tools via pip
fix the volcreate to be more sane
Merge branch 'sqlbackup' into 'master'
Removed id_rsa and .pub . This is moved to clusterbuild files dir for security reasons
Added new role to enable Slurm DB SQL to be backed up and then ssh to a node.
Merge branch 'nvidiaxorg' into 'master'
add a flag to nvidia-xconfig because it is not inputing busids for the m3f class nodes
Loading