- Nov 27, 2017
-
- Nov 24, 2017
-
- Nov 13, 2017
-
- Aug 30, 2017
-
-
Kerri Wait authored
Former-commit-id: 6ef94fd1
-
- Aug 11, 2017
-
-
Chris Hines authored
finally corrected the cuda collectd script See merge request !140 Former-commit-id: b2fbc2b7
-
Chris Hines authored
Former-commit-id: 5a682b06
-
Chris Hines authored
Former-commit-id: 359f414a
-
Lance Wilson authored
Remove unused fan_speed monitoring, add monitoring of memory and gpu utilization See merge request !139 Former-commit-id: 48b9a2f8
-
Chris Hines authored
Former-commit-id: 008241b7
-
- Aug 08, 2017
-
-
Chris Hines authored
Former-commit-id: 35af92cb
-
- Aug 07, 2017
-
-
Chris Hines authored
Gluster See merge request !135 Former-commit-id: 742032ba
-
Former-commit-id: 6d320631
-
changed a role to run on "+ delegate_to: "{{ gluster_servers[0] }}"" so it is consistent within playbook added a ping value and restarted the daemon. This was to try to fix startup bugs we see in Monarch. Former-commit-id: 7ef98227
-
Chris Hines authored
collectd monitoring changes ... monitor additional things in proc, don't monitor slabinfo as its noisy, monitor gpus Former-commit-id: 71b1c79d
- Jul 31, 2017
-
-
Chris Hines authored
nagios script for slurmctld See merge request !132 Former-commit-id: a288ef90
-
Chris Hines authored
Former-commit-id: bf7146ab
-
- Jul 13, 2017
-
-
Chris Hines authored
Dropping cache on epilog & prolog See merge request !130 Former-commit-id: 7f2582b8
-
Chris Hines authored
Former-commit-id: 059f0f24
-
- Jul 11, 2017
-
-
Gin Tan (Monash University) authored
Former-commit-id: 811cbc72
-
Gin Tan (Monash University) authored
Former-commit-id: 1cfb95f8
-
Gin Tan (Monash University) authored
Former-commit-id: 92795eb7
-
- Jul 10, 2017
-
-
Gin Tan (Monash University) authored
Former-commit-id: 1474dab1
-
- Jul 05, 2017
-
-
Gin Tan (Monash University) authored
Former-commit-id: 325432a2
- Jun 19, 2017
-
-
Chris Hines authored
add a collectd role that can be deploed to compute and login nodes See merge request !128 Former-commit-id: 44938884
-
Chris Hines authored
Former-commit-id: b89bb713
-
- May 26, 2017
-
-
Chris Hines authored
update the hpcsystems role to include the naggy_quota cronjob and update the tools via pip See merge request !127 Former-commit-id: 8aff3c00
-
Chris Hines authored
fix the volcreate to be more sane See merge request !126 Former-commit-id: 7eb1ea60
-
Chris Hines authored
Former-commit-id: 75d04efa
-
Chris Hines authored
Former-commit-id: 55e2e3c7
-
- May 18, 2017
-
-
Chris Hines authored
Added new role to enable Slurm DB SQL to be backed up and then ssh to a node. See merge request !125 Former-commit-id: b4c0642b
-
- May 15, 2017
-
-
Simon Michnowicz (Monash University) authored
Former-commit-id: 63ee9bab
-
- May 11, 2017
-
-
Simon Michnowicz (Monash University) authored
Location of node, backup dir, and dummy user account are contained in defaults/main.yml Both SQL nodes and Management node need to have this role applied, with 'server' parameter determining the different Former-commit-id: ca6e0cd5
-
- Apr 21, 2017
-
-
Chris Hines authored
add a flag to nvidia-xconfig because it is not inputing busids for the m3f class nodes See merge request !124 Former-commit-id: d085776b
-
Chris Hines authored
Former-commit-id: 6eedf852
-
- Mar 30, 2017
-
-
Chris Hines authored
Update provision_slurm.py.j2 to import defaultdict to silence this error See merge request !122 Former-commit-id: fc1d70a8
-
- Mar 27, 2017
-
-
Kerri Wait authored
Traceback (most recent call last): File "/usr/local/sbin/provision_slurm.py", line 100, in <module> mk_slurmuser_batch(usergroup,"default") File "/usr/local/sbin/provision_slurm.py", line 67, in mk_slurmuser_batch userdict = defaultdict(list) NameError: global name 'defaultdict' is not defined Former-commit-id: 0cccb855
-
- Mar 24, 2017
-
-
Chris Hines authored
Fix provision slurm See merge request !120 Former-commit-id: 2ecf42f1
-
- Mar 21, 2017
-
-
Chris Hines authored
changed zz_modulecmd to modulecmd, so the list of custom modules load properly See merge request !121 Former-commit-id: 1c47f801
-