- Nov 24, 2017
-
-
Jafar Lie authored
-
- Nov 13, 2017
-
-
Jafar Lie authored
-
- Aug 30, 2017
-
-
Kerri Wait authored
-
- Aug 11, 2017
-
-
Chris Hines authored
-
Chris Hines authored
-
Chris Hines authored
-
- Aug 08, 2017
-
-
Chris Hines authored
-
- Aug 07, 2017
-
-
-
changed a role to run on "+ delegate_to: "{{ gluster_servers[0] }}"" so it is consistent within playbook added a ping value and restarted the daemon. This was to try to fix startup bugs we see in Monarch.
-
Chris Hines authored
collectd monitoring changes ... monitor additional things in proc, don't monitor slabinfo as its noisy, monitor gpus
-
- Jul 31, 2017
-
-
Chris Hines authored
-
- Jul 13, 2017
-
-
Chris Hines authored
-
- Jul 11, 2017
-
-
Gin Tan (Monash University) authored
-
Gin Tan (Monash University) authored
-
Gin Tan (Monash University) authored
-
- Jul 10, 2017
-
-
Gin Tan (Monash University) authored
-
- Jul 05, 2017
-
-
Gin Tan (Monash University) authored
-
- Jun 19, 2017
-
-
Chris Hines authored
-
- May 26, 2017
-
-
Chris Hines authored
-
Chris Hines authored
-
- May 15, 2017
-
-
Simon Michnowicz (Monash University) authored
-
- May 11, 2017
-
-
Simon Michnowicz (Monash University) authored
Location of node, backup dir, and dummy user account are contained in defaults/main.yml Both SQL nodes and Management node need to have this role applied, with 'server' parameter determining the different
-
- Apr 21, 2017
-
-
Chris Hines authored
-
- Mar 27, 2017
-
-
Kerri Wait authored
Traceback (most recent call last): File "/usr/local/sbin/provision_slurm.py", line 100, in <module> mk_slurmuser_batch(usergroup,"default") File "/usr/local/sbin/provision_slurm.py", line 67, in mk_slurmuser_batch userdict = defaultdict(list) NameError: global name 'defaultdict' is not defined
-
- Mar 21, 2017
-
-
Jafar Lie authored
-
- Mar 20, 2017
-
-
Kerri Wait authored
-
Kerri Wait authored
-
Kerri Wait authored
Update provision_slurm.py.j2 to fix logic in mk_slurmuser_batch. This previously did not add any users if one user in the list was a member of 'default'
-
- Mar 10, 2017
-
-
Gin Tan (Monash University) authored
-
- Feb 23, 2017
-
-
Chris Hines authored
tempalte xorg.conf when running on a node with 1GPU (this template works for M3 K1 nodes, will need to do something smarter for other clusters with GPUS)
-
Chris Hines authored
-
Chris Hines authored
-
Chris Hines authored
-
Chris Hines authored
-
Chris Hines authored
-
Chris Hines authored
-
- Feb 07, 2017
-
-
Chris Hines authored
-
- Feb 01, 2017
-
-
Chris Hines authored
-
Chris Hines authored
-
- Jan 06, 2017
-
-
Chris Hines authored
The GPU role used to just detect if any driver was installed and skip installation if one was found it will now execute nvidia-smi to determine the current driver version and compare it against the desired driver version, installing in the case of mismatch Note that if an existing driver is installed, the nvidia persistence daemon must be stopped before installation can proceed
-