- Jul 05, 2017
-
-
Gin Tan (Monash University) authored
-
- Jun 19, 2017
-
-
Chris Hines authored
add a collectd role that can be deploed to compute and login nodes See merge request !128
-
Chris Hines authored
-
- May 26, 2017
-
-
Chris Hines authored
update the hpcsystems role to include the naggy_quota cronjob and update the tools via pip See merge request !127
-
Chris Hines authored
fix the volcreate to be more sane See merge request !126
-
Chris Hines authored
-
Chris Hines authored
-
- May 18, 2017
-
-
Chris Hines authored
Added new role to enable Slurm DB SQL to be backed up and then ssh to a node. See merge request !125
-
- May 15, 2017
-
-
Simon Michnowicz (Monash University) authored
-
- May 11, 2017
-
-
Simon Michnowicz (Monash University) authored
Location of node, backup dir, and dummy user account are contained in defaults/main.yml Both SQL nodes and Management node need to have this role applied, with 'server' parameter determining the different
-
- Apr 21, 2017
-
-
Chris Hines authored
add a flag to nvidia-xconfig because it is not inputing busids for the m3f class nodes See merge request !124
-
Chris Hines authored
-
- Mar 30, 2017
-
-
Chris Hines authored
Update provision_slurm.py.j2 to import defaultdict to silence this error See merge request !122
-
- Mar 27, 2017
-
-
Kerri Wait authored
Traceback (most recent call last): File "/usr/local/sbin/provision_slurm.py", line 100, in <module> mk_slurmuser_batch(usergroup,"default") File "/usr/local/sbin/provision_slurm.py", line 67, in mk_slurmuser_batch userdict = defaultdict(list) NameError: global name 'defaultdict' is not defined
-
- Mar 24, 2017
-
-
Chris Hines authored
Fix provision slurm See merge request !120
-
- Mar 21, 2017
-
-
Chris Hines authored
changed zz_modulecmd to modulecmd, so the list of custom modules load properly See merge request !121
-
Jafar Lie authored
-
- Mar 20, 2017
-
-
Kerri Wait authored
-
Kerri Wait authored
-
Kerri Wait authored
Update provision_slurm.py.j2 to fix logic in mk_slurmuser_batch. This previously did not add any users if one user in the list was a member of 'default'
-
- Mar 10, 2017
-
-
Gin Tan (Monash University) authored
-
- Feb 23, 2017
-
-
Chris Hines authored
tempalte xorg.conf when running on a node with 1GPU (this template works for M3 … See merge request !119
-
Chris Hines authored
tempalte xorg.conf when running on a node with 1GPU (this template works for M3 K1 nodes, will need to do something smarter for other clusters with GPUS)
-
Chris Hines authored
Ansible check See merge request !118
-
Chris Hines authored
-
Chris Hines authored
-
Chris Hines authored
-
Chris Hines authored
-
Chris Hines authored
-
Chris Hines authored
-
- Feb 07, 2017
-
-
Chris Hines authored
missing pip from the list of dependencies See merge request !117
-
Chris Hines authored
-
- Feb 01, 2017
-
-
Chris Hines authored
start ntpd by default See merge request !116
-
Chris Hines authored
executable bit should not be set on systemd service files See merge request !115
-
Chris Hines authored
-
Chris Hines authored
-
- Jan 06, 2017
-
-
Gin Tan authored
Extend the GPU role The GPU role used to just detect if any driver was installed and skip installation if one was found it will now execute nvidia-smi to determine the current driver version and compare it against the desired driver version, installing in the case of mismatch Note that if an existing driver is installed, the nvidia persistence daemon must be stopped before installation can proceed See merge request !114
-
Chris Hines authored
The GPU role used to just detect if any driver was installed and skip installation if one was found it will now execute nvidia-smi to determine the current driver version and compare it against the desired driver version, installing in the case of mismatch Note that if an existing driver is installed, the nvidia persistence daemon must be stopped before installation can proceed
-
- Jan 05, 2017
-
-
Gin Tan authored
Add /usr/local/bin to the end of the default path this was initially removed as it resulted in programs in /etc/profile.d searching /usr/local/bin, which if the NFS server had hung resulted in ec2-user not being able to log in subsequently verified that it seems safe to add it to the end of the path, obviously subject to not having anything called from profile that exists in /usr/local See merge request !113