HPCasCode issueshttps://gitlab.erc.monash.edu.au/hpc-team/HPCasCode/-/issues2022-09-24T09:26:06+10:00https://gitlab.erc.monash.edu.au/hpc-team/HPCasCode/-/issues/37cgroup_allowed_devices.conf does not exist and should not2022-09-24T09:26:06+10:00Andreas Hamachercgroup_allowed_devices.conf does not exist and should not[USERNAME@hi00 conf]$ cat cgroup.conf
CgroupAutomount=yes
ConstrainDevices=yes
ConstrainCores=yes
ConstrainRAMSpace=yes
ConstrainKmemSpace=no`
AllowedDevicesFile=/opt/slurm-22.05.3/etc/cgroup_allowed_devices.conf
/opt/slurm-22.05.3/etc/...[USERNAME@hi00 conf]$ cat cgroup.conf
CgroupAutomount=yes
ConstrainDevices=yes
ConstrainCores=yes
ConstrainRAMSpace=yes
ConstrainKmemSpace=no`
AllowedDevicesFile=/opt/slurm-22.05.3/etc/cgroup_allowed_devices.conf
/opt/slurm-22.05.3/etc/cgroup_allowed_devices.conf does not existAndreas HamacherAndreas Hamacherhttps://gitlab.erc.monash.edu.au/hpc-team/HPCasCode/-/issues/36SSSD cache sometimes unreliable and needing a restart2022-09-20T12:19:13+10:00Andreas HamacherSSSD cache sometimes unreliable and needing a restart@chines If you find the time maybe have a look, if not, thats ok.
Cryospark died when writing this.
a monarch-mgmt died as well quite recently@chines If you find the time maybe have a look, if not, thats ok.
Cryospark died when writing this.
a monarch-mgmt died as well quite recentlyhttps://gitlab.erc.monash.edu.au/hpc-team/HPCasCode/-/issues/34NVME disk not mounted2021-10-28T10:16:51+11:00Chris HinesNVME disk not mountedWhen we have NVME disks in VMs we attempt to use them as spank private tmpdir BUT there is an assumption that they are mounted on /mnt/nvme which is not true for all images/flavours that include nvmeWhen we have NVME disks in VMs we attempt to use them as spank private tmpdir BUT there is an assumption that they are mounted on /mnt/nvme which is not true for all images/flavours that include nvmehttps://gitlab.erc.monash.edu.au/hpc-team/HPCasCode/-/issues/28enable kernel version pinning for ubuntu2020-06-11T11:07:26+10:00Andreas Hamacherenable kernel version pinning for ubuntu/roles/upgrade/tasks/main.yml has been rewritten for yum modules but not for apt/roles/upgrade/tasks/main.yml has been rewritten for yum modules but not for aptmerge back cvl@uwa and have it as a valid third cluster on the pipelinehttps://gitlab.erc.monash.edu.au/hpc-team/HPCasCode/-/issues/6Make sure ldap servers have larger size counts2020-02-21T11:28:26+11:00Chris HinesMake sure ldap servers have larger size countshttps://confluence.atlassian.com/crowdkb/openldap-only-synchronizes-500-user-942838720.htmlhttps://confluence.atlassian.com/crowdkb/openldap-only-synchronizes-500-user-942838720.htmlhttps://gitlab.erc.monash.edu.au/hpc-team/HPCasCode/-/issues/5pam_slurm_adopt should install on version mismatch2019-11-25T17:26:48+11:00Chris Hinespam_slurm_adopt should install on version mismatchhttps://gitlab.erc.monash.edu.au/hpc-team/HPCasCode/-/issues/3the pid file paths given in the systemd unit files for slurm do not match the...2019-12-11T12:51:58+11:00Chris Hinesthe pid file paths given in the systemd unit files for slurm do not match the paths given in the slurm.conf and slurmdbd.confThis is hard to resolve since the *.conf files are stored in the cluster specific repo, but the templates for the services are in this repo. Perhaps as part of installing the service files we can cat the conf files and store the valueThis is hard to resolve since the *.conf files are stored in the cluster specific repo, but the templates for the services are in this repo. Perhaps as part of installing the service files we can cat the conf files and store the valuehttps://gitlab.erc.monash.edu.au/hpc-team/HPCasCode/-/issues/2role duplication ?2020-01-28T18:37:08+11:00Andreas Hamacherrole duplication ?At least this template exists twice:
git diff ./roles/slurm-common/templates/job_submit.lua.j2 ./roles/slurm_config/templates/job_submit.lua.j2
I reckon the underlying problem is that one of the roles slurm-common and slurm_config is "old"At least this template exists twice:
git diff ./roles/slurm-common/templates/job_submit.lua.j2 ./roles/slurm_config/templates/job_submit.lua.j2
I reckon the underlying problem is that one of the roles slurm-common and slurm_config is "old"Gin TanGin Tan