Skip to content

GitLab

  • Menu
Projects Groups Snippets
    • Loading...
  • Help
    • Help
    • Support
    • Community forum
    • Submit feedback
    • Contribute to GitLab
  • Sign in
  • H HPCasCode
  • Project information
    • Project information
    • Activity
    • Labels
    • Members
  • Repository
    • Repository
    • Files
    • Commits
    • Branches
    • Tags
    • Contributors
    • Graph
    • Compare
  • Issues 8
    • Issues 8
    • List
    • Boards
    • Service Desk
    • Milestones
  • Merge requests 2
    • Merge requests 2
  • Deployments
    • Deployments
    • Environments
    • Releases
  • Monitor
    • Monitor
    • Incidents
  • Analytics
    • Analytics
    • Value stream
    • Repository
  • Wiki
    • Wiki
  • Activity
  • Graph
  • Create a new issue
  • Commits
  • Issue Boards
Collapse sidebar
  • hpc-team
  • HPCasCode
  • Issues

  • Open 8
  • Closed 26
  • All 34
New issue
  • Priority Created date Updated date Milestone due date Due date Popularity Label priority Manual Title
  • SSSD cache sometimes unreliable and needing a restart
    #36 · created Dec 06, 2021 by Andreas Hamacher
    • 1
    updated Dec 06, 2021
  • role slurmdb-config requires commited password
    #35 · created Nov 10, 2021 by Andreas Hamacher
    • 0
    updated Nov 10, 2021
  • NVME disk not mounted
    #34 · created Oct 28, 2021 by Chris Hines
    • 1
    • 2
    updated Oct 28, 2021
  • edgecases for COLD-drain-jobs and auto-resumes
    #33 · created Jul 24, 2020 by Andreas Hamacher
    • CLOSED
    • 0
    updated Sep 03, 2020
  • hypervisor maintenance
    #31 · created Jul 08, 2020 by Andreas Hamacher   Automatic rolling node updates   Future Releases
    • CLOSED
    • 0
    updated Sep 03, 2020
  • humans do not work at night
    #30 · created Jun 12, 2020 by Andreas Hamacher   Automatic rolling node updates   Phase1
    • CLOSED
    • 0
    updated Sep 29, 2021
  • run ansible --check and output inconsistencies for consumption by autoupdate
    #29 · created Jun 11, 2020 by Andreas Hamacher   Automatic rolling node updates   Doing Phase1
    • CLOSED
    • 1
    updated Jun 12, 2020
  • enable kernel version pinning for ubuntu
    #28 · created Jun 11, 2020 by Andreas Hamacher   merge back cvl@uwa and have it as a valid third cluster on the pipeline
    • 0
    updated Jun 11, 2020
  • automated hypervisor_upgrade while rolling upgrades
    #27 · created Jun 05, 2020 by Andreas Hamacher   Automatic rolling node updates   Future Releases
    • CLOSED
    • 1
    updated Sep 03, 2020
  • Readme improvements
    #25 · created May 17, 2020 by Andreas Hamacher
    • CLOSED
    • 0
    updated May 20, 2020
  • (to be discussed) Develop a heuristic to maximize uptame and minimize user impact
    #24 · created May 17, 2020 by Andreas Hamacher   Automatic rolling node updates   Phase 3
    • CLOSED
    • 1
    updated Sep 03, 2020
  • Documentation
    #23 · created May 17, 2020 by Andreas Hamacher   Automatic rolling node updates   Phase 3
    • CLOSED
    • 0
    updated Sep 29, 2021
  • think about max-retry time e.g. stop trying after 5 failures
    #22 · created May 17, 2020 by Andreas Hamacher   Automatic rolling node updates   Phase 2
    • CLOSED
    • 0
    updated Sep 29, 2021
  • a mechanism to pause the whole thing during maintenance or while bugfixing a playbook
    #21 · created May 17, 2020 by Andreas Hamacher   Automatic rolling node updates   Phase1
    • CLOSED
    • 0
    updated Sep 29, 2021
  • a mechanism to update the repository and playbook
    #20 · created May 17, 2020 by Andreas Hamacher   Automatic rolling node updates   Phase 2
    • CLOSED
    • 0
    updated Sep 29, 2021
  • a mechanism to online update the nodelist to be able to add nodes to the mechanism
    #19 · created May 17, 2020 by Andreas Hamacher   Automatic rolling node updates   Phase 2
    • CLOSED
    • 0
    updated Sep 29, 2021
  • Canary testing: new commit( remember sha) , test canary nodelist first, only when that one passes then role out largely
    #18 · created May 17, 2020 by Andreas Hamacher   Automatic rolling node updates   Phase1
    • CLOSED
    • 0
    updated Sep 29, 2021
  • Host blacklist, whitelist and canarilist ( crash if a node is blacklisted AND ( white or canary)) just as an extra safety mechanism for e.g. sql and mgmtnodes
    #17 · created May 17, 2020 by Andreas Hamacher   Automatic rolling node updates   Phase1
    • CLOSED
    • 1
    updated Sep 29, 2021
  • hook up a change detector, capable of returning ERROR, GOOD, HotChange-nodelist or DrainChange-nodelist
    #16 · created May 17, 2020 by Andreas Hamacher   Automatic rolling node updates   Phase1
    • CLOSED
    • 0
    updated Jun 16, 2020
  • use ansible vault
    #15 · created May 17, 2020 by Andreas Hamacher   Automatic rolling node updates   Phase1
    • CLOSED
    • 2
    updated Sep 29, 2021
  • Prev
  • 1
  • 2
  • Next