Patching Windows VMs with GCP’s VM Manager

For the eagle eyed among you the three VMs reporting back no data are actually GKE Nodes running Google’s Container OS that they are responsible for maintaining.

By selecting view details then selecting a specific VM I can get a breakdown of available patches including their categories, KB numbers and when they were published.

Turning Insight into Action

So satisfied with the insight I now had into the VMs I was keen to try out the functionality to apply patches. This is done by creating a Google OS Patch Deployment, this can be done in the Console or with Terraform.

resource "google_os_config_patch_deployment" "win-patch" {
  patch_deployment_id = "win-patch"instance_filter {
    group_labels {
      labels = {
        win-patch = "true"
      }
    }zones = ["europe-west2-a", "europe-west2-b", "europe-west2-c"]
  }patch_config {
    reboot_config = "DEFAULT"windows_update {
      classifications = ["CRITICAL", "SECURITY", "DEFINITION"]
    }
  }duration = "3600s"recurring_schedule {
    time_zone {
      id = "Europe/London"
    }time_of_day {
      hours = 2
    }weekly {
      day_of_week = var.win_patch_day
    }
  }rollout {
    mode = "ZONE_BY_ZONE"
    disruption_budget {
      fixed = 5
    }
  }
}

The Terraform Documentation for this resource is quite an interesting read. There are for example options available for pre and post patching scripts which may be very useful in some environments where automated testing exists.

To break down the code above it:

Targets VMs with the label win-patch = “true” that exist in all three of the Europe-west2 zones.
The reboot config is set to default which in Windows terms means only rebooting if required.
A weekly patch run at 2AM on a day of the week I have variablised (to allow me to set different days for non-prod and prod environments)
A rollout plan which allows all 5 VMs to be disrupted simultaneously. (In theory you can put a percentage in this field but I had difficulty doing so)

As another quick a note this above resource is new and was only added in provider version 3.30.0. I was required to update to a newer provider as we were a couple of versions behind.

Reviewing Patch Jobs

When a patch job is executed its progress can be watched in real time or more likely (with patching done in the early hours) reviewed the following morning within the VM Management.

It is also possible to drill even further into the logs on specific machines to see how the job progressed. As you can see this job also required a reboot which the machine executed automatically.

Concluding Thoughts

Simply I am a fan, this solution has enabled me to deploy Windows Patches in an automated, reasonably (though not completely) controlled way with minimal to no cost. However I did have to make some compromises and assumptions:

Microsoft typically releases patches on the 2nd Tuesday of the month in what has affectionately been coined “Patch Tuesday”. This means that I have configured my schedule so that non-production environments get patched on Thursday with production the following Monday. So in theory this means non-production will be patched first with production following unless Microsoft releases patches out of their usual routine.
I am also trusting the Windows update source configured in my machine to be ‘safe’. As it is left as the default this should be Microsoft (or potentially Google). But Google themselves state that for absolute control they recommend a WSUS server be deployed as part of the OS Patch Management Solution. I felt in this environment deploying such would be overkill and would require additional unwelcome management.

Finally I hope this post has given some food for thought and at least presented another method to help maintain long lived GCP VMs.