SweetOps #kops for June, 2019

We are looking at upgrading from Kops 1.11 to 1.12. The upgrade instructions mention that it is a disruptive upgrade without going into details of how much. Is there anyone who has gone through it and can share their experience? cc @Jeremy G (Cloud Posse)

Erik Osterman (Cloud Posse)

07:38:46 PM

@btai @rohit.verma @Jan

btai

07:38:50 PM

@btai has joined the channel

rohit.verma

07:38:50 PM

@rohit.verma has joined the channel

btai

07:39:45 PM

unfortunately still on 1.11.9 for our kops clusters

btai

07:42:53 PM

also im not sure if i will run into this issue even when i do upgrade to 1.12 as my clusters are ephemeral (i would spin up a new 1.12 cluster and deploy/cutover to it)

btai

07:44:19 PM

Technically there is no usable upgrade path from etcd2 to etcd3 that supports HA scenarios, but kops has enabled it using etcd-manager. Nonetheless, this remains a higher-risk upgrade than most other kubernetes upgrades - you are strongly recommended to plan accordingly: back up critical data, schedule the upgrade during a maintenance window, think about how you could recover onto a new cluster, try it on non-production clusters first.

btai

07:46:46 PM

it almost sounds to me that spinning up a new cluster is prob the safest way forward, but im trying to imagine the way you guys are terraforming the cluster/env might make it hard to do that type of blue/green cutover?

Daren

08:41:24 PM

blue/green is difficult for us right now. We are already on etcd 3, but no TLS or etcd-manager. We also use calico

Erik Osterman (Cloud Posse)

08:42:07 PM

Can you use route53 to route traffic to both cluster?

Erik Osterman (Cloud Posse)

08:42:15 PM

that would give you a fall back plan

Erik Osterman (Cloud Posse)

08:42:32 PM

or use external CDN (e.g. cloudflare) with multiple origins

Daren

08:42:39 PM

We still use VPC peering to bridge kops and our backend vpc

Erik Osterman (Cloud Posse)

08:42:55 PM

oh, but you can peer the VPC to both k8s clusters

Erik Osterman (Cloud Posse)

08:43:06 PM

so you create a new kops vpc

Daren

08:43:14 PM

Yes, I said “difficult” not impossible

Erik Osterman (Cloud Posse)

08:43:21 PM

haha, true

Erik Osterman (Cloud Posse)

08:43:40 PM

though this could be a good capability to support

Erik Osterman (Cloud Posse)

08:43:45 PM

even for future upgrades

Daren

08:43:59 PM

yes

Erik Osterman (Cloud Posse)

08:44:09 PM

at the rate k8s moves, this won’t be the last breaking change

btai

09:48:35 PM

thats exactly what we do

btai

09:49:17 PM

kops cluster in its own vpc, peered to our database vpc, new cluster comes up will also vpc peer into db.

Daren

10:46:28 PM

How are you provisioning the kops vpc peering connection?

btai

11:42:46 PM

i would suggest using terraform. cloudposse has an example thats pretty good

Andriy Knysh (Cloud Posse)

01:34:17 AM

some examples of VPC peering https://github.com/cloudposse/terraform-root-modules/tree/master/aws/kops-legacy-account-vpc-peering

cloudposse/terraform-root-modules

Example Terraform service catalog of “root module” blueprints for provisioning reference architectures - cloudposse/terraform-root-modules

Andriy Knysh (Cloud Posse)

01:34:48 AM

https://github.com/cloudposse/terraform-root-modules/blob/master/aws/vpc-peering/main.tf

cloudposse/terraform-root-modules

Example Terraform service catalog of “root module” blueprints for provisioning reference architectures - cloudposse/terraform-root-modules

btai

09:50:25 PM

allows for quick route53 cutover/ you can also do a weighted cutover via route 53 and you have a pretty fast rollback strategy (point route53 back to old cluster)

Jeremy G (Cloud Posse)

04:28:44 AM

Discussion of some of the options for upgrading etcd (none great): https://gravitational.com/blog/kubernetes-and-offline-etcd-upgrades/

The Horrors of Upgrading Etcd Beneath Kubernetes

Proud new Kubernetes cluster owners are often lulled into a false sense of operational confidence by its consensus database’s glorious simplicity. In this Q&A, we dig into the challenges of in-place upgrades of etcd beneath autonomous Kubernetes clusters running within air-gapped environments.

Jeremy G (Cloud Posse)

04:31:30 AM

Step-by-step instructions for upgrading kops cluster by replacing it. Probably best for 1.11 to 1.12 upgrade. (I’ve never tried it. I have not had to upgrade a cluster from 1.11 to 1.12 yet.) https://www.bluematador.com/blog/upgrading-your-aws-kubernetes-cluster-by-replacing-it

Upgrading Your AWS Kubernetes Cluster By Replacing It attachment image

How to use kops to quickly spin up a production-ready Kubernetes cluster to replace your old cluster in AWS.

2019-06-20

pericdaniel

02:25:59 PM

Are people using kops for GCP or Azure? or are people using Kubespray for more multi platform

btai

10:02:15 PM

last i checked (late 2018) kops didnt support azure, but AKS has been plenty good.

Erik Osterman (Cloud Posse)

10:02:39 PM

I wouldn’t use kops even for GCP

btai

10:02:40 PM

however azure postgres (and azure as a whole) have not had great uptime imo

Erik Osterman (Cloud Posse)

10:03:19 PM

I feel like the best option is to use the best tool for the platform. using any kind of generalized tool will likely not give you all the extra jazz provided by the platform.

btai

10:03:29 PM

AKS has been chugging along though but if you have other dependencies within azure

Erik Osterman (Cloud Posse)

10:03:35 PM

e.g. on google, I’d prefer to operate GKE over GCP+Kubernetes

btai

10:03:59 PM

same, AKS is great cause it feels to me fully managed (as opposed to EKS which is highly configurable and even the generic case takes more effort to spin up)