SweetOps #kubernetes for November, 2018

Archive: https://archive.sweetops.com/kubernetes/

2018-11-04

rohit.verma

hi all, what are you opinion on different networking option in kubernetes on aws. Which is more preferred and felt robust. We did tried with aws-vpc-cni but felt that its not stable enough even with 1.1.0 for kuberntes 1.10.6. This becomes more unstable when all your worker nodes are unstable and started giving exception as sandox ip changed etc..

rohit.verma

08:44:08 AM

we then switched to calico, but somehow its observed that its impacting the way pods terminate. If we delete a deployment, pods remain in terminating state for 5+ minutes.

Erik Osterman (Cloud Posse)

04:13:36 PM

The pods stuck in a terminating state is a very frequently observed problem. Could it be related to the network layer? Maybe - but I would explore other possibilities. To me the network culprit seems like a red herring.

Erik Osterman (Cloud Posse)

04:14:15 PM

Lots of posts/issues on it. Usually related to zombies.

2018-11-06

Andriy Knysh (Cloud Posse)

02:23:05 PM

@rohit.verma we saw something like that with some of k8s pods, in particular kiam- when deleted, the pods take many minutes to terminate

Andriy Knysh (Cloud Posse)

02:23:35 PM

so maybe it’s an issue with some deployments, not the network itself?

rohit.verma

03:48:42 PM

But the pods I am referring here is generic like nginx or spring boot app

rohit.verma

03:49:28 PM

Anyways more concerned about a general opinion on different kubernetes networks

Erik Osterman (Cloud Posse)

04:04:26 PM

We haven’t had the opportunity to explore/optimize the network layer in k8s

Erik Osterman (Cloud Posse)

04:05:35 PM

Also are you familiar with the dumb-init “fix” ?

Erik Osterman (Cloud Posse)

04:05:55 PM

This is to address the same symptoms

Erik Osterman (Cloud Posse)

04:07:01 PM

https://github.com/Yelp/dumb-init/blob/master/README.md

Yelp/dumb-init

A minimal init system for Linux containers. Contribute to Yelp/dumb-init development by creating an account on GitHub.

onzyone

04:07:42 PM

@Andriy Knysh (Cloud Posse) hello again, do you have any doco’s and best practices for promoting kube, within nonp? … ie … dev to staging?

Erik Osterman (Cloud Posse)

07:19:35 PM

for clarification, are you talking about promoting images and helm charts? or promoting usage of kubernetes within a company

onzyone

03:57:41 PM

right now we are using diff name spaces in k8s

onzyone

03:58:25 PM

currently it is within company

Erik Osterman (Cloud Posse)

04:12:58 PM

So the same cluster for staging and production?

onzyone

04:27:09 PM

dev and staging

onzyone

02:08:21 PM

bump

Erik Osterman (Cloud Posse)

09:50:28 PM

sorry, i let this fall through the cracks.

Erik Osterman (Cloud Posse)

09:50:57 PM

we don’t have a well documented process for what you want. we’ve implemented and documented it internally for customers, but still need to document it on our site.

Erik Osterman (Cloud Posse)

09:51:01 PM

we have something rough here: https://docs.cloudposse.com/release-engineering/cicd-process/

Erik Osterman (Cloud Posse)

09:51:21 PM

also, looks like the video was taken down =/

onzyone

02:56:54 PM

nice

onzyone

02:57:24 PM

this is the same thing that I had in mind …

onzyone

02:59:01 PM

do what is your view databases with persistent volumes?

Erik Osterman (Cloud Posse)

03:25:57 PM

Use fully managed databases for anything you care about

Erik Osterman (Cloud Posse)

03:26:12 PM

Use database containers for disposable environments

Erik Osterman (Cloud Posse)

03:26:44 PM

So for example, when we deploy environments for every PR we use containers

onzyone

06:00:47 PM

onzyone

06:34:52 PM

what are your thoughts on some of the work that Kelsey Hightower has done in this space? https://github.com/kelseyhightower/pipeline

kelseyhightower/pipeline

A step by step guide on creating build and deployment pipelines for Kubernetes. - kelseyhightower/pipeline

Erik Osterman (Cloud Posse)

07:30:16 PM

have’t taken a look at it

Erik Osterman (Cloud Posse)

04:15:07 PM

Nonp?

onzyone

04:15:47 PM

ya we run a two accounts … were one is prod and one is none-prod

onzyone

04:16:01 PM

and all our none-prod stuff happens in nonp

Erik Osterman (Cloud Posse)

04:16:22 PM

Aha gotcha

Erik Osterman (Cloud Posse)

04:17:10 PM

We don’t have the promotion process documented but I can share how it looks (we use Codefresh)

Erik Osterman (Cloud Posse)

04:17:28 PM

I am currently on my phone so will share a little later

onzyone

04:17:49 PM

np sounds good

Tee

10:09:30 PM

using kops or terraform for creating kubernetes Production. What is better and cons ??

Andriy Knysh (Cloud Posse)

10:12:30 PM

@Tee we use terraform to create kops resources, e.g.

Andriy Knysh (Cloud Posse)

10:12:31 PM

https://github.com/cloudposse/terraform-root-modules/blob/master/aws/kops/main.tf

cloudposse/terraform-root-modules

Collection of Terraform root module invocations for provisioning reference architectures - cloudposse/terraform-root-modules

Andriy Knysh (Cloud Posse)

10:12:46 PM

https://github.com/cloudposse/terraform-root-modules/tree/master/aws/kops-aws-platform

cloudposse/terraform-root-modules

Collection of Terraform root module invocations for provisioning reference architectures - cloudposse/terraform-root-modules

Andriy Knysh (Cloud Posse)

10:13:01 PM

and then use kops to provision k8s clusters

Erik Osterman (Cloud Posse)

10:13:23 PM

there was some discussion earlier in #terraform I think related to EKS

Andriy Knysh (Cloud Posse)

10:13:26 PM

we also have TF modules for EKS

Erik Osterman (Cloud Posse)

10:13:47 PM

@Tee are you thinking GCP or AWS?

Tee

10:13:52 PM

AWS

Erik Osterman (Cloud Posse)

10:14:30 PM

so on AWS, my opinion is that it’s more work than than necessary to manage EKS with terraform. the challenge comes down to upgrading. there’s some discussions on strategies for that.

Andriy Knysh (Cloud Posse)

10:14:52 PM

https://sweetops.slack.com/archives/CB6GHNLG0/p1541085179253600

@Andriy Knysh (Cloud Posse) are ya’ll using it with kops? Looks like it. How does TF generation fit in if at all?

Erik Osterman (Cloud Posse)

10:15:28 PM

with kops, the ability to do rolling-updates is built in; it’s a purpose built tool like kops will do a better job at managing lifecycles.

Erik Osterman (Cloud Posse)

10:15:48 PM

if fargate announces EKS support at the end of the month, I might change my stance

Tee

10:16:12 PM

But the EKS and FARGATE gets pretty expensive

Tee

10:16:19 PM

as far i think

Erik Osterman (Cloud Posse)

10:16:35 PM

humans aren’t cheap either

Tee

10:16:51 PM

Right

Tee

10:18:03 PM

So what do you suggest for longterm. Not considering the cost. With less bottlenecks and nightmares

Andriy Knysh (Cloud Posse)

10:19:01 PM

not an easy question

Tee

10:19:13 PM

I mean in terms of stability

Andriy Knysh (Cloud Posse)

10:19:49 PM

kops is well established and works well, and does lifecycle management

Andriy Knysh (Cloud Posse)

10:20:28 PM

EKS is new and lacks a lot of features, but it will stay and they will improve it

Andriy Knysh (Cloud Posse)

10:20:45 PM

Fargate will improve and cost will be reduced

Erik Osterman (Cloud Posse)

10:21:06 PM

(we’re not using EKS in production yet, so our story will be biased towards kops)

Tee

10:22:05 PM

Oh ok. Thanks @Erik Osterman (Cloud Posse) & @Andriy Knysh (Cloud Posse) for your suggestions.

Andriy Knysh (Cloud Posse)

10:23:06 PM

yea, the point is that with the current state of EKS, you need to do and provision even more resources than using kops

Andriy Knysh (Cloud Posse)

10:23:28 PM

and it does not support many features

Andriy Knysh (Cloud Posse)

10:24:01 PM

Fargate could improve it, but as many mentioned it’s costly (and it does not exists yet)

Tee

10:27:21 PM

That makes sense

Matthew

10:27:37 PM

I am currently moving all of our infrastructure off Mesosphere DC/OS onto EKS and EKS has been phenomenal in my opinion - just lots of support from many different aspects such as AWS and the Kubernetes community

Matthew

10:27:55 PM

as well as great folks like Cloud Posse

Andriy Knysh (Cloud Posse)

10:28:18 PM

yea thanks @Matthew

Andriy Knysh (Cloud Posse)

10:28:57 PM

the point is that with EKS, if for example you need to perform a rolling update, it’s not supported out of the gate

Andriy Knysh (Cloud Posse)

10:29:07 PM

so a lot of friction with many things

Andriy Knysh (Cloud Posse)

10:29:26 PM

with kops it just works

Andriy Knysh (Cloud Posse)

10:30:19 PM

but sure for longterm EKS/Fargate would be better

Matthew

10:30:21 PM

Yeah i’ve talked with EKS specialist from AWS and they currently suggest a blue/green strategy for upgrading which can be tedious and at times break backwards compatibility

btai

11:51:48 PM

how do you export a single context of your kubeconfig?

btai

11:52:40 PM

say my local kubeconfig has a dev qa prod context

Andriy Knysh (Cloud Posse)

12:30:51 AM

@btai we don’t have multiple contexts. We use containers + ENV vars pattern (implemented in geodesic + repo per env + Dockerfile(s)). So in each container (prod, staging, dev, etc), when we run it, we have all ENV vars defined for that particular env (ENV vars come from Dockerfiles or from SSM if they are secrets). That includes everything for Terraform, kops, k8s, etc.

Andriy Knysh (Cloud Posse)

12:31:47 AM

So when we do for example kops export kubecfg, the environment knows what context we want

Andriy Knysh (Cloud Posse)

12:36:21 AM

and we can run those geodesic containers locally and also in CI/CD pipelines (for which we use Codefresh since it can run each pipeline step as a Docker container)

btai

12:58:33 AM

nice thanks

rms1000watt

06:10:07 AM

Have you guys used Codefresh enterprise? I know you’re all big into codefresh here. Just curious of any pitfalls or bits of advice you guys have

rms1000watt

06:10:20 AM

(enterprise to run on-prem)

Erik Osterman (Cloud Posse)

06:10:58 AM

So Codefresh enterprise has 3 variations: full SaaS, hybrid and on-prem

Erik Osterman (Cloud Posse)

06:11:35 AM

we’ve been working exclusively with the enterprise SaaS

rms1000watt

06:11:42 AM

Ooo

Erik Osterman (Cloud Posse)

06:11:43 AM

what’s the primary driver for going on-prem?

rms1000watt

06:12:09 AM

compliance requiring no dependence on external SaaS providers

rms1000watt

06:12:14 AM

Erik Osterman (Cloud Posse)

06:12:23 AM

which compliance certification?

rms1000watt

06:12:52 AM

oh wow, you haven’t even taken me out on a date yet to be asking such risquè questions.

rms1000watt

06:13:02 AM

lol jk, I think fedramp

Erik Osterman (Cloud Posse)

06:13:04 AM

lol

Erik Osterman (Cloud Posse)

06:13:13 AM

ok - that’s a whole ’nother cup of tea

Erik Osterman (Cloud Posse)

06:13:16 AM

not familiar with

Erik Osterman (Cloud Posse)

06:13:51 AM

but sounds like you’d need full on-prem.

Erik Osterman (Cloud Posse)

06:14:02 AM

so i probably wouldn’t enlighten you more than you already know

rms1000watt

06:14:30 AM

No worries. We’re new to codefresh–so just probing for any gotcha’s really

Erik Osterman (Cloud Posse)

06:14:40 AM

@dustinvb can definitely elaborate

Erik Osterman (Cloud Posse)

06:14:54 AM

are you using the helm based install?

rms1000watt

06:15:38 AM

I think at the moment, yes

rms1000watt

06:16:00 AM

debating about the release of terraform 0.12 and using all the templating stuff

rms1000watt

06:16:16 AM

rather than 2 templating engines.. tiller.. and all that jazz

Erik Osterman (Cloud Posse)

06:17:21 AM

so i get where you’re coming from - but from what i’ve gleaned the current helm provider is too basic to handle all kinds of helm charts. maybe with 0.12 it’s better off

Erik Osterman (Cloud Posse)

06:17:38 AM

you’ve seen our helmfiles repo? basically you can’t do half of what we do with helmfile using that provider

rms1000watt

06:17:58 AM

https://marketplace.fedramp.gov/#/product/aiware-government?sort=productName

Just as a cover my ass that I’m not saying providing any confidential information.. it’s publicly available that we’re on fedramp ^^^ lol

rms1000watt

06:18:35 AM

Oh, my bad. I didn’t mean the helm provider.. I meant generating the k8s.yml files on the fly based on the infra-state.. no helm installation anywhere

rms1000watt

06:19:12 AM

just a thought at the moment–not necessarily going that direction for sure

Erik Osterman (Cloud Posse)

06:19:55 AM

but yea, you could basically create terraform modules in place of helm charts

Erik Osterman (Cloud Posse)

06:20:04 AM

… if terraform templating is sufficient

rms1000watt

06:20:22 AM

hehe, yeah, big “if”

Erik Osterman (Cloud Posse)

06:20:30 AM

it’s been my experience, the “simple” case always works well regardless of the technology

rms1000watt

06:23:16 AM

rms1000watt

06:24:35 AM

how have you guys been liking helm? any complaints with the tiller stuff, or you guys are experienced enough with it all–nothing really bugs you?

Erik Osterman (Cloud Posse)

06:24:47 AM

i mean, it sucks about the tiller and all

Erik Osterman (Cloud Posse)

06:24:55 AM

but i look at helm more like an interface

Erik Osterman (Cloud Posse)

06:25:22 AM

and the interface won’t change dramatically, but the underlying implementation is getting a big overhaul as you’re probably aware

Erik Osterman (Cloud Posse)

06:25:33 AM

as part of that tiller is going away and the template engine going pluggable

rms1000watt

06:25:41 AM

“tillerless helm” is the buzz

rms1000watt

06:25:42 AM

yea

Erik Osterman (Cloud Posse)

06:25:57 AM

as a way to manage a complex apps it’s great

Erik Osterman (Cloud Posse)

06:26:01 AM

and app dependencies

Erik Osterman (Cloud Posse)

06:27:02 AM

i say (and with some humility) that those before us have invested a lot of time in what it takes to manage software releases

Erik Osterman (Cloud Posse)

06:27:11 AM

deb, rpm, apk, etc.

Erik Osterman (Cloud Posse)

06:27:54 AM

we tried to avoid that with just a Makefile; it worked well until it didn’t. in the end, we needed all that a package manager provides and conceded to package .apk alpine packages

Erik Osterman (Cloud Posse)

06:28:29 AM

my point is that just templatizing raw kubernetes resources and applying them seems easy enough and i’m sure you can get away with it for a long time

Erik Osterman (Cloud Posse)

06:30:29 AM

but then you realize you want to have dependencies, triggers on deployment or uninstall, and rollbacks, etc. then you’re on your own.

Erik Osterman (Cloud Posse)

06:31:05 AM

the more homegrown/spun, the more the solution diverges from the trajectory the community is taking

Erik Osterman (Cloud Posse)

06:31:16 AM

because the community is solving problems around a standardized toolset

rms1000watt

06:31:25 AM

all true

rms1000watt

06:31:33 AM

so i’m curious.. you bring up rollbacks

rms1000watt

06:31:52 AM

codefresh/spinnaker’s solutions didn’t offer enough in that aspect?

Erik Osterman (Cloud Posse)

06:32:14 AM

codefresh relies on the fact that helm does rollbacks automatically

rms1000watt

06:32:19 AM

Erik Osterman (Cloud Posse)

06:32:42 AM

and even bakes that into the UI with one-click rollbacks

Erik Osterman (Cloud Posse)

06:33:11 AM

they also have some even more cool stuff in the works - but you’ll have to ask them to see it

rms1000watt

06:33:26 AM

For sure

rms1000watt

06:33:34 AM

we have meetings setup with them

rms1000watt

06:33:40 AM

We’ll probe

Erik Osterman (Cloud Posse)

06:33:56 AM

very cool! hit me afterwards and let me know how it goes

rms1000watt

06:34:40 AM

does all this reveal a well needed niche (product offering) in the CI/CD process for k8s?

rms1000watt

06:34:52 AM

since there always ends up being handrolled stuff?

Erik Osterman (Cloud Posse)

06:35:17 AM

haha, not sure - there are more CI/CD platforms today than ever

rms1000watt

06:35:21 AM

https://github.com/gaia-pipeline/gaia I like their philosophy at that in particular

gaia-pipeline/gaia

Build powerful pipelines in any programming language. - gaia-pipeline/gaia

rms1000watt

06:35:21 AM

yea

Erik Osterman (Cloud Posse)

06:35:23 AM

i can’t keep them straight anymore

Erik Osterman (Cloud Posse)

06:35:41 AM

spinnaker is now coming out with an enterprise offering too

rms1000watt

06:35:49 AM

haha nice. Well, after the bloodbath, hopefully the best solution reigns supreme

rms1000watt

06:35:50 AM

rms1000watt

06:36:25 AM

halyard was surprising when I first played with it

Erik Osterman (Cloud Posse)

06:36:27 AM

and then github actions

rms1000watt

06:36:42 AM

then I looked at the helm chart for spinnaker.. and it was just a bunch of hal commands

rms1000watt

06:36:44 AM

yea

Erik Osterman (Cloud Posse)

06:37:15 AM

but i agree that there’s still big room for improvement

Erik Osterman (Cloud Posse)

06:37:28 AM

the fact there is so much handrolling and independent tooling

Erik Osterman (Cloud Posse)

06:38:40 AM

I think codefresh is well poised to do that as it relates to cicd+kubernetes+helm

rms1000watt

06:39:13 AM

does your gut think helm isn’t going anywhere?

Erik Osterman (Cloud Posse)

06:42:14 AM

until I see an alternative that has anywhere near the critical mass of helm, yes - i think it’s here for the foreseeable future

Erik Osterman (Cloud Posse)

06:42:43 AM

for example, there’s ksonnet (based on jsonnet) which looks interesting

Erik Osterman (Cloud Posse)

06:43:20 AM

but i think some variation of that could be used as a pluggable engine for helm

Erik Osterman (Cloud Posse)

06:43:43 AM

also, i don’t want to see proliferation of more packaging systems right now - it’s too early

Erik Osterman (Cloud Posse)

06:45:35 AM

https://github.com/rimusz/helm-tiller

rimusz/helm-tiller

Helm tiller plugin aka Tillerless Helm. Contribute to rimusz/helm-tiller development by creating an account on GitHub.

Erik Osterman (Cloud Posse)

06:45:39 AM

have you seen this plugin?

Erik Osterman (Cloud Posse)

06:45:51 AM

this is pretty smart.

rms1000watt

06:46:07 AM

I thiiiink I’ve seen this one.. if not it was something similar

Erik Osterman (Cloud Posse)

06:46:08 AM

basically, it’s a drop in replacement. it still stores all configs in the cluster (per namespace if you want)

Erik Osterman (Cloud Posse)

06:46:17 AM

you run a temporary tiller locally

Erik Osterman (Cloud Posse)

06:46:29 AM

this can be run as part of CI

rms1000watt

06:47:18 AM

interesting.. hmm.. nice actually!

Erik Osterman (Cloud Posse)

06:47:38 AM

(though would break the codefresh helm UI, since it would need to talk to the tiller and there would be none running)

rms1000watt

06:47:56 AM

ah, right

2018-11-07

Erik Osterman (Cloud Posse)

07:35:49 AM

https://github.com/technosophos/helm-ksonnet/blob/master/README.md

technosophos/helm-ksonnet

Experimental ksonnet plugin for Helm. Contribute to technosophos/helm-ksonnet development by creating an account on GitHub.

Erik Osterman (Cloud Posse)

07:35:52 AM

Dig it

Erik Osterman (Cloud Posse)

07:45:10 AM

https://github.com/helm/helm/issues/2577#issuecomment-339238305

Proposal: Jsonnet template integration · Issue #2577 · helm/helm

In order to provide jsonnet rendering for helm charts a new ReleaseModule similar to the Rudder ReleaseModule should be developed. This module would take charts and render them as Jsonnet templates…

Erik Osterman (Cloud Posse)

07:45:39 AM

Guess my hopes of seeing ksonnet as a template engine in helm were misguided

Erik Osterman (Cloud Posse)

07:47:25 AM

I know Lua is coming. I’d heard such great things about jsonnet, that I assumed it would be well suited. But Lua I guess is a better understood embeddable language

Erik Osterman (Cloud Posse)

07:48:12 AM

Last I had to write Lua was 14 years ago when dealing with Nginx

2018-11-08

rms1000watt

07:15:18 AM

For the codefresh peeps out there.. does it matter what/how the ingress controller looks when using codefresh for deployments?

rms1000watt

07:16:06 AM

Was reading through: https://docs.traefik.io/user-guide/kubernetes/#traffic-splitting and it came to mind

2018-11-09

Erik Osterman (Cloud Posse)

03:24:22 PM

Nope, we use for example the CloudFlare Acesss/Argo ingress and nginx-ingress controller in the same cluster

Ryan Ryke

03:33:01 PM

have you seen this https://www.youtube.com/watch?v=kOa_llowQ1c

Kubernetes The Easy Way! (For Developers In 2018)

Erik Osterman (Cloud Posse)

07:37:25 PM

I love his presentations and he’s definitely the best evangelist for kubernetes

Kubernetes The Easy Way! (For Developers In 2018)

Erik Osterman (Cloud Posse)

07:37:51 PM

and i think he’s presenting the simple side that should be presented

Erik Osterman (Cloud Posse)

07:38:00 PM

and here comes the but…..

Erik Osterman (Cloud Posse)

07:38:38 PM

but in the real world of deploying complex applications with interdependencies, secrets, configurations, etc… it devolves into something much more complicated

Ryan Ryke

07:38:43 PM

his presentations are always awesome

Ryan Ryke

07:38:49 PM

for sure

Erik Osterman (Cloud Posse)

07:38:57 PM

and the gap to cross from the hello world examples to customer apps is huge

Ryan Ryke

07:38:57 PM

he makes it look so “easy button”

Erik Osterman (Cloud Posse)

07:39:00 PM

https://github.com/cloudposse/helmfiles/blob/master/helmfile.d/0400.kube-prometheus.yaml

cloudposse/helmfiles

Comprehensive Distribution of Helmfiles. Works with helmfile.d - cloudposse/helmfiles

Erik Osterman (Cloud Posse)

07:39:08 PM

PLEASE SOMEONE SHOW ME HOW TO MAKE THIS EASIER

Erik Osterman (Cloud Posse)

07:39:13 PM

i want to

Erik Osterman (Cloud Posse)

07:39:14 PM

i hate this

Erik Osterman (Cloud Posse)

07:39:31 PM

and here’s the rest of all the other apps

Erik Osterman (Cloud Posse)

07:39:32 PM

https://github.com/cloudposse/helmfiles/tree/master/helmfile.d

cloudposse/helmfiles

Comprehensive Distribution of Helmfiles. Works with helmfile.d - cloudposse/helmfiles

Ryan Ryke

07:40:19 PM

so with one of my customers we are working on two distinct steps… once to build the app, then a seperate one to update (deploy) the app in an ongoing fashion

Erik Osterman (Cloud Posse)

07:40:21 PM

so we’re using helm, and some hate on helm for one reason or another. but one things for sure, this is hiding an even more enormous pile of YAML/go templating on the backend.

Ryan Ryke

03:33:11 PM

i hate kelsey hightower in the best way possible

Andriy Knysh (Cloud Posse)

06:56:00 PM

this is interesting https://aws.amazon.com/blogs/opensource/continuous-delivery-eks-jenkins-x/

Continuous Delivery with Amazon EKS and Jenkins X | Amazon Web Services attachment image

Amazon Elastic Container Service for Kubernetes (Amazon EKS) provides a container orchestration platform for building and deploying modern cloud applications using Kubernetes. Jenkins X is built on Kubernetes to provide automated CI/CD for such applications. Together, Amazon EKS and Jenkins X provide a continuous delivery platform that allows developers to focus on their applications. This […]

Andriy Knysh (Cloud Posse)

06:57:25 PM

i did not think it could do so much, it creates pipelines for infrastructure itself (prod and staging), and pipelines for the app, and even spawns a separate testing/staging env in k8s for each PR, and comments on GitHub on PRs (like atlantis), and creates GitHub repos with Helm charts for the infrastructure (prod and staging)

Andriy Knysh (Cloud Posse)

07:00:34 PM

https://github.com/jenkins-x/sso-operator (@Erik Osterman (Cloud Posse) already posted it before)

jenkins-x/sso-operator

Single Sign-On Kubernetes operator for Dex identity provider - jenkins-x/sso-operator

Andriy Knysh (Cloud Posse)

07:03:02 PM

one thing it can’t do is to upgrade the k8s cluster b/c it itself sits in the same cluster

Erik Osterman (Cloud Posse)

07:03:49 PM

wow

ramesh.mimit

12:28:55 AM

@here Any recommendations for learning distributed systems from basics to advance?

ramesh.mimit

12:29:36 AM

noticed, lot of people knows the tools but not the concepts…

2018-11-11

Andriy Knysh (Cloud Posse)

04:39:06 PM

@ramesh.mimit I found this site very interesting and with lots of resources about distributed systems, and real-life examples from many companies

Andriy Knysh (Cloud Posse)

04:39:07 PM

http://highscalability.com/

Andriy Knysh (Cloud Posse)

04:39:15 PM

http://highscalability.com/all-time-favorites/

Andriy Knysh (Cloud Posse)

04:39:26 PM

http://highscalability.com/blog/category/example

Andriy Knysh (Cloud Posse)

04:40:18 PM

http://highscalability.squarespace.com/blog/category/strategy

ramesh.mimit

09:31:47 PM

@Andriy Knysh (Cloud Posse) thanks..

Erik Osterman (Cloud Posse)

12:03:32 AM

https://news.ycombinator.com/item?id=18428497

Erik Osterman (Cloud Posse)

12:04:07 AM

“Google Kubernetes Engine’s third consecutive day of service disruption”

2018-11-12

btai

06:42:05 PM

anyone use the official python kube library?

btai

06:51:03 PM

can you load the config from a dict?

Erik Osterman (Cloud Posse)

08:06:48 PM

~~~why not use config profiles instead?~~~

Erik Osterman (Cloud Posse)

08:07:05 PM

~~~e.g. AWS_DEFAULT_PROFILE=cp-prod-admin~~~

Erik Osterman (Cloud Posse)

08:07:40 PM

~~~the underlying aws SDK should then handle everything automatically~~~

btai

10:17:34 PM

the kube config?

Erik Osterman (Cloud Posse)

10:39:21 PM

heh, my bad @btai

btai

10:48:40 PM

btai

07:38:32 AM

how would i run kubectl within a container running from a job?

Erik Osterman (Cloud Posse)

07:42:31 AM

here’s an example doing it from a deployment: https://github.com/onfido/k8s-rabbit-pod-autoscaler

onfido/k8s-rabbit-pod-autoscaler

Kubernetes autoscaler for pods that consume RabbitMQ - onfido/k8s-rabbit-pod-autoscaler

Erik Osterman (Cloud Posse)

07:42:40 AM

doing it from a job wouldn’t be any different

Erik Osterman (Cloud Posse)

07:42:46 AM

just need the proper role bindings

Erik Osterman (Cloud Posse)

07:43:26 AM

in this case, kubectl is gettin called from in the autoscale.sh

btai

07:45:35 AM

so if i have the wrong role bindings

btai

07:45:49 AM

would i be getting this error:

btai

07:46:25 AM

The connection to the server localhost:8080 was refused - did you specify the right host or port?

Erik Osterman (Cloud Posse)

07:48:44 AM

all i know is when we implemented it for redis using the strategy above (for rabbit), we didn’t need to specify any hosts

Erik Osterman (Cloud Posse)

07:48:53 AM

it just autodiscovers it

btai

07:48:59 AM

the pod autodiscovers

btai

07:49:04 AM

ok thats what i was hoping for

Erik Osterman (Cloud Posse)

07:49:05 AM

it also provides a kube context

btai

07:49:47 AM

so the pod itself didnt have any kubeconfig or kube api secrets

Erik Osterman (Cloud Posse)

07:50:09 AM

yea, it didn’t have anythign like that

Erik Osterman (Cloud Posse)

07:50:28 AM

https://github.com/vanvalenlab/kiosk-autoscaler

vanvalenlab/kiosk-autoscaler

Contribute to vanvalenlab/kiosk-autoscaler development by creating an account on GitHub.

btai

07:52:06 AM

yeah i have a job basically doing the same thing

btai

07:52:18 AM

executing a shell script that makes a kubectl call

btai

07:52:34 AM

but i get the above error

Erik Osterman (Cloud Posse)

07:52:46 AM

kops cluster?

btai

07:52:49 AM

aks

Erik Osterman (Cloud Posse)

07:52:53 AM

Erik Osterman (Cloud Posse)

07:52:58 AM

we tested it on gke and kops

btai

07:53:29 AM

yeah it works in kops

btai

07:53:33 AM

that job

Erik Osterman (Cloud Posse)

07:53:50 AM

oh interesting!

Erik Osterman (Cloud Posse)

07:53:58 AM

you have rbac enabled in kops?

btai

07:54:01 AM

although the kops

btai

07:54:06 AM

doesnt have rbac enabled

Erik Osterman (Cloud Posse)

07:54:09 AM

Erik Osterman (Cloud Posse)

07:54:10 AM

haha

btai

07:54:11 AM

yeah

btai

07:54:59 AM

so do i create a clusterrolebinding for the job?

Erik Osterman (Cloud Posse)

07:56:53 AM

https://github.com/onfido/k8s-rabbit-pod-autoscaler/blob/master/deploy.yml#L1-L29

onfido/k8s-rabbit-pod-autoscaler

Kubernetes autoscaler for pods that consume RabbitMQ - onfido/k8s-rabbit-pod-autoscaler

2018-11-13

btai

08:17:46 AM

thanks

btai

08:27:14 AM

sorry, still new to k8s

btai

08:27:43 AM

hypothetically if i create a cluster role binding with the namespace and name that matches the job, that should work?

Erik Osterman (Cloud Posse)

06:03:04 AM

More or less

Erik Osterman (Cloud Posse)

06:03:51 AM

I don’t know the specific matching selectors that are available

Erik Osterman (Cloud Posse)

06:04:28 AM

This is near. Copy secrets from a centralized system of record. https://github.com/mittwald/kubernetes-replicator/

mittwald/kubernetes-replicator

Kubernetes controller for synchronizing secrets & config maps across namespaces - mittwald/kubernetes-replicator

2018-11-14

Erik Osterman (Cloud Posse)

08:55:20 PM

How are you guys handling busy helm deployments where the tiller is busy attending to other deployments…

                                                                                                                             
Error: could not find a ready tiller pod

Erik Osterman (Cloud Posse)

08:55:32 PM

@Max Moon @dustinvb

Erik Osterman (Cloud Posse)

08:59:16 PM

https://github.com/helm/helm/pull/3464

Add --replicas option for Tiller HA fixes #2334 by onorua · Pull Request #3464 · helm/helm

Introduce –replicas option to configure amount of Tiller instances on the cluster. Fixes #2334. The next PR will be about distributed lock, this one is just exterior.

dustinvb

08:59:24 PM

I haven’t ran into this scaling issue yet.

Erik Osterman (Cloud Posse)

08:59:26 PM

—replicas option looks nice

Erik Osterman (Cloud Posse)

09:10:29 PM

@michal.matyjek @Daren have you run into this?

Max Moon

09:20:49 PM

I have not run into this yet either

Daren

09:53:49 PM

I have not

michal.matyjek

10:02:54 PM

not yet

michal.matyjek

10:03:04 PM

how many deployments are we talking about?

Erik Osterman (Cloud Posse)

11:22:05 PM

just concurrency

Erik Osterman (Cloud Posse)

11:22:17 PM

so we’re running helm on every PR synchronization for unlimited staging environments

Erik Osterman (Cloud Posse)

11:22:20 PM

so we’re getting it

Erik Osterman (Cloud Posse)

11:22:35 PM

e.g. 2 developers push at around the same time

2018-11-19

sarkis

04:54:08 PM

hey all - curious what the verdict is on kiam vs kube2iam… it seems like kiam was created to address some issues with kube2iam - is kiam the way to go these days?

Erik Osterman (Cloud Posse)

07:32:05 PM

@sarkis yea, kube2iam is dead and should not be used. It’s a massive liability to even deploy in an AWS account. If you run more than N hosts (N ~10), you’ll DoS AWS APIs and they rate limit you.

Erik Osterman (Cloud Posse)

07:33:10 PM

kiam addresses this by having a client/server model. clients run on all nodes (agents), and talk to the server. the server is responsible for fetching the credentials which reduces rate of requests

Erik Osterman (Cloud Posse)

07:33:13 PM

it also caches

Erik Osterman (Cloud Posse)

07:33:43 PM

I think there’s been some frustration related to the rate of development on Kiam, but the worse bugs are fixed.

Erik Osterman (Cloud Posse)

07:33:56 PM

Also, I don’t know of any alternatives to kiam and kube2iam for AWS

sarkis

07:48:04 PM

thanks @Erik Osterman (Cloud Posse)!