SweetOps #kubernetes for May, 2019

ahhh….might be this:
This could be because the cluster was created with one set of AWS credentials (from an IAM user or role), and kubectl is using a different set of credentials.

I created it via CI

Andriy Knysh (Cloud Posse)

06:43:09 PM

this was the issue?

johncblandii

06:44:13 PM

I think so

johncblandii

06:44:27 PM

https://docs.aws.amazon.com/eks/latest/userguide/troubleshooting.html#unauthorized

Amazon EKS Troubleshooting - Amazon EKS

This chapter covers some common errors that you may see while using Amazon EKS and how to work around them.

johncblandii

06:45:23 PM

and the kubectl apply on CI failed because kc isn’t available on there (yet)

johncblandii

06:45:54 PM

FYI @wbrown43 ^

johncblandii

07:09:02 PM

confirmed. used the CI users creds and it worked as expected

btai

09:13:39 PM

@johncblandii you’ll need to update your authenticator configmap to allow other roles/users

johncblandii

09:38:52 PM

just got through that part, @btai.

johncblandii

09:39:03 PM

my last eks was a local install i did so i did not realize this was a rule

btai

09:42:37 PM

@johncblandii yeah took me half a day just trying to figure out how to get aws-iam-authenticator working and I ran into the same issues as you did haha

btai

09:43:05 PM

but no problems since

johncblandii

09:43:07 PM

to which I spent the same half-day (while in 5 hours of meetings back-to-back-to-back)

johncblandii

09:43:08 PM

johncblandii

09:43:19 PM

now my nodes aren’t connecting so onto issue #3; lol

btai

09:44:28 PM

did you add the role for the worker nodes to your config map as well?

johncblandii

09:44:55 PM

yup. the cp module does it

johncblandii

09:48:51 PM

going to nix one and let the scaling kick off a fresh one now that the map is applied

johncblandii

10:01:24 PM

so no public ip seems to have been the issue

johncblandii

10:01:31 PM

started one w/ a public ip and voila

btai

01:02:08 AM

@johncblandii are you talking about public ip for your worker nodes?

johncblandii

03:25:08 AM

yup

johncblandii

03:25:37 AM

our other cluster didn’t have public, but when I upgraded them to 1.12 i had to do the same

(unsure if that’s related, but i did notice that)

btai

05:19:29 PM

@johncblandii fwiw, i didnt have to make my worker nodes public. im still on 1.11 but i cant imagine that would change in 1.12

johncblandii

05:19:58 PM

i hear you. that’s just what i noticed when i moved to .12

Jeremy G (Cloud Posse)

06:17:22 PM

I have a cluster in AWS in 3 availability zones, with 3 masters, but only 2 nodes. kops put both nodes in the same AZ? Is this a bug? How do I get kops to spread the nodes evenly across AZs?

Erik Osterman (Cloud Posse)

06:18:04 PM

it’s not a kops thing

Erik Osterman (Cloud Posse)

06:18:28 PM

compare how the master node pools are created to how the worker node pools are created

Erik Osterman (Cloud Posse)

06:18:40 PM

that’s how to ensure more even distribution

Erik Osterman (Cloud Posse)

06:19:11 PM

AWS will make “best effort” to allocate instances evenly, but no guarantee

Erik Osterman (Cloud Posse)

06:19:26 PM

the only way to have a “guarantee” is to create node pools tied to exactly one AZ

Issif

06:34:26 PM

for this purpose, we create 3 ASG with only one master inside

Erik Osterman (Cloud Posse)

06:34:48 PM

precisely…

Jeremy G (Cloud Posse)

09:18:08 PM

Yes, kops creates an instance group per zone for the masters, but just 1 instance group for all the nodes.

Jeremy G (Cloud Posse)

10:08:33 PM

So it turns out the bigger issue is that AWS autoscale group does launching and zone balancing separately, and to do zone balancing it has to launch a new instance before deleting the old one. Well, we had run up against our instance/type limit for the region, so it could not do zone balancing.

Erik Osterman (Cloud Posse)

10:08:54 PM

oh fascinating

Erik Osterman (Cloud Posse)

10:08:57 PM

good sluething

wbrown43

06:50:41 PM

@wbrown43 has joined the channel

2019-05-07

johncblandii

07:04:45 AM

Is there a clean way to get the security group created for an LB so I can assign it to the workers SG to approve traffic?

johncblandii

07:05:11 AM

The LB is created through the helm deploy.

Andriy Knysh (Cloud Posse)

01:48:27 PM

using this https://www.terraform.io/docs/providers/aws/d/security_group.html and query by filter or tags?

AWS: aws_security_group - Terraform by HashiCorp

Provides details about a specific Security Group

Erik Osterman (Cloud Posse)

02:50:24 PM

https://github.com/stakater/Whitelister

stakater/Whitelister

A tool to white list node and developer IPs for kubernetes. - stakater/Whitelister

Erik Osterman (Cloud Posse)

02:50:54 PM

I would pursue a k8s native solution rather than trying to fuse terraform with helm

Erik Osterman (Cloud Posse)

02:51:39 PM

Also, IP whitelisting should be used as a last resort. Identity Aware Proxies is ala keycloak is a better approach

btai

05:21:07 PM

alb ingress controller creates you an ALB and the necessary security groups and assigns them to access your workers

johncblandii

05:21:52 PM

@Andriy Knysh (Cloud Posse) there isn’t enough on the SG to query that way. it has the [k8s.io/](http://k8s.io/)… tag, but it is not specific.

johncblandii

05:22:49 PM

@Erik Osterman (Cloud Posse) this isn’t fusing helm and tf. it is the SG created by TF, but I’m mainly just adding an SG record so it is mainly AWS infrastructure networking.

Andriy Knysh (Cloud Posse)

05:24:27 PM

if you go that route, you can filter by name (the resource has some name) and not tags

Andriy Knysh (Cloud Posse)

05:24:35 PM

or add your own specific tag

Erik Osterman (Cloud Posse)

05:24:52 PM

I think I lack context of where you are trying to do this?

Erik Osterman (Cloud Posse)

05:25:39 PM

“Is there a clean way from XXXXXXX to get the security group created for an LB by ZZZZZZ so I can assign it to the workers SG in YYYYYYY to approve traffic?”

johncblandii

03:27:39 PM

you may be technically right w/ fusing them. i’m technically wanting a value from k8s so i can configure the AWS SG to allow communication.

The SG is handled within TF manually

btai

06:31:03 PM

@johncblandii what type of LB are you using, if youre using an ALB I would suggest alb ingress controller as it does all that for you. (the downside is when you tear down your cluster, it wont clean up for you)

johncblandii

08:45:30 PM

it automatically used a classic elb (helm install)

btai

12:08:24 AM

what helm chart @johncblandii

johncblandii

03:05:43 AM

Twistlock

johncblandii

11:41:04 PM

welp…bitten by the “providers cannot be dynamically initialized” issue

2019-05-08

2019-05-09

rms1000watt

09:45:57 PM

Dang, how do we get Curtis Mattoon into cloud posse slack? https://github.com/cmattoon/aws-ssm/pull/29

Added log-level functionality by rms1000watt · Pull Request #29 · cmattoon/aws-ssm

I didn’t see any other way to set the log level. So here it is!

rms1000watt

09:46:41 PM

This tool works pretty good. But just curious if you peeps have any other methods for dynamically added k8s secrets from SSM

Erik Osterman (Cloud Posse)

09:47:26 PM

not from SSM

Erik Osterman (Cloud Posse)

09:47:38 PM

have you seen @mumoshu’s ASM operator?

rms1000watt

09:47:50 PM

nope, I shall take a look-see

Erik Osterman (Cloud Posse)

09:48:06 PM

i think extending that to support SSM would be nice

mumoshu

10:00:34 PM

you reminded me that we had the exact issue for it! https://github.com/mumoshu/aws-secret-operator/issues/14

Use parameter store now that higher throughput is available? · Issue #14 · mumoshu/aws-secret-operator

AWS recently added the capability to increase throughput for SSM parameter store: https://docs.aws.amazon.com/systems-manager/latest/userguide/parameter-store-throughput.html Is there a chance aws-…

Erik Osterman (Cloud Posse)

09:48:13 PM

or creating a separte one

rms1000watt

09:49:15 PM

https://github.com/mumoshu/aws-secret-operator (for the others in the channel)

mumoshu/aws-secret-operator

A Kubernetes operator that automatically creates and updates Kubernetes secrets according to what are stored in AWS Secrets Manager. - mumoshu/aws-secret-operator

rms1000watt

09:51:43 PM

Why not use AWS SSM Parameter Store as a primary source of secrets?

Pros:

Parameter Store has an efficient API to batch get multiple secrets sharing a same prefix.

Cons:

Its API rate limit is way too low. This has been discussed in several places in the Internet:

rms1000watt

09:52:00 PM

However, they just updated the rate limit to 1k req/s

rms1000watt

09:52:06 PM

so it might be a non-issue now

rms1000watt

09:52:35 PM

Also, you can set the limit and incur costs. Haven’t actually clicked this before.. lets see what happens

rms1000watt

09:52:55 PM

rms1000watt

09:53:26 PM

Ohhh, this is how to you get 1k: https://docs.aws.amazon.com/systems-manager/latest/userguide/parameter-store-throughput.html

You can increase the limit to 1,000 TPS on the Settings tab. Increasing the throughput limit incurs a charge on your AWS account.

rms1000watt

09:55:43 PM

$0.05 per 10,000 Parameter Store API interactions k.. I’ll stop spamming

Erik Osterman (Cloud Posse)

09:56:48 PM

that’s great; didn’t know they increased the limit

mumoshu

09:58:32 PM

I thought secretsmanager had the same amount of charge

rms1000watt

09:59:07 PM

secretmanager i think is $1/mo/secret. Lemme google a littttle

rms1000watt

10:00:02 PM

whoops.. $0.40/mo

rms1000watt

10:00:32 PM

PER SECRET PER MONTH
$0.40 per secret per month. For secrets that are stored for less than a month, the price is prorated (based on the number of hours.)

PER 10,000 API CALLS
$0.05 per 10,000 API calls.

rms1000watt

10:00:59 PM

loren

11:34:25 PM

Also good to cache your secrets, to avoid extra API calls and rate limits… https://aws.amazon.com/about-aws/whats-new/2019/05/Secrets-Manager-Client-Side-Caching-Libraries-in-Python-NET-Go/

rms1000watt

05:23:59 AM

this is interesting

rms1000watt

05:25:40 AM

curious how it works in detail. Like, does it make your microservice stateful? Or does it put the cache local to your cluster? Or is aws handling all the caching for us automagically?

The go SDK code looks straight forward though. Awesome find!

rms1000watt

05:23:06 AM

https://github.com/cmattoon/aws-ssm/pull/30 fixing a bug in aws-ssm if anyone else was considering to use it

Added next token to getparameterbypath for secrets > 10 by rms1000watt · Pull Request #30 · cmattoon/aws-ssm

The Go SDK for GetParameterByPath limits to 10 values in the response. This should grab them all.

Erik Osterman (Cloud Posse)

05:31:19 PM

how does it look when you want many parameters?

kind: Secret
metadata:
  name: my-secret
  annotations:
    aws-ssm/k8s-secret-name: my-secret
    aws-ssm/aws-param-name: my-db-password
    aws-ssm/aws-param-type: SecureString

e.g. /db/*

Added next token to getparameterbypath for secrets > 10 by rms1000watt · Pull Request #30 · cmattoon/aws-ssm

The Go SDK for GetParameterByPath limits to 10 values in the response. This should grab them all.

Erik Osterman (Cloud Posse)

05:31:45 PM

The name of the AWS SSM Parameter. May be a path.

Erik Osterman (Cloud Posse)

05:31:49 PM

i guess that answers it

Erik Osterman (Cloud Posse)

05:32:01 PM

but still curious. i never really kicked the tires on aws-ssm

Erik Osterman (Cloud Posse)

06:57:26 PM

(ultimately, client wanted per-service access controls so we went with Chamber +S3 + IAM + KIAM)

rms1000watt

09:34:05 PM

@Erik Osterman (Cloud Posse)

apiVersion: v1
kind: Secret
metadata:
  name: my-secret-name
  annotations:
    aws-ssm/k8s-secret-name: my-secret-name
    aws-ssm/aws-param-name: {{ .Values.ssm_path }}
    aws-ssm/aws-param-type: Directory
data: {}

Where ` .Values.ssm_path == /directory/within/ssm`

Erik Osterman (Cloud Posse)

09:34:46 PM

Ah, thx!

rms1000watt

09:34:57 PM

(lol, sorry about the delay)

Erik Osterman (Cloud Posse)

09:35:06 PM

how’s the helmfile PR coming along?

rms1000watt

09:35:42 PM

stale at the moment. been a bit busy. basically I didn’t consider multiple files

rms1000watt

09:36:11 PM

and there’s some chicken/egg issue about when the template-rendering happens and when to reference a file

rms1000watt

09:36:22 PM

so I just need to hit my head a little harder on it

Erik Osterman (Cloud Posse)

09:36:36 PM

maybe that will be simpler if they decouple the multi-phase rendering

rms1000watt

09:38:31 PM

Possibly. I thought multi-phase rendering was needed for template in template situations

2019-05-10

2019-05-11

Exequiel Barrirero

11:25:19 PM

https://www.sentialabs.io/2018/10/21/Integrating-EKS-with-other-AWS-services.html

Exequiel Barrirero

11:25:46 PM

Exequiel Barrirero

11:26:46 PM

Interesting approach for -> Deploying API Gateway in front of EKS / K8s Kops Clusters inside VPC private subnets And many other useful info about Integrating EKS with other AWS Services

2019-05-15

davidvasandani

05:05:04 PM

https://github.com/containership/konstellate

containership/konstellate

Free and Open Source GUI to Visualize Kubernetes Applications. - containership/konstellate

oscarsullivan_old

05:19:31 PM

Thanks I like this. For #terraform there’s also https://github.com/camptocamp/terraboard

camptocamp/terraboard

A web dashboard to inspect Terraform States - camptocamp/terraboard

davidvasandani

10:40:48 PM

Thanks for sharing @oscarsullivan_old! This looks really neat. You should share it in the #terraform channel.

camptocamp/terraboard

A web dashboard to inspect Terraform States - camptocamp/terraboard

Vidhi Virmani

12:57:13 AM

I am trying to setup kubernetes dashboard on AWS EKS cluster. I am able to setup the dashboard but facing a small issue with certs. I want to use aws certificate arn with the dashoard as an argument with command

kubectl apply -f <https://raw.githubusercontent.com/kubernetes/dashboard/v1.10.1/src/deploy/recommended/kubernetes-dashboard.yaml>

is this possible?

2019-05-16

nutellinoit

12:52:15 PM

To anyone that is tempted to use t3a or m5a instances on an EKS cluster, don’t

nutellinoit

12:52:20 PM

https://github.com/awslabs/amazon-eks-ami/issues/262

Support for t3a, m5ad and r5ad instance types is missing · Issue #262 · awslabs/amazon-eks-ami

What would you like to be added: Support for t3a, m5ad and r5ad instance types. Why is this needed: AWS had added new instance types, and the AMI does not currently support them.

nutellinoit

12:52:54 PM

there is an incompatibility on calculating number of eni available

endofcake

01:30:32 AM

https://reactiveops.com/blog/introducing-polaris-keeping-your-kubernetes-clusters-healthy

Introducing Polaris: Keeping your Kubernetes Clusters Healthy - Reactive Ops

We started ReactiveOps with a simple vision: transform infrastructure operations by leveraging decades of large-scale operations and product experience.

timduhenchanter

02:57:44 AM

Scale on queue depth

https://github.com/kedacore/keda

kedacore/keda

KEDA is a Kubernetes-based Event Driven Autoscaling component. It provides event driven scale for any container running in Kubernetes - kedacore/keda

2019-05-17

Issif

12:17:54 PM

https://github.com/reactiveops/polaris

reactiveops/polaris

Validation of best practices in your Kubernetes clusters - reactiveops/polaris

Sandeep Kumar

01:12:33 PM

How do we generate a wildcard certificate using kubernetes kind:managedCertificate, trying with below method but not successful apiVersion: networking.gke.io/v1beta1 kind: ManagedCertificate metadata: name: example-certificate spec: domains: - *.example.net

Please let me know if there is any documentation/suggestions to create a wild card certificate with expiry date mentioned in it

sarkis

11:00:32 PM

^ Polaris looks really interesting… I’m going to try to get it going this weekend see if it’s useful… any thoughts on it yet if someones already set it up?

sarkis

11:22:04 PM

Couldn’t wait for the weekend testing it now… it offers some nice checks… I can see this becoming more and more useful as more checks/best practices are added …

2019-05-19

James D. Bohrman

05:10:02 AM

Hey all! I’m having an issue building my example-voting-app with Codefresh.

I added the variable for KUBE_CONTEXT but I keep getting an error that throws:

error: no context exists with the name: "gke_example-voting-app-240610_us-east1-c_example-votin
g-app".                                                                                        
[SYSTEM] Error: Failed to run freestyle step: Running Helm Upgrade; caused by NonZeroExitCodeEr
ror: Container for step title: Running Helm Upgrade, step type: freestyle, operation: Freestyle

Erik Osterman (Cloud Posse)

05:22:57 AM

KUBE_CONTEXT should be the name of a kubernetes integration in codefresh

Erik Osterman (Cloud Posse)

05:23:06 AM

it would seldom, if ever have the app name in it

Erik Osterman (Cloud Posse)

05:23:26 AM

https://codefresh.io/docs/docs/deploy-to-kubernetes/add-kubernetes-cluster/

Add Kubernetes Cluster

How to connect your Kubernetes cluster to the Codefresh dashboard

James D. Bohrman

06:12:12 AM

Got it thanks!

James D. Bohrman

05:10:28 AM

I ran kubectl get context in my GKE shell and got:

gke_example-voting-app-240610_us-east1-c_example-voting-app

James D. Bohrman

05:10:42 AM

I put that as my KUBE_CONTEXT variable and can’t figure what I’m doing wrong. The docs say to put KUBE_CONTEXT as “Your friendly Kubernetes Cluster Name” I’ve also tried “example-voting-app” as the context variable. Which is the EKS cluster name. No dice there either.

2019-05-20

aaratn

09:16:06 AM

Can anyone help me with aws alb loadbalancer with helm chart ? Any samples that I can refer ?

sarkis

06:12:25 PM

https://github.com/flant/werf

flant/werf

Werf (previously known as dapp) helps to implement and support Continuous Integration and Continuous Delivery - flant/werf

2019-05-22

Pablo Costa

06:17:19 PM

https://aws.amazon.com/about-aws/whats-new/2019/05/amazon-eks-simplifies-kubernetes-cluster-authentication/

bye bye aws-iam-authenticator

Andriy Knysh (Cloud Posse)

06:20:13 PM

finally

btai

06:49:50 PM

ergg.. i spent a good part of a day understanding how it works/getting it to work w/my eks cluster spun up in tf

Erik Osterman (Cloud Posse)

06:30:50 PM

Public/Free Office Hours with Cloud Posse starting now!!

https://zoom.us/j/684901853

btai

06:48:44 PM

anyone try federation yet?

2019-05-24

Kevin Gimbel

12:55:31 PM

Hey all, I’ve a question and I can’t seem to find an answer. I’m running an AWS EKS cluster with two Nodes, each Node in EKS has a restriction of 20 Pods per Node. The Nodes are auto scaled and shut down each night and started in the morning since it’s just a test / staging system at the moment. However, one Node is always full (20/20 Capacity) while the other runs 4/20. We want to run a DaemonSet with filebeat for log aggregation but cannot ensure it runs on both nodes because one is full.

Is there a way I can (easily) ensure the DaemonSet is scheduled before all other pods? Or can I reserve a spot / space on a Node for a specific Pod, Deployment, or DaemonSet?

Kevin Gimbel

12:56:13 PM

I would like to avoid configuration overhead. I’ve already read about Affinity and Anti-Affinity but I’m not sure if this can help me

Kevin Gimbel

01:16:59 PM

Someone in the Kubernetes Slack answered my question, looks like this is it: https://kubernetes.io/docs/concepts/configuration/pod-priority-preemption/

Pod Priority and Preemption

Erik Osterman (Cloud Posse)

04:44:17 PM

Yes, this is what you want to look into.

2019-05-29

Vidhi Virmani

12:49:45 AM

Hi all,

Is there anyone who has setup kubernetes dashboard on EKS using istio ingress gateway? I am facing some issues where my dashboard crash after 4 mins. I am not sure if its a good idea to use istio ingress gateway to run kubernetes-dashboard. Any help is appreciated

Vidhi Virmani

03:30:08 AM

It is fixed now. I had to provide few configs in istio

Erik Osterman (Cloud Posse)

04:33:38 AM

@Vidhi Virmani how are you securing it?

johncblandii

11:29:09 PM

(comment just to monitor response)

Vidhi Virmani

05:26:48 AM

@Erik Osterman (Cloud Posse) I am currently allowing very few users to access the dashboard using aws-iam-authenticator.