SweetOps #geodesic for August, 2018

Discussions related to https://github.com/cloudposse/geodesic

Archive: https://archive.sweetops.com/geodesic/

2018-08-01

2018-08-02

Erik Osterman (Cloud Posse)

12:12:31 AM

damn, I spun up an AWS Workspace for Windows 10 not realizing it doesn’t support Hyper-V, so no Docker. =(

Erik Osterman (Cloud Posse)

12:22:31 AM

heh, and no “Windows Server 1709” support yet for “AWS Workspaces”, so no WSL even.

Erik Osterman (Cloud Posse)

12:22:32 AM

Erik Osterman (Cloud Posse)

12:22:54 AM

@Sebastian Nemeth i wanted to try this to test Geodesic on WSL/Docker

Erik Osterman (Cloud Posse)

12:26:41 AM

@rohit.verma going back to your question of simplifying IDE integration with geodesic

Erik Osterman (Cloud Posse)

12:27:02 AM

does running ` mount –bind /localhost/Dev/cloudposse/terraform-root-modules/ /conf` inside of geodesic make things any better for you?

Erik Osterman (Cloud Posse)

06:19:18 AM

I’ve been using this today and it really has helped for this specific use-case

Erik Osterman (Cloud Posse)

06:20:15 AM

For example, I’m working on some kops automation. So I run:

Erik Osterman (Cloud Posse)

06:20:18 AM

mount --bind /localhost/Dev/cloudposse/terraform-root-modules/aws/kops /conf/kops

Erik Osterman (Cloud Posse)

12:27:27 AM

(replace Dev/cloudposse/terraform-root-modules/ with the path to your root-modules folder)

Erik Osterman (Cloud Posse)

04:58:01 AM

Erik Osterman (Cloud Posse)

04:58:33 AM

I’ve started getting these errors. I suspect it could be related to bash not responding to kill -WINCH $$ within some kind of deadline

Erik Osterman (Cloud Posse)

04:59:54 AM

https://github.com/cloudposse/geodesic/pull/188

tamsky

07:54:58 PM

That’s crazy talk. POSIX signals were designed to be deadlock free. I’ve never heard of a signal deadline. Have you?

Erik Osterman (Cloud Posse)

08:00:32 PM

I don’t know the internals well enough anymore. Not strictly speaking of posix signals, thought I have experienced that if a process doesn’t acknowledge a signal that something happened to it, but I confess it’s been 15 years since I was that low level and probably am entirely off.

Erik Osterman (Cloud Posse)

08:00:49 PM

I just don’t know what to attribute this behavior to

Erik Osterman (Cloud Posse)

08:02:08 PM

I think what I might be thinking of (confusing it with) is where signal handling has been used for heartbeating a process. If it doesn’t respond, then the process is reaped.

Erik Osterman (Cloud Posse)

04:59:05 AM

@tamsky Have you seen this before?

Andriy Knysh (Cloud Posse)

05:17:57 AM

i started getting the errors too (after some period of inactivity in geodesic)

sarkis

04:23:42 PM

I’ve noticed strange behavior after long period of inactivity but was assuming it had to do with my aws-vault session expiring

Erik Osterman (Cloud Posse)

05:53:30 AM

@tamsky https://github.com/cloudposse/geodesic/pull/210

strategically signal bash to resize by osterman · Pull Request #210 · cloudposse/geodesic

what Only call kill -WINCH $when dimensions of screen change why Theory is that it contributes to this error... I suspect it could be related to bash not responding to kill -WINCH$ within s…

2018-08-03

2018-08-04

tamsky

07:57:33 PM

[2] Stopped indicates the process received a SIGTSTP

tamsky

08:01:17 PM

I should smile along with my crazy talk.

Erik Osterman (Cloud Posse)

08:03:15 PM

tamsky

08:10:09 PM

https://github.com/cloudposse/geodesic/pull/210#pullrequestreview-143392008

so in marked contrast to my bash shell’s default shopt, geodesic does have checkwinsize on….

so unless that setting changed only recently – I think that the shopt should have been solving this problem all along.

Strategically signal bash to resize by osterman · Pull Request #210 · cloudposse/geodesic

what Only call kill -WINCH $when dimensions of screen change why Theory is that it contributes to this error... I suspect it could be related to bash not responding to kill -WINCH$ within s…

tamsky

08:10:43 PM

And I’m thinking: “where have you been all my life checkwinsize?” Not having it, forced me to learn the kill -WINCH trick in the meantime.

Erik Osterman (Cloud Posse)

08:11:32 PM

The checkwinsize wasn’t working for me, but maybe that was masked by other problems

Erik Osterman (Cloud Posse)

08:11:44 PM

For example the ones you previously fixed

tamsky

08:11:50 PM

maybe docker run doesn’t pass WINCH to subshells?

tamsky

08:13:02 PM

because the terminal’s window change signal needs to propagate down this entire chain: (iterm/xterm) -> local shell -> docker run -> geodesic shell

Erik Osterman (Cloud Posse)

08:13:32 PM

Oh fascinating. Didn’t know that either

tamsky

08:14:46 PM

I guess it does propagate… here’s one where the signal was killing apache if docker run -it was used: https://github.com/docker-library/php/issues/64

SIGWINCH issue · Issue #64 · docker-library/php

When making a derived image from php:5.6-apache, when the server starts, at the slightest movement it stops, and gives me this message: [Wed Jan 21 2005.736731 2015] [mpm_prefork:notice] [pid 1…

Erik Osterman (Cloud Posse)

08:14:59 PM

Regarding the other problem, with sigstop it seems to happen when AWS session expires. Probably related to aws-vault usage.

tamsky

08:15:30 PM

yes, that sounds like aws-vault trying to read/write stdin/out and being blocked

Erik Osterman (Cloud Posse)

08:16:00 PM

Yes, I think it is related to stdin

tamsky

08:21:43 PM

I just scanned their issue queue and didn’t see anything related.

tamsky

08:23:35 PM

next time this happens to someone, can they please run and report back: pstree -p ; for i in $(jobs -p) ; do echo $p ; ls -l /proc/$i/fd ; done;

Erik Osterman (Cloud Posse)

05:18:55 AM

tamsky

08:24:17 PM

as well as the [N] Stopped aws-vault exec ...

Erik Osterman (Cloud Posse)

08:24:46 PM

Will do!

Erik Osterman (Cloud Posse)

08:27:13 PM

Man… can’t say how much I appreciate your insights. In the short time since you’ve joined, learned a lot of little tricks from you.

Erik Osterman (Cloud Posse)

05:19:31 AM

Erik Osterman (Cloud Posse)

05:20:23 AM

@tamsky this is the output.

Erik Osterman (Cloud Posse)

05:20:43 AM

pid 8 is aws-vault server mode (mock metadata api)

Erik Osterman (Cloud Posse)

05:21:55 AM

2018-08-06

tamsky

09:52:18 PM

doh, pstree -p ; jobs -l; for i in $(jobs -p) ; do echo $i ; ls -l /proc/$i/fd ; done;

tamsky

09:52:38 PM

$p was in my example – should have been $i

Erik Osterman (Cloud Posse)

01:25:08 AM

oops, I missed that too.

Erik Osterman (Cloud Posse)

01:25:21 AM

will try again next time.

2018-08-07

Erik Osterman (Cloud Posse)

11:35:39 PM

@Sebastian Nemeth just cut 0.13.0 which adds WSL (Windows) support

Erik Osterman (Cloud Posse)

11:35:43 PM

Can you give that a shot?

Sebastian Nemeth

08:26:02 AM

Will do!

Sebastian Nemeth

08:37:03 AM

Hey man - so this still isn’t solving the problem convincingly, I think… There’s just one problem I can see…

The first is that it’s common for WSL users to change the mount path for their local drives from /mnt/c to just /c (for example), which makes a lot of things easier for us. But this causes the new geodesic to fail with:

/usr/local/bin/root.potato.com: line 91: /mnt/c/Windows/System32/cmd.exe: No such file or directory

E.g. line here: https://github.com/cloudposse/geodesic/commit/a096ddf28314f0d7c9423f61b8516853663b4d24#diff-499f40d14b68a5dc159a3d3ebc5c4870R91

Looks like it’s looking for cmd.exe under /mnt - however, cmd.exe is something that should always be in PATH in WSL, so it might be fine to omit the path and just use cmd.exe everywhere?

Add Support for Windows Subsystem for Linux (WSL) (#202) · cloudposse/geodesic@a096ddf

Add user environment preserving * fix(*): add env variables for changing $HOME varaible(for wsl) * fix(wrapper-on-wsl): Now windows and linux usernames get dynamically * refactor(wrapper…

Sebastian Nemeth

08:41:02 AM

The location of the mounted drives can be obtained from /etc/wsl.conf under [automount] > root. https://docs.microsoft.com/en-us/windows/wsl/wsl-config#set-wsl-launch-settings

Manage Linux Distributions

Reference listing and configuring multiple Linux distributions running on the Windows Subsystem for Linux.

Sebastian Nemeth

08:42:06 AM

@Erik Osterman (Cloud Posse)

Sebastian Nemeth

11:03:01 AM

Add a PR here: https://github.com/cloudposse/geodesic/pull/214

Support non-default WSL root mount path by Martaver · Pull Request #214 · cloudposse/geodesic

Uses regex to look up the correct root mount path from wsl.conf. I tested the script on my system, and it works - however wasn’t able to test the whole build.

2018-08-08

2018-08-09

2018-08-15

Dylan

07:38:23 PM

@Dylan has joined the channel

2018-08-17

Erik Osterman (Cloud Posse)

05:57:54 PM

Lots of great UX fixes were merged today

Erik Osterman (Cloud Posse)

05:58:03 PM

check for duplicate syslog-ng

Erik Osterman (Cloud Posse)

05:58:09 PM

fancier banner

Erik Osterman (Cloud Posse)

05:58:17 PM

prompt line-wrapping

Erik Osterman (Cloud Posse)

05:58:44 PM

^C ssh-agent no longer aborts subsequent scripts

2018-08-19

Erik Osterman (Cloud Posse)

06:59:04 PM

some discussion right now in #release-engineering related to #geodesic

2018-08-21

tarrall

01:23:29 AM

@tarrall has joined the channel

2018-08-22

Adam

07:01:01 PM

@Adam has joined the channel

tarrall

08:04:27 PM

OK carrying on from #announcements

Erik Osterman (Cloud Posse)

08:04:54 PM

First up… we’ve followed the cold-start process and now have root.example.com, prod.example.com etc repos, accounts stood up, k8s cluster up in prod. Which is cool, but the current Dockerfile is basically an intermingling of configuration and code, making updating it to track the “upstream” versions (e.g. [prod.cloudposse.co/Dockerfile](http://prod.cloudposse.co/Dockerfile`)) awkward. Are there plans to extract the configuration into a separate file in order to make the existing repo more usable longterm, or is this intended more as a “here’s an example of how you might glue this all together” repo rather than a tool you’d use directly?

tarrall

08:04:56 PM

And yeah I’m picking up where Jonathan left off, or at least trying to

Erik Osterman (Cloud Posse)

08:05:07 PM

the Current Dockerfile is basically an intermingling of configuration and code, making updating it to track the “upstream” versions awkward

Erik Osterman (Cloud Posse)

08:05:44 PM

so there are a lot of versioning going on

Erik Osterman (Cloud Posse)

08:06:02 PM

(binaries, images, charts, helmfiles, modules)

Erik Osterman (Cloud Posse)

08:06:19 PM

which versions are you referring to?

Andriy Knysh (Cloud Posse)

08:06:51 PM

hi @tarrall, welcome

tarrall

08:07:53 PM

maybe I asked this the wrong way, let me rephrase. When I got here, they had a copy of prod.cloudposse.co that was obviously from several commits ago, and things didn’t seem quite right, so I figured hey let me check out the latest version before I try to troubleshoot too far.

tarrall

08:08:07 PM

However our Dockerfile (and yours) have lines like this

tarrall

08:08:08 PM

ENV TF_VAR_account_id="12345"
ENV TF_VAR_namespace="example"
ENV TF_VAR_stage="prod"
ENV TF_VAR_domain_name="prod.example.com"
ENV TF_VAR_zone_name="prod.example.com."

Erik Osterman (Cloud Posse)

08:08:26 PM

aha, gotcha!

Erik Osterman (Cloud Posse)

08:08:37 PM

yes, so the reference architectures IMO are designed to be hardforked

tarrall

08:08:37 PM

which means I need to examine your Dockerfile, copy the relevant changes into ours, instead of having a portable Dockerfile with that stuff elsewhere & pulled in

Erik Osterman (Cloud Posse)

08:08:51 PM

I really don’t think it makes sense to try to use ours verbatim

tarrall

08:08:55 PM

OK gotcha

Erik Osterman (Cloud Posse)

08:09:08 PM

they are an example of how to use all of our tools

Erik Osterman (Cloud Posse)

08:09:19 PM

it’s an example of how we do and how we do it for our customers

Erik Osterman (Cloud Posse)

08:09:59 PM

additionally, terraform-root-modules is also more of highly functioning examples

Erik Osterman (Cloud Posse)

08:10:20 PM

you can definitely reference them and use them, but what you run in AWS will be different from what other people run. they are basically examples for how to invoke all our terraform modules. . . a demonstration of how we use it.

tarrall

08:11:52 PM

and then question #2 is … let’s say I want to slap an RDS instance in here — maybe RDS postgres, maybe Aurora. I see cloudposse/terraform-aws-rds-cluster and that’s likely where I’d start. What would the recommended approach be here? I could copypasta that into prod.example.com/rds-cluster and then have the Dockerfile pull that in, but that’s kinda no bueno — if my stage and prod envs both end up with RDS clusters, I should be sharing the same code for both.

Erik Osterman (Cloud Posse)

08:12:22 PM

ok, good question

Erik Osterman (Cloud Posse)

08:12:42 PM

so there are a few concepts here. let me try to explain.

Erik Osterman (Cloud Posse)

08:13:12 PM

geodesic is our base image. that distributes the tools. so that’s our “opinionated” toolchain.

Erik Osterman (Cloud Posse)

08:13:37 PM

then there are “geodesic modules” which are basically those reference architectures. those implement some architecture using the tool chain.

Erik Osterman (Cloud Posse)

08:14:30 PM

then there are terraform-root-modules. those are basically a collection of patterns. usually those patterns are highly specific to your organization. for example, you would have a way of defining the infrastructure for your “API service”

Erik Osterman (Cloud Posse)

08:15:06 PM

this is basically our “MVC” of infrastructure.

Erik Osterman (Cloud Posse)

08:15:27 PM

let’s take your example.

Erik Osterman (Cloud Posse)

08:16:20 PM

You want to add an RDS cluster. How you do this is specific to your organization. You may choose postgres or mysql. You have some opinions on the parameter groups. You have some requirements for security groups, etc.

Erik Osterman (Cloud Posse)

08:16:54 PM

You add that to terraform-root-modules. The root module have no “identity”.

Erik Osterman (Cloud Posse)

08:17:21 PM

root modules are versioned. also, we like to build a container for the root modules so we can easily copy that stuff around between images.

Erik Osterman (Cloud Posse)

08:18:09 PM

Now, to invoke that RDS database, you pull that into [prod.example.com](http://prod.example.com) image. this achieves many things:

Erik Osterman (Cloud Posse)

08:18:12 PM

it’s super DRY

Erik Osterman (Cloud Posse)

08:18:18 PM

it’s versioned infrastructure

Erik Osterman (Cloud Posse)

08:19:28 PM

separation of concerns. the [prod.example.com](http://prod.example.com) repo defines all parameters to run in that environment. this is basically the “identity” layer.

tarrall

08:19:44 PM

yup I like the versioning approach there — saves drama around your TF module’s interface changing

Erik Osterman (Cloud Posse)

08:19:53 PM

Erik Osterman (Cloud Posse)

08:19:57 PM

then there’s the question: how do we develop?

Erik Osterman (Cloud Posse)

08:20:31 PM

i mean, if you have to push changes to terraform-root-modules, rebuild image, then rebuild your current account repo ([staging.example.com](http://staging.example.com)) everytime you make a change, you’ll NEVER get done.

Erik Osterman (Cloud Posse)

08:20:55 PM

for this reason, when we develop, we cd /localhost/path/to/my/root-modules/

Erik Osterman (Cloud Posse)

08:21:09 PM

and do all iteration there until we achieve the desired outcome

Erik Osterman (Cloud Posse)

08:21:47 PM

then commit/push that, open PR against master in terraform-root-modules and then merge that after approval, tag a release, and subsequently distribute that release across the various stages as needed.

Erik Osterman (Cloud Posse)

08:22:05 PM

also, if you use [dependabot.com](http://dependabot.com) it’s pretty cool - you can get these updates as PRs automatically

Andriy Knysh (Cloud Posse)

08:23:17 PM

@tarrall we already have your example as an example https://github.com/cloudposse/terraform-root-modules/blob/master/aws/backing-services/aurora-postgres.tf

cloudposse/terraform-root-modules

terraform-root-modules - Collection of Terraform root module invocations for provisioning reference architectures

Erik Osterman (Cloud Posse)

08:23:19 PM

(we’re also working on CI/CD of everything - but it will be a bit before that’s fully baked)

Andriy Knysh (Cloud Posse)

08:23:27 PM

and here how we pull it

Andriy Knysh (Cloud Posse)

08:23:36 PM

https://github.com/cloudposse/prod.cloudposse.co/blob/master/Dockerfile#L39

cloudposse/prod.cloudposse.co

prod.cloudposse.co - Example Terraform/Kubernetes Reference Infrastructure for Cloud Posse Production Organization in AWS

Andriy Knysh (Cloud Posse)

08:23:43 PM

https://github.com/cloudposse/staging.cloudposse.co/blob/master/Dockerfile#L39

cloudposse/staging.cloudposse.co

staging.cloudposse.co - Example Terraform Reference Architecture for Geodesic Module Staging Organization in AWS.

tarrall

08:23:44 PM

OK makes sense. I’m inclined here, I think, to have our own root modules container which is separate from yours, and develop in there…

Erik Osterman (Cloud Posse)

08:24:08 PM

exactly. that’s what I would recommend.

Erik Osterman (Cloud Posse)

08:24:18 PM

also, we have our https://github.com/cloudposse/helmfiles

cloudposse/helmfiles

helmfiles - Comprehensive Distribution of Helmfiles. Works with helmfile.d

tarrall

08:24:24 PM

@Andriy Knysh (Cloud Posse) thanks! Somehow all my google-fu was able to turn up was your TF module, not the one that was in root-modules

Erik Osterman (Cloud Posse)

08:24:31 PM

You might like to fork these too

Erik Osterman (Cloud Posse)

08:25:45 PM

(also, we welcome all PRs - so if you find/fix bugs, develop cool new things, would love to see it)

tarrall

08:26:10 PM

I’m waaaaaaay too much of a k8s n00b to want to fork someone’s “young” repo and develop on that. Good odds y’all will find and fix bugs & improve workflow faster than I can, which means I’m better off being able to follow your repo rather than blazing my own trail

tarrall

08:26:30 PM

so yeah PRs fersure, once I’ve got a vague clue of what I’m doing

Erik Osterman (Cloud Posse)

08:26:32 PM

haha, sounds good! you’re on a fast track.

tarrall

08:28:44 PM

BTW one other minor thing — I think AWS well-architected (or whatever they’re calling it these days) normally recommends a separate “identities” account where the humans are managed, rather than managing those out of the master account. At least, that’s what I did at my last place, and I kinda liked that because IMO the master account should be locked down hard. Might be something to consider adding to the reference architecture, though I realize it may be overkill for many places.

Erik Osterman (Cloud Posse)

08:29:12 PM

I think what we call [root.example.com](http://root.example.com) is that

Erik Osterman (Cloud Posse)

08:29:23 PM

(though I think we should rename ours to [identity.example.com](http://identity.example.com))

Andriy Knysh (Cloud Posse)

08:29:46 PM

https://docs.cloudposse.com/reference-architectures/notes-on-multiple-aws-accounts/

tarrall

08:29:54 PM

Aaaaah. Yeah to renaming, because I think we ended up with root.example.com == master. Not positive.

Erik Osterman (Cloud Posse)

08:30:39 PM

hrmmmm ok, so I need to re-review the well-architected doc

tarrall

08:30:43 PM

Heh I was at that re:Invent talk…

tarrall

08:30:57 PM

one of only like 3-4 I managed to make last year

Erik Osterman (Cloud Posse)

08:31:16 PM

Perhaps we have something we should rethink there. I’ve always treated root = identity = master

Erik Osterman (Cloud Posse)

08:31:23 PM

but deploy nothing other than identity in it

Erik Osterman (Cloud Posse)

08:31:37 PM

then prod, staging, audit (~security), dev, testing accounts

Erik Osterman (Cloud Posse)

08:31:41 PM

which share nothing

Erik Osterman (Cloud Posse)

08:31:48 PM

and identity delegates to those accounts

Erik Osterman (Cloud Posse)

08:32:26 PM

i need to adjust my mental model for how it would look if master != identity

Erik Osterman (Cloud Posse)

08:33:05 PM

so is master just billing?

tarrall

08:33:11 PM

yeah on identity delegating, fersure. I like having the “master account” (payer account, and where the service control policies are defined) separate from the “identities account” (where humans are defined)

Andriy Knysh (Cloud Posse)

08:33:23 PM

master is root/Org

Andriy Knysh (Cloud Posse)

08:33:34 PM

identity is our current [root.cloudposse.co](http://root.cloudposse.co)

tarrall

08:33:53 PM

ok cool so mostly just I got confused by the naming convention

Erik Osterman (Cloud Posse)

08:34:04 PM

something like that. yea, naming is hard.

tarrall

08:34:26 PM

2 hard things in CS, right? Naming things, cache invalidation, and counting

Erik Osterman (Cloud Posse)

08:34:30 PM

we’ve had a lot of discusssion around this internally (fwiw) - we know we need to change root to something or to rename terraform-root-modules (for which there’s no relationship)

tarrall

08:34:30 PM

Erik Osterman (Cloud Posse)

08:35:06 PM

i’m inclined to rename [root.cloudposse.co](http://root.cloudposse.co) to [identity.cloudposse.co](http://identity.cloudposse.co)

Erik Osterman (Cloud Posse)

08:35:20 PM

and introduce a new [master.cloudposse.co](http://master.cloudposse.co) or [billing.cloudposse.co](http://billing.cloudposse.co)

Erik Osterman (Cloud Posse)

08:36:04 PM

so we do DNS zone deletation from identity as well

Erik Osterman (Cloud Posse)

08:36:10 PM

where would that belong?

Andriy Knysh (Cloud Posse)

08:37:00 PM

yea, there we have two diff hierarchies: AWS and DNS

tarrall

08:37:47 PM

Re DNS, oh man, that’s one of those that’s gonna be super company dependent right? I mean, for some places using Route53 maybe they do a zone per account, other places maybe have a single shared zone with cross-account access…

Andriy Knysh (Cloud Posse)

08:37:50 PM

root of DNS is [cloudposse.co](http://cloudposse.co)

Erik Osterman (Cloud Posse)

08:38:07 PM

I mean, for some places using Route53 maybe they do a zone per account, other places maybe have a single shared zone with cross-account access…

Erik Osterman (Cloud Posse)

08:38:08 PM

tarrall

08:38:29 PM

LOLOLOL but when you migrated to the cloud LOOONG before AWS had orgs…

Erik Osterman (Cloud Posse)

08:38:37 PM

haha - yea, we’re probably going remain very strict about “share nothing”

tarrall

08:38:46 PM

you might happen to be proud of having almost everything migrated out of Classic

Erik Osterman (Cloud Posse)

08:38:53 PM

(we even recommend using different TLDs per account in some situations)

tarrall

08:39:23 PM

Yup, reasonable fersure. I’ve always liked at least separating “customer-facing” TLD from “internal ops” TLD

tarrall

08:39:30 PM

example.com / example.net kinda thing

Erik Osterman (Cloud Posse)

08:39:33 PM

yes - that’s a must

Erik Osterman (Cloud Posse)

08:39:47 PM

we call the customer facing one the “vanity domain”

Andriy Knysh (Cloud Posse)

08:41:02 PM

so we provision the root DNS zone (e.g. [cloudposse.co](http://cloudposse.co)) in [master.cloudposse.co](http://master.cloudposse.co)?

tarrall

08:47:22 PM

An alternative there might be to have an account dedicated to “shared infrastructure.” I can’t decide if it is just massive overkill to have that as a separate account from the “identities” account or not… identities is certainly an instance of “shared infra”.

Andriy Knysh (Cloud Posse)

08:48:57 PM

yes that’s possible

Andriy Knysh (Cloud Posse)

08:49:31 PM

what we’ve found out after many iterations is that there are no perfect solution for this

tarrall

08:49:51 PM

Yup

Andriy Knysh (Cloud Posse)

08:50:04 PM

you touch/change something in one place, you get a lot of issues in other places

tarrall

08:50:43 PM

And k8s is new enough that I’m confident that in a year or two, the “best practices” there today will be a laughingstock. This just based on my past experience with seeing workflows mature on Chef and Terraform…

Andriy Knysh (Cloud Posse)

09:00:21 PM

that’s actually one of the main reasons we created https://github.com/cloudposse/terraform-root-modules and https://github.com/cloudposse/helmfiles - to introduce some patterns for TF and k8s

cloudposse/terraform-root-modules

terraform-root-modules - Collection of Terraform root module invocations for provisioning reference architectures

Andriy Knysh (Cloud Posse)

09:00:52 PM

they are not perfect, but at least we have the same structure between projects and consistent naming (which is hard)

2018-08-23

tarrall

01:52:13 AM

Latest “probably a n00b mistake I’m making” issue — init-terraform in terraform-root-modules/aws/ecr is erroring…

tarrall

01:52:23 AM

will cut/paste the error here in a sec

tarrall

01:52:51 AM

 ✓   (flowtune-prod-admin) ecr ⨠  init-terraform
Mounted buckets
Filesystem                    Mounted on
flowtune-prod-terraform-state /secrets/tf
Initializing modules...
- module.kops_ecr_app
  Getting source "git::<https://github.com/cloudposse/terraform-aws-kops-ecr.git?ref=tags/0.1.0>"
- module.kops_ecr_user
  Getting source "git::<https://github.com/cloudposse/terraform-aws-iam-system-user.git?ref=tags/0.3.0>"
- module.kops_ecr_app.label
  Getting source "git::<https://github.com/cloudposse/terraform-null-label.git?ref=tags/0.3.3>"
- module.kops_ecr_app.kops_metadata
  Getting source "git::<https://github.com/cloudposse/terraform-aws-kops-metadata.git?ref=tags/0.1.1>"
- module.kops_ecr_app.kops_ecr
  Getting source "git::<https://github.com/cloudposse/terraform-aws-ecr.git?ref=tags/0.2.6>"
- module.kops_ecr_app.kops_ecr.label
  Getting source "git::<https://github.com/cloudposse/terraform-null-label.git?ref=tags/0.3.1>"
- module.kops_ecr_user.label
  Getting source "git::<https://github.com/cloudposse/terraform-null-label.git?ref=tags/0.3.1>"

Initializing the backend...

Successfully configured the backend "s3"! Terraform will automatically
use this backend unless the backend configuration changes.

Error: output 'registry_url': "repository_url" is not a valid output for module "kops_ecr"
Error: output 'repository_name': "name" is not a valid output for module "kops_ecr"
Error: output 'kops_ecr_app_registry_url': "repository_url" is not a valid output for module "kops_ecr_app"
Error: output 'kops_ecr_app_repository_name': "name" is not a valid output for module "kops_ecr_app"

tarrall

01:53:27 AM

this is with terraform-root-modules:0.5.3

tarrall

01:53:38 AM

and cloudposse/geodesic:0.16.0

Erik Osterman (Cloud Posse)

01:55:13 AM

@tarrall I think this is fixed in an upcoming PR

Erik Osterman (Cloud Posse)

01:55:13 AM

https://github.com/cloudposse/terraform-root-modules/pull/35/files

Disable default ecr by goruha · Pull Request #35 · cloudposse/terraform-root-modules

What Disabled default ecr That can be BREAKING CHANGES for some projects that use default ecr. Why Default ecr does not make sense for custom projects, that needs names for ecr

tarrall

01:55:27 AM

yeah I was thinking that was likely, despite the misleading title

Erik Osterman (Cloud Posse)

01:55:39 AM

yea, the PR should be updated

Erik Osterman (Cloud Posse)

01:55:45 AM

code review changed the nature of the PR

Erik Osterman (Cloud Posse)

01:55:46 AM

Erik Osterman (Cloud Posse)

01:56:02 AM

btw, definitely recommend forking or creating your own root modules sooner rather than later

Erik Osterman (Cloud Posse)

01:56:08 AM

are you guys on codefresh?

tarrall

01:56:09 AM

and/or split out bugfix work from “I want to change the functionality” work

tarrall

01:56:10 AM

yeah

tarrall

01:56:14 AM

not on codefresh no

Erik Osterman (Cloud Posse)

01:57:49 AM

I updated the PR description

Erik Osterman (Cloud Posse)

01:58:08 AM

repository_name was renamed (easy fix on your side)

tarrall

01:59:06 AM

and yeah time to fork. Kinda in a “chicken and egg” situation where I’m just starting to set up services — no build server yet, we have bitbucket for code (shockingly bad but this should surprise no one, it’s atlassian after all), etc. All of this “build and publish a Dockerfile” workflow would be easier if I wasn’t in the middle of trying to set up ECR to … publish Dockerfiles

Erik Osterman (Cloud Posse)

01:59:36 AM

yea, coldstart problems..

tarrall

02:00:39 AM

I’m just glad I already have a few years of experience in arguing with Terraform, or I’d probably be outside yelling at the cloud

Erik Osterman (Cloud Posse)

02:00:42 AM

(you can possibly use dockerhub automated builds)

tarrall

02:02:27 AM

since I’m comfortable with vanilla terraform, including magic tricks like data sources / finding stuff by tags, I’m gonna just do a combination of “copy and modify” on your code and just rolling my own from scratch to get this going. Some experience writing Dockerfiles but less with the day-to-day workflow stuff like dockerhub, compose etc

2018-08-24

Max Moon

07:11:30 PM

@Erik Osterman (Cloud Posse) Do you know if there are any known issues with upgrading the nginx ingress image using the cloudposse/nginx-ingress chart? Just curious, I have an edge case scenario that results in a (known) race condition issue in 0.11

Erik Osterman (Cloud Posse)

07:11:59 PM

So it should be relatively straight forward

Erik Osterman (Cloud Posse)

07:12:09 PM

I think we recently added a helmfile for the official ingress

Erik Osterman (Cloud Posse)

07:12:22 PM

basically, our ingress gives you the fancy 404/500 pages

Erik Osterman (Cloud Posse)

07:12:37 PM

but I can see why you’d want to move to the official one

Erik Osterman (Cloud Posse)

07:12:44 PM

(we started ours before there was an official one)

Max Moon

07:12:57 PM

right right

Erik Osterman (Cloud Posse)

07:13:01 PM

https://github.com/cloudposse/helmfiles/blob/master/helmfile.d/0330.stable-nginx-ingress.yaml

cloudposse/helmfiles

helmfiles - Comprehensive Distribution of Helmfiles. Works with helmfile.d

Erik Osterman (Cloud Posse)

07:13:23 PM

so long as the ingress class is the same it, should be a drop in replacement

Erik Osterman (Cloud Posse)

07:13:32 PM

but definitely test in staging first!

Max Moon

07:13:36 PM

HAH

Max Moon

07:13:39 PM

who has time for that!

Erik Osterman (Cloud Posse)

07:14:17 PM

the stable ingress comes with prometheus exporters for monitoring

Erik Osterman (Cloud Posse)

07:14:24 PM

so it integrates nicely with grafana

Max Moon

07:14:36 PM

i don’t mind being on the CP chart (we use the error pages), was moreso curious if you knew anyone using say… 0.13 nginx ingress

Erik Osterman (Cloud Posse)

07:14:58 PM

not yet =/ - as in i don’t know if anyone has tried upgrading

Max Moon

07:15:23 PM

okay cool, no worries!

Erik Osterman (Cloud Posse)

07:14:36 PM

btw, what did you guys decide to do for monitoring?

Max Moon

07:15:14 PM

still auditioning companies, i was in Ireland for two weeks in July so the monitoring got re-prioritized for.. next week

Erik Osterman (Cloud Posse)

07:15:27 PM

Erik Osterman (Cloud Posse)

07:15:48 PM

we’ve gotten grafana working with the autodiscovery of dashboards in configmaps

Erik Osterman (Cloud Posse)

07:15:54 PM

it’s so freggin sweet

Max Moon

07:16:05 PM

we’ve got the portal up and rocking in both stage and prod, it’s soooo nice

Erik Osterman (Cloud Posse)

07:16:21 PM

i think that alone is a BIG motivator to use it

Erik Osterman (Cloud Posse)

07:16:33 PM

maybe not for everything, but definitely as a first line of monitoring

Max Moon

07:16:41 PM

right right, i dig it!

Erik Osterman (Cloud Posse)

07:16:57 PM

have you tried using the portal with argo yet?

Erik Osterman (Cloud Posse)

07:17:07 PM

should be really easy

Max Moon

07:17:12 PM

not yet, we actually found our first use case for argo last week

Max Moon

07:17:19 PM

so that should be coming along… swiftly

Erik Osterman (Cloud Posse)

07:17:23 PM

see this? https://github.com/cloudposse/helmfiles/pull/17

[cloudflare-ingress-controller] Add helmfile by osterman · Pull Request #17 · cloudposse/helmfiles

what Add cloudflare ingress controller (aka argo / acsess) why Expose services inside kubernetes securely and speedily using Argo tunnels

Max Moon

07:17:41 PM

as per usual, saving me time!

Erik Osterman (Cloud Posse)

07:18:07 PM

it’s slightly out of date

Max Moon

07:18:11 PM

also, re: race condition, will put some deets in #kubernetes for other folks?

Erik Osterman (Cloud Posse)

07:18:19 PM

yea, that would be cool

Max Moon

07:18:23 PM

Erik Osterman (Cloud Posse)

07:18:25 PM

or even open up an issue in our helmfiles or charts repos

Max Moon

07:18:35 PM

sounds good

Max Moon

07:18:43 PM

will do that in a bit

2018-08-27

Erik Osterman (Cloud Posse)

09:42:25 PM

Erik Osterman (Cloud Posse)

09:42:50 PM

Here’s the slide I’ve been looking for that came from 2017 re:Invent talk on architecting security and governance across a multi-account strategy (SID331)

Erik Osterman (Cloud Posse)

09:45:07 PM

Erik Osterman (Cloud Posse)

09:46:15 PM

It’s very close to our reference architecutres.

Erik Osterman (Cloud Posse)

09:46:31 PM

The “Enterprise Accounts” are more decomposed