SweetOps #geodesic for April, 2021

Discussions related to https://github.com/cloudposse/geodesic

Archive: https://archive.sweetops.com/geodesic/

2021-04-08

Neeraj Mittal

I am new to sweetops and trying too get my way around reference-architecture repo. I can see mention about accounts, repos, assets subfolder and couldn’t understand how they are suppose to be used. Any help will be appreciated

Erik Osterman (Cloud Posse)

05:03:11 AM

please don’t start there - that repo is in limbo

Erik Osterman (Cloud Posse)

05:03:21 AM

https://docs.cloudposse.com is the best place to start for now

Erik Osterman (Cloud Posse)

05:03:30 AM

multi-account documentation is not available

Neeraj Mittal

02:20:48 PM

figured out, how to use reference architecture with atmos, and components. And looks good so far

goldcaddy77

09:24:00 PM

@Erik Osterman (Cloud Posse) is multi-account support/documentation available in the paid offering?

Erik Osterman (Cloud Posse)

09:54:35 PM

yes

Erik Osterman (Cloud Posse)

09:55:28 PM

in fact, one of our most recent additions to the team has run through it in the last week and brought up a new customer environment despite never having touched it before.

goldcaddy77

10:48:47 PM

Very cool thanks

2021-04-10

2021-04-23

Joe Hosteny

09:26:58 PM

Hi all. I am running through the tutorial for atmos to understand how to port our old multi-account structure the new workflow. Am I correct in understanding all of the state files will now go into the root account?

jose.amengual

09:34:29 PM

yes, that is how we use it

jose.amengual

09:36:21 PM

on bucket with subfolders basically

Joe Hosteny

09:36:55 PM

Thanks @jose.amengual. Right, that’s what I inferred from running through the tutorial. Just wanted to make sure that is the intended setup.

Joe Hosteny

09:39:56 PM

Does this path look correct to you, for a component state file in an account / env? –

Joe Hosteny

09:39:59 PM

<s3://acme-ue2-tfstate-key-hookworm/env:/ue2-root/>

Joe Hosteny

09:40:19 PM

Seems env: is incorrect?

jose.amengual

09:43:46 PM

mmmmm

Joe Hosteny

09:44:19 PM

I’m actually on my way out, so I will look into it later. But it seemed like a missing interpolation for some var.

jose.amengual

09:44:36 PM

yes I think you are missing something

jose.amengual

09:44:47 PM

we use : “workspace_key_prefix”

Joe Hosteny

09:45:39 PM

ok, i ran the stock tutorial. I found one issue, so perhaps this is another minor one. I will put a PR up once i try that change.

Erik Osterman (Cloud Posse)

09:52:42 PM

@Igor Rodionov has written a script to help migrate it for one of our customers

Joe Hosteny

10:55:37 PM

Is that something you could share @Erik Osterman (Cloud Posse) ?

Igor Rodionov

04:59:46 PM

@Joe Hosteny Hello. I can share the script on monday - I have to remove project specific settings and edge cases

Joe Hosteny

05:01:43 PM

Thanks @Igor Rodionov. That would be great. I am working on trying to port one of my orgs (built with reference arch) to the new structure, starting with the account module. I am taking notes, and am happy to contribute those back as well if it is helpful to others.

marc slayton

03:25:18 AM

I ran into the above issue as well (with “env:” leaking into the s3 bucket path) – defining workspace_key_prefix, and making sure there was an ‘environment’ and ‘stage’ set in the top level yaml files seemed to do the trick.

marc slayton

03:25:38 AM

fwiw, I also have a bunch of tutorial notes which may be helpful to some – not sure where I should upload them, however, so they do not get in the way.

Joe Hosteny

01:03:09 PM

@marc slayton thanks for your help yesterday. I have some notes too - happy to set up a shared repo if you would like to record those somewhere in the interim, and I can add mine as well

marc slayton

01:57:04 PM

I have an example set of configs that got me to a successful multi-account spin up the first time (beginner’s luck, I expect). I am hoping to finish a procedure which works reproducibly, even if the initial tfstate-backend gets overwritten. This is a common gotcha, I think – an easy mistake to make in the initial config.

marc slayton

01:58:32 PM

On my second multi-account spin up, I made the mistake of using the same bucket and workspace definitions for the tfstate-backend initialization and for the root account spin up. This had the effect of overwriting my initial tfstate and left me needing to reimport the root account.

marc slayton

02:00:19 PM

I think the problem can be corrected by importing the root account definition – but as you know, Atmos has a bug in the terraform-core.variant file, which is blocking me from testing the workaround.

marc slayton

02:10:18 PM

It’s a little embarrassing, but learning how to recompile atmos has taken longer than I thought. Not much of a roadmap there just yet, and there are lots of unfamiliar tools (for me anyway), and fiddly details. As soon as I have a fresh compile of Atmos, I will forward you the procedure that worked, and also the workaround for the gotcha. If I can’t figure that out today, then I will forward what I have and cross my fingers that it works for you.

Joe Hosteny

02:23:26 PM

Hi Marc, I was able to do this with the Dockerfile in the examples directory in the atmos repo. I then just forked the repo for the module changes and pointed it to my branch with the fix for the region issue. Let me see if there are any other changes from that.

Joe Hosteny

02:25:15 PM

diff --git a/atmos/modules/terraform/terraform-core.variant b/atmos/modules/terraform/terraform-core.variant
index e0f0556..fdb9321 100644
--- a/atmos/modules/terraform/terraform-core.variant
+++ b/atmos/modules/terraform/terraform-core.variant
@@ -220,8 +220,7 @@ job "terraform provider override" {
       <<-EOS
       provider:
         aws:
-          - region: "${opt.region}"
-            assume_role:
+          - assume_role:
               role_arn: "${opt.role}"
       EOS
     ,

Joe Hosteny

02:29:11 PM

Joe Hosteny

02:29:45 PM

Joe Hosteny

02:30:39 PM

The first patch is (obviously) my copy of the dockerfile. Second is main.variant in the cli subdir. You can import with these changes.

Joe Hosteny

02:32:19 PM

The make docker/build will export a binary with the new options, so atmos will have the fix when run in the built container

Joe Hosteny

02:34:53 PM

The tfstate-backend component also has bucket versioning enabled. It saved me once when I manually deleted the wrong workspace’s statefile going through this.

marc slayton

07:58:42 PM

Hey Joe – all the above makes sense to me, but there is still something obvious I seem to be missing. I have a similar repository to the one you rmention, only mine is called ‘gangofnuns’. I use the same name for my forked repository on github:

https://github.com/gangofnuns/atmos/releases/tag/v0.17.2-beta.1

I can build a binary and upload it, given a few extra variables on the command line, however, when I use Geodesic to pull down my version of the docker image, it contains no atmos at all. So clearly, I have the flow incorrect in some way. This is a good exercise though, as I’m sure this is something other people are wondering about. It’s complicated, and there’s no roadmap – I’ll keep digging this afternoon/eve.

marc slayton

08:05:27 PM

Problem appears to be here, when I run the build (NOTE: had to add a variable to get it working with the docker registry.)

DOCKER_IMAGE_NAME=gangofnuns/atmos make docker/build

marc slayton

08:05:57 PM

 => ERROR [5/8] RUN apk add atmos@gangofnuns                                                                                                                                   1.1s
------
 > [5/8] RUN apk add atmos@gangofnuns:
#9 0.983 WARNING: The repository tag for world dependency 'atmos@gangofnuns' does not exist
#9 0.983 ERROR: Not committing changes due to missing repository tags. Use --force-broken-world to override.
------
executor failed running [/bin/sh -c apk add atmos@gangofnuns]: exit code: 99
make: *** [docker/build] Error 1

Joe Hosteny

08:06:22 PM

What do you mean by “use Geodesic to pull down my version of the docker image”?

marc slayton

08:06:57 PM

The flow is a little complicated, but here’s how I understand it right now:

marc slayton

08:07:33 PM

1.) pull down the github repository for cloudposse/atmos.

marc slayton

08:07:51 PM

2.) make the code changes we talked about.

marc slayton

08:08:22 PM

3.) Create a new docker container with the code changes. Upload to a different docker registry.

marc slayton

08:09:05 PM

4.) Change the reference in the Geodesic container build to pull in the new container you just uploaded to the registry.

marc slayton

08:09:43 PM

5.) Run ‘make all’ from within a repo such as the ‘multi account reference’ repo.

marc slayton

08:10:38 PM

With an edit to the Dockerfile, this should pull down the atmos container I built above, and use it to create the Geodesic container.

marc slayton

08:11:16 PM

Even as I write this, it seems way too complicated. I must be missing something obvious about the intended flow.

Joe Hosteny

08:13:44 PM

I’m a bit confused by what your new docker container is, vs. the reference in the “Geodesic container build”. I am only using one container - the one in the example repo. The line RUN variant2 export binary $PWD $CLI_NAME creates atmos, or whatever you call it if you change that name. So I do a make docker/build, sudo make install, and then run [atmos.mydomain.com](http://atmos.mydomain.com), and if you do which atmos when that runs, you will see one in /usr/local/bin/. CLI_NAME in the dockerfile controls the exe name, whereas the installable container name is set by APP_NAME in the Makefile.

Joe Hosteny

08:14:59 PM

That is, I set APP_NAME in the Makefile to atmos.<somedomain>.com, since I will have multiple versions of the geodesic container, one for each set of accounts managed (since we have multiple segregated projects)

Joe Hosteny

08:16:41 PM

So, from my bash command line, I run [atmos.crl.com](http://atmos.crl.com), which starts the container built from that Dockerfile, and then atmos from the command prompt inside that running container

Joe Hosteny

08:19:58 PM

It sounds like you might be using two dockerfiles, and having one based on another? If so, is it possible you didn’t copy the binary to the latter stage?

Joe Hosteny

08:20:24 PM

I’ve got to run for the evening, but if you are around tomorrow, send me a PM and perhaps we can do a quick zoom?

marc slayton

08:22:20 PM

RE: Geodesic container build – I’ve been following the instructions for building a multi-account stack. The instructions say: “Go get this repository, with these specific components, and stack definitions”. That repsitory comes with a Makefile and a Dockerfile for building Geodesic. That’s the container I’m using, and where I’ve been assuming I’d need to install the newly recompiled atmos.

marc slayton

08:29:54 PM

Yes, I will have worked through my confusion by then. (So embarrassed.) And in any case, I will send you what I have in terms of procedures for multi-account spin up. RE: Zoom - have you tried Gather Town instead? If not, no problem – I can do Zoom too. The password for the Gather space in the link is “so-foo-bar”. Works in any Chrome browser. Cheers –

Joe Hosteny

09:01:22 PM

Thanks, that call cleared up what you are doing

marc slayton

09:37:00 PM

Appreciate the extra time, Joe – many thanks.

marc slayton

09:39:55 PM

Sorry I didn’t get to show you Gather Town, but maybe some other day. It is a fun step beyond Zoom, imho. Will send some preliminary spin up notes this eve. I am in California, btw, so it might be late. Cheers –

marc slayton

08:01:01 AM

@Joe Hosteny - here’s my original template for a multi-account run. Notice that initially, I set the tfstate-backend to local, not s3. I later copied it to S3 in the master account. after accounts were spun up. I have more procedural notes, but it’s a bit late. I’ll have to finish organizing those for tomorrow. Enjoy!

marc slayton

09:08:56 PM

Hey Joe – here’s what I have thus far from a notes perspective. Hope to add more this evening after some other weekly chores. Still very much a work in progress. Cheers –

Erik Osterman (Cloud Posse)

04:51:26 AM

cc: @Andriy Knysh (Cloud Posse)

marc slayton

06:38:13 PM

^^^ quick caveat – these are embarrassingly incomplete. Slowly trying to improve. Very sorry for delays.

marc slayton

04:49:00 AM

Quick update: With a change to terraform-core.variant and a rebuild of atmos, I was able to import the master account, and thus get my second multi-account reference architecture up and running, despite having botched the tfstate-backend setup. So now I have two procedures that seem to work. This is using terraform 0.14.10.

marc slayton

04:51:59 AM

Important note: I ran into this bug after importing my master account. The workaround was to add:

  # There is no AWS Organizations API for reading role_name
  lifecycle {
    ignore_changes = ["iam_user_access_to_billing"]
  }

components/terraform/accounts/main.tf

seems to have worked without issue. I’ve voted up the bug fix. Hopefully that one gets looked at soon.

marc slayton

04:55:23 AM

Still recording my jolly antics here.

Joe Hosteny

03:57:57 PM

Hi @marc slayton - made a lot of progress over the weekend, including finishing deploying core iam / dns / cloudtrail across all accounts, and importing a number of pre-existing resources created via the old ref arch. Happy to sync at some point if it is helpful.

marc slayton

05:51:58 PM

I would most definitely be interested. I’m looking for sort of a ‘hit list’ of things I would need to address while back porting existing modules.

marc slayton

05:52:52 PM

With such a list, or a condensed series of liner notes, I could probably make decent progress on backporting further modules.

2021-04-28

2021-04-29

Joe Hosteny

05:39:15 PM

I am working through some of the terraform-aws-components currently. I understand there are changes coming, both from looking at the branch activity and from comments made here. Which branch would currently be best to work off of - master, all-new-components or the per module upstream-<name> branches when copying down a single component? I’m not too concerned with things not being 100% stabilized - just looking to minimize disruptions later as things are rolled out to master.

Matt Gowie

08:58:08 PM

@Joe Hosteny all-new-components I believe would be the spot. That is where all the various upstream branches have been merged in the short term.

Joe Hosteny

09:02:51 PM

Yup, thanks @Matt Gowie. I’ve found a number of minor issues that I’ve been able to work around. I haven’t filed tickets, since I am not sure if I am doing something wrong, or where to best put some of them (e.g., one could be a variant2 issue, or the way it is being used by atmos). Additionally, some look to be due to changes between that branch and master. If you would like, I can send you a list when I am done (or perhaps do a quick zoom) to see where to best route some of those issues.

Joe Hosteny

09:04:12 PM

I’ve been able to import my old ref arch org and sub accounts so far, create dns and identity accounts via atmos, create the account and account-map state, and am working through iam-primary-roles.

Matt Gowie

09:08:13 PM

Good stuff @Joe Hosteny! That’s awesome. Yeah I’d definitely love to chat about what you’ve found and how you’re liking it so far. As I’m sure @Erik Osterman (Cloud Posse) would as well.

Matt Gowie

09:08:43 PM

As to where to file issues… it does depend. Collecting them and then chatting through it makes sense — Let’s do that as I’m sure I can direct you where to go

Matt Gowie

09:09:18 PM

I’ll send you a link to grab some time on my calendar whenever is convenient for you.

Joe Hosteny

09:12:16 PM

I can already tell this will make my life about 100x easier…once the migration is done

Matt Gowie

09:14:26 PM

Yeah, I think this will be a very impressive and sustainable way to build terraform projects once it gains a bit more community steam. I’ve definitely benefited from it a ton already.

#geodesic (2021-04)

https://github.com/cloudposse/geodesic

2021-04-08

2021-04-10

2021-04-23

2021-04-28

2021-04-29

2021-04-30