SweetOps #release-engineering for July, 2020

CI/CD Discussions

Archive: https://archive.sweetops.com/release-engineering/

2020-07-01

sweetops

07:35:24 PM

Anyone know if it’s possible to use github branch protections to ensure that a PR to branch-a must come from branch-b?

sweetops

07:35:41 PM

i’m thinking not

Erik Osterman (Cloud Posse)

07:39:36 PM

yes & no. You can trivially create a github action that checks the source branch and target branches.

Erik Osterman (Cloud Posse)

07:39:47 PM

You can fail the build if they aren’t what you want them to be.

Erik Osterman (Cloud Posse)

07:39:56 PM

then use github branch protections to require this github action build to pass before merging..

sweetops

07:41:53 PM

okay, i’m with you.

sweetops

07:42:07 PM

I guess it’s finally time for me to explore github actions

Erik Osterman (Cloud Posse)

09:57:20 PM

it’ll be well worth it for these simple things.

2020-07-02

2020-07-03

Karoline Pauls

10:35:07 AM

If anyone is interested in the Jenkins follow-up.

Karoline Pauls

10:36:59 AM

in general case - beware of solutions that look BEST on a comparison chart - they may tick every single feature but the chart won’t tell you how well those features are integrated

Karoline Pauls

10:38:22 AM

https://issues.jenkins-ci.org/browse/JENKINS-62250?focusedCommentId=390581#comment-390581

this for example - the plugin-based architecture means there’s no strong unifying design

Karoline Pauls

10:38:45 AM

if you push too hard, things show their limits, mostly manifesting as stack traces

Karoline Pauls

10:51:29 AM

too many builds in a matrix - Groovy codegen failure

need a button to deploy any branch at any time (worked fine on GitLab) - if you use a global agent, it will remain in use… because the job is technically paused, you have to use agent none

testing a docker container built in a previous step? - great, how do you tag it? BRANCH-HASH-BUILD_ID - great, BUILD_ID will change if you restart a stage. BRANCH-HASH? - great, but you still need a global agent, otherwise you won’t be able to use the HASH part in a k8s agent pod template…

Oh, you’re a smart one, like me - you’ll do agent any to fetch the repo and set GIT variables and nest agent kubernetes inside, which can use git variables in its template! Congrats, now put it in a build matrix, so you can more reliably trigger the deadlock you’ve just constructed!

I’d really like to see https://github.com/laszlocph/woodpecker to take off. I don’t have time to write my own CI, although I’ve written a scheduler and dashboard once before.

laszlocph/woodpecker

An opinionated fork of the Drone CI system. Contribute to laszlocph/woodpecker development by creating an account on GitHub.

Karoline Pauls

06:33:10 PM

also how does it manage to take several minutes to start is beyond me and i’m scared to strace it

2020-07-06

joshmyers

12:52:53 PM

joshmyers

12:53:01 PM

Small one for ya - https://github.com/cloudposse/github-commenter/pull/19

Add GHE functionality by joshmyers · Pull Request #19 · cloudposse/github-commenter

what Adds Github Enterprise functionality why So we can use the awesome github-commenter with GHE. testing Running the thing now returns the comment ID rather than memory address: 2020/07/06 13…

Erik Osterman (Cloud Posse)

08:39:55 PM

Thanks! LGTM - gonna have @Andriy Knysh (Cloud Posse) review

Add GHE functionality by joshmyers · Pull Request #19 · cloudposse/github-commenter

what Adds Github Enterprise functionality why So we can use the awesome github-commenter with GHE. testing Running the thing now returns the comment ID rather than memory address: 2020/07/06 13…

joshmyers

08:23:03 AM

Cheers

joshmyers

02:57:03 PM

@Andriy Knysh (Cloud Posse) Let me know if there is anything I can do to help get that released

Andriy Knysh (Cloud Posse)

03:03:46 PM

looking now

joshmyers

03:04:12 PM

Thank you so much!

Erik Osterman (Cloud Posse)

03:07:14 PM

Merged - release 0.7.0

Erik Osterman (Cloud Posse)

03:07:17 PM

images are building

Erik Osterman (Cloud Posse)

03:07:23 PM

https://github.com/cloudposse/github-commenter/runs/846123382

cloudposse/github-commenter

Command line utility for creating GitHub comments on Commits, Pull Request Reviews or Issues - cloudposse/github-commenter

joshmyers

03:11:41 PM

Erik Osterman (Cloud Posse)

05:24:18 PM

thanks josh for the contribution! glad you were able to use our github-commenter ! =)

joshmyers

08:11:12 PM

Using it for a conftest Atlantis integration along with github-status-updater

joshmyers

08:11:19 PM

Thanks folks

Erik Osterman (Cloud Posse)

09:20:15 PM

That’s rad!

joshmyers

07:44:37 AM

joshmyers

08:26:14 AM

github-status-updater updates the status to failed so is no longer mergeable, Atlantis won’t apply.

2020-07-07

Andrew Nazarov

07:29:27 AM

Any tips how you effectively switch off some functionality on dev/staging? Like you don’t want to send emails to real users from dev and staging envs or something similar (btw there is a nice service called Mailtrap to achieve this, but sending emails is not the only case). Recently we’ve got a requirement that a certain file with a cryptic name should be created by hand on every production instance (a set of services) to be sure that production functionality can’t be accidentally activated on dev or staging and env vars are not even an option.

Tim Birkett

08:40:13 AM

Why are env vars not an option? How are your applications / services configured? Typically, you’d have feature flags in your applications config to turn things on and off. You should just be able to configure services differently throughout environments.

Andrew Nazarov

10:37:21 AM

Almost everything is configured via env vars, sure. But some of the functionality is disabled/locked down if a certain file doesn’t exist (there is a method in the code verifying this), and there are people who don’t want to consider anything else, because they think env vars can be easily messed up.

The obvious idea is to have dev functionality by default and only if some certain flags are set explicitly the production features are activated. But we failed to agree upon the env vars approach.

I was wondering if somebody has such strict requirements somehow and possible solutions like double verifications or something. This manual file creation is unsuitable for dynamic infrastructures and seems a bit odd.

roth.andy

01:24:36 PM

feature flags

Lee Skillen

10:33:07 AM

Be careful with dev-by-default. If you release to production but have failed to override the defaults, you might end up with something like (extreme example), your production instances running in debug mode. We take the opposite approach, and everything is (endpoints aside) configured for prod-by-default. Basically the default configuration is a reasonable best practice, then we have a separate dev and staging configuration baseline. The way that developers run the service makes it load the dev baseline, and they can customise it based on environment, set by the individual developer. In terms of turning on/off functionality; a lot of prod-like things just won’t work until configured because we’ve separated the app from the services it requires. Local devs run those services alongside it. Production is already running them as separate layers, and is configured as part of IaC.

Andrew Nazarov

12:02:15 PM

Thanks for the clarification! Yes, I was wrong in my definition. It was not literally “dev-by-default”, but let’s call it “reasonable defaults” when all stuff like debug modes are deactivated and so on.

2020-07-08

2020-07-14

Zach

04:32:23 PM

Anybody using SemaphoreCI? Or have opinions on it?

2020-07-15

roth.andy

01:04:49 AM

I am the Jenkins Master!!!!

Joe Niland

01:08:54 AM

Nice!

bradym

01:13:38 AM

very cool

wannafly37

12:30:56 PM

Is this JenkinsX?

roth.andy

12:34:33 PM

No. Regular Jenkins OSS with the blue ocean plugin

Jonathan Marcus

12:45:07 PM

What are those zPlaceholder nodes?

roth.andy

12:48:26 PM

There’s a bug in the blue ocean UI where if some stages run in parallel and some don’t all of the stages will appear sequentially rather than being tiered.

roth.andy

12:49:17 PM

My fix was to just have all stages set to parallel and add a 2and placeholder stage to each one that only has 1 stage in it

Zach

02:17:56 PM

“Working as intended, workaround is available” Jenkins Team — probably

Erik Osterman (Cloud Posse)

05:35:42 PM

@johncblandii

johncblandii

12:01:06 AM

I haven’t seen that bug. I’ve used s - s - p - s - p before, but maybe that was because it was CloudBees which was a lot more solid than OSS

2020-07-16

Zach

10:21:43 PM

what exactly are the limits of running self managed gitlab CE in our cloud environment? I can’t seem to find a good document on CE vs EE

Joe Niland

10:39:18 PM

I have a client using it. Are there any specific limitations you’re concerned about?

Zach

10:40:48 PM

Ah well I finally found the page, just wasn’t googling well enough https://about.gitlab.com/pricing/self-managed/feature-comparison/

Self-Managed Feature Comparison attachment image

Code, test & deploy with GitLab. Everyone can contribute!

Zach

10:41:24 PM

Mostly just looking for whether there was some critical “oh this won’t work for you” feature as a CI/CD tool

Zach

10:42:00 PM

And it looks like the CE version won’t connect with external repos (github or bitbucket)? So that’s a non starter

Joe Niland

10:42:22 PM

I see. I’ve built a few pipelines on CE and haven’t hit any limitations. But they are all in on GitLab CE.

Joe Niland

10:43:26 PM

Yeah looks like Premium+ only

Zach

11:03:00 PM

hmm we might only need the starter plan anyways if we just hosted with them. We’re deep into Jira already so I don’t need their project/issue management piece

Joe Niland

11:12:29 PM

I think they’re fine for git and ci/cd. The Kubernetes integration is pretty awesome.

Zach

11:13:53 PM

I’m trying to get us off of a (failing) jenkins setup but we have a few ‘special cases’ we’re trying to find compatibility for, and we’re all EC2 ASG deployments right now

Zach

11:14:08 PM

Kubernetes probably way more complicated than my org needs

Joe Niland

11:14:57 PM

What are the special cases and do you use awscli to deploy?

Zach

11:18:29 PM

we have a few ‘jobs that trigger other jobs’ pipelines for a couple of apps, and some large/long running tests. We’ve been looking at travis, circle and semaphore so far. They’re close but don’t get us 100% without a lot of pain. Our deploys are AMIs into ASGs and done via terraform, when there’s a new AMI we B/G the entire ASG

Zach

11:19:19 PM

there’s some aspects to it I’m working to change because there’s a lot of pain in a few places

Joe Niland

11:45:12 PM

What is the pain exactly? Lately worked on triggering other pipelines within Azure DevOps.

roth.andy

11:46:51 PM

There’s a bunch of good stuff in Starter that you don’t get in CE. One of the biggest ones for me is with CE there is really no way to make someone wait for a green pipeline run before merging an MR. There is also no way to require code review approvals. That alone is worth $4/dev/month to me.

Zach

11:47:27 PM

Yah I think the starter would work, I’m kicking off a trial now

Zach

11:47:35 PM

we’re just evaluating a lot of options right now

Zach

11:49:36 PM

@Joe Niland we B/G the entire system at once, with a pretty lengthy release cycle. There’s some historical reasons for that but I”m trying to change it. It requires a lot of manual effort (automated via terraform but its not true “CID”) due to some decisions on how that B/G would be accomplished, which are now difficult to engineer back out

Joe Niland

01:22:16 AM

@Zach makes sense.

Zach

09:04:03 PM

Can’t tell if I’m reading the gitlab docs correctly, they seem to be a bit of a messy overlap between hosted/self-hosted …. If using hosted gitlab, I can use a self-hosted runner in my own cloud? Are these able to be spun up on demand by integration with gitlab?

Zach

02:31:02 AM

Private go modules are the devil with hosted CI services

2020-07-17

2020-07-19

Zach

04:02:40 PM

Folks using azure devops - with the self hosted build agents, is there a way to do this (in AWS preferrably) so that the agent is created ‘on demand’ as part of an EC2 ASG or Fargate task? We have a few very long running, and expensive compute resource jobs currently in jenkins, i wouldn’t want to just leave those instances running 24/7 if were to migrate to Azure DevOps. Their docs seem more focused on just having some permanent pool of these agents up.

Mark

08:32:10 AM

So what I have done quite recently which might cater for you is add an agent to our bastion/control node to be the interface for azure pipelines to interact with.

I used Terraform and for the aws_instance resource there is a property called user_data which will execute a custom cloudconfiguration script which actually creates an agent for us with a name controlled by a variable in the pipeline. So in the next job of the pipeline, I was able to set the agent dynamically based on the pipeline variable.

Zach

01:36:30 PM

ah ha! yes thats exactly the sort of thing I was wanting to do

Zach

01:36:58 PM

So you have 1 “permanent self-hosted agent” always running on a cheap node, and you use that one to spin up the others

Zach

01:38:05 PM

And all the agents need a public IP correct?

Mark

02:02:24 PM

Yeah pretty much. When I need a new environment I first use the azure provided agent to create my new agent (terraform). From then, it’s over to my agent to do whatever work it needs to do.

pool:
    name: terraformed-envs
    demands: env -equals $(TF_VAR_ENV_NAME)

Actually a fair callout regards the public ip, the instance does have one but I never thought it required a public ip, it has one for a different reason. I would imagine it would require a public ip though given the networking across IAAS’s.

Zach

02:03:29 PM

I assume you have some logic in the ‘init agent’ job to wait for the ‘build agent’ to actually become available?

Zach

02:04:15 PM

re: public ip, I think the Azure docs said the self-hosted agents have to be reachable on 443 to communicate status back

Mark

02:04:59 PM

Yeah there is classic polling logic intermediary

Mark

02:05:06 PM

does this agent exist…………….

Zach

02:06:37 PM

Ah, so you poll the azure api

Zach

02:07:09 PM

Super helpful thanks - this proves out some issues we had with dumping jenkins in favor of other solutions

Mark

02:08:30 PM

Yeah given our whole ecosystem.. this is one of the smaller tradeoffs I’m dealing with I believe we did something similar dumping jenkins for this

2020-07-20

2020-07-21

2020-07-23

wannafly37

03:21:19 PM

Trying to decide where to store helm values and ci/cd information for charts; Am thinking deploy to staging on merge to master and deploy to prod on github release, but that feels cumbersome with a monorepo due to having to filter on paths and put names in releases, but it feels a little wrong to have an entire repo with just 2 files in it for helm deploys.

Erik Osterman (Cloud Posse)

02:20:58 AM

Have you considered #helmfile?

Erik Osterman (Cloud Posse)

02:21:29 AM

With helmfile you can easily pull values stored from tons of locations. Disk, buckets, terraform state, vault, SSM, etc.

wannafly37

11:43:59 AM

Thanks, I hadn’t - helmfile looks interesting but not sure how it’d help my scenario. Right now I’m looking at doing named github release tags [email protected] for example, and having the pipeline parse the tag to know what to run. This way all files could live in the same repo, the other “easier” option is a repo for each, but in the external-dns example the repo would literally have just a values file and the yaml for the pipeline

shamil.kashmeri

08:25:26 PM

helmfile can help with that, and, you can do that with helm-git

shamil.kashmeri

08:28:33 PM

helm-git if youre talking about deploying charts from a git repo with tags as chart versions

#release-engineering (2020-07)

All things CI/CD. Specific emphasis on Codefresh and CodeBuild with CodePipeline.

2020-07-01

2020-07-02

2020-07-03

2020-07-06

2020-07-07

2020-07-08

2020-07-14

2020-07-15

2020-07-16

2020-07-17

2020-07-19

2020-07-20

2020-07-21

2020-07-23

2020-07-24