SweetOps #aws for September, 2021

Discussion related to Amazon Web Services (AWS)

Archive: https://archive.sweetops.com/aws/

2021-09-01

2021-09-02

Almondovar

07:39:33 AM

Hi all, anyone has ever used the EC2 serial console connection? i am getting this message while trying to use it to all of our instances

conzymaher

08:03:43 AM

What EC2 instance type?

conzymaher

08:03:51 AM

Error message indicated you are not using a nitro based instance

conzymaher

08:04:49 AM

https://aws.amazon.com/ec2/instance-types/

Amazon EC2 Instance Types - Amazon Web Services

EC2 instance types comprise of varying combinations of CPU, memory, storage, and networking capacity. This gives you the flexibility to choose an instance that best meets your needs.

Almondovar

08:12:30 AM

Thank you Conor, if i understand well this nitro system is for a little bit more expensive ec2’s, as we use the smallest possible t2, i don’t see it in the table

conzymaher

08:14:26 AM

T3 instances are supported so you could possibly change the instance type. Also https://docs.aws.amazon.com/systems-manager/latest/userguide/session-manager.html is a great alternative option

AWS Systems Manager Session Manager - AWS Systems Manager

Manage instances using an auditable and secure one-click browser-based interactive shell or the AWS CLI without having to open inbound ports.

Almondovar

08:23:55 AM

Thank you Conor, the reason that i am asking all these things is that every month as we install some python related packages our server crashes, and none of the 4 methods works EC2 Instance Connect and SSH client should work out of the box (they are unreachable, as server crashed). Session Manager should be configured by me and EC2 Serial Console says that our instance is not compatible (wants nitro)

Alex Jurkiewicz

08:34:04 AM

T3 should be the same price as T2. You can even use t3a which is even cheaper

conzymaher

08:34:48 AM

If the server is crashing that easily its likely undersized

Almondovar

08:39:14 AM

the day of crash it reached 34% cpu usage but i dont see anything else weird about utilization

conzymaher

08:40:40 AM

You can see your CPU credit balance is getting very low (at that point the performance will be throttled) this is not the cause of your issue in this particular case but worth watching

conzymaher

08:40:46 AM

Its likely memory exhaustion

conzymaher

08:40:56 AM

What instance type is it exactly?

Almondovar

08:41:10 AM

t2.micro

conzymaher

08:41:37 AM

Not really suitable for most production workloads unless they are incredibly small / bursty

conzymaher

08:41:45 AM

This is an “insert credit card” fix

conzymaher

08:41:57 AM

Try a t2.medium for a while and see how it goes

Almondovar

08:47:39 AM

what if i change to t3a.micro like alex suggested above? btw how is it possible to have double cpu but be even cheaper? is it because of the switch to AMD?

conzymaher

08:48:21 AM

I doubt it will make any difference if memory consumption is the problem

conzymaher

08:49:07 AM

https://docs.aws.amazon.com/AmazonCloudWatch/latest/monitoring/metrics-collected-by-CloudWatch-agent.html

Metrics collected by the CloudWatch agent - Amazon CloudWatch

Lists the on-premises and additional Amazon EC2 metrics made available by the CloudWatch agent.

Almondovar

08:50:23 AM

aha, now i noticed that RAM status is not showing in any graphs, right?

Almondovar

09:06:26 AM

by the way, the serial console didnt help either, only black screen appears

Almondovar

09:06:42 AM

Almondovar

09:07:31 AM

there is a bar going up to the right of the screen

Daniel Huesca

08:29:27 AM

Hello everybody!

AWS DocumentDB related question - https://github.com/cloudposse/terraform-aws-documentdb-cluster

Can anyone please help me configure my terraform module to NOT create a new parameter group, but instead use the default one provided by AWS (or any previously created param group)? There is no mention in the docs on how to do this, only a way to pass parameters for the module to create a new one.

Alex Jurkiewicz

08:33:12 AM

You might need to patch the module. Why is it a problem to use a custom parameter group?

If you use the default one, applying parameter changes in future will require you to first apply a custom parameter group, which will cause downtime

Daniel Huesca

08:35:41 AM

Hello Alex, These are clusters that will almost never need a parameter change. My boss is a kinda OCD about having N amount of parameter groups (and any other unnecesary resource) laying around when all the clusters (more than 30) all use the same params.

Alex Jurkiewicz

10:32:58 AM

that’s too bad you have an irrational boss

11:28:19 AM

It’s helpful to have a custom param group if you ever need a custom param in the future…

Did you want to use an existing param group instead? Or simply not use a param group at all? I think expanding the module to use an existing param group and existing subnet group would be a nice feature

If you want to put in a pr, you can start here

https://github.com/cloudposse/terraform-aws-documentdb-cluster/blob/5c900d9a2eaf89457ecf86a7b96960044c5856f4/main.tf#L88

terraform-aws-documentdb-cluster/main.tf at 5c900d9a2eaf89457ecf86a7b96960044c5856f4 · cloudposse/terraform-aws-documentdb-cluster attachment image

Terraform module to provision a DocumentDB cluster on AWS - terraform-aws-documentdb-cluster/main.tf at 5c900d9a2eaf89457ecf86a7b96960044c5856f4 · cloudposse/terraform-aws-documentdb-cluster

2021-09-03

Adnan

09:28:20 AM

Hi People, anyone ever had this issue with the AWS ALB Ingress controller:

failed to build LoadBalancer configuration due to failed to resolve 2 qualified subnet with at least 8 free IP Addresses for ALB. Subnets must contains these tags: 'kubernetes.io/cluster/my-cluster-name': ['shared' or 'owned'] and 'kubernetes.io/role/elb': ['' or '1']. See <https://kubernetes-sigs.github.io/aws-alb-ingress-controller/guide/controller/config/#subnet-auto-discovery> for more details.

So there three subnets with the appropriate tagging and many ips I could not yet find the reason why it is complaining about the subnets

11:37:02 AM

Other thread in #kubernetes

https://sweetops.slack.com/archives/CBW699XE0/p1630659219013100?thread_ts=1630659219.013100&cid=CBW699XE0

Hi People, anyone ever had this issue with the AWS ALB Ingress controller:

failed to build LoadBalancer configuration due to failed to resolve 2 qualified subnet with at least 8 free IP Addresses for ALB. Subnets must contains these tags: 'kubernetes.io/cluster/my-cluster-name': ['shared' or 'owned'] and 'kubernetes.io/role/elb': ['' or '1']. See <https://kubernetes-sigs.github.io/aws-alb-ingress-controller/guide/controller/config/#subnet-auto-discovery> for more details.

So there three subnets with the appropriate tagging and many ips I could not yet find the reason why it is complaining about the subnets

2021-09-07

Almondovar

11:55:16 AM

hi guys, is it any possible way to automate the enablement of ec2 console cable connection in every new ec2 i spin? the commands i am executing for ubuntu instances are the following:

sudo -i
vi /etc/ssh/sshd_config // and go down to edit line 
passwordAuthentication yes 
// saving with :wq!

systemctl restart sshd
passwd // input password 2 times

Grummfy

12:37:10 PM

you can play with the cloud-init or user data section of your instance

Alex Jurkiewicz

09:03:26 PM

Does the virtual console really use sshd??

Alex Jurkiewicz

09:03:57 PM

I would assume a virtual console is using a tty, and bypassing ssh

Carlos Tovar

01:39:18 AM

@Almondovar yeah, the ec2 serial console is serial access, not ssh. A handful of ec2 AMIs come preconfigured forit (e.g. amazon linux and i think ubuntu 20). You also need to turn the service at the AWS account level and use an IAM role/user permissioned to use the service.

Almondovar

06:45:24 AM

Hi Carlos, do i understand well that the steps i performed are not necessary to enable cloud console connnection? tbh once i followed them they instantly allowed access to the console connection

Carlos Tovar

02:59:19 PM

@Almondovar hey, missed your IM, yes, that is my understanding. But if the changes made worked, then even better

Steve Wade (swade1987)

09:52:16 PM

Does anyone know of anything similar to https://github.com/sportradar/aws-azure-login but written in Go?

Erik Osterman (Cloud Posse)

10:07:14 PM

Have you checked out using Leapp instead?

Erik Osterman (Cloud Posse)

10:08:28 PM

https://www.leapp.cloud/

Leapp - One step away from your Cloud attachment image

Leapp grants to the users the generation of temporary credentials only for accessing the Cloud programmatically.

Erik Osterman (Cloud Posse)

10:09:05 PM

We used to use all kinds of scripts, hacks, and tools but leapp has replaced them for us

Erik Osterman (Cloud Posse)

10:09:48 PM

It’s an open source electron app distributed as a single binary

Erik Osterman (Cloud Posse)

10:10:42 PM

Cc @Andrea Cavagna

Steve Wade (swade1987)

10:16:25 PM

interesting @Erik Osterman (Cloud Posse) you just use the free one?

Erik Osterman (Cloud Posse)

10:31:23 PM

Yup

Andrea Cavagna

10:54:53 PM

Leapp is free for anyone, it’s an open source projects, we are going to close the federation with AzureAD and AWS pull request this week and having a release, for any further question, @Steve Wade (swade1987) feel free to text me :)

Andrea Cavagna

10:56:42 PM

The only paid solution by now is the enterprise support to the opensource project, made by the maintainers of the app

Btw @Erik Osterman (Cloud Posse) I grant you that in the next weeks I will partecipate to an office hour so we can respond to any question about Leapp!

Steve Wade (swade1987)

09:53:02 PM

I currently have a script (see below) but it seems a little hacky …

#! /usr/bin/env bash

AWS_PROFILE=${1}

AZURE_TENANT_ID="<redacted>"
AZURE_APP_ID_URI="<redacted>"
AZURE_DEFAULT_ROLE_ARN="arn:aws:iam::<redacted>:role/platform-engineer-via-sso"
AZURE_DEFAULT_DURATION_HOURS=1

# Make sure user has necessary tooling installed.
if ! which ag > /dev/null 2>&1; then
  echo 'Please install the_silver_searcher.'
  exit
fi

# Run the configuration step if not set.
 # shellcheck disable=SC2046
if [ $(ag azure ~/.aws/config | wc -l) -gt 0 ]; then
  printf "Already configured, continuing ...\n\n"
else
  printf "Use the following values when asked for input ... \n"
  printf "Azure Tenant ID: %s\n" ${AZURE_TENANT_ID}
  printf "Azure App ID URI: %s\n" ${AZURE_APP_ID_URI}
  printf "Default Role ARN: %s\n" ${AZURE_DEFAULT_ROLE_ARN}
  printf "Default Session Duration Hours: %s\n\n" ${AZURE_DEFAULT_DURATION_HOURS}
  docker run -it -it -v ~/.aws:/root/.aws sportradar/aws-azure-login --configure --profile "$AWS_PROFILE"
fi

# Perform the login.
docker run -it -it -v ~/.aws:/root/.aws sportradar/aws-azure-login --profile "$AWS_PROFILE"

printf "\nMake sure you now export your AWS_PROFILE as %s\n" "${AWS_PROFILE}"

2021-09-08

Santiago Campuzano

02:15:04 PM

Does anyone know if it’s possible to reserve/allocate a small pool of consecutive Public IP/ELastic IP Addresses on AWS ? I’ve been searching documentation with no luck

mikesew

07:19:12 PM

Has anybody used AWS Config Advanced Queries? basically, pulling aws describe data using SQL) i’m trying to pull config data using the aws cli, then throw it into a CSV or some other datastore for querying.

aws configservice  select-aggregate-resource-config  \
--configuration-aggregator-name AllAccountsAggregator  \
--expression "
SELECT
  resourceId,
  resourceName,
  resourceType,
  tags,
  relationships,
  configuration.storageEncrypted,
  availabilityZone
WHERE
  resourceType = 'AWS::RDS::DBInstance'
  AND configuration.engine = 'oracle-ee'
  AND resourceName = 'rds-uat-vertex-9'
" \
| jq -r '.'

.. i’m having problems parsing the outputs. this is mainly a JQ problem.

ccastrapel

12:13:56 AM

We should chat tomorrow, I have some code in ConsoleMe that parses the nested json config returns

ccastrapel

12:14:10 AM

Github.com/Netflix/consoleme

David

08:54:32 PM

When using shield, is it best to put protections on a Route53 zone, or an ALB that that zone connects to, or both?

And then the same question with Route53 pointing to CloudFront, API Gateway, etc.

2021-09-09

Steve Wade (swade1987)

04:55:54 PM

before I start writing my own … does anyone know of a lambda that takes an RDS snapshot and ships it to S3?

Matthew Bonig

09:04:07 PM

wild, I’m doing that right now.

Matthew Bonig

09:04:35 PM

mysql? postgres? sqlserver? oracle?

Matthew Bonig

09:04:56 PM

rds snapshot or a db-native (like pg_dump or mysqldump)?

Steve Wade (swade1987)

08:44:02 AM

@Matthew Bonig mysql and rds snapshot

Matthew Bonig

02:26:35 PM

gotcha. So what I ended up doing was writing a lambda that did the dump (using pg_dump) and then streamed that to a bucket. I then have another process that reads that object from the bucket and restores it in another database. Nothing is packaged up nicely for distribution yet, but so far it seems to be working ok.

Matthew Bonig

02:26:51 PM

The plan is to have that Lambda be part of a state machine that will backup and restore any database requested from one env to another.

Steve Wade (swade1987)

02:27:44 PM

nice thats pretty cool

Steve Wade (swade1987)

02:28:04 PM

the dump into S3 makes sense as a lambda

Steve Wade (swade1987)

02:28:11 PM

what you write it in?

Matthew Bonig

02:28:18 PM

only concern is that you can’t get the entire database in 15 minutes =-/

Matthew Bonig

02:28:23 PM

nodejs

Steve Wade (swade1987)

02:28:59 PM

you can’t get it?

Matthew Bonig

02:30:32 PM

yes, I can in my case. I was just saying my only concern about using a lambda is that the database is so big that it couldn’t be dumped and uploaded to s3 within 15 minutes.

Matthew Bonig

02:31:06 PM

generally the runs I’ve been doing in a fairly small database were getting done in just a few minutes (with an 800mb dump file) so we’ll probably be fine. But if you’re trying to do this for some 3 tb database, you’re going to have a bad time with Lambdas

Steve Wade (swade1987)

08:28:16 PM

Did you notice much size difference between a MySQL dump to S3 and just taking a snapshot @Matthew Bonig ? My boss seems to think a dump is over engineering it for some reason

Steve Wade (swade1987)

08:30:51 PM

I’ll paste you his points here when I’m at my laptop in about 15 mins if you’re still around

Steve Wade (swade1987)

08:35:55 PM

We wont be retaining the shapshots. The RDS snapshot we should use is the automated ones, which we have to keep for contractual and compliance reasons anyway, so there is no additional cost.

Moving this into a sqldump would be over engineering in my view, RDS has the ability to export a snapshot directly to S3 so why reinvent the wheel? Lets face it AWS are pretty good at this stuff, so we should leverage their backup tooling where possible.

The snapshots moved into S3 will need to be retained indefinitely due to the contractual wording…this is being worked on, but wont change any time soon.

I also dont want to have a split between HelmReleases and TF. If we can manage this all in one place (which we can) it feels better than splitting it out. As a consumer having to deploy the infra, then also deploy a HelmRelease feels clunky. Where as deploying just the RDS instance and its backup strategy as a single unit would be more intuitive.

Steve Wade (swade1987)

08:36:19 PM

I proposed using a CronJob in our EKS clusters to facilitate the backup

Matthew Bonig

10:41:48 PM

In my base, postgres, so pg_dump. But, that was pretty large since totally uncompressed SQL.

I had looked into using snapshot sharing, but since the db was encrypted with a KMS I couldn’t share with the other account, I couldn’t ship it that way.

Should have looked more for the native s3 integration, but didn’t. Will look now. I don’t know how the encryption would work though. I would assume shipping it to s3 keeps the data encrypted (and needing the same key as the RDS instance)

Matthew Bonig

10:42:03 PM

I use cronjobs in a cluster to backup a mysql and mongodb. Works great.

Matthew Bonig

10:47:18 PM

oh man, totally should have done this s3 export.

Jim Park

05:42:50 PM

I wrote one for Elasticache, to ship elasticache snapshot to another account and restore it. I’ll put together a gist for you. It’s not RDS, but there may be similar semantics.

Jim Park

05:51:14 PM

Actually, scratch that. I haven’t completed open sourcing it, apologies about the false start.

mikesew

08:44:47 PM

RDS Q: I made a storage modification, but accidentally set to apply-in-maintenance window. How can I turn around and force it to apply-immediately? I’m in storage-full status.

Alex Jurkiewicz

08:50:16 PM

Make another change and tell it to apply immediately

mikesew

11:41:18 PM

thanks @Alex Jurkiewicz , aws support pretty much told me the same thing. they said to do it via CLI, not console.

aws rds modify-db-instance  \
  --db-instance-identifier my-db-instance-01  \
  --allocated-storage 200  \
  --max-allocated-storage 500  \
  --apply-immediately;

jason einon

10:34:20 PM

hwy, not sure to post here or terraform… anyone been ale to create a rds read replica in a different vpc via terraform…i have been stuck on this for a fewdays… getting the error:

Error creating DB Instance: InvalidParameterCombination: The DB instance and EC2 security group are in different VPCs.

jason einon

10:34:52 PM

i am able to apply the desired config through the console but no through Terraform sadly

Andriy Knysh (Cloud Posse)

11:59:33 PM

maybe this will help https://stackoverflow.com/questions/53386811/terraform-the-db-instance-and-ec2-security-group-are-in-different-vpcs

terraform the db instance and ec2 security group are in different vpcs

i am trying to create a vpc with public and private subnet along with Aurora mysql cluster and instance in same vpc with custom security group for RDS. i’ve created vpc (public/private subnet, cus…

Andriy Knysh (Cloud Posse)

11:59:58 PM

looks like one of them is using the default VPC

2021-09-10

Adnan

07:17:13 AM

Hi People, Wanted to ask about experiences upgrading kubernetes eks versions. I recently did an upgrade from 1.19 to 1.20. After the upgrade some of my workloads are experiencing weird high cpu spikes. But correlation does not equal causation so I wanted to ask if anyone here experienced something similar.

Max Lobur (Cloud Posse)

12:26:26 PM

The only change I can think of, that can cause this is docker deprecation: https://kubernetes.io/blog/2020/12/02/dockershim-faq/

But that’s not included to the 1.20 by default, you should do it separately in a node group. So if you followed the release notes and did it - must be it.

Dockershim Deprecation FAQ

This document goes over some frequently asked questions regarding the Dockershim deprecation announced as a part of the Kubernetes v1.20 release. For more detail on the deprecation of Docker as a container runtime for Kubernetes kubelets, and what that means, check out the blog post Don’t Panic: Kubernetes and Docker. Why is dockershim being deprecated? Maintaining dockershim has become a heavy burden on the Kubernetes maintainers. The CRI standard was created to reduce this burden and allow smooth interoperability of different container runtimes.

Max Lobur (Cloud Posse)

12:30:19 PM

Other than that the k8s version itself (the control plane) has no effect on workload resource consumption, it’s involved only during CRUD of the yamls.

Max Lobur (Cloud Posse)

12:31:28 PM

It must be something else - the AMI version of a worker, the runtime, instance type of a worker, and so on

2021-09-11

2021-09-13

Alyson

11:20:51 PM

Hi all right with you? Do you know if there is any web application to make it easier to navigate AWS S3?

pjaudiomv

11:23:49 PM

easier in regards to what, you mean like for a public bucket?

Alyson

11:25:16 PM

Yes! I need to release AWS S3 to people in the marketing department

Alyson

11:26:03 PM

These are people who have no technical knowledge and they need to have the option to download a full AWS s3 folder

pjaudiomv

11:26:49 PM

theres this which lets you browse a bucket https://github.com/awslabs/aws-js-s3-explorer

GitHub - awslabs/aws-js-s3-explorer: AWS JavaScript S3 Explorer is a JavaScript application that uses AWS's JavaScript SDK and S3 APIs to make the contents of an S3 bucket easy to browse via a web browser. attachment image

AWS JavaScript S3 Explorer is a JavaScript application that uses AWS's JavaScript SDK and S3 APIs to make the contents of an S3 bucket easy to browse via a web browser. - GitHub - awslabs/aws-…

Alyson

11:27:00 PM

I recently tested AWS s3 explorer, but it doesn’t have the option to download a full folder.

https://github.com/awslabs/aws-js-s3-explorer

pjaudiomv

11:29:25 PM

what about using a app like cyberduck or something like that

pjaudiomv

11:32:10 PM

this guy seems to have a fork where you can select multiple files https://github.com/awslabs/aws-js-s3-explorer/pull/86

V2 alpha by matthew-ellis · Pull Request #86 · awslabs/aws-js-s3-explorer attachment image

Issue #, if available: Description of changes: Add download button to header (only shows when items are selected) Enable download of multiple files at once in a ZIP folder - select items and click…

Alyson

05:07:52 PM

It’s working perfectly. Thanks a lot for the help!

2021-09-14

AugustasV

02:39:14 PM

Try to describe instances /usr/local/bin/aws ec2 describe-instances –instance-ids i-sssf –region –output text –debug and got that

nmkDIykR/VMOgP+bBmVRcm/QWkCbquedU53R9SAv9deDrjkWkLKuPEnHgu57eGq55K1nFTAVhJ2IG5u5C2IuNKCskgAqz6+JH5fMdlAhYtAzw6FTv+YTi9DFhJaBA9niDk+n2lNhtx/iIbDRNGGCrMXuQbU5hPeHy8ijY6g==', 'Authorization': b'AWS4-HMAC-SHA256 Credential=ASIAUXKPUFZ7UOBXM3GN/20210914/eu-west-1/ec2/aws4_request, SignedHeaders=content-type;host;x-amz-date;x-amz-security-token, Signature=a8d69a78cbf6ac49ba9cc7774d5e9625ec8a2843e7eedeaba2630da7a4a41e1f', 'Content-Length': '76'}>
2021-09-14 14:34:51,592 - MainThread - urllib3.connectionpool - DEBUG - Starting new HTTPS connection (1): ec2.eu-west-1.amazonaws.com:443

it’s private EC2 instance, why can’t get the output?

netstat -tnlp | grep :443
tcp        0      0 0.0.0.0:443             0.0.0.0:*               LISTEN      1013/nginx: maste

jose.amengual

04:18:45 PM

output? what do you mean?

AugustasV

06:38:41 PM

I mean when I run aws ec2 describe instance command, I would like to get result

jose.amengual

07:21:56 PM

do you have a firewall or something that could be blocking connections?

AugustasV

07:25:17 PM

I think the problem is that it’s private ec2 instance right? Doesn’t have public IP address. Instance metadata received

<http://169.254.169.254/latest/meta-data/>

By using curl command got result

jose.amengual

09:15:12 PM

the instance should have internet and it should be able to hit the api

jose.amengual

09:15:28 PM

it has nothing to do with the public ip

jose.amengual

09:15:52 PM

but usually to get metadata from within an instance you use this address http://169.254.169.254/latest/meta-data/

jose.amengual

09:15:57 PM

no need to run the cli for that

2021-09-15

Steve Wade (swade1987)

10:59:32 AM

does anyone have a clean way of authenticating (via kubectl) to EKS when using Azure AD as the OIDC identity provider? not sure if people have hooked up Dex with Gangway to provide a UI for obtaining them?

Andrea Cavagna

12:47:58 PM

We have an open enhancement in Leapp. maybe it can help you: https://github.com/Noovolari/leapp/issues/170

Add support for kubeconfig (CLI) integration · Issue #170 · Noovolari/leapp attachment image

Is your feature request related to a problem? Please describe. I am a user of kubernetes and of kubectl and eks. At present, kubectl references the aws binary for authentication, which expects cert…

Zach

02:40:18 PM

Odd, I’m using kubectl and leapp just fine right now oh, this is to have kubectl ask leapp directly. Huh.

Eric Villa

02:35:38 PM

Hi @Steve Wade (swade1987)! May I ask you how you have federated Azure AD to AWS?

Steve Wade (swade1987)

05:38:46 PM

@Eric Villa I have a blogpost on it https://medium.com/p/how-to-configure-azure-ad-as-an-oidc-identity-provider-for-eks-53337203e5cd?source=social.tw&_referrer=twitter&_branch_match_id=967466935918883996

How-To: Configure Azure AD as an OIDC Identity Provider for EKS attachment image

Problem statement

Eric Villa

07:35:55 AM

Ok thank you! I’ll check it out

Steve Wade (swade1987)

04:03:52 PM

does anyone know if there is a recommended approach to alert on failed RDS snapshot to s3 exports?

Matthew Bonig

04:07:13 PM

CloudWatch Events?

AugustasV

08:03:18 AM

Lambda functions to send sns notification to communication channel like teams or slack?

2021-09-16

Antarr Byrd

02:51:36 PM

I’m trying to try out Kinesis using CloudFormation. I’m getting failed invocations when my scheduler invokes the Lamba. But nothing is showing up in Cloudwatch logs. Any ideas how to handle/fix this?

AWSTemplateFormatVersion: "2010-09-09"
Description: "Template for AWS Kinesis resources"
Resources:
  DataStream:
    Type: AWS::Kinesis::Stream
    Properties:
      ShardCount: 1
      RetentionPeriodHours: 24
      Name: !Sub ${AWS::StackName}
  Lambda:
    Type: AWS::Lambda::Function
    Properties:
      Role: !Sub arn:aws:iam::${AWS::AccountId}:role/service-role/lambda_basic_execution
      Runtime: python3.6
      FunctionName: !Sub ${AWS::StackName}-lambda
      Handler: index.lambda_handler
      Code:
        ZipFile: |
          import requests
          import boto3
          import uuid
          import time
          import json
          import random

          def lambda_handler(event, context):
            client = boto3.client('kinesis', region_name='${AWS::Region}')
            partition_key = str(uuid.uuid4())
            response = requests.get('<https://randomuser.me/api/?exc=login>')
            if response.status_code == 200:
              data = json.dumps(response.json())
              client.put_record(
                StreamName='{AWS::StackName}',
                Data=data,
                PartitionKey=partition_key
              )
              print ("Data sent to Kinesis")
            else:
              print('Error: {}'.format(response.status_code))
  Schedule:
    Type: AWS::Events::Rule
    Properties:
      ScheduleExpression: "rate(1 minute)"
      State: ENABLED
      Targets:
        - Arn: !GetAtt Lambda.Arn
          Id: "TargetFunctionV1"
          Input: '{}'
  LogGroup:
    Type: AWS::Logs::LogGroup
    Properties:
      LogGroupName: !Sub /aws/lambda/${AWS::StackName}-lambda
      RetentionInDays: 7
  LogStream:
    Type: AWS::Logs::LogStream
    Properties:
      LogGroupName: !Ref LogGroup
      LogStreamName: !Sub /aws/lambda/${AWS::StackName}-lambda
  PermissionsForEventsToInvokeLambda:
    Type: AWS::Lambda::Permission
    Properties:
      FunctionName: !GetAtt Lambda.Arn
      Action: lambda:InvokeFunction
      Principal: events.amazonaws.com
      SourceArn: !GetAtt DataStream.Arn

Alex Jurkiewicz

08:43:37 PM

You checking the whole log group?

Alex Jurkiewicz

08:43:52 PM

You are creating a lot stream but Lambda won’t use that

2021-09-17

Shreyank Sharma

02:12:17 PM

Hi,

Is it possible to add custom endpoint to AWS Kinesis Signalling Stream endpoint(kinesis.us-east-1.amazonaws.com),

Tried installing a nginx in an ec2 instance and tried to reverse proxy pointing (customendpoint -> kinesis.us-east-1.amazonaws.com) and used certbot to issue certificate to my custom endpoint but the app is giving https://<custom-domain>/describeSignalingChannel 404 (Not Found)

Thanks

2021-09-19

Ozzy Aluyi

08:09:59 PM

Hi All, anyone know why my targets are stuck?

Ozzy Aluyi

08:10:40 PM

Target registration is in progress

Ozzy Aluyi

08:11:35 PM

it’s been trying to register for over and hour now.

Ozzy Aluyi

08:11:56 PM

any fix/solution will be appreciated.

venkata.mutyala

08:12:13 PM

Are the health checks passing?

venkata.mutyala

08:12:30 PM

Have you spot checked the health checks as being valid/working?

Ozzy Aluyi

08:13:43 PM

currently what it looks like

Ozzy Aluyi

08:14:16 PM

it was failing earlier.

Ozzy Aluyi

08:14:31 PM

now it is stuck on registering

venkata.mutyala

12:07:47 AM

I would suggest reaching out to their support if you haven’t already. Likely they will be able to spot the problem easily. Could very well be on their end too.

Darren Cunningham

01:03:52 PM

does your exec role have permissions to pull the image?

Ozzy Aluyi

03:06:41 PM

@Darren Cunningham sorry what exec role?

Darren Cunningham

03:30:45 PM

https://docs.aws.amazon.com/AmazonECS/latest/developerguide/task_execution_IAM_role.html

Amazon ECS task execution IAM role - Amazon Elastic Container Service

The task execution role grants the Amazon ECS container and Fargate agents permission to make AWS API calls on your behalf. The task execution IAM role is required depending on the requirements of your task. You can have multiple task execution roles for different purposes and services associated with your account.

Darren Cunningham

03:32:10 PM

additionally, if you’re pulling from a private ECR – double check the policy on the ECR

2021-09-20

omerfsen

03:32:00 PM

Hi has anyone ran tfstate backend module with 1.0.7 version of terraform?

omerfsen

03:32:03 PM

│ Error: Unsupported argument
│ 
│   on main.tf line 8, in module "tfstate_backend":
│    8:   force_destroy = true
│ 
│ An argument named "force_destroy" is not expected here.
╵
╷
│ Error: Unsupported argument
│ 
│   on main.tf line 10, in module "tfstate_backend":
│   10:   bucket_enabled   = var.bucket_enabled
│ 
│ An argument named "bucket_enabled" is not expected here.
╵
╷
│ Error: Unsupported argument
│ 
│   on main.tf line 11, in module "tfstate_backend":
│   11:   dynamodb_enabled = var.dynamodb_enabled
│ 
│ An argument named "dynamodb_enabled" is not expected here.
╵
╷
│ Error: Unsupported argument
│ 
│   on main.tf line 13, in module "tfstate_backend":
│   13:   context = module.this.context
│ 
│ An argument named "context" is not expected here.

Erik Osterman (Cloud Posse)

06:26:59 PM

hrmmm not following. master should be compatible too

omerfsen

06:28:27 PM

Sorry it was not about master/main

omerfsen

06:28:31 PM

You are correct

omerfsen

03:36:51 PM

was using master

omerfsen

03:36:53 PM

that is why

2021-09-21

mikesew

01:13:13 PM

JQ question: I want to get just the environment tag out of a set of RDS instance’s tags (pulled from AWS Config advanced queries). Does anybody know how to pull out just the value of the “env” tag for each instance?

aws configservice  select-aggregate-resource-config  \
--expression "
SELECT tags
WHERE resourceType = 'AWS::RDS::DBInstance'
" | jq -r '.Results[]' | jq -r .tags 
[
  {
    "value": "MON/01:00",
    "key": "auto-schedule-start"
  },
	{
    "value": "prod",                  <==== I ONLY WANT THIS
    "key": "env"
  }
]
[
	{
    "value": "dev",
    "key": "env"
  },
  {
    "value": "daily",                 <==== I ONLY WANT THIS
    "key": "backup"
  }
]

my jq attempt:

| jq 'select(.key="env").value'

.. but it’s returning all values, not just for the “env” tags. Any JQ folks here can assist? =]

02:01:35 PM

I used to use jq with awscli a lot but then i switched to their native jmespath using the --query method.

See if something like this works for you

https://github.com/aws/aws-cli/issues/621#issuecomment-36314975

Select tag value by key when using --query · Issue #621 · aws/aws-cli attachment image

Trying to output only specific tag values from describe-instances using the –query for example aws idis-eu-west-1 ec2 describe-instances –query "Reservations[].Instances[].{ID:InstanceId, TA…

mikesew

05:25:16 PM

i saw that syntax and thought about it, but the awsconfig advanced query uses simplified SQL syntax and HAS to dump an entire tags object. I have to then filter using jq. thank you for the link tho! super useful in other ways.

Kian Sajjadi

01:34:08 AM

Anyone here have any experience with setting up privatelink for fargate instances to pull images from ecr?

pjaudiomv

01:51:14 AM

Yes what’s the issue your running into

pjaudiomv

01:53:11 AM

Ecr has two endpoints you need to add, api and dkr once you create the endpoints you just need to add a security group for 443 and make sure fargate can access it

Kian Sajjadi

01:53:49 AM

hmm ive done exactly that but it seems the iamges are still being pulled through my public NAT

pjaudiomv

01:54:13 AM

You’ll probably also need to add s3 that can be a gateway endpoint

Kian Sajjadi

01:54:34 AM

so ive got 2 interfaces and a gateway

pjaudiomv

01:54:48 AM

Did you add the private subnets on the ecr interface endpoints

Kian Sajjadi

01:55:22 AM

yeah, all my private subnets are on both of the ecr interface endpoints

Kian Sajjadi

01:55:45 AM

same SG’s too

pjaudiomv

01:56:25 AM

Did you enable private hosted zone on the interface endpoint

Kian Sajjadi

01:56:42 AM

used the same policy as recommended by the docs, and my private DNS looks to be correct too (*.dkr.ecr.ap-southeast-2.amazonaws.com)

pjaudiomv

01:57:11 AM

Is this a dev env

Kian Sajjadi

01:57:34 AM

yeah

pjaudiomv

01:57:40 AM

Like can you temporarily remove the route to the nat and see if the ecr image still pulls for container

Kian Sajjadi

01:59:02 AM

I’ll give that a go now

pjaudiomv

02:00:29 AM

Yea I’m curious, it should be working from what you’ve setup

pjaudiomv

02:01:43 AM

One of these blog posts may or may not help

https://www.easydeploy.io/blog/how-to-create-private-link-for-ecr-to-ecs-containers-to-save-nat-gatewayec2-other-charges/

https://aws.amazon.com/blogs/compute/setting-up-aws-privatelink-for-amazon-ecs-and-amazon-ecr/

How to create private link for ECR to ECS containers to reduce NAT gateway(EC2-Other) charges | easydeploy.io attachment image

How to reduce NAT gateway charges on by creating Private link between ECR and ECS containers.

Setting up AWS PrivateLink for Amazon ECS, and Amazon ECR | Amazon Web Services attachment image

Amazon ECS and Amazon ECR now have support for AWS PrivateLink. AWS PrivateLink is a networking technology designed to enable access to AWS services in a highly available and scalable manner. It keeps all the network traffic within the AWS network. When you create AWS PrivateLink endpoints for ECR and ECS, these service endpoints appear […]

pjaudiomv

02:01:57 AM

What platform version are you using for fargate

pjaudiomv

02:02:01 AM

3 or 4

Kian Sajjadi

02:02:31 AM

1.4

Kian Sajjadi

02:03:04 AM

ive checked out both of those blog links haha, there must be a misconfiguration somewhere else that im not seeing

pjaudiomv

02:04:26 AM

Yea I’m trying to think of any gotchas, I’m not at my computer right now so can’t visually run through my setups

Kian Sajjadi

02:09:13 AM

sitting here waiting for terraform cloud to see the commit, not realising i didnt push yet

Kian Sajjadi

04:49:54 AM

alroghty, did some stuff

Kian Sajjadi

04:51:20 AM

turned off route for private subnets, added the cloudwatch private endpoint,and got it to seemingly pull from the private endpoint

pjaudiomv

11:48:41 AM

Nice

2021-09-22

Alencar Junior

11:50:24 AM

Hi folks, is it possible on EKS to set by default desired capacity in node groups to zero and increase it “automatically” as soon a new service is deployed? Currently I have a service(DWH) which runs daily for around 2 hours in a m5d.8xlarge instance and then becomes idle. I would like to avoid having that instance running for many hours without using it (currently trying to reduce costs).

Vlad Ionescu (he/him)

12:05:31 PM

Cluster AutoScaler can do that!

EKS-specific docs are at https://docs.aws.amazon.com/eks/latest/userguide/cluster-autoscaler.html and https://www.eksworkshop.com/beginner/080_scaling/

autoscaler/cluster-autoscaler at master · kubernetes/autoscaler attachment image

Autoscaling components for Kubernetes. Contribute to kubernetes/autoscaler development by creating an account on GitHub.

Cluster Autoscaler - Amazon EKS

The Kubernetes Cluster Autoscaler automatically adjusts the number of nodes in your cluster when pods fail or are rescheduled onto other nodes. The Cluster Autoscaler is typically installed as a Deployment in your cluster. It uses leader election

EKSworkshop.com attachment image

Amazon EKS Workshop

Alencar Junior

07:38:05 AM

Thanks @Vlad Ionescu (he/him)!

Almondovar

01:19:05 PM

hi guys i want to enable iam authentication in mariaDB but i have the feeling that its not supported. am i right? or is it because the db is not publicly accesible? as you can see in the screenshot the right one, is mysql and iam auth is enabled, but left one is mariaDB and i dont even see the option to enable it…

mikesew

05:10:44 AM

Reading https://docs.aws.amazon.com/AmazonRDS/latest/UserGuide/UsingWithRDS.IAMDBAuth.html, it only lists MySQL an postgres support, not Mariadb. Im sure aws support can confirm it, but that’s likely why. Don’t think the public IP has to do w it

Almondovar

06:18:29 AM

Thank you for confirming Michael

mikesew

05:05:10 AM

Cloudwatch Alarm SNS Question: I want to send Cloudwatch alarms to multiple destinations (2 MS teams channels & pager duty). All use a webhook.

Based on tutorials , it seems I make an SNS topic, make a lambda to translate/send a message to the webhook.

My question: do I need 3 separate lambdas to handle each destination? Or is there some other best practice / tool I should be doing ?

Darren Cunningham

01:34:47 PM

You don’t need separate Lambdas, but I would typically encourage it. Being that the intent of Lambda is Function as a Service, it’s ideal when a Lambda does not have too much baked into it. The more you pack into a single Lambda (1) it’s inherently going to run slower, which then has cost implications as you scale (2) increases code complexity, which makes testing all scenarios harder (3) depending on the runtime, can make bundling the Lambda a PITA.

If I was going to implement something like this, I would create two Lambdas: 1 for PagerDuty & 1 for MS Teams and then determine the best way to filter SNS Messages accordingly.

However, your use case sounds like one that others should have solved so I’m betting there is a tool or OSS project out there. Typically the teams that I’ve worked on have solved similar use cases with Datadog or New Relic. Those platforms are great, however they come with a significant investment…both time and money. Each of the platforms will have “Get started in 10 minutes” but that’s about as honest as 6 minute abs.

mikesew

04:56:12 PM

Thanks. I AM now bundling 2 teams lambda’s (multiple webhook Environment variables, and the core function doing the notification for each). thanks, so my function names basically should be

• cloudwatch-alarms-to-sns-to-teams

• cloudwatch-alarms-to-sns-to-pagerduty

mikesew

05:01:47 PM

We also have prometheus, but it’s a bit of black box to me as a practitioner (ie. how to use requires involvement from my platform/monitoring team) , I need a point solution and honestly want to know the “AWS way” of doing things.

Rohit S

04:36:14 AM

Pagerduty does not require json payload handling, I have usually recommended customer (who have PagerDuty) to funnel all alerts from PagerDuty, you can make rules and push the data to MS Teams or Slack.

MS Teams and Slack use web hooks so the data needs to be structured which is what the lambda does.

Whereas, Pagerduty has CloudWatch integrations, so it can injest the json output from CloudWatch as is.

mikesew

06:11:36 PM

ah – so what you’re saying is

Cloudwatch alarms to SNS topics
SNS topics to PAGERDUTY (easier integration)
PAGERDUTY to Teams (integration guide)

Rohit S

11:20:17 PM

Yup, spot on. So there’s no needs for lambdas to manage and you can create event rules in PagerDuty.

2021-09-23

2021-09-24

Fabian

03:55:47 PM

Hi anyone have an idea how long it’ll take to restore Automatic Backups for Postgres RDS? I have 4 running for a while. I’ve also restored Snapshots which are already running.

jose.amengual

04:57:46 PM

how big is it?

jose.amengual

04:58:04 PM

inserts are not in parallel in Postgres AFAIK

jose.amengual

04:58:17 PM

and there is no S3 import in RDS for postgres

jose.amengual

04:58:52 PM

to give you an idea 500 gb will take like 10 hours or more

jose.amengual

04:59:26 PM

but depends on size , IO etc

mikesew

07:27:34 PM

@Fabian: trying to dissect your statement. yyou’ve already restored (manual) snapshots successfully, but here you’re trying to restore automatic snapshots (ie. those that are taken by AWS nightly)? there typically should be no difference. what i’ve seen is that you’re supposed toopen up a ticket to aws support and they can probably see what’s going on in the backgrund using their API calls

Darren Cunningham

04:03:33 PM

depends on the size of the backup (maybe storage class - probably not though as I don’t think you can change this with automatics) and the size of the instance you’ve requested

Fabian

04:42:03 PM

Any rough idea?

Fabian

04:42:11 PM

I’ve been restoring for 1h now

Steve Wade (swade1987)

11:33:11 PM

What is the recommended approach for alerting to slack on a failed lambda invocation? I have written a rds snapshot to S3 lambda that fires from an event rule but want to know when it fails.

loren

11:49:22 PM

I feel like the lambda would need to do the error handling… Or maybe, create a metric from a log filter… https://docs.aws.amazon.com/AmazonCloudWatch/latest/logs/MonitoringLogData.html

Creating metrics from log events using filters - Amazon CloudWatch Logs

Create metric filters with CloudWatch Logs and use them to create metrics and monitor log events using CloudWatch.

Darren Cunningham

05:30:06 PM

DLQ -> SNS -> SQS & Lambda. Lambda handles the notification and SQS becomes the “repo” of messages that need to be manually remediated. As you identify automatic remediation opportunities you could add a Lambda process the messages from the SQS queue. You should also then go and add a filter to the notifications Lambda since you don’t need to get alerted for events you can automatically remediate.

2021-09-25

2021-09-26

2021-09-27

O K

02:06:32 PM

Hey, How can I specify EBS storage for brokers in AWS MSK module https://github.com/cloudposse/terraform-aws-msk-apache-kafka-cluster

GitHub - cloudposse/terraform-aws-msk-apache-kafka-cluster: Terraform module to provision AWS MSK attachment image

Terraform module to provision AWS MSK. Contribute to cloudposse/terraform-aws-msk-apache-kafka-cluster development by creating an account on GitHub.

O K

02:09:32 PM

got it https://github.com/cloudposse/terraform-aws-msk-apache-kafka-cluster/blob/master/variables.tf#L16

terraform-aws-msk-apache-kafka-cluster/variables.tf at master · cloudposse/terraform-aws-msk-apache-kafka-cluster attachment image

Terraform module to provision AWS MSK. Contribute to cloudposse/terraform-aws-msk-apache-kafka-cluster development by creating an account on GitHub.

Eric Steen

03:59:15 PM

Hi all, thanks for the amazing work. Does anyone have experience with vpn access to multiple regions using AWS transit gateway? cannot find an example of how to set this up. I am trying to wire up ec2_client_vpn with transit gateway in terraform.

msharma24

08:02:23 PM

Create a VPC Call it “VPN Client VPC” - Can be a small VPC /28 - Do not deploy any workloads in this VPC.
Deploy AWS Client VPN into o this VPC (1)
Attach the VPN Client VPC to the TGW as a VPC Spoke - Set appliance mode enabled for symetric routing (Attach the Subnets (Multi AZ) where the client vpn is deployed) , Add 0.0.0.0/0 to the TGW-id in the VPC RTs
Attach VPC B from another Region to the TGW - Modify its VPC RT to 0.0.0.0/0 TGW-id
If youre only using Default TGW Route Domain with auto prop and auto association then any client originating from the Client VPN should be able to ping resources in another region Spoke attachment

msharma24

08:02:30 PM

Typing this top off my head

Eric Steen

08:03:34 PM

Thanks @msharma24 !

msharma24

08:03:38 PM

Sorry dont have a working example

msharma24

08:05:05 PM

@Eric Steen I have recently built a AWS network Firewall with Transit gateway - you could easily fork and replace the Firewall VPC with the VPN Client VPC and it should work https://github.com/msharma24/terraform-aws-network-firewall-deployment-models/tree/main/centralized

terraform-aws-network-firewall-deployment-models/centralized at main · msharma24/terraform-aws-network-firewall-deployment-models attachment image

Deployment models for AWS Network Firewall with Terraform - terraform-aws-network-firewall-deployment-models/centralized at main · msharma24/terraform-aws-network-firewall-deployment-models

Carmelo

07:22:15 AM

Thanks @msharma24

omerfsen

07:55:13 PM

Hello what do you use to terminate/drain/remove nodes that is on Unready state on aws eks?

Erik Osterman (Cloud Posse)

06:27:40 PM

like the node termination handler?

omerfsen

06:27:55 PM

Yes

Erik Osterman (Cloud Posse)

06:28:22 PM

https://github.com/aws/aws-node-termination-handler

GitHub - aws/aws-node-termination-handler: Gracefully handle EC2 instance shutdown within Kubernetes attachment image

Gracefully handle EC2 instance shutdown within Kubernetes - GitHub - aws/aws-node-termination-handler: Gracefully handle EC2 instance shutdown within Kubernetes

Erik Osterman (Cloud Posse)

06:28:25 PM

we deploy this fwiw

omerfsen

06:28:48 PM

Yes i was looking for this ;)

omerfsen

06:29:01 PM

Cause getting unready nodes on aws nowadays

2021-09-28

2021-09-29

Mohamed Habib

10:36:43 AM

My codebuild jobs suddenly stopped working. I’m using docker inside codebuilds and it was working well but suddenly now seeing ERROR: Cannot connect to the Docker daemon at unix:///var/run/docker.sock. Is the docker daemon running? is anyone experiencing a similar issue ?

Mohamed Habib

10:37:22 AM

My buildspec.yml looks like so, its failing in the install phase:

version: 0.2

phases:
  pre_build:
    commands:
      - echo prebuild commands
  install:
    commands:
      - nohup /usr/bin/dockerd --host=unix:///var/run/docker.sock --host=<tcp://127.0.0.1:2375> --storage-driver=overlay2 &
      - timeout 60 sh -c "until docker info; do echo .; sleep 1; done"
  build:
    commands:
      - ls
      - pwd
      - docker login -u $DOCKERHUB_USER -p $DOCKERHUB_TOKEN
      - git clone {repository_url} code
      - cd code
      - git checkout {branch}
      - dg project generate --name {project_name}

Mohamed Habib

10:38:37 AM

I’m using custom image and I confirmed its running “previlliged” mode

Matthew Bonig

02:22:36 PM

huh, never seen that nohup call before. why you do that?

Mohamed Habib

04:28:48 PM

It’s based on https://docs.aws.amazon.com/codebuild/latest/userguide/sample-docker-custom-image.html

Docker in custom image sample for CodeBuild - AWS CodeBuild

Provides information about the Docker in custom image sample that is designed to work with AWS CodeBuild.

Mohamed Habib

07:58:25 AM

UPD: So it turns out that I had to add VOLUME /var/lib/docker to the dockerfile because I am using a custom image and codebuild has moved from Alinux v1 to Alinux v2

mikesew

12:14:41 AM

Question on Event Subscriptions: I’m looking at RDS Event subscriptsion to try to connect to pagerduty. Are there event subscriptions for services OTHER than RDS? I see documentDB, DMS, but I don’t see things like EC2, ALB.. do they exist?

Rohit S

01:50:02 AM

Potentially because they’re managed services, with the user having no/not much control over these services.

I think you can tap into aws.health as a provider in CloudTrail to get notifications for other services.

Alex Jurkiewicz

02:41:56 AM

it depends on the service. Check the EventBridge AWS schema registry and you can figure out what sort of events are published

mikesew

08:38:54 PM

thanks. it was more that Iwas expecting to find a ton of other tutorials for “ec2 event subscription” “fargate event subscription” “eks event subscription” , only to find that it’s really just RDS, DMS, an DynamoDB