Position: Lead AWS Cloud Platform Engineer
Location: Reston VA #HYBRID
Duration: Long term
Interview: Video call round 1 in office round 2
PURPOSE:
- Seeking experienced and dynamic AWS Cloud Platform Engineer to provide hands-on technical expertise for supporting our AWS footprint including Security and Guardrails.
- As an AWS Cloud Platform Engineer you will be responsible for designing implementing and integrating our cloud platform resources on Amazon Web Services (AWS).
- You will collaborate with a team of talented cloud engineers and technology teams to deliver scalable reliable and secure cloud solutions that support our companys growth and innovation.
- Defines designs and develops system requirements. Performs trade-off analysis of performance life-cycle cost risk producibility and other system or program requirements.
- Assesses architecture and current hardware limitations defines and designs system specifications and evaluates input/output processes and working parameters for hardware/software compatibility.
- Coordinates design of subsystems and integration of total system. Defines system support requirements to include monitoring capacity staffing and patching/updating.
- Analyzes and resolves program support deficiencies. Conducts independent technical investigations in systems design.
ESSENTIAL FUNCTIONS:
30% Installs tunes upgrades troubleshoots and maintains all computer systems relevant to the supported applications including all necessary tasks to perform operating system administration user account management disaster recovery strategy and networking configuration.
25% Develop and implement techniques to prevent system problems troubleshoots incidents to recover services and support the root cause analysis
20% Evaluates new systems by performing in-depth tests including end-user reviews. Researches software and related products to support recommendations and purchasing. Determines systems integration issues by evaluating components; developing and completing performance tests; analyzing test data; studying project requirements; analyzing user and potential user input; evaluating similar and related products and systems. Develop system automation and system integration of business processes.
15% Improves engineering job knowledge by attending educational workshops; reviewing professional publications; establishing personal networks; benchmarking state-of-the-art practices; participating in professional societies.
10% Acts as a mentor for junior and senior team members.
Preferred Qualifications:
- Minimum of 10years of IT experience of which at least 5 years must be in AWS Cloud Platform engineering and Administration.
- Strong Leadership experience with driving Transformation initiatives
- Building robust Middleware Environments
- Must have strong hands-on knowledge of AWS platform and services but not limited to VPC Networking Direct Connect Subnets NACLs Security Groups EC2 S3 IAM ELBs Lambda CloudWatch CloudTrail EKS etc.
- Must Have Hands on current Implementation and Production level experience in AWS Cloud.
- Hands on experience with Automation and Infrastructure Provisioning is a must Our goal is to only provision infrastructure with Code and Policy As Code.
- Must be familiar with Terraform automation Ansible playbooks and Python code.
- Experience with AWS Cloud Formation and CDK is required.
- Must have hands on experience in writing Lambda functions preferably in Python (Boto3).
- Must be well versed in writing Linux Bash scripts.
- Minimum of One AWS certification is required.
- Hands-on experience with Containerization and Amazon EKS is a big plus.
- A great understanding of various DevOps toolchains including Git/repo Crucible Jenkins etc.
Solid understanding and experience with a CI/CD tool chain.
Qualifications:
- 10 years of overall experience in IT including hands-on Development and Systems engineering background.
- Bachelors degree in computer science Information Technology or related field; masters degree preferred.
- 5-10 years of experience in cloud engineering and automation with a focus on AWS cloud services.
- Minimum of 8-10 years of IT experience of which at least 5 years must be in AWS Cloud Automation and Administration.
- 3-5 years of experience in a Site Reliability Engineering role
- 3 years of experience with implementation of Containerization (Kubernetes) Cloud technologies (AWS Azure or Google etc.) DevOps tool chain (Jenkins Artifactory bitbucket etc.) and technical patterns (IaC Automated Provisioning/Release CI/CD etc.
- Must be well versed in Terraform automation Ansible playbooks and Python code.
- In-depth knowledge of AWS services and solutions including but not limited to EC2 S3 RDS Lambda VPC IAM CloudFormation and CloudWatch.
- Strong understanding of cloud architecture principles design patterns and best practices for building scalable resilient and secure cloud environments.
- Proficiency in infrastructure as code (IaC) tools such as Terraform or AWS CloudFormation.
- Exposure to Artificial intelligence patterns Machine Learning and engineering of AWS solutions.
- Good knowledge of leveraging PageMaker Amazon Kendra SageMaker Lambda Bedrock Rag models etc.
- Excellent communication collaboration and leadership skills with the ability to effectively interact with technical and non-technical stakeholders.
- Minimum of 1 AWS certifications (e.g. AWS Certified Solutions Architect AWS Certified DevOps Engineer is required.
- Experience working in an Agile/Scrum environment preferred.
- Solid understanding of Software coding techniques and experience with
- Hands-on experience with CI/CD pipelines containerization technologies (e.g. Docker Kubernetes) and serverless computing is a plus.
- Full spectrum of Software engineering (Build Integration Test Releasing and Deployment) leveraging Python.
- Experience in Developing and/or challenging engineering solutions/practices and collaborating with peers within and outside of immediate team including customers (Developers Architects and Engineers)
- AWS Cloud certifications minimum of One is required.
- FinOps Certification is a Plus
Roles & Responsibilities:
-
- Communicates Architectural decisions plans goals and strategies while highlighting short-term trade-offs vs. long-term commitments and costs
- Engage in and improve the end-to-end Lifecycle of services starting from Inception & design deployment and operations.
- Establish automation capabilities leveraging Cloud native solutions to improve the Developer experience.
- Support activities including System design consulting developing software platforms and frameworks capacity planning and launch reviews.
- Willingness to roll up the sleeves and troubleshoot difficult issues and engage the Customer.
- Willingness to learn new AWS Services and other technologies as required.
- Systems Scalability and sustainability leveraging automation and strive to improve our systems with changes that improve reliability and velocity.
- Experience with Enterprise Cloud transformation and migration efforts.
- Actively participate and help guide customers on using Cloud-native design and architecture patterns.
- Provide Consultation on Technology infrastructure planning and engineering for assigned systems; Assesses the implications of technology strategies on infrastructure capabilities.
- Establish strategies to migrate Legacy applications by conversion to multiple Microservices and hosting on AWS Cloud platform.
- Leverage Cloud-native architecture components including Containers immutable infrastructure Microservices Service Mesh etc. to build highly available and Fault tolerant applications.
- Conduct research on the global technology trends and their applicability to products in support of our internal development teams and business initiatives.
- Promotes and ensures Modern application design applies engineering best practices in the development and operations life cycle and mitigates vulnerabilities
- Monitors and manages the Stability Availability and Performance of enterprise systems and platforms across IT domains.
(e.g. Systems Network Storage Security) by analyzing systems to identify problems trends and opportunities for improvement.
- Automate end to end process to maintain (patches and upgrades) of our AWS Cloud ecosystem.
- Makes data-driven recommendations and decisions and continuously improves the overall efficacy and efficiency of our software delivery capabilities.
- Mentoring peers as well as engaging with others across teams and socializing solutions.
Strong skills are desired in each of the following areas:
Development: Experience programming with one or more languages: Python Java Groovy Go etc.
IAC Tools for Platform Automation: Strong skills and experience in at least one: Ansible and Terraform AWS Cloud formation CDK.
Containers: Docker or other OCI-certified containers- is a Plus
Container Orchestration Platform: Experience with Kubernetes AWS EKS AWS ECS is a plus.
CNI Plugins: Calico Flannel Weave Net etc.
Service Mesh: Istio AWS App Mesh OpenShift Service Mesh etc.
Container Security Tools: Twistlock Sysdig Aqua etc. is a plus
Platform Monitoring Observability & Performance Tools: Nginx New Relic AppDynamics Data Dog Thanos Jaeger LogDNA etc.
DevOps Tools: Git/Repo Crucible Bitbucket Jira Ansible Puppet Jenkins ArgoCD Bamboo Maven Artifactory Nexus etc.
Knowledge Skills and Abilities (KSAs):
-
- Knowledge of programming languages and web-based technologies.
- Ability to collaborate to solve technical problems across teams.
- Expert -Excellent communication skills both written and verbal.
- Expert - The incumbent is required to immediately disclose any debarment exclusion or other event that makes them ineligible to perform work directly or indirectly on Federal health care programs.
- Must be able to effectively work in a fast-paced environment with frequently changing priorities deadlines and workloads that can be variable for long periods of time.
- Must be able to meet established deadlines and handle multiple customer service demands from internal and external customers within set expectations for service excellence.
- Must be able to effectively communicate and provide positive customer service to every internal and external customer including customers who may be demanding or otherwise challenging.
Thanks & Regards
--
LAXMAN
Team Lead - Talent Acquisition
KMM Technologies Inc.
CMMI Level 2 ISO 9001 ISO 20000 ISO 27000 Certified