Azure SRE Engineer
Chicago IL
Onsite
Responsibilities:
Cloud Technologies:
- Design and Implement Cloud Infrastructure services for IaaS PaaS CaaS SaaS Applications. Manage and optimize cloud infrastructure services on Azure and AWS.
Automation and Scripting:
- Tools: PowerShell Python Bash.
- Services: Jenkins GitHub Actions GitLab Harness is an advantage.
- Automate tasks and streamline processes using various scripting languages and tools.
CI/CD Pipeline Management:
- Tools: Azure DevOps Jenkins GitHub Actions ArgoCD.
- Services: Continuous Integration and Continuous Deployment (CI/CD) pipelines.
- Implement and manage CI/CD pipelines to ensure smooth software delivery.
Infrastructure as Code (IaC):
- Tools: Terraform Ansible.
- Services: Automating infrastructure provisioning and management.
- Use IaC tools to automate the provisioning and management of infrastructure.
Identity and Access Management:
- Azure: Azure Active Directory RoleBased Access Control (RBAC).
- AWS: AWS IAM AWS SSO.
- Manage identity and access controls to ensure secure access to resources.
Monitoring and Logging:
- Tools: New Relic Azure Monitor Application Insights Log Analytics Workspace (KQL) Prometheus Grafana.
- Services: Setting up Dashboards/s for reactive/proactive monitoring s and logging.
- Set up and manage monitoring and logging tools to ensure system health and performance.
Cloud Security and Threat Management:
- Azure: Azure Security Center Network Security Groups (NSGs) Azure Firewall Private End Points etc.
- Implement security measures and manage threats to protect cloud resources leveraging PRISMA.
Data Pipeline Design and Implementation:
- Azure: Azure Data Factory Azure Synapse Analytics Databricks.
- Design and implement data pipelines for processing and visualization.
Collaboration and Communication:
- Tools: Microsoft Teams Jira Confluence.
- Services: Effective communication and collaboration within a distributed team.
- Foster effective communication and collaboration within a geographically distributed team.
Documentation:
- Tools: Lucidchart Microsoft Office 365 Jira/Confluence.
- Services: Creating and maintaining uptodate documentation.
- Create and maintain comprehensive documentation for systems and processes.
- Create software architecture and design documentation for the supported solutions and overall best practices and patterns.
AI/ML Integration:
- Tools: Azure Machine Learning Azure OpenAI Service.
- Services: Leveraging AI/ML technologies for automation and optimization.
- Integrate AI/ML technologies to enhance automation and optimize workflows.
Security and Compliance:
- Services: Implementing security measures and ensuring compliance with industry standards.
- Ensure systems and processes comply with security standards and regulations.
Technical Support and Troubleshooting:
- Tools: Azure Support AWS Support.
- Services: Providing technical support and troubleshooting.
- Provide technical support and troubleshoot issues to maintain system reliability.
- Work directly with the technology and engineering teams to integrate data processing and business objectives.
- Monitor and optimize data performance uptime and scale; maintain high standards of code quality and thoughtful design.
- Support team with technical planning design and code reviews including peer code reviews.
- Provide Architecture and Technical Knowledge training and support for the solution groups.
Continuous Learning and Improvement:
- Approach: Staying current with emerging trends and technologies and continuously improving skills and processes.
- Stay updated with the latest trends and continuously improve skills and processes.
- Conceptualize PoC and Architecture as a Service
- Stay current with emerging trends on GenAI Technologies and make recommendations as needed to help the organization innovate.
- Proactively plan complex projects from scope/timeline development through technical design and execution.
- Demonstrate leadership through mentoring other team members.
Skills
Mandatory Skills : AppDynamics Azure Monitor Chaos Monkey Chaos Testing CLOUD FLARE CloudWatch CYBERARK DATA Dog Dynatrace Git Grafana Jenkins New Relic Observability Prometheus Python Reliability Patterns Shell Scripting SITE24X7 Splunk SUMO LOGIC Threat Modeling