Our client provides an online student experience platform.
The position is based in Cambridge MA and is hybrid (in office 2-3 days per month).
Responsibilities:
Automation and Efficiency: Automate manual tasks for developing and deploying code and data by implementing continuous deployment and continuous integration frameworks. This involves creating software deployment strategies and automating tasks through scripting coding orchestration tools and Infrastructure-as-Code (IaC) practices. Key initiatives include migrating to Terraform for new services and cleaning up existing infrastructure as well as consolidating CI/CD pipelines likely to GitHub Actions using self-hosted runners and streamlining QA automation.
Strategy and Architecture: Develop and execute a global DevOps/Cloud strategy aligned with business objectives scalability and technology choices. This includes designing building managing and improving core infrastructure.
Reliability and Performance: Ensure high availability performance and reliability of SaaS services through effective monitoring alerting and disaster recovery planning. This also involves identifying and implementing data storage methods and assisting with the observability stack project to accelerate solution implementation.
Security and Compliance: Collaborate with the security team to conduct regular security assessments and audits ensuring compliance with industry regulations and standards relevant to SaaS offerings including SOC 2 audits. This also involves overseeing infrastructure best practices restricting privileges and reducing access for creating items in production environments aligning with DevSecOps principles.
Troubleshooting and Optimization: Debug production issues across various services and levels of the stack improve system observability and traceability and implement monitoring solutions to identify and resolve production issues efficiently.
Lead and Mentor: Guide and mentor a team of DevOps/Cloud engineers fostering collaboration and continuous improvement in the delivery pipeline through training sessions for junior team members.
Communication and Collaboration: Act as a liaison between the DevOps/Cloud team and other technical and non-technical teams enhancing communication and interaction between development and operations to ensure clear information conveyance and team performance.
Continuous Improvement: Stay abreast of industry trends and best practices conducting research tests and executing new techniques that can be applied to software development projects. This includes promoting documenting and implementing technologies and processes that enhance developer productivity.
Technologies:
Cloud Providers: Expertise in AWS is a must-have.
Infrastructure as Code: Strong knowledge of Terraform.
CI/CD Tools: Strong knowledge of CI/CD tools with a preference for GitHub Actions.
Observability Stacks: Experience with observability stacks (LGTM Elastic DataDog).
Our client provides an online student experience platform.The position is based in Cambridge MA and is hybrid (in office 2-3 days per month).Responsibilities: Automation and Efficiency: Automate manual tasks for developing and deploying code and data by implementing continuous deployment and continu...
Our client provides an online student experience platform.
The position is based in Cambridge MA and is hybrid (in office 2-3 days per month).
Responsibilities:
Automation and Efficiency: Automate manual tasks for developing and deploying code and data by implementing continuous deployment and continuous integration frameworks. This involves creating software deployment strategies and automating tasks through scripting coding orchestration tools and Infrastructure-as-Code (IaC) practices. Key initiatives include migrating to Terraform for new services and cleaning up existing infrastructure as well as consolidating CI/CD pipelines likely to GitHub Actions using self-hosted runners and streamlining QA automation.
Strategy and Architecture: Develop and execute a global DevOps/Cloud strategy aligned with business objectives scalability and technology choices. This includes designing building managing and improving core infrastructure.
Reliability and Performance: Ensure high availability performance and reliability of SaaS services through effective monitoring alerting and disaster recovery planning. This also involves identifying and implementing data storage methods and assisting with the observability stack project to accelerate solution implementation.
Security and Compliance: Collaborate with the security team to conduct regular security assessments and audits ensuring compliance with industry regulations and standards relevant to SaaS offerings including SOC 2 audits. This also involves overseeing infrastructure best practices restricting privileges and reducing access for creating items in production environments aligning with DevSecOps principles.
Troubleshooting and Optimization: Debug production issues across various services and levels of the stack improve system observability and traceability and implement monitoring solutions to identify and resolve production issues efficiently.
Lead and Mentor: Guide and mentor a team of DevOps/Cloud engineers fostering collaboration and continuous improvement in the delivery pipeline through training sessions for junior team members.
Communication and Collaboration: Act as a liaison between the DevOps/Cloud team and other technical and non-technical teams enhancing communication and interaction between development and operations to ensure clear information conveyance and team performance.
Continuous Improvement: Stay abreast of industry trends and best practices conducting research tests and executing new techniques that can be applied to software development projects. This includes promoting documenting and implementing technologies and processes that enhance developer productivity.
Technologies:
Cloud Providers: Expertise in AWS is a must-have.
Infrastructure as Code: Strong knowledge of Terraform.
CI/CD Tools: Strong knowledge of CI/CD tools with a preference for GitHub Actions.
Observability Stacks: Experience with observability stacks (LGTM Elastic DataDog).
View more
View less