Role : IT Engineer
Location : Reston VA Hybrid
Shifts: Candidates should be open to rotational shifts or a constant night/day shift based on the roster (like GOC).
Duration : Oneyear contract
Candidates will support the following key areas:
- Production support
- Application & Cloud Infrastructure support
- Incident management
- Command center
- Production stability
- Observability
- Strong administrative experience in AWS
- Oncall 24x7 support
- Experience with ticketing systems such as Jira ServiceNow etc.
- Experience with monitoring tools and Splunk
- Strong written and interpersonal skills
Key Responsibilities:
- Provide support for complex or specialized application or infrastructure tasks incidents changes and requests.
- Coordinate and manage changes to the production environment while ensuring safety and soundness.
- Lead operational implementation and maintenance of complex IT infrastructure and application projects.
- Troubleshoot and resolve advanced and complex system service and application issues.
- Provide guidance and training to junior team members.
- Identify and drive automation initiatives.
- Propose and implement system performance enhancements.
- Resolve monitoring and alerting issues including threshold updates and new monitors.
- Handle compliance activities including access and password management.
- Handle escalated incidents and lead critical incident response efforts and root cause analysis.
- Lead critical application releases and implementations.
- Update configurations including testing and peer reviewing.
- Create/modify code scripts and monitors to resolve or prevent incidents.
- Collaborate with teams to enhance monitoring tools and processes.
- Provide reporting and analysis on businessimpacting incident trends.
Skills Required:
- Good communication skills (oral and written).
- Attention to detail and multitasking ability.
- Proven experience with:
- Unix/Linux
- AWS cloud platforms (Certification required)
- Advanced scripting and automation skills
- Monitoring tools like Extrahop SolarWinds and Catchpoint
- Analyzing dashboards and reporting tools to identify trends and patterns
- Handson experience with Splunk and other transactionlevel monitoring tools.
Top MustHave Skills:
- TIL Certification is required.
- Proficiency in Office products (Excel Word Outlook).
Education/Experience:
- 510 years of cloud operations and engineering experience.
- Bachelors degree.