DevOpsSRE

Not Interested
Bookmark
Report This Job

profile Job Location:

Austin, TX - USA

profile Monthly Salary: Not Disclosed
Posted on: 14 hours ago
Vacancies: 1 Vacancy

Job Summary

DevOps/SRE Role

Job Description:

Location: Onsite in Austin TX
Duration: 6 month contract to hire


POSITION SUMMARY:
The Engineer will spend time helping with Data Center growth using home-grown & 3rd party monitoring & management solutions automation frameworks and documenting processes. Enhance internal tools and automation using Python and shell scripting. Develop and maintain complex sql queries for data correlation across multiple data sources in Grafana. Integrate code analysis tools to improve software quality and security. Collaborate with team members to improve operational efficiency and monitoring visibility. Help team with pdu/cdu/powershelf onboarding and provisioning.

RESPONSIBILITIES:

  • Design and Implement Monitoring Solutions: Design develop and implement robust monitoring logging and alerting systems using tools like Prometheus and Grafana to ensure high availability and performance of power and cooling production systems in data centers.
  • Log Management and Data Analysis: Utilize Splunk for log management data analysis and creating insightful dashboards to troubleshoot issues and identify performance bottlenecks.
  • Software Development and Automation: Write clean efficient and maintainable code in Python and other relevant languages to automate monitoring tasks and build integrations between various systems.
  • Database Management: Work with SQL and potentially NoSQL databases to manage time-series data and optimize database performance related to observability systems.
  • Version Control and Collaboration: Use Git for version control and collaborate effectively with development operations and product teams to integrate observability best practices into the software development lifecycle.
  • Troubleshooting and Incident Response: Participate in on-call rotations respond to incidents and conduct postmortem analysis to prevent future issues.
  • Create documentation for existing processes.

REQUIREMENTS:

  • Education: Bachelors degree in Computer Science Engineering or a related technical field or equivalent experience.
  • Experience: Relevant professional experience as a Software Engineer or Observability Engineer in a production environment.
  • Technical Skills:
    • Proficiency in programming languages specifically Python.
    • Hands-on experience with observability tools including Prometheus and Grafana.
    • Experience with log analysis platforms like Splunk.
    • Strong understanding of SQL and database management principles.
    • Familiarity with version control systems especially Git.
    • Experience in hosting application services in Kubernetes clusters.
    • Experience with cloud platforms (AWS GCP Azure) and containerization technologies (Docker Kubernetes) is often a plus.
    • Experience with Netbox inventory update using api.
  • Soft Skills:
    • Strong problem-solving and analytical abilities.
    • Excellent communication skills and the ability to work in a collaborative team environment.
    • Demonstrated ability to work independently and manage ambiguity.
DevOps/SRE Role Job Description: Location: Onsite in Austin TX Duration: 6 month contract to hire POSITION SUMMARY: The Engineer will spend time helping with Data Center growth using home-grown & 3rd party monitoring & management solutions automation frameworks and documenting processes. En...
View more view more