Job Title: Sr. Specialist Software Reliability Engineer
Job Code: 22686
Job Location: Tulsa OK (Onsite)
Schedule: 9/80
Job Description:
The Senior Specialist Software Reliability Engineering position develops and executes Software Reliability Engineeirng (SWRE) program and procedures. The Software Reliability Engineering role ensures integration of the SWRE effort with traditional hardware related reliability efforts (e.g. Failure Predictions Failure Modes Effects and Criticality Analysis (FMECA) Reliability Block Diagrams (RBD) Fault Tree Analysis (FTA) Root Cause Analysis (RCA) Failure Reporting & Corrective Action System (FRACAS) and BuiltinTest (BIT) and Testability Analysis). The role will create professional datadriven presentations for briefing technical decisions or recommendations both to internal stakeholders and external vendors/customers. The position will recommendchanges to software code architecture or processes to increase software reliability as well as researchingnew SWRE methods for effectiveness to specific applications such as SW Reliability Growth.
Essential Functions:
- Ability to obtain and maintain a DoD security clearance.
- Creates and implements SWRE requirements throughout all phases of the systems lifecycle (design production operations and sustainment).
- Creates and identifies failure definitions and scoring criteria across various software development and product fielding phases.
- Develops hierarchical Software Bill of Materials (BOM) (SBOM) collaborating with Cyber resources.
- Performs and facilitates Software Failure Modes and Effects Analysis (SFMEA) meetings and documentation. Highlevel failure modes a SFMEA may catch would relate to requirements functionality design timing sequence/logic/state data exception handling interfaces (including UI/UX) coding memory and algorithm defects.
- Creates behavior models and state diagrams for analysis using Markov Chains or similar methods.
- Participates in software design meetings and code reviews from multiple viewpoints (e.g. Functional Interface Detailed Maintenance Usability Serviceability Vulnerability Process)
- Tracks software defects and failures over time with the ability to create actionable insights metrics and investigate with Software Root Cause Analysis (RCA) approaches. Reports actionable insights and reliability recommendations back to engineering and the program management organization.
- Calculates Software Failure Rate / Mean Time Between Failures (MTBF) utilizing one or more of the methods described in IEEE1633 to include in terms of Effective 1k Source Lines of Code (EKSLOC)
- Promotes a DesignforTest mindset within the software development organization.
- Integrates with MBSE Modeling to incorporate R&M paramters into modeling and simulation tools.
- Collaborate with Software team for SFMEA development root cause analysis of field observations of software bugs faults or failures.
- Works effectively and efficiently in a virtual environment with tools such as MS Teams Atlassian Jira etc.
- Proficiency in advanced Excel methods (data structures pivot tables manipulation transformation and visualization) is highly preferred.
- Technical understanding of electronic components / computer hardware concepts is a plus.
- Technical understanding of troubleshooting complex failure modes which include the interactions between software and hardware is a plus.
- Ability to interpret and work in a Model Based Systems Engineering (MBSE) environment to understand the system deisgn and contribute to system modeling in Cameo.
- Proactive mindset with the ability to anticipate potential Reliability challenges and develop effective mitigation strategies.
- Possesses strong understanding and use of reliability software tools (e.g. ReliaSoft Relex (PTC Windchill) Reylence).
- Statistical analysis experience using software tools such as MiniTab JMP R MatLab or similar.
- Experience in data mining using business tools SQL PowerBI Tableau Alteryx or similar data visualization tool.
Qualifications:
- Bachelors Degree and minimum 6 years of prior relevant experience. Graduate Degree and a minimum of 4 years of prior related experience. In lieu of a degree minimum of 10 years of prior related experience.
- 4 years experience in Software Systems and/or ReliabilityEngineering
- Must be a U.S. citizen.
Preferred Additional Skills:
- Active DoD security clearance.
- Degree in Electrical/Mechanical Engineering Systems Engineering Computer Science Mathematics Physics or similar educational background preferred.
- Knowledge and experience in the Aerospace/Defense sector.
- Understands failure modes and mechanisms at a physical level (e.g. mechanical electrical thermal optical) and at the software level (i.e. BuiltinTest (BIT) interfaces etc).
- Calculates BuiltinTest (BIT) Fault Detection Rate (FDR) Fault Isolation Rate (FIR) and False Alarm Rate (FAR) to support Testability analysis.
- Collaborated with Software Development team for Design for Test with BuiltinTest (BIT) / BuiltinTest Equipment (BITE).
- Promotes a culture of Reliability Centered Maintenance (RCM) implements Predictive Maintenance (PdM) processes and recommends both logistical and engineering opportunities to improve longterm sustainability of the systems through a mission readiness lens. Knowledge of Prognostics Health Management (PHM) or Condition Based Maintance plus (CBM) is a plus.
- Participates in safety analyses including Human Machine Interface (HMI) and other Human Factors design pricipals as it relates to mission operation and success.
- Familiarity with Agile development methods (e.g. Scrum SAFe) DevSecOps CI/CD containerization.
- Knowledge and experience in advanced Optical Systems EO/IR Sensors or ISR applications.
- Statistical methods and distributions associated with reliability engineering (e.g. exponential Weibull).
- Reliability prediction methodologies (e.g. MIL217 VITA 51.
- FRACAS experience.
- Integrated Logistics Support (ILS) / Integrated Product Support (IPS) experience.
- Verification and Validation (V&V) experience or Integration and Test (I&T) experience.
- Environmental Testing / Qualification (ESS) experience.
- Knowledge of databases and data mining / big data analytics / predictive analytics.
- Knowledge of any of the following documents: MILSTD2155 MILHDBK 217 MILHDBK 338 MILSTD 1629 MILHDBK 470 TAHB0009 SAE ARP5580 SAE 1025
- Experience working under an AS9100 / ISO9001 Quality Management System.
- ASQ Certified Reliability Engineer (CRE) Certification or similar.
- Excellent oral and written communication skills.
- Strong organizational and time management skills.
- Be dynamic and adaptable to change.
- Have a strong desire to learn possess good interpersonal skills and decision making/critical thinking skills.
- The essential physical requirements of the position will be reviewed with the applicant during the interview.
#LIAS1
Required Experience:
Senior IC