Senior Systems Engineer – Mechanical

Fleet Data Centers

Not Interested
Bookmark
Report This Job

profile Job Location:

Mercer Island, WA - USA

profile Monthly Salary: $ 120000 - 150000
Posted on: 30+ days ago
Vacancies: 1 Vacancy

Job Summary

Position Overview:

We are seeking a Senior Systems Engineer Mechanical to own the detailed design validation analysis and ongoing optimization of Fleets data center cooling topology from campus level through the rack. This role requires a deep understanding of Fleet data center cooling topology including air-side and liquid-side systems (fan walls CRAHs/CRACs chillers dry coolers pumps heat exchangers distribution manifolds in-rack cooling components) and how these components interact under a variety of operating and failure scenarios.


The ideal candidate will pair strong mechanical engineering fundamentals with practical data center cooling experience ensuring that the air-to-liquid mix and cooling configuration for each deployment match rack layouts and rack SKUs that CFD and failure-mode simulations are routinely used to de-risk deployments and that cooling system behavior is well understood and systematically improved. This role is accountable for end-to-end thermal system integrity including aisle-level optimization fan wall octet configuration failure-mode simulations (e.g. CRAC outage dry cooler outage) and impact assessment for infrastructure upgrades and expansions with the goal of optimizing uptime SLAs and minimizing cooling stranding.


We have a hybrid policy and candidates can sit in Seattle WA Denver CO Austin TX or Alexandria VA


Responsibilities


Cooling Topology Ownership & Rack-Level Alignment

  • Develop and maintain a deep understanding of Fleet data center cooling topology including:
    • Air-side systems: fan walls CRAHs/CRACs air handlers ducting containment filters.
    • Liquid-side systems: chillers dry coolers pumps CDUs heat exchangers headers/manifolds valve trains.
    • Rack-level solutions: liquid-cooled cold plates rear-door heat exchangers in-rack manifolds hybrid air/liquid configurations.
  • Determine the air-to-liquid mixneeded to support a given rack layout and density considering rack SKUs aisle configuration containment and site constraints.
  • Ensure cooling topology and capacity at room/aisle level support current and forecast rack deployments and density targets.

Rack SKUs Cooling Requirements & Data Accuracy

  • Understand the air and liquid cooling requirements for each rack SKU including:
    • Inlet temperature and humidity ranges.
    • Liquid flow pressure and temperature ranges for cold plates and rear-door heat exchangers.
  • Maintain structured mapping from rack SKUs to:
    • Required airflow per rack/aisle.
    • Required liquid flow per rack/manifold/loop.
    • Special constraints (e.g. mixed air/liquid aisles max ΔT).
  • Ensure specifications and counts for cooling components (fan wall modules CRAHs/CRACs CDUs pumps valves manifolds piping sizes coils) are accurate documented and provided to capacity planning and procurement.

CFD & Thermal Analysis

  • Perform CFD analysis at room and aisle level to:
    • Validate that planned rack placement does not create hot spots.
    • Confirm that airflow patterns pressure profiles and temperature distributions are within allowable limits.
    • Identify and mitigate cooling stranding where cooling capacity exists but cannot be effectively delivered to IT load because of placement or topology.
  • Use CFD and thermal modeling tools to:
    • Evaluate different rack arrangements and containment strategies.
    • Test sensitivity to changes in IT load fan speeds supply temperatures and air-to-liquid mix.
    • Quantify margin to thresholds (e.g. maximum rack inlet temperature maximum component temperatures).
  • Translate CFD results into actionable design rules placement constraints and deployment guidelines for capacity planners and operations.

Aisle-Level Optimization & Fan Wall Configuration

  • Optimize cooling for each aisle based on:
    • Actual and forecasted IT load distribution.
    • Air-to-liquid split for the racks in that aisle.
    • Containment strategy (cold aisle hot aisle full containment partial containment).
  • Recommend fan wall octet configurations (and other fan wall module configurations) per deployment to:
    • Deliver required airflow and pressure with redundancy.
    • Maintain redundancy and margin for failure and maintenance scenarios.
    • Minimize energy use while maintaining thermal headroom.
  • Work with operations to tune setpoints (supply temperatures fan speeds differential pressure chilled water temperatures) to support uptime SLAs and reduce cooling stranding and over-provisioning.

Failure Mode Simulations & Uptime Optimization

  • Conduct failure mode simulations and analyses for mechanical systems including at minimum:
    • CRAC/CRAH outage scenarios (single unit or multiple simultaneous failures).
    • Dry cooler outage or degraded performance scenarios.
    • Pump failures valve failures and partial loss of liquid loops.
  • Evaluate for each scenario:
    • Transient and steady-state temperature excursions at the rack and component level.
    • Time-to-threshold (how long before violating safe temperature limits).
    • Impact on redundancy load shedding requirements and achievable uptime.
  • Use results to:
    • Recommend design improvements (additional redundancy loop segmentation capacity rebalancing).
    • Define operational responses and MOPs (e.g. load shedding priorities setpoint changes).
    • Optimize uptime SLAs while minimizing cooling stranding especially in mixed air/liquid deployments and high-density aisles.

Infrastructure Upgrades & Expansion Impact Analysis

  • Lead or support infrastructure upgrades and expansion impact analyses for cooling systems including:
    • Adding or resizing fan walls CRAHs/CRACs dry coolers chillers pumps CDUs and distribution headers.
    • Increasing liquid cooling fraction as AI-heavy racks grow in share.
    • Changing setpoints or operating modes (e.g. different supply temperatures economization strategies).
  • Quantify for proposed changes:
    • Effect on current and future thermal capacity and headroom.
    • Changes in aisle-level and room-level airflow / liquid flow distribution.
    • Impact on PUE water usage and operating costs.
  • Provide mechanical engineering input into MOPs and risk assessments for any cooling system change that could impact live IT load.

Cross-Functional Collaboration & Documentation

  • Partner with capacity planners rack design teams site operations facilities engineering and procurement to ensure:
    • Cooling design and capacity assumptions are aligned with rack deployment plans and SLAs.
    • Air-to-liquid decisions are integrated into forecast models and program timelines.
  • Produce and maintain clear design guides reference one-lines piping schematics and airflow diagrams for:
    • Baseline Fleet data halls.
    • High-density / AI-specific deployments.
  • Contribute mechanical content to internal standards and playbooks covering:
    • Cooling topology design rules.
    • CFD analysis methodologies and acceptance criteria.
    • Failure mode simulation procedures and reporting standards.


Required Qualifications

  • Bachelors degree in Mechanical Engineering or a closely related engineering discipline.
  • 6 years of experience in data center mechanical engineering mission-critical HVAC design or thermal systems engineering for large industrial or technology facilities.
  • Demonstrated deep understanding of data center cooling topologies including both air-cooled and liquid-cooled architectures (fan walls CRAHs/CRACs chillers dry coolers pumps heat exchangers CDUs manifolds containment systems).
  • Hands-on experience performing and interpreting CFD analysis for data halls or similar mission-critical environments with a track record of using CFD results to drive design changes and rack placement decisions.
  • Proven ability to:
    • Determine appropriate air-to-liquid mix for given rack layouts and densities.
    • Assess and optimize thermal performance at rack aisle and room levels.
    • Identify and remediate hot spots and cooling stranding.
  • Experience designing or analyzing failure modes for cooling systems (e.g. CRAC/CRAH outage dry cooler/chiller degradation pump or valve failures) and translating results into design and operational mitigations.
  • Strong analytical and problem-solving skills with the ability to connect thermal and mechanical design decisions to uptime SLA performance and site efficiency (PUE water usage).
  • Clear written and verbal communication skills including the ability to document complex cooling concepts and present analyses to engineering and operations stakeholders.

Preferred

  • Experience in hyperscale or colocation data centers especially supporting high-density AI/GPU clusters and advanced liquid cooling (direct-to-chip rear-door heat exchangers in-rack manifolds).
  • Proficiency with industry-standard CFD and thermal analysis tools and familiarity with integrating results into DCIM/BMS or capacity planning workflows.
  • Familiarity with data center efficiency metrics (e.g. PUE WUE) and how cooling design decisions influence them.
  • Experience with DCIM BMS and monitoring systems for tracking and optimizing thermal performance in production environments.
  • Knowledge of relevant mechanical and building codes and standards as applied to mission-critical facilities.
  • Prior experience conducting infrastructure upgrade or expansion impact analyses in live data centers including development of MOPs and risk mitigations.

Required Traits and Skills

  • Integrity and Ethical Standards: Make safety- and reliability-focused decisions in all cooling design and operations work especially in live critical environments.
  • Effective Communication: Clearly explain complex cooling and CFD concepts tradeoffs and risks to both technical and non-technical audiences; comfortable presenting findings to engineering operations and leadership teams.
  • Operational Paranoia: Anticipate mechanical and thermal risks identify vulnerabilities in cooling topology and controls and proactively implement mechanisms to prevent and minimize disruptions and safeguard safety security availability and scale.
  • Strategic & Systems Thinking: Understand how local cooling decisions (e.g. air-to-liquid ratios fan wall octet configurations CDUs placement) affect the behavior of the entire thermal system and overall uptime SLA.
  • Critical Thinking & Analytical Ability: Use data telemetry CFD and empirical testing to drive decisions; evaluate cost risk and performance tradeoffs for alternative cooling designs and operating strategies.
  • Collaboration & Relationship Management: Build strong working relationships with capacity planners site operations facilities engineers vendors and other partners to drive aligned outcomes across power cooling and IT.
  • Leadership & Mentorship: Provide technical leadership within the mechanical engineering domain mentor peers and junior engineers and help establish Fleets best practices for high-density cooling.

Expected Salary Range

$120000 - $150000 Salary Bonus

Fleet Data Centers Employment

Fleet Data Center employees enjoy competitive compensation and comprehensive benefits including 100% employer-covered medical dental and vision insurance a 401K program standard paid holidays and unlimited PTO.

NOTE: This job description is not intended to be all-inclusive. Employees may perform other related duties as assigned to meet the organizations ongoing needs.

Fleet Data Centers is proud to be an Equal Opportunity Employer. Qualified applicants are considered for employment regardless of age race color religion sex national origin sexual orientation gender identity disability or veteran status.

If you need assistance applying for any of our open positions please contact us at



Required Experience:

Senior IC

Position Overview:We are seeking a Senior Systems Engineer Mechanical to own the detailed design validation analysis and ongoing optimization of Fleets data center cooling topology from campus level through the rack. This role requires a deep understanding of Fleet data center cooling topology incl...
View more view more

Key Skills

  • Children Activity
  • Administrative Skills
  • Data Warehousing
  • Fire And Safety Engineering
  • Fleet
  • Asic

About Company

Fleet Data Centers designs, builds and operates mega-scale data center campuses. Fleet provides its customers with flexibility and predictability to meet their upside demand forecasts, addressing a key need in the market as traditional leased models are struggling to keep pace with t ... View more

View Profile View Profile