Job Description Summary
At GE Vernova Power our Data Science and AI (DSAI) team spearheads innovation by integrating advanced data science and AI solutions to transform business operations. We are looking for a creative and detail-oriented Staff Data Scientist to be instrumental in operationalizing our Generative AI strategy with a critical focus on data.
This role serves as a crucial bridge between our complex business data and advanced AI models. You will lead teams developing statistical machine learning and AI solutions for Gas Power stakeholders. Your core mission involves deep exploratory analysis strategic curation and rigorous management of business-specific data to generate high-quality reliable input for Large Language Model (LLM) applications.
You will contribute to deploying modern machine learning operational research and semantic analysis methods to derive insights and achieve Gas Powers strategic objectives. Importantly this position focuses not on building models from scratch but on ensuring that models developed by our central AI Foundry effectively understand and address specific business challenges. This role is ideal for a data expert passionate about uncovering hidden context and enabling transformative business solutions.
Job Description
As a Staff Data Scientist you will be part of a data science or cross-disciplinary team developing innovative solutions typically involving large complex data sets to achieve business outcomes. These teams will include statisticians computer scientists software developers engineers product managers and functional addition to hands on development the Staff Data Scientist will lead extended team members from the Emerging Technology Guild and functional DT teams to develop and operationalize data science solutions are ready for scale-up.
- Perform comprehensive exploratory data analysis (EDA) on diverse and complex business datasets using statistical analysis Natural Language Processing (NLP) and unsupervised clustering techniques to uncover patterns identify quality issues and extract meaningful insights.
- Collaborate closely with business Subject Matter Experts (SMEs) to translate their deep domain knowledge into structured AI-ready datasets for use in prompt engineering Retrieval-Augmented Generation (RAG) and model fine-tuning.
- Develop and prepare golden datasets that serve as pristine examples of our business processes significantly reducing the iteration time for prompt engineering and AI development teams.
- Design create and maintain a suite of data benchmarks that represent our core business use cases. These benchmarks will be the definitive standard for evaluating the real-world performance of AI methods within our BU.
- Establish and enforce rigorous data quality standards and validation protocols ensuring the accuracy relevance and integrity of all data used in our GenAI applications.
- Proactively identify and document potential data biases working with stakeholders to develop mitigation strategies that promote responsible and fair AI outcomes.
- Serve as the primary steward for the BUs curated AI datasets defining and implementing a clear data management strategy that includes versioning access controls and a lifecycle management plan.
- Create and maintain comprehensive documentation for all curated datasets (e.g. datasheets for datasets) detailing their origin schema limitations and intended use to ensure transparency and reusability.
- Continuously survey the BUs data landscape to identify new high-value data sources and champion their integration into our Generative AI ecosystem.
- Act as the primary data liaison between the Business Unit prompt engineers and the central AI Foundry.
- Rigorously test and validate the effectiveness of generalized tools and methods provided by the AI Foundry against your BU-specific data benchmarks.
- Provide precise data-driven feedback and recommendations to the Foundry collaborating to refine and enhance central AI capabilities to ensure they meet our specific business needs.
- Communicate methods findings and hypotheses with stakeholders
Qualifications
Required Qualifications:
- Bachelors or Masters degree in a quantitative field such as Data Science Computer Science Statistics Economics or a related discipline.
- 3-5 years of professional experience as a Data Scientist Data Analyst or in a similar role with a heavy emphasis on data exploration manipulation and preparation.
- Strong proficiency in Python and core data science libraries (e.g. pandas NumPy scikit-learn spaCy NLTK).
- Demonstrated experience with a wide range of exploratory data analysis and unsupervised machine learning techniques (e.g. clustering topic modeling dimensionality reduction).
- Proven ability to work with messy unstructured and semi-structured data especially text.
- Exceptional communication and interpersonal skills with a talent for translating complex technical concepts to non-technical audiences and building strong relationships with business stakeholders.
Preferred Qualifications:
- Hands-on experience preparing data specifically for Generative AI systems (e.g. creating datasets for RAG few-shot prompting or supervised fine-tuning).
- Familiarity with the architecture of modern LLMs and the role of vector databases (e.g. Pinecone Milvus Weaviate).
- Experience in establishing data quality frameworks data governance policies or data management best practices.
- Prior experience working in a federated analytics or data science model where collaboration between central and business-embedded teams was required.
- Domain expertise relevant to our Business Unit
Additional Information
GE Vernova will only employ those who are legally authorized to work in the United States for this opening. Any offer of employment is conditioned upon the successful completion of a drug screen (as applicable).
Relocation Assistance Provided: No
#LI-Remote - This is a remote position
For candidates applying to a U.S. based position the pay range for this position is between $103900.00 and $173100.00. The Company pays a geographic differential of 110% 120% or 130% of salary in certain areas. The specific pay offered may be influenced by a variety of factors including the candidates experience education and skill set.
Bonus eligibility: discretionary annual bonus.
This posting is expected to remain open for at least seven days after it was posted on November 26 2025.
Available benefits include medical dental vision and prescription drug coverage; access to Health Coach from GE Vernova a 24/7 nurse-based resource; and access to the Employee Assistance Program providing 24/7 confidential assessment counseling and referral services. Retirement benefits include the GE Vernova Retirement Savings Plan a tax-advantaged 401(k) savings opportunity with company matching contributions and company retirement contributions as well as access to Fidelity resources and financial planning consultants. Other benefits include tuition assistance adoption assistance paid parental leave disability benefits life insurance 12 paid holidays and permissive time off.
GE Vernova Inc. or its affiliates (collectively or individually GE Vernova) sponsor certain employee benefit plans or programs GE Vernova reserves the right to terminate amend suspend replace or modify its benefit plans and programs at any time and for any reason in its sole discretion. No individual has a vested right to any benefit under a GE Vernova welfare benefit plan or program. This document does not create a contract of employment with any individual.