Job Title: Data Analyst/ Engineer
W2/FTE Only
Location: Remote
Exp: 10 Years
Visa: GC/USC/H4/GCEAD Only
Job Summary
Senior Level Data Engineer / Data Analyst technical lead with data analytics experience Databricks Pyspark and Python
This is a key role that requires senior/lead with great communication skills who is very proactive with risk & issue management.
Experience and Education Required
10 years of experience as Data Analyst / Data Engineer/Data Scientist with Databricks on AWS expertise in designing and implementing scalable secure and costefficient data solutions on AWS
Job Profile:
- Handson data analytics experience with Databricks on AWS Pyspark and Python
- Must have prior experience with migrating a data asset to the cloud using a GenAI automation option
- Experience in migrating data from onpremises to AWS
- Expertise in developing data models delivering datadriven insights for business solutions
- Experience in pretraining finetuning augmenting and optimizing large language models (LLMs)
- Experience in Designing and implementing database solutions developing PySpark applications to extract transform and aggregate data generating insights
- Data Collection & Integration: Identify gather and consolidate data from diverse sources including internal databases and spreadsheets ensuring data integrity and relevance.
- Data Cleaning & Transformation: Apply thorough data quality checks cleaning processes and transformations using Python (Pandas) and SQL to prepare datasets.
- Automation & Scalability: Develop and maintain scripts that automate repetitive data preparation tasks.
- Autonomy & Proactivity: Operate with minimal supervision demonstrating initiative in problemsolving prioritizing tasks and continuously improving the quality and impact of your work
Technical Skills:
- Minimum of 10 years of experience as a Data Analyst Data Engineer or related role ideally with a bachelors degree or higher in a relevant field.
- Strong proficiency in Python (Pandas Scikitlearn Matplotlib) and SQL with experience working across various data formats and sources.
- Proven ability to automate data workflows implement codebased best practices and maintain documentation to ensure reproducibility and scalability.
Behavioral Skills:
- Ability to manage in tight circumstances very proactive with risk & issue management
- Requirement Clarification & Communication: Interact directly with colleagues to clarify objectives challenge assumptions.
- Documentation & Best Practices: Maintain clear concise documentation of data workflows coding standards and analytical methodologies to support knowledge transfer and scalability.
- Collaboration & Stakeholder Engagement: Work closely with colleagues who provide data raising questions about data validity sharing insights and cocreating solutions that address evolving needs.
- Excellent communication skills for engaging with colleagues clarifying requirements and conveying analytical results in a meaningful nontechnical manner.
- Demonstrated critical thinking skills including the willingness to question assumptions evaluate data quality and recommend alternative approaches when necessary.
- A selfdirected resourceful problemsolver who collaborates well with others while confidently managing tasks and priorities independently.