drjobs Senior Data Engineer LLM

Senior Data Engineer LLM

Employer Active

1 Vacancy
drjobs

Job Alert

You will be updated with latest job alerts via email
Valid email field required
Send jobs
Send me jobs like this
drjobs

Job Alert

You will be updated with latest job alerts via email

Valid email field required
Send jobs
Job Location drjobs

Warsaw - Poland

Monthly Salary drjobs

Not Disclosed

drjobs

Salary Not Disclosed

Vacancy

1 Vacancy

Job Description

Description

Sunsers is a technology consultancy that empowers finance and healthcare leaders to succeed by leveraging cuttingedge software data and AI.

We combine worldclass engineering deep industry expertise and proprietary knowhow to deliver innovative highimpact solutions. Specializing in software engineering DevOps data engineering and data science we design and build AIpowered data platforms and web applications tailored to each clients unique needs.

Trusted by over 60 clients across the US UK and beyond we consistently maintain a 4.9/5 client satisfaction rating with partnerships averaging five years or more.

The project:

We are carrying out the project for our client an American private equity and investment management fund listed on the Forbes 500 list based in New York.

We support them in the area of the infrastructure and data platform and very recently we also build and experiment with Gen AI applications. The client operates very widely in the world of finance loans investments and real estate.

As a Senior Data Engineer youll design and implement core systems that enable data science and data visualization at companies that implement datadriven decision processes to create a competitive advantage.

Youll build data platform for data and business teams including internal tooling data pipeline orchestrator data warehouses and more using:

Technologies: Python Terraform SQL Pandas Shell scripts

Tools: git Docker Snowflake Pinecone Neo4j Jenkins Jupyter Notebook OpenAI API Apache Airflow / Astronomer Kubernetes Artifactory Windows with WSL Linux Gitlab

AWS: EC2 ELB IAM RDS Route53 S3 and more

Best Practices: Continuous Integration Code Reviews

The ideal candidate will be well organized eager to constantly improve and learn driven and most of all a team player!

Your responsibilities will include:

  • Developing PoCs using latest technologies experimenting with third party integrations
  • Delivering production grade applications once PoCs are validated
  • Creating solutions that enable data scientists and business analysts to be selfsufficient as much as possible.
  • Finding new ways how to leverage Gen AI applications and underlying vector and graph data storages
  • Designing datasets and schemes for consistency and easy access
  • Contributing data technology stacks including data warehouses and ETL pipelines
  • Building data flows for fetching aggregation and data modeling using batch and streaming pipelines
  • Documenting design decisions before implementation


Requirements

Whats important for us

  • At least 5 years of professional experience in datarelated role
  • Undergraduate or graduate degree in Computer Science Engineering Mathematics or similar
  • Expertise in Python and SQL languages
  • Experience with data warehouses (Snowflake)
  • Experience with different types of database technologies (RDBMS vector graphs document based etc.
  • Expertise in AWS stack and services
  • Proficiency in using Docker
  • Experience with infrastructureascode tools like Terraform
  • Great analytical skills and attention to detail asking questions and proactively searching for answers
  • Excellent command in spoken and written English at least C1
  • Creative problemsolving skills
  • Excellent technical documentation and writing skills
  • Ability to work with both Windows and Unixlike operating systems as the primary work environments

You will score extra points for:

  • Experience with integrating LLMs (OpenAI but also others maybe open source)
  • Understanding of LLMs fine tuning embedding and vector semantic searching
  • Experience with Pinecone or Neo4j
  • Familiarity with data visualization in Python using either Matplotlib Seaborn or Bokeh
  • Proficiency in statistics and machine learning as well as Python libraries like Pandas NumPy matplotlib seaborn scikitlearn etc
  • Experience in building ETL processes and data pipelines with platforms like Airflow or Luigi
  • Knowledge of any Python web framework like Django or Flask with SQLAlchemy
  • Experience in operating within a secure networking environment like a corporate proxy
  • Experience in working with repository manager for example Jfrog Artifactory


Benefits

What do we offer


Sounds like a perfect place for you Dont hesitate to click apply and submit your application today!


Required Experience:

Senior IC

Employment Type

Full Time

About Company

Report This Job
Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.