Research Scientist, Relational Foundation Models

Avra

Job Location:

São Paulo - Brazil

Monthly Salary: Not Disclosed

Posted on: 30+ days ago

Vacancies: 1 Vacancy

Job Summary

About Avra

Avra is building relational foundation models for enterprise decision-making in Brazil.

Our work focuses on graph-native models for structured high-stakes prediction problems: credit fraud growth monitoring and other decisions where entities cannot be understood in isolation. We model companies people and the relationships between them as evolving networks then adapt those representations to customer-specific prediction tasks that plug into existing decisioning systems.

We work with internationally recognized research advisors and we care about research that becomes useful in production.

The role

This is an applied scientist role with real modeling depth.

You will help evolve the thesis architecture and applications of Avras relational foundation models: how we train them how we adapt them to specific tasks and how they generalize across use cases.

Day to day youll move between papers code experiments and production constraints. The goal is not to try interesting ideas for their own sake. The goal is to find which ideas improve real downstream models under realistic deployment conditions.

We run a weekly research review. Strong papers matter; shipped models matter more.

What youll work on

New approaches for relational foundation models over heterogeneous and temporal graphs
GNNs graph transformers attention over relations relative temporal encodings and other architectures for structured entity networks
Training objectives such as reconstruction contrastive learning generative modeling supervised learning and hybrid combinations
Transfer from foundation representations to downstream tasks through fine-tuning late fusion distillation calibration and task-specific evaluation
Rigorous evaluation: temporal validation leakage checks ablations strong baselines and error analysis
Large-scale training infrastructure using Ray including sampling sharding memory layout distributed execution and throughput optimization
Performance-sensitive ML systems: data loading graph sampling memory efficiency fused kernels and training-loop bottlenecks
Turning research ideas into reliable modeling components used in production

What were looking for

5 years in applied ML research research engineering or equivalent high-level ML systems work
Deep hands-on experience with PyTorch or a similar deep learning framework
Ability to read current research identify the core idea and turn it into a controlled experiment within a week or two
Experience with graph ML recommender systems ranking time-series models representation learning or structured-data domains where strong tabular baselines are hard to beat
Strong experimental discipline: baselines ablations temporal splits leakage prevention reproducibility and honest error analysis
Comfort with large datasets distributed training and the difference between a clean benchmark run and a pipeline that has to work every week
Engineering judgment to build work that others can maintain
Clear communication around model behavior experimental results and technical tradeoffs

You stand out if

You have worked with heterogeneous or temporal graphs using PyG DGL custom graph tooling or related systems
You have used Ray for distributed training data processing or serving
You have written Rust C CUDA Triton or fused kernels or worked seriously with JAX
You have optimized graph sampling memory usage data loading training loops or distributed workloads
You have shipped models into production and monitored how they behaved after deployment
You have contributed to open-source ML infrastructure published strong applied research or built serious internal research systems
You have worked in environments where the model only matters if it improves a real business metric

Requirements

Bachelors degree in a quantitative field: Computer Science Mathematics Statistics Physics Engineering Economics or similar
Masters or PhD is a plus not a filter
Strong written English
Portuguese is useful but not required

What we offer

Competitive salary equity and open compensation bands
Direct collaboration with founders research leadership and experienced AI advisors
Research budget paper incentives and support for publishing when the work is strong and appropriate
100% remote work with a São Paulo office available when you want it
Flexible time off national health plan and extended parental leave
High ownership over research directions that can become part of Avras core platform

If you want to help build foundation models for relational decision-making not as a benchmark exercise but as infrastructure used by real enterprises on real economic networks wed like to meet you.

Required Experience:

About AvraAvra is building relational foundation models for enterprise decision-making in Brazil.Our work focuses on graph-native models for structured high-stakes prediction problems: credit fraud growth monitoring and other decisions where entities cannot be understood in isolation. We model compa...

About Avra

Avra is building relational foundation models for enterprise decision-making in Brazil.

We work with internationally recognized research advisors and we care about research that becomes useful in production.

The role

This is an applied scientist role with real modeling depth.

You will help evolve the thesis architecture and applications of Avras relational foundation models: how we train them how we adapt them to specific tasks and how they generalize across use cases.

We run a weekly research review. Strong papers matter; shipped models matter more.

What youll work on

New approaches for relational foundation models over heterogeneous and temporal graphs
GNNs graph transformers attention over relations relative temporal encodings and other architectures for structured entity networks
Training objectives such as reconstruction contrastive learning generative modeling supervised learning and hybrid combinations
Transfer from foundation representations to downstream tasks through fine-tuning late fusion distillation calibration and task-specific evaluation
Rigorous evaluation: temporal validation leakage checks ablations strong baselines and error analysis
Large-scale training infrastructure using Ray including sampling sharding memory layout distributed execution and throughput optimization
Performance-sensitive ML systems: data loading graph sampling memory efficiency fused kernels and training-loop bottlenecks
Turning research ideas into reliable modeling components used in production

What were looking for

5 years in applied ML research research engineering or equivalent high-level ML systems work
Deep hands-on experience with PyTorch or a similar deep learning framework
Ability to read current research identify the core idea and turn it into a controlled experiment within a week or two
Experience with graph ML recommender systems ranking time-series models representation learning or structured-data domains where strong tabular baselines are hard to beat
Strong experimental discipline: baselines ablations temporal splits leakage prevention reproducibility and honest error analysis
Comfort with large datasets distributed training and the difference between a clean benchmark run and a pipeline that has to work every week
Engineering judgment to build work that others can maintain
Clear communication around model behavior experimental results and technical tradeoffs

You stand out if

You have worked with heterogeneous or temporal graphs using PyG DGL custom graph tooling or related systems
You have used Ray for distributed training data processing or serving
You have written Rust C CUDA Triton or fused kernels or worked seriously with JAX
You have optimized graph sampling memory usage data loading training loops or distributed workloads
You have shipped models into production and monitored how they behaved after deployment
You have contributed to open-source ML infrastructure published strong applied research or built serious internal research systems
You have worked in environments where the model only matters if it improves a real business metric

Requirements

Bachelors degree in a quantitative field: Computer Science Mathematics Statistics Physics Engineering Economics or similar
Masters or PhD is a plus not a filter
Strong written English
Portuguese is useful but not required

What we offer

Competitive salary equity and open compensation bands
Direct collaboration with founders research leadership and experienced AI advisors
Research budget paper incentives and support for publishing when the work is strong and appropriate
100% remote work with a São Paulo office available when you want it
Flexible time off national health plan and extended parental leave
High ownership over research directions that can become part of Avras core platform

If you want to help build foundation models for relational decision-making not as a benchmark exercise but as infrastructure used by real enterprises on real economic networks wed like to meet you.

Required Experience:

Apply Now

About Company

Avra

Our foundation model helps our clients bring the right SME to the top of the funnel, hyper-personalize offers, and reduce default. Request a demo.

View Profile View Profile

AI AutoApply

Apply to 100+ jobs with one click

AI Resume Builder

Create an ATS-ready CV in minutes

AI Cover Letter

Write a personalized letter instantly

Research Scientist, Relational Foundation Models

São Paulo - Brazil

Job Summary

About Avra

The role

What youll work on

What were looking for

You stand out if

Requirements

What we offer

About Avra

The role

What youll work on

What were looking for

You stand out if

Requirements

What we offer

About Company

Related Jobs