drjobs Data Scientist

Data Scientist

Employer Active

1 Vacancy
The job posting is outdated and position may be filled
drjobs

Job Alert

You will be updated with latest job alerts via email
Valid email field required
Send jobs
Send me jobs like this
drjobs

Job Alert

You will be updated with latest job alerts via email

Valid email field required
Send jobs
Jobs by Experience drjobs

0-1years

Job Location drjobs

Tokyo - Japan

Salary drjobs

Not Disclosed

drjobs

Salary Not Disclosed

Vacancy

1 Vacancy

Job Description

:

(Adtech) :
  • RTB( DSP(DemandSide Platform)
:
  • WiFi KPI
:

:


You will first participate in one of the following projects. After that you can further develop your expertise or participate in other projects to build your career as desired based on your achievements.

Adtech Project:
  • In RTB (realtime bidding) one of the main mechanisms for internet advertising we mainly design and develop machine learning models for DSP (DemandSide Platform) that optimize the costeffectiveness of advertisers and measure their effectiveness.
App project:
  • We will use causal inference technology to measure the effectiveness of new features and measures for apps that facilitate free WiFi connection and improve the KPIs of our services by establishing a system for datadriven management decisions.
Other projects:
  • We will apply data analysis and machine learning technology to support cryptocurrency trading fraud detection etc.
Research and development work:
  • While working on projects we will all spend a certain amount of time researching cuttingedge machine learning methods and new machine learning applications
  • In addition members selected every quarter will focus on research and development


Requirements

:
:

  • Transformer GNN MLP / Gradient Boost Tree LR Random Forest ExtraTree Ada Boost XGBoost LightGBM PCA FPGrowth Word2Vec Doc2Vec HMM

  • t F MCMC
/ :
/ :
  • Python PyData numpy scipy pandas Streamlit
  • PyTorch TensorFlow LangChain Spark PySpark
/ :
  • Google Cloud GCS BigQuery VertexAI Dataflow
  • AWS S3 Athena EMR/Serverless StepFunction SageMaker Bedrock
  • MySQL MariaDB Percona Server PostgreSQL Galera Cluster Oracle Hive Hadoop/HDFS
  • ConoHa GPU
  • LLM
  • OpenAI API Llama3 LangChain HuggingFace
:
  • Atlassian Jira Confluence Trello
  • VS Code PyCharm Jupyter
  • GitHub Copilot
  • Tableau Looker Studio metabase
  • ChatGPT Gemini Claude
:
  • scrum
Technologies used:
Analysis Methods:
Machine learning:
  • Transformer series (largescale language models etc. graph neural network (GNN) multilayer perceptron (MLP) ensemble learning/gradient boosting (Gradient Boost Tree LR Random Forest ExtraTree Ada Boost XGBoost LightGBM) PCA FPGrowth Word2Vec Doc2Vec collaborative filtering Bayesian inference HMM model (hidden Markov model)
Statistical analysis:
  • ttest chisquare test Ftest binomial test KolmogorovSmirnov test ShapiroWilk test sampling (MCMC bootstrap method etc. analysis of variance causal inference (difference of differences method etc.
Development Technology/Environment:
ProgrammingFframework:
  • Python PyData (numpy scipy pandas etc. Streamlit
  • PyTorch TensorFlow LangChain Spark (PySpark)
Cloud/onpremise (Middleware):
  • Google Cloud (GCS BigQuery VertexAI Dataflow etc.
  • AWS (S3 Athena EMR/Serverless StepFunction SageMaker Bedrock etc.
  • MySQL MariaDB Percona Server PostgreSQL Galera Cluster Oracle Hive Hadoop/HDFS
  • ConoHa (GPU server)
  • Largescale language model (LLM) related
  • OpenAI API Llama3 LangChain HuggingFace
Development Tools:
  • Atlassian (Jira Confluence) Trello
  • VS Code PyCharm Jupyter
  • GitHub (Copilot)
  • Tableau Looker Studio metabase
  • ChatGPT Gemini Claude
Development Methods:
  • Agile development (scrumbased)

:

(Adtech) :
  • RTB( DSP(Demand-Side Platform)
:
  • WiFi KPI
:

:


You will first participate in one of the following projects. After that, you can further develop your expertise or participate in other projects to build your career as desired based on your achievements.

Adtech Project:
  • In RTB (real-time bidding), one of the main mechanisms for internet advertising, we mainly design and develop machine learning models for DSP (Demand-Side Platform) that optimize the cost-effectiveness of advertisers, and measure their effectiveness.
App project:
  • We will use causal inference technology to measure the effectiveness of new features and measures for apps that facilitate free WiFi connection, and improve the KPIs of our services by establishing a system for data-driven management decisions.
Other projects:
  • We will apply data analysis and machine learning technology to support cryptocurrency trading, fraud detection, etc.
[Research and development work]:
  • While working on projects, we will all spend a certain amount of time researching cutting-edge machine learning methods and new machine learning applications
  • In addition, members selected every quarter will focus on research and development


Requirements

:
:

  • Transformer GNN MLP / Gradient Boost Tree + LR, Random Forest, ExtraTree , Ada Boost, XGBoost, LightGBM PCA FP-Growth Word2Vec Doc2Vec HMM

  • t F MCMC,
/ :
/ :
  • Python PyData numpy scipy pandas Streamlit
  • PyTorch TensorFlow LangChain Spark PySpark
/ :
  • Google Cloud GCS BigQuery VertexAI Dataflow
  • AWS S3 Athena EMR/Serverless StepFunction SageMaker Bedrock
  • MySQL MariaDB Percona Server PostgreSQL Galera Cluster Oracle Hive Hadoop/HDFS
  • ConoHa GPU
  • LLM
  • OpenAI API Llama3 LangChain HuggingFace
:
  • Atlassian Jira Confluence Trello
  • VS Code PyCharm Jupyter
  • GitHub Copilot
  • Tableau Looker Studio metabase
  • ChatGPT Gemini Claude
:
  • scrum
[Technologies used]:
Analysis Methods:
Machine learning:
  • Transformer series (large-scale language models, etc., graph neural network (GNN), multi-layer perceptron (MLP), ensemble learning/gradient boosting (Gradient Boost Tree + LR, Random Forest, ExtraTree, Ada Boost, XGBoost, LightGBM), PCA, FP-Growth, Word2Vec, Doc2Vec, collaborative filtering, Bayesian inference, HMM model (hidden Markov model)
Statistical analysis:
  • t-test, chi-square test, F-test, binomial test, Kolmogorov-Smirnov test, Shapiro-Wilk test, sampling (MCMC, bootstrap method, etc., analysis of variance, causal inference (difference of differences method, etc.
Development Technology/Environment:
ProgrammingFframework:
  • Python, PyData (numpy, scipy, pandas, etc., Streamlit
  • PyTorch, TensorFlow, LangChain, Spark (PySpark)
Cloud/on-premise (Middleware):
  • Google Cloud (GCS, BigQuery, VertexAI, Dataflow, etc.
  • AWS (S3, Athena, EMR/Serverless, StepFunction, SageMaker, Bedrock, etc.
  • MySQL, MariaDB, Percona Server, PostgreSQL, Galera Cluster, Oracle, Hive, Hadoop/HDFS
  • ConoHa (GPU server)
  • Large-scale language model (LLM) related
  • OpenAI API, Llama3, LangChain, HuggingFace
Development Tools:
  • Atlassian (Jira, Confluence), Trello
  • VS Code, PyCharm, Jupyter
  • GitHub (Copilot)
  • Tableau, Looker Studio, metabase
  • ChatGPT, Gemini, Claude
Development Methods:
  • Agile development (scrum-based)

Employment Type

Full Time

Company Industry

About Company

Report This Job
Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.