Talent.com
Data Scientist (Reinforcement Learning / LLM Agent / Vision Language Model - either 1)

Data Scientist (Reinforcement Learning / LLM Agent / Vision Language Model - either 1)

BinanceAuckland, Auckland, New Zealand
12 days ago
Job description

Binance is a leading global blockchain ecosystem behind the world’s largest cryptocurrency exchange by trading volume and registered users. We are trusted by over 280 million people in 100+ countries for our industry-leading security, user fund transparency, trading engine speed, deep liquidity, and an unmatched portfolio of digital-asset products. Binance offerings range from trading and finance to education, research, payments, institutional services, Web3 features, and more. We leverage the power of digital assets and blockchain to build an inclusive financial ecosystem to advance the freedom of money and improve financial access for people around the world.

About the Role

You will develop and optimize Reinforcement Learning (RL) models for enterprise-scale applications such as customer service, token reporting, compliance, and Web3 domain reasoning.

You will explore and evaluate advanced Algorithms including PPO, GRPO, DPO, RLHF, RLAIF, and Agentic RL to enhance the capabilities of LLMs, VLMs, and Agentic AI at Binance. The role requires a strong theoretical foundation in RL—covering policy optimization, reward modeling, and planning—paired with the Engineering skills to build scalable production systems.

You will take full ownership from research through deployment, driving experimentation with systematic evaluation and benchmarking. Collaboration across research, infrastructure, and application teams will be key to delivering impactful AI solutions.

Responsibilities :

  • Research and develop state-of-the-art RL algorithms, focusing on Large Model Optimization and alignment techniques.
  • Design and implement RL training pipelines, including environment simulation, data generation, and reward function design.
  • Apply RL methods to enhance LLM / VLM / Agentic AI capabilities in reasoning, planning, and autonomous decision-making.
  • Collaborate with Engineers and researchers to integrate RL solutions into enterprise AI platforms.
  • Monitor model performance in production and continuously improve through Iterative training and Fine-tuning.

Requirements :

  • Master’s Degree in Computer Science, Applied Mathematics, Machine Learning, or related fields.
  • 5+ years of hands-on experience in RL or LLM / VLM / Agentic AI optimization.
  • Strong coding skills in Python, with experience in ML frameworks and RL libraries.
  • Experience with large-scale distributed training and optimization.
  • Self-driven, ownership mindset, and strong problem-solving skills. Excellent communication skills for cross-functional collaboration.
  • Why Binance

  • Shape the future with the world’s leading blockchain ecosystem
  • Collaborate with world-class talent in a user-centric global organization with a flat structure
  • Tackle unique, fast-paced projects with autonomy in an innovative environment
  • Thrive in a results-driven workplace with opportunities for career growth and continuous learning
  • Competitive salary and company benefits
  • Work-from-home arrangement (the arrangement may vary depending on the work nature of the business team)
  • Binance is committed to being an equal opportunity employer. We believe that having a diverse workforce is fundamental to our success.

    By submitting a job application, you confirm that you have read and agree to our Candidate Privacy Notice .

    #J-18808-Ljbffr

    Create a job alert for this search

    Data Scientist • Auckland, Auckland, New Zealand

    Related jobs
    • Promoted
    Machine Learning Engineer II

    Machine Learning Engineer II

    Space TalentAuckland, Auckland, New Zealand
    Rocket Lab is an end-to-end space company delivering responsive launch services, complete spacecraft design and manufacturing, payloads, satellite components, and more – all with the goal of openin...Show moreLast updated: 7 days ago
    • Promoted
    Machine Learning Engineer

    Machine Learning Engineer

    Rocket LabAuckland, Auckland, New Zealand
    Rocket Lab is a global leader in launch and space systems.The rockets and satellites we build and launch enable some of the most ambitious and vital space missions globally, supporting scientific e...Show moreLast updated: 30+ days ago
    • Promoted
    Machine Learning Engineer - Pasture

    Machine Learning Engineer - Pasture

    black.aiAuckland, Auckland, New Zealand
    Halter’s Pasture team is dedicated to helping farmers better manage their most valuable resource—pasture—for greater productivity, profit, and sustainability. To provide the highest quality insights...Show moreLast updated: 30+ days ago
    • Promoted
    Data and AI ML Solution Architect

    Data and AI ML Solution Architect

    Accenture New ZealandAuckland, Auckland, New Zealand
    Seeking a Data & AI Solution Architect with a strong focus on Artificial Intelligence (AI), Machine Learning (ML), and Generative AI. You will be responsible for designing and implementing advanced ...Show moreLast updated: 20 days ago
    • Promoted
    Data Scientist / Machine Learning Engineer (Market Growth Lifecycle)

    Data Scientist / Machine Learning Engineer (Market Growth Lifecycle)

    BinanceAuckland, Auckland, New Zealand
    In this role, you will be responsible for developing and optimising core algorithmic models in marketing scenarios, including user behaviour prediction, profiling, ROI estimation, and traffic alloc...Show moreLast updated: 12 days ago
    • Promoted
    Data Scientist (Reinforcement Learning / Llm Agent / Vision Language Model - Either 1)

    Data Scientist (Reinforcement Learning / Llm Agent / Vision Language Model - Either 1)

    BinanceAuckland, Auckland, New Zealand
    Binance is a leading global blockchain ecosystem behind the world's largest cryptocurrency exchange by trading volume and registered users. We are trusted by over 280 million people in 100+ countrie...Show moreLast updated: 11 days ago
    • Promoted
    Machine Learning Engineer II

    Machine Learning Engineer II

    Rocket Lab USA Inc.Auckland, Auckland, New Zealand
    Be among the first 25 applicants.Rocket Lab is an end‑to‑end space company delivering responsive launch services, complete spacecraft design and manufacturing, payloads, satellite components, and m...Show moreLast updated: 2 days ago
    • Promoted
    Machine Learning Engineer

    Machine Learning Engineer

    Space TalentAuckland, Auckland, New Zealand
    Rocket Lab is an end-to-end space company delivering responsive launch services, complete spacecraft design and manufacturing, payloads, satellite components, and more – all with the goal of openin...Show moreLast updated: 30+ days ago
    • Promoted
    Data Scientist / Algorithm Engineer (LLM) – AI Safety

    Data Scientist / Algorithm Engineer (LLM) – AI Safety

    BinanceWorkFromHome, Auckland, New Zealand
    Binance is a leading global blockchain ecosystem behind the world’s largest cryptocurrency exchange by trading volume and registered users. We are trusted by over 280 million people in 100+ countrie...Show moreLast updated: 12 days ago
    • Promoted
    Binance Accelerator Program - LLM Model Training & Data Processing

    Binance Accelerator Program - LLM Model Training & Data Processing

    BinanceWorkFromHome, Auckland, New Zealand
    Binance is the global blockchain company behind the world’s largest digital asset exchange by trading volume and users, serving a greater mission to accelerate cryptocurrency adoption and increase ...Show moreLast updated: 30+ days ago
    • Promoted
    Machine Learning Engineer II

    Machine Learning Engineer II

    Rocket LabAuckland, Auckland, New Zealand
    Based on‑site at Rocket Lab's Auckland, NZ facility, the Machine Learning Engineer II is responsible for developing impactful AI solutions, managing data cleanup, annotation, and augmentation, and ...Show moreLast updated: 7 days ago
    • Promoted
    Research Scientist - LLM Foundation Models

    Research Scientist - LLM Foundation Models

    BinanceWorkFromHome, Auckland, New Zealand
    Binance is a leading global blockchain ecosystem behind the world’s largest cryptocurrency exchange by trading volume and registered users. We are trusted by over 280 million people in 100+ countrie...Show moreLast updated: 30+ days ago
    • Promoted
    Machine Learning Engineer II

    Machine Learning Engineer II

    Rocket Lab USAAuckland, Auckland, New Zealand
    Rocket Lab is an end-to-end space company delivering responsive launch services, complete spacecraft design and manufacturing, payloads, satellite components, and more – all with the goal of openin...Show moreLast updated: 2 days ago
    • Promoted
    Machine Learning Engineer

    Machine Learning Engineer

    HalterAuckland, Auckland, New Zealand
    Halter’s Pasture team is dedicated to helping farmers better manage their most valuable resource—pasture—for greater productivity, profit, and sustainability. To provide the highest quality insights...Show moreLast updated: 30+ days ago
    • Promoted
    Head of learning area science

    Head of learning area science

    Howick CollegeAuckland, Auckland, New Zealand
    Secondary (Years 7–15) / wharekura, Middle leadership.Howick College is a large co-educational school in East Auckland.We are an innovative and dynamic school committed to providing high-quality ed...Show moreLast updated: 4 days ago
    • Promoted
    Research Data Scientist, NLP & Financial Signals

    Research Data Scientist, NLP & Financial Signals

    BinanceWorkFromHome, Auckland, New Zealand
    Binance is a leading global blockchain ecosystem behind the world’s largest cryptocurrency exchange by trading volume and registered users. We are trusted by over 280 million people in 100+ countrie...Show moreLast updated: 2 days ago
    • Promoted
    Data Scientist, NLP & Trading Strategies (Quantitative)

    Data Scientist, NLP & Trading Strategies (Quantitative)

    BinanceWorkFromHome, Auckland, New Zealand
    Binance is a leading global blockchain ecosystem behind the world’s largest cryptocurrency exchange by trading volume and registered users. We are trusted by over 280 million people in 100+ countrie...Show moreLast updated: 2 days ago
    • Promoted
    Binance Accelerator Program - Data Scientist, Analytics

    Binance Accelerator Program - Data Scientist, Analytics

    BinanceWorkFromHome, Auckland, New Zealand
    Binance is a leading global blockchain ecosystem behind the world’s largest cryptocurrency exchange by trading volume and registered users. We are trusted by over 280 million people in 100+ countrie...Show moreLast updated: 30+ days ago