Talent.com
Data Scientist (Reinforcement Learning / Llm Agent / Vision Language Model - Either 1)

Data Scientist (Reinforcement Learning / Llm Agent / Vision Language Model - Either 1)

BinanceAuckland, Auckland, New Zealand
12 days ago
Job description

Binance is a leading global blockchain ecosystem behind the world's largest cryptocurrency exchange by trading volume and registered users.

We are trusted by over 280 million people in 100+ countries for our industry-leading security, user fund transparency, trading engine speed, deep liquidity, and an unmatched portfolio of digital-asset products.

Binance offerings range from trading and finance to education, research, payments, institutional services, Web3 features, and more.

We leverage the power of digital assets and blockchain to build an inclusive financial ecosystem to advance the freedom of money and improve financial access for people around the world.About the RoleYou will develop and optimize Reinforcement Learning (RL) models for enterprise-scale applications such as customer service, token reporting, compliance, and Web3 domain reasoning.You will explore and evaluate advanced Algorithms including PPO, GRPO, DPO, RLHF, RLAIF, and Agentic RL to enhance the capabilities of LLMs, VLMs, and Agentic AI at Binance.

The role requires a strong theoretical foundation in RL—covering policy optimization, reward modeling, and planning—paired with the Engineering skills to build scalable production systems.You will take full ownership from research through deployment, driving experimentation with systematic evaluation and benchmarking.

Collaboration across research, infrastructure, and application teams will be key to delivering impactful AI solutions.

Responsibilities : Research and develop state-of-the-art RL algorithms, focusing on Large Model Optimization and alignment techniques.Design and implement RL training pipelines, including environment simulation, data generation, and reward function design.Apply RL methods to enhance LLM / VLM / Agentic AI capabilities in reasoning, planning, and autonomous decision-making.Collaborate with Engineers and researchers to integrate RL solutions into enterprise AI platforms.Monitor model performance in production and continuously improve through Iterative training and Fine-tuning.Requirements : Master's Degree in Computer Science, Applied Mathematics, Machine Learning, or related fields.5+ years of hands-on experience in RL or LLM / VLM / Agentic AI optimization.Strong coding skills in Python, with experience in ML frameworks and RL libraries.Experience with large-scale distributed training and optimization.Self-driven, ownership mindset, and strong problem-solving skills.

Excellent communication skills for cross-functional collaboration.Why Binance

  • Shape the future with the world's leading blockchain ecosystem
  • Collaborate with world-class talent in a user-centric global organization with a flat structure
  • Tackle unique, fast-paced projects with autonomy in an innovative environment
  • Thrive in a results-driven workplace with opportunities for career growth and continuous learning
  • Competitive salary and company benefits
  • Work-from-home arrangement (the arrangement may vary depending on the work nature of the business team)Binance is committed to being an equal opportunity employer.

We believe that having a diverse workforce is fundamental to our success.By submitting a job application, you confirm that you have read and agree to our Candidate Privacy Notice .

#J-

  • Ljbffr
  • Create a job alert for this search

    Data Scientist • Auckland, Auckland, New Zealand

    Related jobs
    • Promoted
    Machine Learning Engineer - Pasture

    Machine Learning Engineer - Pasture

    black.aiAuckland, Auckland, New Zealand
    Halter’s Pasture team is dedicated to helping farmers better manage their most valuable resource—pasture—for greater productivity, profit, and sustainability. To provide the highest quality insights...Show moreLast updated: 30+ days ago
    • Promoted
    Vice President of Product Engineering, 2 Hour Learning (Remote) - $400,000 / year USD

    Vice President of Product Engineering, 2 Hour Learning (Remote) - $400,000 / year USD

    TrilogyWorkFromHome, Auckland, New Zealand
    Vice President of Product Engineering, 2 Hour Learning (Remote) - $400,000 / year USD.Ready to architect the future of AI-powered education? Join 2 Hour Learning as our Vice President of Product Engi...Show moreLast updated: 7 days ago
    • Promoted
    Data and AI ML Solution Architect

    Data and AI ML Solution Architect

    Accenture New ZealandAuckland, Auckland, New Zealand
    Seeking a Data & AI Solution Architect with a strong focus on Artificial Intelligence (AI), Machine Learning (ML), and Generative AI. You will be responsible for designing and implementing advanced ...Show moreLast updated: 21 days ago
    • Promoted
    Data Scientist / Machine Learning Engineer (Market Growth Lifecycle)

    Data Scientist / Machine Learning Engineer (Market Growth Lifecycle)

    BinanceAuckland, Auckland, New Zealand
    In this role, you will be responsible for developing and optimising core algorithmic models in marketing scenarios, including user behaviour prediction, profiling, ROI estimation, and traffic alloc...Show moreLast updated: 13 days ago
    • Promoted
    Data Scientist, Market Growth Lifecycle

    Data Scientist, Market Growth Lifecycle

    BinanceAuckland, Auckland, New Zealand
    About the RoleIn this role, you will be responsible for developing and optimising core algorithmic models in marketing scenarios, including user behaviour prediction, profiling, ROI estimation, and...Show moreLast updated: 30+ days ago
    • Promoted
    Chief Data and Analytics Officer

    Chief Data and Analytics Officer

    New Zealand Trade and EnterpriseWorkFromHome, Auckland, New Zealand
    At NZTE, we help New Zealand businesses grow internationally by connecting them with the right partners, insights, and opportunities. We’re proud to support exporters and investors who are shaping t...Show moreLast updated: 11 days ago
    • Promoted
    Data Scientist / Algorithm Engineer (LLM) – AI Safety

    Data Scientist / Algorithm Engineer (LLM) – AI Safety

    BinanceWorkFromHome, Auckland, New Zealand
    Binance is a leading global blockchain ecosystem behind the world’s largest cryptocurrency exchange by trading volume and registered users. We are trusted by over 280 million people in 100+ countrie...Show moreLast updated: 13 days ago
    • Promoted
    Binance Accelerator Program - LLM Model Training & Data Processing

    Binance Accelerator Program - LLM Model Training & Data Processing

    BinanceWorkFromHome, Auckland, New Zealand
    Binance is the global blockchain company behind the world’s largest digital asset exchange by trading volume and users, serving a greater mission to accelerate cryptocurrency adoption and increase ...Show moreLast updated: 30+ days ago
    • Promoted
    Machine Learning Engineer II

    Machine Learning Engineer II

    Rocket LabAuckland, Auckland, New Zealand
    Based on‑site at Rocket Lab's Auckland, NZ facility, the Machine Learning Engineer II is responsible for developing impactful AI solutions, managing data cleanup, annotation, and augmentation, and ...Show moreLast updated: 8 days ago
    • Promoted
    Engineering Senior Machine Learning Engineer New Zealand (Remote) FullTime

    Engineering Senior Machine Learning Engineer New Zealand (Remote) FullTime

    Leonardo Interactive PtyWorkFromHome, Auckland, New Zealand
    Ai is building one of the world’s highest-throughput Generative AI platforms, enabling millions of users, from beginners to professionals, to create high-quality images and videos with ease.Now par...Show moreLast updated: 30+ days ago
    • Promoted
    Research Scientist - LLM Foundation Models

    Research Scientist - LLM Foundation Models

    BinanceWorkFromHome, Auckland, New Zealand
    Binance is a leading global blockchain ecosystem behind the world’s largest cryptocurrency exchange by trading volume and registered users. We are trusted by over 280 million people in 100+ countrie...Show moreLast updated: 30+ days ago
    • Promoted
    Machine Learning Engineer

    Machine Learning Engineer

    HalterAuckland, Auckland, New Zealand
    Halter’s Pasture team is dedicated to helping farmers better manage their most valuable resource—pasture—for greater productivity, profit, and sustainability. To provide the highest quality insights...Show moreLast updated: 30+ days ago
    • Promoted
    Binance Accelerator Program - Data Scientist, Analytics

    Binance Accelerator Program - Data Scientist, Analytics

    BinanceWorkFromHome, Auckland, New Zealand
    Binance is a leading global blockchain ecosystem behind the world’s largest cryptocurrency exchange by trading volume and registered users. We are trusted by over 280 million people in 100+ countrie...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    AI Engineer – Data Generation & RLHF (Remote)

    AI Engineer – Data Generation & RLHF (Remote)

    TwineWorkFromHome, Auckland, New Zealand
    This role is ideal for a freelancer interested in supporting the development of large language models through data generation and reinforcement learning from human feedback (RLHF).The project cente...Show moreLast updated: 13 hours ago
    • Promoted
    Quality Release Lead

    Quality Release Lead

    The a2 Milk CompanyPokeno, Waikato, New Zealand
    We’re excited to share that our business has recently joined the a2 Milk Company family.Formerly known as Yashili Dairy NZ Ltd, we are now operating as a2 Nutritionals NZ Limited, based in Pokeno.T...Show moreLast updated: 22 days ago
    • Promoted
    Senior Machine Learning Research Scientist - Research Engineer

    Senior Machine Learning Research Scientist - Research Engineer

    SmarterDxWorkFromHome, Auckland, New Zealand
    Senior Machine Learning Research Scientist - Research Engineer.As a Senior Machine Learning Research Scientist, you will lead groundbreaking ML research and development at SmarterDx, collaborating ...Show moreLast updated: 29 days ago
    • Promoted
    Snowflake Data Architect

    Snowflake Data Architect

    Accenture New ZealandAuckland, Auckland, New Zealand
    Accenture is a leading global professional services company committed to inclusion, diversity, and supporting the whole person. Our core values include Stewardship, Best People, Client Value Creatio...Show moreLast updated: 30+ days ago
    • Promoted
    Engineering Director - AI Platform (ANZ Remote)

    Engineering Director - AI Platform (ANZ Remote)

    CanvaWorkFromHome, Auckland, New Zealand
    Engineering Director - AI Platform (ANZ Remote).Join the team redefining how the world experiences design.Hey, g'day, mabuhay, kia ora, 你好, hallo, vítejte! Thanks for stopping by.We know job huntin...Show moreLast updated: 14 days ago