Talent.com
This job offer is not available in your country.
Data Scientist (Reinforcement Learning / LLM Agent / Vision Language Model - either 1)

Data Scientist (Reinforcement Learning / LLM Agent / Vision Language Model - either 1)

BinanceWellington, Wellington, New Zealand
1 day ago
Job description

Binance is a leading global blockchain ecosystem behind the world’s largest cryptocurrency exchange by trading volume and registered users. We are trusted by over 280 million people in 100+ countries for our industry-leading security, user fund transparency, trading engine speed, deep liquidity, and an unmatched portfolio of digital-asset products. Binance offerings range from trading and finance to education, research, payments, institutional services, Web3 features, and more. We leverage the power of digital assets and blockchain to build an inclusive financial ecosystem to advance the freedom of money and improve financial access for people around the world.

About the Role

You will develop and optimize Reinforcement Learning (RL) models for enterprise-scale applications such as customer service, token reporting, compliance, and Web3 domain reasoning.

You will explore and evaluate advanced Algorithms including PPO, GRPO, DPO, RLHF, RLAIF, and Agentic RL to enhance the capabilities of LLMs, VLMs, and Agentic AI at Binance. The role requires a strong theoretical foundation in RL—covering policy optimization, reward modeling, and planning—paired with the Engineering skills to build scalable production systems.

You will take full ownership from research through deployment, driving experimentation with systematic evaluation and benchmarking. Collaboration across research, infrastructure, and application teams will be key to delivering impactful AI solutions.

Responsibilities :

  • Research and develop state-of-the-art RL algorithms, focusing on Large Model Optimization and alignment techniques.
  • Design and implement RL training pipelines, including environment simulation, data generation, and reward function design.
  • Apply RL methods to enhance LLM / VLM / Agentic AI capabilities in reasoning, planning, and autonomous decision-making.
  • Collaborate with Engineers and researchers to integrate RL solutions into enterprise AI platforms.
  • Monitor model performance in production and continuously improve through Iterative training and Fine-tuning.

Requirements :

  • Master’s Degree in Computer Science, Applied Mathematics, Machine Learning, or related fields.
  • 5+ years of hands-on experience in RL or LLM / VLM / Agentic AI optimization.
  • Strong coding skills in Python, with experience in ML frameworks and RL libraries.
  • Experience with large-scale distributed training and optimization.
  • Self-driven, ownership mindset, and strong problem-solving skills. Excellent communication skills for cross-functional collaboration.
  • Why Binance

  • Shape the future with the world’s leading blockchain ecosystem
  • Collaborate with world-class talent in a user-centric global organization with a flat structure
  • Tackle unique, fast-paced projects with autonomy in an innovative environment
  • Thrive in a results-driven workplace with opportunities for career growth and continuous learning
  • Competitive salary and company benefits
  • Work-from-home arrangement (the arrangement may vary depending on the work nature of the business team)
  • Binance is committed to being an equal opportunity employer. We believe that having a diverse workforce is fundamental to our success.

    By submitting a job application, you confirm that you have read and agree to our Candidate Privacy Notice .

    #J-18808-Ljbffr

    Create a job alert for this search

    Data Scientist • Wellington, Wellington, New Zealand

    Related jobs
    • Promoted
    Data Scientist (LLM), Multi-Agent Systems

    Data Scientist (LLM), Multi-Agent Systems

    BinanceWorkFromHome, Wellington, New Zealand
    Binance is a leading global blockchain ecosystem behind the world’s largest cryptocurrency exchange by trading volume and registered users. We are trusted by over 280 million people in 100+ countrie...Show moreLast updated: 17 days ago
    • Promoted
    Research Scientist - LLM Foundation Models

    Research Scientist - LLM Foundation Models

    BinanceWorkFromHome, Wellington, New Zealand
    Binance is a leading global blockchain ecosystem behind the world’s largest cryptocurrency exchange by trading volume and registered users. We are trusted by over 280 million people in 100+ countrie...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Machine Learning Engineer - Canva AI

    Senior Machine Learning Engineer - Canva AI

    CanvaWorkFromHome, Wellington, New Zealand
    Canva Wellington, Wellington, New Zealand.Join or sign in to find your next job.Senior Machine Learning Engineer - Canva AI. Canva Wellington, Wellington, New Zealand.Be among the first 25 applicant...Show moreLast updated: 30+ days ago
    • Promoted
    Data Scientist (Recommendation Systems)

    Data Scientist (Recommendation Systems)

    BinanceWorkFromHome, Wellington, New Zealand
    Binance is a leading global blockchain ecosystem behind the world’s largest cryptocurrency exchange by trading volume and registered users. We are trusted by over 280 million people in 100+ countrie...Show moreLast updated: 2 days ago
    • Promoted
    Remarkable AI Expert

    Remarkable AI Expert

    Remarkable AIWorkFromHome, Wellington, New Zealand
    Become a Customer Support Expert for brands you love.Chatdesk (DBA Remarkable AI) Experts are freelance customer support agents who help companies provide the best support for their customers throu...Show moreLast updated: 17 days ago
    • Promoted
    Engagement Manager (Sales), Senior Manager, Technology Consulting, Data & Analytics

    Engagement Manager (Sales), Senior Manager, Technology Consulting, Data & Analytics

    Ernst & Young Advisory Services Sdn BhdNew Zealand
    Location : Other locations : Primary Location Only.At EY, we develop you with future-focused skills and equip you with world-class experiences. We empower you in a flexible environment, and fuel you a...Show moreLast updated: 30+ days ago
    • Promoted
    Binance Accelerator Program - LLM Model Training & Data Processing

    Binance Accelerator Program - LLM Model Training & Data Processing

    BinanceWorkFromHome, Wellington, New Zealand
    Binance is the global blockchain company behind the world’s largest digital asset exchange by trading volume and users, serving a greater mission to accelerate cryptocurrency adoption and increase ...Show moreLast updated: 30+ days ago
    • Promoted
    Division CFO, Trilogy (Remote) - $400,000 / year USD

    Division CFO, Trilogy (Remote) - $400,000 / year USD

    TrilogyWorkFromHome, Wellington, New Zealand
    Division CFO, Trilogy (Remote) - $400,000 / year USD.Division CFO, Trilogy (Remote) - $400,000 / year USD.This range is provided by Trilogy. Your actual pay will be based on your skills and experience —...Show moreLast updated: 6 days ago
    • Promoted
    Lead Research Scientist - Generative AI

    Lead Research Scientist - Generative AI

    CanvaWorkFromHome, Wellington, New Zealand
    Lead Research Scientist - Generative AI.Be among the first 25 applicants.Lead Research Scientist - Generative AI.Join the team redefining how the world experiences design.Hey, g'day, mabuhay, kia o...Show moreLast updated: 30+ days ago
    • Promoted
    Data Scientist, Market Growth Lifecycle

    Data Scientist, Market Growth Lifecycle

    BinanceWellington, New Zealand
    Binance is a leading global blockchain ecosystem behind the world's largest cryptocurrency exchange by trading volume and registered users. We are trusted by over 280 million people in 100+ countrie...Show moreLast updated: 30+ days ago
    • Promoted
    Engineering Senior Machine Learning Engineer New Zealand (Remote) FullTime

    Engineering Senior Machine Learning Engineer New Zealand (Remote) FullTime

    Leonardo Interactive PtyWorkFromHome, Wellington, New Zealand
    Ai is building one of the world’s highest-throughput Generative AI platforms, enabling millions of users, from beginners to professionals, to create high-quality images and videos with ease.Now par...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Machine Learning Research Scientist - Research Engineer

    Senior Machine Learning Research Scientist - Research Engineer

    SmarterDxWorkFromHome, Wellington, New Zealand
    Senior Machine Learning Research Scientist - Research Engineer.As a Senior Machine Learning Research Scientist, you will lead groundbreaking ML research and development at SmarterDx, collaborating ...Show moreLast updated: 17 days ago
    • Promoted
    • New!
    Data Scientist (Reinforcement Learning / Llm Agent / Vision Language Model - Either 1)

    Data Scientist (Reinforcement Learning / Llm Agent / Vision Language Model - Either 1)

    BinanceWellington, New Zealand
    Binance is a leading global blockchain ecosystem behind the world's largest cryptocurrency exchange by trading volume and registered users. We are trusted by over 280 million people in 100+ countrie...Show moreLast updated: 13 hours ago
    • Promoted
    Principal Machine Learning Engineer

    Principal Machine Learning Engineer

    Vital.ioWorkFromHome, New Zealand
    We are looking for a Principal Machine Learning Engineer to join our remote-first team, in a New Zealand & Australia friendly timezone. You will be leading from the front on our ML efforts, ensuring...Show moreLast updated: 30+ days ago
    • Promoted
    Data Scientist / Algorithm Engineer (LLM) – AI Safety

    Data Scientist / Algorithm Engineer (LLM) – AI Safety

    BinanceWorkFromHome, Wellington, New Zealand
    Binance is a leading global blockchain ecosystem behind the world’s largest cryptocurrency exchange by trading volume and registered users. We are trusted by over 280 million people in 100+ countrie...Show moreLast updated: 1 day ago
    • Promoted
    Mathematics Expert (Remote)

    Mathematics Expert (Remote)

    TaskifyWorkFromHome, Wellington, New Zealand
    Join to apply for the Mathematics Expert (Remote) role at Taskify.Get AI-powered advice on this job and more exclusive features. As a Mathematics Domain Expert, you will leverage your advanced subje...Show moreLast updated: 2 days ago
    • Promoted
    Data Scientist / Machine Learning Engineer (Market Growth Lifecycle)

    Data Scientist / Machine Learning Engineer (Market Growth Lifecycle)

    BinanceWellington, Wellington, New Zealand
    Binance is a leading global blockchain ecosystem behind the world’s largest cryptocurrency exchange by trading volume and registered users. We are trusted by over 280 million people in 100+ countrie...Show moreLast updated: 1 day ago
    • Promoted
    Real Estate Agent (Hutt Valley)

    Real Estate Agent (Hutt Valley)

    REAP RecruitmentLower Hutt, Wellington, New Zealand
    Opportunities for Existing & New Real Estate Salespeople | Multiple Offices & Brands | Apply Today to See Them AllREAP Recruitment specialise in (and are the largest advertiser of) recruitment oppo...Show moreLast updated: 30+ days ago