Talent.com
This job offer is not available in your country.
Data Scientist (Reinforcement Learning / LLM Agent / Vision Language Model - either 1)

Data Scientist (Reinforcement Learning / LLM Agent / Vision Language Model - either 1)

BinanceWorkFromHome, Taranaki, New Zealand
1 day ago
Job description

Binance is a leading global blockchain ecosystem behind the world’s largest cryptocurrency exchange by trading volume and registered users. We are trusted by over 280 million people in 100+ countries for our industry-leading security, user fund transparency, trading engine speed, deep liquidity, and an unmatched portfolio of digital-asset products. Binance offerings range from trading and finance to education, research, payments, institutional services, Web3 features, and more. We leverage the power of digital assets and blockchain to build an inclusive financial ecosystem to advance the freedom of money and improve financial access for people around the world.

About the Role

You will develop and optimize Reinforcement Learning (RL) models for enterprise-scale applications such as customer service, token reporting, compliance, and Web3 domain reasoning.

You will explore and evaluate advanced Algorithms including PPO, GRPO, DPO, RLHF, RLAIF, and Agentic RL to enhance the capabilities of LLMs, VLMs, and Agentic AI at Binance. The role requires a strong theoretical foundation in RL—covering policy optimization, reward modeling, and planning—paired with the Engineering skills to build scalable production systems.

You will take full ownership from research through deployment, driving experimentation with systematic evaluation and benchmarking. Collaboration across research, infrastructure, and application teams will be key to delivering impactful AI solutions.

Responsibilities :

  • Research and develop state-of-the-art RL algorithms, focusing on Large Model Optimization and alignment techniques.
  • Design and implement RL training pipelines, including environment simulation, data generation, and reward function design.
  • Apply RL methods to enhance LLM / VLM / Agentic AI capabilities in reasoning, planning, and autonomous decision-making.
  • Collaborate with Engineers and researchers to integrate RL solutions into enterprise AI platforms.
  • Monitor model performance in production and continuously improve through Iterative training and Fine-tuning.

Requirements :

  • Master’s Degree in Computer Science, Applied Mathematics, Machine Learning, or related fields.
  • 5+ years of hands-on experience in RL or LLM / VLM / Agentic AI optimization.
  • Strong coding skills in Python, with experience in ML frameworks and RL libraries.
  • Experience with large-scale distributed training and optimization.
  • Self-driven, ownership mindset, and strong problem-solving skills. Excellent communication skills for cross-functional collaboration.
  • Why Binance

  • Shape the future with the world’s leading blockchain ecosystem
  • Collaborate with world-class talent in a user-centric global organization with a flat structure
  • Tackle unique, fast-paced projects with autonomy in an innovative environment
  • Thrive in a results-driven workplace with opportunities for career growth and continuous learning
  • Competitive salary and company benefits
  • Work-from-home arrangement (the arrangement may vary depending on the work nature of the business team)
  • Binance is committed to being an equal opportunity employer. We believe that having a diverse workforce is fundamental to our success.

    By submitting a job application, you confirm that you have read and agree to our Candidate Privacy Notice .

    #J-18808-Ljbffr

    Create a job alert for this search

    Data Scientist • WorkFromHome, Taranaki, New Zealand

    Related jobs
    • Promoted
    SENIOR DATA SCIENTIST - (CONTRACT)

    SENIOR DATA SCIENTIST - (CONTRACT)

    Randstad New ZealandWorkFromHome, Taranaki, New Zealand
    Immediate start for a Senior Data Scientist to lead high-impact AI / ML projects focusing on price elasticity and demand forecasting. We are hiring for a premier retail technology client known for dri...Show moreLast updated: 1 day ago
    • Promoted
    Data Ops Engineer

    Data Ops Engineer

    One New ZealandWorkFromHome, Taranaki, New Zealand
    Data Ops Engineer – Data & AI Platforms at One New Zealand.Define SLIs / SLOs for data freshness, completeness, timeliness, and pipeline success. Build dashboards, alerts, and traces, and ensure every...Show moreLast updated: 3 days ago
    • Promoted
    Software Engineer - AI Avatar Project

    Software Engineer - AI Avatar Project

    TwineWorkFromHome, Taranaki, New Zealand
    This role involves developing an AI avatar feature for a news website, focusing on enhancing user engagement and interactivity. The AI avatar will greet users, read headlines, and offer to search fo...Show moreLast updated: 4 days ago
    • Promoted
    Senior Asset Data Analyst

    Senior Asset Data Analyst

    AdeccoNew Plymouth, Taranaki, New Zealand
    The ideal candidate will be responsible for full-stack project asset data governance from ingestion (field capture) through to CAPEX posting, enterprise registration, and ownership transfer.Advance...Show moreLast updated: 1 day ago
    • Promoted
    SENIOR DATA SCIENTIST - (CONTRACT)

    SENIOR DATA SCIENTIST - (CONTRACT)

    RandstadWorkFromHome, Taranaki, New Zealand
    SENIOR DATA SCIENTIST - (CONTRACT).We are seeking a highly analytical Senior Data Scientist to champion Mathematical Optimization for retail assortment planning and space allocation.This is a high-...Show moreLast updated: 3 days ago
    • Promoted
    Software Engineer – AI Search

    Software Engineer – AI Search

    TwineWorkFromHome, Taranaki, New Zealand
    This part-time, remote contract role focuses on developing a cutting-edge hybrid search platform that bridges complex PDFs and intricate Excel documents. You will architect and implement solutions e...Show moreLast updated: 1 day ago
    • Promoted
    Technical Feed Partner - Taranaki

    Technical Feed Partner - Taranaki

    GrainCorpNew Plymouth, Taranaki, New Zealand
    GrainCorp Animal Nutrition is a national animal feed business, based in Taranaki NZ, dealing directly with corporate and farming customers across New Zealand. GrainCorp Feeds offers a comprehensive ...Show moreLast updated: 3 days ago
    • Promoted
    Data Analytics Specialist (Remote)

    Data Analytics Specialist (Remote)

    Firefly Digital LimitedWorkFromHome, Taranaki, New Zealand
    In a few short years, Firefly Digital has grown to become one of New Zealand's top digital marketing agencies.We are a full-service digital performance agency that is growing FAST! And we've won a ...Show moreLast updated: 4 days ago
    • Promoted
    Data Collection Specialist Regional | Matanga Kohi Raraunga - (Christchurch) Part-time

    Data Collection Specialist Regional | Matanga Kohi Raraunga - (Christchurch) Part-time

    StatsNZWorkFromHome, Taranaki, New Zealand
    Matanga Kohi Raraunga | Data Collection Specialist Regional | Part-time — Christchurch.Collaborate with us in improving life in Aotearoa. Play a part in creating a better future for New Zealanders.S...Show moreLast updated: 4 days ago
    • Promoted
    Web Designer – AI Branding

    Web Designer – AI Branding

    TwineWorkFromHome, Taranaki, New Zealand
    Join a project focused on transforming a public sector business website into a modern, AI-driven brand using Figma Sites. You will take an existing Figma prototype and elevate it to a polished, prod...Show moreLast updated: 4 days ago
    • Promoted
    Data Developer - Integrations

    Data Developer - Integrations

    Taranaki Regional CouncilStratford, Taranaki, New Zealand
    Our people love working here because we are an enthusiastic team, with a diverse range of skills and experience, who are all rowing in the same direction. We are all passionate about making a positi...Show moreLast updated: 1 day ago
    • Promoted
    Gis / Asset Data Technician

    Gis / Asset Data Technician

    PowercoNew Plymouth, New Zealand
    We're looking for a team player who brings a positive can-do attitude to help keep our systems up to standard with accurate and available information. This is how we contribute to keeping our custom...Show moreLast updated: 1 day ago
    • Promoted
    Software Developer (Python) | Upto $90 / hr Remote

    Software Developer (Python) | Upto $90 / hr Remote

    Crossing HurdlesWorkFromHome, Taranaki, New Zealand
    Software Developer (Python) | Upto $90 / hr Remote.Join to apply for the Software Developer (Python) | Upto $90 / hr Remote role at Crossing Hurdles. Direct message the job poster from Crossing Hurdles....Show moreLast updated: 1 day ago
    • Promoted
    Data Collection Specialist Regional | Matanga Kohi Raraunga - (Wellington / Upper Hutt) Part-time

    Data Collection Specialist Regional | Matanga Kohi Raraunga - (Wellington / Upper Hutt) Part-time

    Statistics New ZealandWorkFromHome, Taranaki, New Zealand
    Collaboration in improving life in Aotearoa.Play a part in creating a better future for New Zealanders.Stats NZ | Tatauranga Aotearoa is a Central Government employer of 1200+ people across Aotearo...Show moreLast updated: 3 days ago
    • Promoted
    Freelance Electrical Engineer - AI Trainer

    Freelance Electrical Engineer - AI Trainer

    MindriftWorkFromHome, Taranaki, New Zealand
    Freelance Electrical Engineer - AI Trainer.Mindrift invites experts to contribute to AI projects through the platform that connects specialists with Generative AI initiatives from major tech innova...Show moreLast updated: 1 day ago
    • Promoted
    • New!
    Python Developer | Upto $90 / hr Remote

    Python Developer | Upto $90 / hr Remote

    Crossing HurdlesWorkFromHome, Taranaki, New Zealand
    Python Developer | Upto $90 / hr Remote.Minimum 2 weeks, with potential for extension.Flexible, 10–20 hours / week with possible increase to 40 hours. Codex, Claude code) to perform realistic coding tas...Show moreLast updated: 19 hours ago
    • Promoted
    UGC Content Creator

    UGC Content Creator

    TwineWorkFromHome, Taranaki, New Zealand
    Be among the first 25 applicants.Get AI-powered advice on this job and more exclusive features.This role is ideal for a creative individual who enjoys being on camera and producing engaging short-f...Show moreLast updated: 4 days ago
    • Promoted
    Data Collection Specialist Regional | Matanga Kohi Raraunga - (Christchurch) Part-time

    Data Collection Specialist Regional | Matanga Kohi Raraunga - (Christchurch) Part-time

    Statistics New ZealandWorkFromHome, Taranaki, New Zealand
    Collaborate with us in improving life in Aotearoa.Play a part in creating a better future for New Zealanders.Stats NZ | Tatauranga Aotearoa is a Central Government employer of 1200+ people across A...Show moreLast updated: 3 days ago