Talent.com
This job offer is not available in your country.
Data Scientist (Reinforcement Learning / LLM Agent / Vision Language Model - either 1)

Data Scientist (Reinforcement Learning / LLM Agent / Vision Language Model - either 1)

BinanceWorkFromHome, Canterbury, New Zealand
23 hours ago
Job description

Binance is a leading global blockchain ecosystem behind the world’s largest cryptocurrency exchange by trading volume and registered users. We are trusted by over 280 million people in 100+ countries for our industry-leading security, user fund transparency, trading engine speed, deep liquidity, and an unmatched portfolio of digital-asset products. Binance offerings range from trading and finance to education, research, payments, institutional services, Web3 features, and more. We leverage the power of digital assets and blockchain to build an inclusive financial ecosystem to advance the freedom of money and improve financial access for people around the world.

About the Role

You will develop and optimize Reinforcement Learning (RL) models for enterprise-scale applications such as customer service, token reporting, compliance, and Web3 domain reasoning.

You will explore and evaluate advanced Algorithms including PPO, GRPO, DPO, RLHF, RLAIF, and Agentic RL to enhance the capabilities of LLMs, VLMs, and Agentic AI at Binance. The role requires a strong theoretical foundation in RL—covering policy optimization, reward modeling, and planning—paired with the Engineering skills to build scalable production systems.

You will take full ownership from research through deployment, driving experimentation with systematic evaluation and benchmarking. Collaboration across research, infrastructure, and application teams will be key to delivering impactful AI solutions.

Responsibilities :

  • Research and develop state-of-the-art RL algorithms, focusing on Large Model Optimization and alignment techniques.
  • Design and implement RL training pipelines, including environment simulation, data generation, and reward function design.
  • Apply RL methods to enhance LLM / VLM / Agentic AI capabilities in reasoning, planning, and autonomous decision-making.
  • Collaborate with Engineers and researchers to integrate RL solutions into enterprise AI platforms.
  • Monitor model performance in production and continuously improve through Iterative training and Fine-tuning.

Requirements :

  • Master’s Degree in Computer Science, Applied Mathematics, Machine Learning, or related fields.
  • 5+ years of hands-on experience in RL or LLM / VLM / Agentic AI optimization.
  • Strong coding skills in Python, with experience in ML frameworks and RL libraries.
  • Experience with large-scale distributed training and optimization.
  • Self-driven, ownership mindset, and strong problem-solving skills. Excellent communication skills for cross-functional collaboration.
  • Why Binance

  • Shape the future with the world’s leading blockchain ecosystem
  • Collaborate with world-class talent in a user-centric global organization with a flat structure
  • Tackle unique, fast-paced projects with autonomy in an innovative environment
  • Thrive in a results-driven workplace with opportunities for career growth and continuous learning
  • Competitive salary and company benefits
  • Work-from-home arrangement (the arrangement may vary depending on the work nature of the business team)
  • Binance is committed to being an equal opportunity employer. We believe that having a diverse workforce is fundamental to our success.

    By submitting a job application, you confirm that you have read and agree to our Candidate Privacy Notice .

    #J-18808-Ljbffr

    Create a job alert for this search

    Data Scientist • WorkFromHome, Canterbury, New Zealand

    Related jobs
    • Promoted
    Web Designer – AI Branding

    Web Designer – AI Branding

    TwineWorkFromHome, Canterbury, New Zealand
    Join a project focused on transforming a public sector business website into a modern, AI-driven brand using Figma Sites. You will take an existing Figma prototype and elevate it to a polished, prod...Show moreLast updated: 4 days ago
    • Promoted
    Division CFO, Trilogy (Remote) - $400,000 / year USD

    Division CFO, Trilogy (Remote) - $400,000 / year USD

    TrilogyWorkFromHome, Canterbury, New Zealand
    Division CFO, Trilogy (Remote) - $400,000 / year USD.Division CFO, Trilogy (Remote) - $400,000 / year USD.This range is provided by Trilogy. Your actual pay will be based on your skills and experience —...Show moreLast updated: 23 hours ago
    • Promoted
    SENIOR DATA SCIENTIST - (CONTRACT)

    SENIOR DATA SCIENTIST - (CONTRACT)

    Randstad New ZealandWorkFromHome, Canterbury, New Zealand
    Immediate start for a Senior Data Scientist to lead high-impact AI / ML projects focusing on price elasticity and demand forecasting. We are hiring for a premier retail technology client known for dri...Show moreLast updated: 23 hours ago
    • Promoted
    Engineering Specialist | Upto $75 / hr Remote

    Engineering Specialist | Upto $75 / hr Remote

    Crossing HurdlesWorkFromHome, Canterbury, New Zealand
    Engineering Specialist | Upto $75 / hr Remote.Crossing Hurdles is a recruitment firm that refers top candidates to partners working with the world’s leading AI research labs to help build and train c...Show moreLast updated: 23 hours ago
    • Promoted
    Junior Digital Learning Development Specialist

    Junior Digital Learning Development Specialist

    University of WestminsterCavendish, Canterbury, New Zealand
    The salary stated above is the pro-rata amount, the full-time salary would be £35,856 - £40,088 Per Annum (Incl.We are looking for a recent graduate from the University of Westminster who can act a...Show moreLast updated: 23 hours ago
    • Promoted
    Cobalt Core Pentester - US Remote-Only

    Cobalt Core Pentester - US Remote-Only

    CobaltWorkFromHome, Canterbury, New Zealand
    The Cobalt Core is a community of highly skilled security pentesters who are passionate about what they do and strive to deliver quality work. This curated community is made up of security professio...Show moreLast updated: 23 hours ago
    • Promoted
    Remarkable AI Expert

    Remarkable AI Expert

    Remarkable AIWorkFromHome, Canterbury, New Zealand
    Become a Customer Support Expert for brands you love.Chatdesk (DBA Remarkable AI) Experts are freelance customer support agents who help companies provide the best support for their customers throu...Show moreLast updated: 12 days ago
    • Promoted
    Principal Clinical Scientist, Radiology Physics

    Principal Clinical Scientist, Radiology Physics

    East Kent Hospitals University NHS Foundation TrustCanterbury, New Zealand
    Go back East Kent Hospitals University NHS Foundation Trust.The closing date is 30 October 2025.Radiology Physics is part of the expanding Medical Physics and Clinical Engineering Department at EKH...Show moreLast updated: 2 days ago
    • Promoted
    Software Engineer – AI Search

    Software Engineer – AI Search

    TwineWorkFromHome, Canterbury, New Zealand
    This part-time, remote contract role focuses on developing a cutting-edge hybrid search platform that bridges complex PDFs and intricate Excel documents. You will architect and implement solutions e...Show moreLast updated: 23 hours ago
    • Promoted
    Data Analytics Specialist (Remote)

    Data Analytics Specialist (Remote)

    Firefly Digital LimitedWorkFromHome, Canterbury, New Zealand
    In a few short years, Firefly Digital has grown to become one of New Zealand's top digital marketing agencies.We are a full-service digital performance agency that is growing FAST! And we've won a ...Show moreLast updated: 4 days ago
    • Promoted
    Freelance Civil Engineering Expert - AI Trainer

    Freelance Civil Engineering Expert - AI Trainer

    MindriftWorkFromHome, Canterbury, New Zealand
    Freelance Civil Engineering Expert - AI Trainer.Be among the first 25 applicants.At Mindrift, innovation meets opportunity. We believe in using the power of collective intelligence to ethically shap...Show moreLast updated: 23 hours ago
    • Promoted
    Data & AI Platform Engineering Lead

    Data & AI Platform Engineering Lead

    One New ZealandWorkFromHome, Canterbury, New Zealand
    Data & AI Platform Engineering Lead.Through our AI School, access to world-class learning platforms, or career pathways that evolve with you, we create an environment where your curiosity thrives, ...Show moreLast updated: 4 days ago
    • Promoted
    Designer | Upto $70 / hr | Remote

    Designer | Upto $70 / hr | Remote

    Crossing HurdlesWorkFromHome, Canterbury, New Zealand
    Designer | Upto $70 / hr | Remote.Direct message the job poster from Crossing Hurdles.We refer top candidates to our partners working with the world’s leading AI research labs to help build and train...Show moreLast updated: 20 days ago
    • Promoted
    Science teacher–physics

    Science teacher–physics

    Te Puna Wai o Waipapa - Hagley CollegeCanterbury, New Zealand
    Science teacher – physics, full time, fixed term, Start Date : 28 / 01 / 2026, End Date : 27 / 01 / 2027 (Parental leave cover). Secondary (Years 7–15) / wharekura, Certificated teacher.Fixed term, full-time ...Show moreLast updated: 2 days ago
    • Promoted
    Team leader Y7–8

    Team leader Y7–8

    Bluestone SchoolCanterbury, New Zealand
    Primary and intermediate (Years 1–8) / kura tuatahi, Syndicate leader.Subjects offered : English, Health and physical education, Learning languages, Mathematics and statistics, Science, Social scien...Show moreLast updated: 23 hours ago
    • Promoted
    Python and Kubernetes Software Engineer - Data, AI / ML & Analytics

    Python and Kubernetes Software Engineer - Data, AI / ML & Analytics

    CanonicalWorkFromHome, Canterbury, New Zealand
    Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is widely used in breakthrough enterprise initiat...Show moreLast updated: 30+ days ago
    • Promoted
    Data Ops Engineer

    Data Ops Engineer

    One New ZealandWorkFromHome, Canterbury, New Zealand
    Data Ops Engineer – Data & AI Platforms at One New Zealand.Define SLIs / SLOs for data freshness, completeness, timeliness, and pipeline success. Build dashboards, alerts, and traces, and ensure every...Show moreLast updated: 2 days ago
    • Promoted
    Data Scientist (Recommendation Systems), Binance Square

    Data Scientist (Recommendation Systems), Binance Square

    BinanceWorkFromHome, Canterbury, New Zealand
    Binance is a leading global blockchain ecosystem behind the world’s largest cryptocurrency exchange by trading volume and registered users. We are trusted by over 280 million people in 100+ countrie...Show moreLast updated: 27 days ago