Talent.com
This job offer is not available in your country.
Data Scientist (Reinforcement Learning / LLM Agent / Vision Language Model - either 1)

Data Scientist (Reinforcement Learning / LLM Agent / Vision Language Model - either 1)

BinanceWorkFromHome, Manawatū-Whanganui, New Zealand
6 hours ago
Job description

Binance is a leading global blockchain ecosystem behind the world’s largest cryptocurrency exchange by trading volume and registered users. We are trusted by over 280 million people in 100+ countries for our industry-leading security, user fund transparency, trading engine speed, deep liquidity, and an unmatched portfolio of digital-asset products. Binance offerings range from trading and finance to education, research, payments, institutional services, Web3 features, and more. We leverage the power of digital assets and blockchain to build an inclusive financial ecosystem to advance the freedom of money and improve financial access for people around the world.

About the Role

You will develop and optimize Reinforcement Learning (RL) models for enterprise-scale applications such as customer service, token reporting, compliance, and Web3 domain reasoning.

You will explore and evaluate advanced Algorithms including PPO, GRPO, DPO, RLHF, RLAIF, and Agentic RL to enhance the capabilities of LLMs, VLMs, and Agentic AI at Binance. The role requires a strong theoretical foundation in RL—covering policy optimization, reward modeling, and planning—paired with the Engineering skills to build scalable production systems.

You will take full ownership from research through deployment, driving experimentation with systematic evaluation and benchmarking. Collaboration across research, infrastructure, and application teams will be key to delivering impactful AI solutions.

Responsibilities :

  • Research and develop state-of-the-art RL algorithms, focusing on Large Model Optimization and alignment techniques.
  • Design and implement RL training pipelines, including environment simulation, data generation, and reward function design.
  • Apply RL methods to enhance LLM / VLM / Agentic AI capabilities in reasoning, planning, and autonomous decision-making.
  • Collaborate with Engineers and researchers to integrate RL solutions into enterprise AI platforms.
  • Monitor model performance in production and continuously improve through Iterative training and Fine-tuning.

Requirements :

  • Master’s Degree in Computer Science, Applied Mathematics, Machine Learning, or related fields.
  • 5+ years of hands-on experience in RL or LLM / VLM / Agentic AI optimization.
  • Strong coding skills in Python, with experience in ML frameworks and RL libraries.
  • Experience with large-scale distributed training and optimization.
  • Self-driven, ownership mindset, and strong problem-solving skills. Excellent communication skills for cross-functional collaboration.
  • Why Binance

  • Shape the future with the world’s leading blockchain ecosystem
  • Collaborate with world-class talent in a user-centric global organization with a flat structure
  • Tackle unique, fast-paced projects with autonomy in an innovative environment
  • Thrive in a results-driven workplace with opportunities for career growth and continuous learning
  • Competitive salary and company benefits
  • Work-from-home arrangement (the arrangement may vary depending on the work nature of the business team)
  • Binance is committed to being an equal opportunity employer. We believe that having a diverse workforce is fundamental to our success.

    By submitting a job application, you confirm that you have read and agree to our Candidate Privacy Notice .

    #J-18808-Ljbffr

    Create a job alert for this search

    Data Scientist • WorkFromHome, Manawatū-Whanganui, New Zealand

    Related jobs
    • Promoted
    Remarkable AI Expert

    Remarkable AI Expert

    Remarkable AIWorkFromHome, Manawatū-Whanganui, New Zealand
    Become a Customer Support Expert for brands you love.Chatdesk (DBA Remarkable AI) Experts are freelance customer support agents who help companies provide the best support for their customers throu...Show moreLast updated: 11 days ago
    • Promoted
    SENIOR DATA SCIENTIST - (CONTRACT)

    SENIOR DATA SCIENTIST - (CONTRACT)

    RandstadWorkFromHome, Manawatū-Whanganui, New Zealand
    SENIOR DATA SCIENTIST - (CONTRACT).We are seeking a highly analytical Senior Data Scientist to champion Mathematical Optimization for retail assortment planning and space allocation.This is a high-...Show moreLast updated: 2 days ago
    • Promoted
    • New!
    Engineering Specialist | Upto $75 / hr Remote

    Engineering Specialist | Upto $75 / hr Remote

    Crossing HurdlesWorkFromHome, Manawatū-Whanganui, New Zealand
    Engineering Specialist | Upto $75 / hr Remote.Crossing Hurdles is a recruitment firm that refers top candidates to partners working with the world’s leading AI research labs to help build and train c...Show moreLast updated: 6 hours ago
    • Promoted
    • New!
    Content Design Manager - Support Documentation

    Content Design Manager - Support Documentation

    AtlassianWorkFromHome, Manawatū-Whanganui, New Zealand
    Atlassians can choose where they work – whether in an office, from home, or a combination of the two.That way, Atlassians have more control over supporting their family, personal goals, and other p...Show moreLast updated: 6 hours ago
    • Promoted
    • New!
    Division CFO, Trilogy (Remote) - $400,000 / year USD

    Division CFO, Trilogy (Remote) - $400,000 / year USD

    TrilogyWorkFromHome, Manawatū-Whanganui, New Zealand
    Division CFO, Trilogy (Remote) - $400,000 / year USD.Division CFO, Trilogy (Remote) - $400,000 / year USD.This range is provided by Trilogy. Your actual pay will be based on your skills and experience —...Show moreLast updated: 6 hours ago
    • Promoted
    Designer | Upto $70 / hr | Remote

    Designer | Upto $70 / hr | Remote

    Crossing HurdlesWorkFromHome, Manawatū-Whanganui, New Zealand
    Designer | Upto $70 / hr | Remote.Direct message the job poster from Crossing Hurdles.We refer top candidates to our partners working with the world’s leading AI research labs to help build and train...Show moreLast updated: 19 days ago
    • Promoted
    Real Estate Agent |(Manawatu)

    Real Estate Agent |(Manawatu)

    REAP RecruitmentManawatu, Manawatu / Wanganui, New Zealand
    Opportunities for Existing & New Real Estate Salespeople | Multiple Offices & Brands | Apply Today to See Them AllREAP Recruitment specialise in (and are the largest advertiser of) recruitment oppo...Show moreLast updated: 30+ days ago
    • Promoted
    Data Scientist (Recommendation Systems), Binance Square

    Data Scientist (Recommendation Systems), Binance Square

    BinanceWorkFromHome, Manawatū-Whanganui, New Zealand
    Binance is a leading global blockchain ecosystem behind the world’s largest cryptocurrency exchange by trading volume and registered users. We are trusted by over 280 million people in 100+ countrie...Show moreLast updated: 26 days ago
    • Promoted
    • New!
    Software Engineer – AI Search

    Software Engineer – AI Search

    TwineWorkFromHome, Manawatū-Whanganui, New Zealand
    This part-time, remote contract role focuses on developing a cutting-edge hybrid search platform that bridges complex PDFs and intricate Excel documents. You will architect and implement solutions e...Show moreLast updated: 6 hours ago
    • Promoted
    Freelance B2B Content Strategist

    Freelance B2B Content Strategist

    DigitalWorkFromHome, Manawatū-Whanganui, New Zealand
    Hire Digital is seeking an experienced.B2B Content Strategist (Freelance, Remote).This position focuses on creating and implementing a strong content strategy that strengthens our brand presence in...Show moreLast updated: 14 days ago
    • Promoted
    • New!
    SENIOR DATA SCIENTIST - (CONTRACT)

    SENIOR DATA SCIENTIST - (CONTRACT)

    Randstad New ZealandWorkFromHome, Manawatū-Whanganui, New Zealand
    Immediate start for a Senior Data Scientist to lead high-impact AI / ML projects focusing on price elasticity and demand forecasting. We are hiring for a premier retail technology client known for dri...Show moreLast updated: 6 hours ago
    • Promoted
    Python and Kubernetes Software Engineer - Data, AI / ML & Analytics

    Python and Kubernetes Software Engineer - Data, AI / ML & Analytics

    CanonicalWorkFromHome, Manawatū-Whanganui, New Zealand
    Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is widely used in breakthrough enterprise initiat...Show moreLast updated: 26 days ago
    • Promoted
    Web Designer – AI Branding

    Web Designer – AI Branding

    TwineWorkFromHome, Manawatū-Whanganui, New Zealand
    Join a project focused on transforming a public sector business website into a modern, AI-driven brand using Figma Sites. You will take an existing Figma prototype and elevate it to a polished, prod...Show moreLast updated: 3 days ago
    • Promoted
    • New!
    Cobalt Core Pentester - US Remote-Only

    Cobalt Core Pentester - US Remote-Only

    CobaltWorkFromHome, Manawatū-Whanganui, New Zealand
    The Cobalt Core is a community of highly skilled security pentesters who are passionate about what they do and strive to deliver quality work. This curated community is made up of security professio...Show moreLast updated: 6 hours ago
    • Promoted
    • New!
    Virtual Travel Consultant

    Virtual Travel Consultant

    Live the Dash TravelWorkFromHome, Manawatū-Whanganui, New Zealand
    We are currently seeking an enthusiastic and detail-driven.In this role, you’ll be the go-to expert for clients planning vacations, honeymoons, group trips, cruises, and more.If you enjoy connectin...Show moreLast updated: 6 hours ago
    • Promoted
    Data Analytics Specialist (Remote)

    Data Analytics Specialist (Remote)

    Firefly Digital LimitedWorkFromHome, Manawatū-Whanganui, New Zealand
    In a few short years, Firefly Digital has grown to become one of New Zealand's top digital marketing agencies.We are a full-service digital performance agency that is growing FAST! And we've won a ...Show moreLast updated: 3 days ago
    • Promoted
    • New!
    Freelance Electrical Engineer - AI Trainer

    Freelance Electrical Engineer - AI Trainer

    MindriftWorkFromHome, Manawatū-Whanganui, New Zealand
    Freelance Electrical Engineer - AI Trainer.Mindrift invites experts to contribute to AI projects through the platform that connects specialists with Generative AI initiatives from major tech innova...Show moreLast updated: 6 hours ago
    • Promoted
    • New!
    Freelance Civil Engineering Expert - AI Trainer

    Freelance Civil Engineering Expert - AI Trainer

    MindriftWorkFromHome, Manawatū-Whanganui, New Zealand
    Freelance Civil Engineering Expert - AI Trainer.Be among the first 25 applicants.At Mindrift, innovation meets opportunity. We believe in using the power of collective intelligence to ethically shap...Show moreLast updated: 6 hours ago