Talent.com
This job offer is not available in your country.
Data Scientist (Reinforcement Learning / LLM Agent / Vision Language Model - either 1)

Data Scientist (Reinforcement Learning / LLM Agent / Vision Language Model - either 1)

BinanceWorkFromHome, Southland, New Zealand
12 hours ago
Job description

Binance is a leading global blockchain ecosystem behind the world’s largest cryptocurrency exchange by trading volume and registered users. We are trusted by over 280 million people in 100+ countries for our industry-leading security, user fund transparency, trading engine speed, deep liquidity, and an unmatched portfolio of digital-asset products. Binance offerings range from trading and finance to education, research, payments, institutional services, Web3 features, and more. We leverage the power of digital assets and blockchain to build an inclusive financial ecosystem to advance the freedom of money and improve financial access for people around the world.

About the Role

You will develop and optimize Reinforcement Learning (RL) models for enterprise-scale applications such as customer service, token reporting, compliance, and Web3 domain reasoning.

You will explore and evaluate advanced Algorithms including PPO, GRPO, DPO, RLHF, RLAIF, and Agentic RL to enhance the capabilities of LLMs, VLMs, and Agentic AI at Binance. The role requires a strong theoretical foundation in RL—covering policy optimization, reward modeling, and planning—paired with the Engineering skills to build scalable production systems.

You will take full ownership from research through deployment, driving experimentation with systematic evaluation and benchmarking. Collaboration across research, infrastructure, and application teams will be key to delivering impactful AI solutions.

Responsibilities :

  • Research and develop state-of-the-art RL algorithms, focusing on Large Model Optimization and alignment techniques.
  • Design and implement RL training pipelines, including environment simulation, data generation, and reward function design.
  • Apply RL methods to enhance LLM / VLM / Agentic AI capabilities in reasoning, planning, and autonomous decision-making.
  • Collaborate with Engineers and researchers to integrate RL solutions into enterprise AI platforms.
  • Monitor model performance in production and continuously improve through Iterative training and Fine-tuning.

Requirements :

  • Master’s Degree in Computer Science, Applied Mathematics, Machine Learning, or related fields.
  • 5+ years of hands-on experience in RL or LLM / VLM / Agentic AI optimization.
  • Strong coding skills in Python, with experience in ML frameworks and RL libraries.
  • Experience with large-scale distributed training and optimization.
  • Self-driven, ownership mindset, and strong problem-solving skills. Excellent communication skills for cross-functional collaboration.
  • Why Binance

  • Shape the future with the world’s leading blockchain ecosystem
  • Collaborate with world-class talent in a user-centric global organization with a flat structure
  • Tackle unique, fast-paced projects with autonomy in an innovative environment
  • Thrive in a results-driven workplace with opportunities for career growth and continuous learning
  • Competitive salary and company benefits
  • Work-from-home arrangement (the arrangement may vary depending on the work nature of the business team)
  • Binance is committed to being an equal opportunity employer. We believe that having a diverse workforce is fundamental to our success.

    By submitting a job application, you confirm that you have read and agree to our Candidate Privacy Notice .

    #J-18808-Ljbffr

    Create a job alert for this search

    Data Scientist • WorkFromHome, Southland, New Zealand

    Related jobs
    • Promoted
    Executive Recruitment Partner

    Executive Recruitment Partner

    HaldrenWorkFromHome, Southland, New Zealand
    Be among the first 25 applicants.Are you an experienced recruitment professional ready to take control of your career while backed by a global brand? Keller Haldren is seeking an ambitious, indepen...Show moreLast updated: 2 days ago
    • Promoted
    Designer | Upto $70 / hr | Remote

    Designer | Upto $70 / hr | Remote

    Crossing HurdlesWorkFromHome, Southland, New Zealand
    Designer | Upto $70 / hr | Remote.Direct message the job poster from Crossing Hurdles.We refer top candidates to our partners working with the world’s leading AI research labs to help build and train...Show moreLast updated: 19 days ago
    • Promoted
    • New!
    Division CFO, Trilogy (Remote) - $400,000 / year USD

    Division CFO, Trilogy (Remote) - $400,000 / year USD

    TrilogyWorkFromHome, Southland, New Zealand
    Division CFO, Trilogy (Remote) - $400,000 / year USD.Division CFO, Trilogy (Remote) - $400,000 / year USD.This range is provided by Trilogy. Your actual pay will be based on your skills and experience —...Show moreLast updated: 12 hours ago
    • Promoted
    Head of learning area - physical education

    Head of learning area - physical education

    Verdon CollegeSouthland, New Zealand
    Head of learning area physical education.Secondary (Years 7–15) / wharekura, Certificated teacher.Verdon College is a state integrated Catholic Y7-13 College with a roll of 700 students.Due to the ...Show moreLast updated: 2 days ago
    • Promoted
    Data Scientist (Recommendation Systems), Binance Square

    Data Scientist (Recommendation Systems), Binance Square

    BinanceWorkFromHome, Southland, New Zealand
    Binance is a leading global blockchain ecosystem behind the world’s largest cryptocurrency exchange by trading volume and registered users. We are trusted by over 280 million people in 100+ countrie...Show moreLast updated: 27 days ago
    • Promoted
    Data Analytics Specialist (Remote)

    Data Analytics Specialist (Remote)

    Firefly Digital LimitedWorkFromHome, Southland, New Zealand
    In a few short years, Firefly Digital has grown to become one of New Zealand's top digital marketing agencies.We are a full-service digital performance agency that is growing FAST! And we've won a ...Show moreLast updated: 3 days ago
    • Promoted
    Remarkable AI Expert

    Remarkable AI Expert

    Remarkable AIWorkFromHome, Southland, New Zealand
    Become a Customer Support Expert for brands you love.Chatdesk (DBA Remarkable AI) Experts are freelance customer support agents who help companies provide the best support for their customers throu...Show moreLast updated: 11 days ago
    • Promoted
    • New!
    Software Engineer – AI Search

    Software Engineer – AI Search

    TwineWorkFromHome, Southland, New Zealand
    This part-time, remote contract role focuses on developing a cutting-edge hybrid search platform that bridges complex PDFs and intricate Excel documents. You will architect and implement solutions e...Show moreLast updated: 12 hours ago
    • Promoted
    • New!
    SENIOR DATA SCIENTIST - (CONTRACT)

    SENIOR DATA SCIENTIST - (CONTRACT)

    Randstad New ZealandWorkFromHome, Southland, New Zealand
    Immediate start for a Senior Data Scientist to lead high-impact AI / ML projects focusing on price elasticity and demand forecasting. We are hiring for a premier retail technology client known for dri...Show moreLast updated: 12 hours ago
    • Promoted
    • New!
    Engineering Specialist | Upto $75 / hr Remote

    Engineering Specialist | Upto $75 / hr Remote

    Crossing HurdlesWorkFromHome, Southland, New Zealand
    Engineering Specialist | Upto $75 / hr Remote.Crossing Hurdles is a recruitment firm that refers top candidates to partners working with the world’s leading AI research labs to help build and train c...Show moreLast updated: 12 hours ago
    • Promoted
    • New!
    Content Design Manager - Support Documentation

    Content Design Manager - Support Documentation

    AtlassianWorkFromHome, Southland, New Zealand
    Atlassians can choose where they work – whether in an office, from home, or a combination of the two.That way, Atlassians have more control over supporting their family, personal goals, and other p...Show moreLast updated: 12 hours ago
    • Promoted
    SENIOR DATA SCIENTIST - (CONTRACT)

    SENIOR DATA SCIENTIST - (CONTRACT)

    RandstadWorkFromHome, Southland, New Zealand
    SENIOR DATA SCIENTIST - (CONTRACT).We are seeking a highly analytical Senior Data Scientist to champion Mathematical Optimization for retail assortment planning and space allocation.This is a high-...Show moreLast updated: 2 days ago
    • Promoted
    Data Ops Engineer

    Data Ops Engineer

    One New ZealandWorkFromHome, Southland, New Zealand
    Data Ops Engineer – Data & AI Platforms at One New Zealand.Define SLIs / SLOs for data freshness, completeness, timeliness, and pipeline success. Build dashboards, alerts, and traces, and ensure every...Show moreLast updated: 2 days ago
    • Promoted
    Data Collection Specialist Regional | Matanga Kohi Raraunga - (Christchurch) Part-time

    Data Collection Specialist Regional | Matanga Kohi Raraunga - (Christchurch) Part-time

    StatsNZWorkFromHome, Southland, New Zealand
    Matanga Kohi Raraunga | Data Collection Specialist Regional | Part-time — Christchurch.Collaborate with us in improving life in Aotearoa. Play a part in creating a better future for New Zealanders.S...Show moreLast updated: 3 days ago
    • Promoted
    Senior Data Scientist - Product, Features & Growth - Open to Remote across ANZ

    Senior Data Scientist - Product, Features & Growth - Open to Remote across ANZ

    CanvaWorkFromHome, Southland, New Zealand
    Senior Data Scientist - Product, Features & Growth - Open to Remote across ANZ.We are looking for a Senior Data Scientist who can partner with Canva’s product, business, and operations teams to dri...Show moreLast updated: 3 days ago
    • Promoted
    • New!
    Freelance Electrical Engineer - AI Trainer

    Freelance Electrical Engineer - AI Trainer

    MindriftWorkFromHome, Southland, New Zealand
    Freelance Electrical Engineer - AI Trainer.Mindrift invites experts to contribute to AI projects through the platform that connects specialists with Generative AI initiatives from major tech innova...Show moreLast updated: 12 hours ago
    • Promoted
    Web Designer – AI Branding

    Web Designer – AI Branding

    TwineWorkFromHome, Southland, New Zealand
    Join a project focused on transforming a public sector business website into a modern, AI-driven brand using Figma Sites. You will take an existing Figma prototype and elevate it to a polished, prod...Show moreLast updated: 3 days ago
    • Promoted
    • New!
    Freelance Civil Engineering Expert - AI Trainer

    Freelance Civil Engineering Expert - AI Trainer

    MindriftWorkFromHome, Southland, New Zealand
    Freelance Civil Engineering Expert - AI Trainer.Be among the first 25 applicants.At Mindrift, innovation meets opportunity. We believe in using the power of collective intelligence to ethically shap...Show moreLast updated: 12 hours ago