Talent.com
No longer accepting applications
Data Scientist (Reinforcement Learning / LLM Agent / Vision Language Model - either 1)

Data Scientist (Reinforcement Learning / LLM Agent / Vision Language Model - either 1)

BinanceWorkFromHome, Hawke's Bay, New Zealand
19 days ago
Job description

Binance is a leading global blockchain ecosystem behind the world’s largest cryptocurrency exchange by trading volume and registered users. We are trusted by over 280 million people in 100+ countries for our industry-leading security, user fund transparency, trading engine speed, deep liquidity, and an unmatched portfolio of digital-asset products. Binance offerings range from trading and finance to education, research, payments, institutional services, Web3 features, and more. We leverage the power of digital assets and blockchain to build an inclusive financial ecosystem to advance the freedom of money and improve financial access for people around the world.

About the Role

You will develop and optimize Reinforcement Learning (RL) models for enterprise-scale applications such as customer service, token reporting, compliance, and Web3 domain reasoning.

You will explore and evaluate advanced Algorithms including PPO, GRPO, DPO, RLHF, RLAIF, and Agentic RL to enhance the capabilities of LLMs, VLMs, and Agentic AI at Binance. The role requires a strong theoretical foundation in RL—covering policy optimization, reward modeling, and planning—paired with the Engineering skills to build scalable production systems.

You will take full ownership from research through deployment, driving experimentation with systematic evaluation and benchmarking. Collaboration across research, infrastructure, and application teams will be key to delivering impactful AI solutions.

Responsibilities :

  • Research and develop state-of-the-art RL algorithms, focusing on Large Model Optimization and alignment techniques.
  • Design and implement RL training pipelines, including environment simulation, data generation, and reward function design.
  • Apply RL methods to enhance LLM / VLM / Agentic AI capabilities in reasoning, planning, and autonomous decision-making.
  • Collaborate with Engineers and researchers to integrate RL solutions into enterprise AI platforms.
  • Monitor model performance in production and continuously improve through Iterative training and Fine-tuning.

Requirements :

  • Master’s Degree in Computer Science, Applied Mathematics, Machine Learning, or related fields.
  • 5+ years of hands-on experience in RL or LLM / VLM / Agentic AI optimization.
  • Strong coding skills in Python, with experience in ML frameworks and RL libraries.
  • Experience with large-scale distributed training and optimization.
  • Self-driven, ownership mindset, and strong problem-solving skills. Excellent communication skills for cross-functional collaboration.
  • Why Binance

  • Shape the future with the world’s leading blockchain ecosystem
  • Collaborate with world-class talent in a user-centric global organization with a flat structure
  • Tackle unique, fast-paced projects with autonomy in an innovative environment
  • Thrive in a results-driven workplace with opportunities for career growth and continuous learning
  • Competitive salary and company benefits
  • Work-from-home arrangement (the arrangement may vary depending on the work nature of the business team)
  • Binance is committed to being an equal opportunity employer. We believe that having a diverse workforce is fundamental to our success.

    By submitting a job application, you confirm that you have read and agree to our Candidate Privacy Notice .

    #J-18808-Ljbffr

    Create a job alert for this search

    Data Scientist • WorkFromHome, Hawke's Bay, New Zealand

    Related jobs
    • Promoted
    Research Scientist - LLM Foundation Models

    Research Scientist - LLM Foundation Models

    BinanceWorkFromHome, Hawke's Bay, New Zealand
    Binance is a leading global blockchain ecosystem behind the world’s largest cryptocurrency exchange by trading volume and registered users. We are trusted by over 280 million people in 100+ countrie...Show moreLast updated: 30+ days ago
    • Promoted
    Māori Language Specialist - AI Trainer

    Māori Language Specialist - AI Trainer

    Invisible AgencyWorkFromHome, Hawke's Bay, New Zealand
    Are you an experienced Māori language professional eager to shape the future of AI? Large-scale language models are evolving rapidly, moving beyond simple chatbots into powerful engines of learning...Show moreLast updated: 30+ days ago
    • Promoted
    Vice President of Product Engineering, 2 Hour Learning (Remote) - $400,000 / year USD

    Vice President of Product Engineering, 2 Hour Learning (Remote) - $400,000 / year USD

    TrilogyWorkFromHome, Hawke's Bay, New Zealand
    Vice President of Product Engineering, 2 Hour Learning (Remote) - $400,000 / year USD.Ready to architect the future of AI-powered education? Join 2 Hour Learning as our Vice President of Product Engi...Show moreLast updated: 8 days ago
    • Promoted
    Senior Machine Learning Research Scientist - Research Engineer

    Senior Machine Learning Research Scientist - Research Engineer

    SmarterDxWorkFromHome, Hawke's Bay, New Zealand
    Senior Machine Learning Research Scientist - Research Engineer.As a Senior Machine Learning Research Scientist, you will lead groundbreaking ML research and development at SmarterDx, collaborating ...Show moreLast updated: 30+ days ago
    • Promoted
    Freelance Civil Engineering Expert - AI Trainer

    Freelance Civil Engineering Expert - AI Trainer

    MindriftWorkFromHome, Hawke's Bay, New Zealand
    Freelance Civil Engineering Expert - AI Trainer.Be among the first 25 applicants.At Mindrift, innovation meets opportunity. We believe in using the power of collective intelligence to ethically shap...Show moreLast updated: 19 days ago
    • Promoted
    AI Engineer – Data Generation & RLHF (Remote)

    AI Engineer – Data Generation & RLHF (Remote)

    TwineWorkFromHome, Hawke's Bay, New Zealand
    This role is ideal for a freelancer interested in supporting the development of large language models through data generation and reinforcement learning from human feedback (RLHF).The project cente...Show moreLast updated: 1 day ago
    • Promoted
    Engineering Senior Machine Learning Engineer New Zealand (Remote) FullTime

    Engineering Senior Machine Learning Engineer New Zealand (Remote) FullTime

    Leonardo Interactive PtyWorkFromHome, Hawke's Bay, New Zealand
    Ai is building one of the world’s highest-throughput Generative AI platforms, enabling millions of users, from beginners to professionals, to create high-quality images and videos with ease.Now par...Show moreLast updated: 30+ days ago
    • Promoted
    Solution Advisor Specialist (Data Analytics & Artificial Intelligence)

    Solution Advisor Specialist (Data Analytics & Artificial Intelligence)

    SAPWorkFromHome, Hawke's Bay, New Zealand
    At SAP, we keep it simple : you bring your best to us, and we'll bring out the best in you.We're builders touching over 20 industries and 80% of global commerce, and we need your unique talents to h...Show moreLast updated: 1 day ago
    • Promoted
    Data Analyst (PhD)

    Data Analyst (PhD)

    DataAnnotationWorkFromHome, Hawke's Bay, New Zealand
    We are looking for a Data Analyst (PhD) to join our team to train AI models.You will measure the progress of these AI chatbots, evaluate their logic, and solve problems to improve the quality of ea...Show moreLast updated: 30+ days ago
    • Promoted
    Remote Search Analyst New Zealand

    Remote Search Analyst New Zealand

    TELUS Digital AI Data SolutionsWorkFromHome, Hawke's Bay, New Zealand
    Remote Search Analyst New Zealand.Remote Search Analyst at TELUS Digital AI Data Solutions.English speaker living in New Zealand. Flexible schedule and engaging tasks with an innovative web-based ev...Show moreLast updated: 30+ days ago
    • Promoted
    Data Scientist / Algorithm Engineer (LLM) – AI Safety

    Data Scientist / Algorithm Engineer (LLM) – AI Safety

    BinanceWorkFromHome, Hawke's Bay, New Zealand
    Binance is a leading global blockchain ecosystem behind the world’s largest cryptocurrency exchange by trading volume and registered users. We are trusted by over 280 million people in 100+ countrie...Show moreLast updated: 14 days ago
    • Promoted
    Data Analytics Specialist (Remote)

    Data Analytics Specialist (Remote)

    Firefly Digital LimitedWorkFromHome, Hawke's Bay, New Zealand
    In a few short years, Firefly Digital has grown to become one of New Zealand's top digital marketing agencies.We are a full-service digital performance agency that is growing FAST! And we've won a ...Show moreLast updated: 22 days ago
    • Promoted
    AI Engineer – Freelancer

    AI Engineer – Freelancer

    TwineWorkFromHome, Hawke's Bay, New Zealand
    Join a project focused on advancing an AI-powered influencer marketing platform by optimizing and scaling its data pipeline. The primary objective is to enhance the performance, cost-efficiency, and...Show moreLast updated: 6 days ago
    • Promoted
    Staff Data Scientist

    Staff Data Scientist

    SmarterDxWorkFromHome, Hawke's Bay, New Zealand
    As a Staff Data Scientist at SmarterDx, you will play a pivotal role in training cutting-edge machine learning models and ensuring their strategic integration into our Clinical AI Platform.Your wor...Show moreLast updated: 30+ days ago
    • Promoted
    SENIOR DATA SCIENTIST - (CONTRACT)

    SENIOR DATA SCIENTIST - (CONTRACT)

    RandstadWorkFromHome, Hawke's Bay, New Zealand
    SENIOR DATA SCIENTIST - (CONTRACT).We are seeking a highly analytical Senior Data Scientist to champion Mathematical Optimization for retail assortment planning and space allocation.This is a high-...Show moreLast updated: 21 days ago
    • Promoted
    Biology Specialist | $80 / hr Remote

    Biology Specialist | $80 / hr Remote

    Crossing HurdlesWorkFromHome, Hawke's Bay, New Zealand
    Biology Specialist | $80 / hr Remote.Crossing Hurdles is a recruitment firm.We refer top candidates to our partners working with the world’s leading AI research labs to help build and train cutting-e...Show moreLast updated: 14 days ago
    • Promoted
    Data Science Manager

    Data Science Manager

    SmarterDxWorkFromHome, Hawke's Bay, New Zealand
    As a Data Science Manager at SmarterDx, you will lead a team of data scientists and play a critical role in scaling our Clinical AI Platform through strategic vision and operational excellence.You ...Show moreLast updated: 30+ days ago
    • Promoted
    Biology Expert

    Biology Expert

    DataAnnotationWorkFromHome, Hawke's Bay, New Zealand
    We are looking for a Biology Expert to join our team to train AI models.You will measure the progress of these AI chatbots, evaluate their logic, and solve problems to improve the quality of each m...Show moreLast updated: 27 days ago