TopGenAIJobs

TopGenAIJobs

A Gen AI & Agentic AI Jobs Platform to discover high-quality Gen AI and Agentic AI opportunities from top companies worldwide.

topgenaijobs.com

Quick Links

  • Home
  • Browse Jobs
  • Browse by Category
  • Companies
  • Post a Job
  • Career Resources
  • About Us

Resources

  • Blog
  • Career Guide
  • Resume Tips
  • Interview Prep
  • Salary Guide
  • Skill Demand Index

Top Gen AI Roles

  • Gen AI Engineer Jobs
  • Agentic AI Engineer Jobs
  • Prompt Engineer Jobs
  • LLM Engineer Jobs
  • RAG Engineer Jobs
  • MLOps Engineer Jobs
  • Remote AI Jobs
  • Entry Level AI Jobs
  • Senior AI Jobs

Legal

  • Privacy Policy
  • Terms of Service
  • Cookie Policy
  • Contact

© 2026 TopGenAIJobs (Gen AI & Agentic AI Jobs Platform). All rights reserved.

Made with ❤ by TopGenAIJobs Team

    Home/Jobs/RL AI Research Scientist

    RL AI Research Scientist

    Pokee

    Singapore
    3-5 years
    Today
    ₹13–25 LPA
    Full-time
    Remote

    Skills Required

    LLM
    Reinforcement Learning
    Machine Learning
    Python
    PyTorch
    Deep Learning
    Policy Gradient Methods
    Value-based Methods
    Model-based RL
    Multi-agent RL
    RLHF/RLAIF

    Description

    Join Pokee AI as a Research Scientist and work on designing, implementing, and scaling novel reinforcement learning algorithms for their AI agent platform.

    Company: Pokee AI

    Role: RL AI Research Scientist

    Location: Remote (US/Singapore Preferred)

    Experience:

    • PhD (or equivalent research experience) in Reinforcement Learning, Machine Learning, or a closely related field
    • Strong publication record at top venues (NeurIPS, ICML, ICLR, AAAI, or equivalent)

    Key Skills:

    • Reinforcement Learning
    • Machine Learning
    • Python
    • Deep Learning
    • PyTorch

    Qualification:

    • PhD (or equivalent research experience) in Reinforcement Learning, Machine Learning, or a closely related field

    Role Focus:

    • Design and implement novel RL algorithms for training AI agents on complex, multi-step enterprise workflows
    • Develop and refine reward modeling, context selection, and policy optimization techniques that improve agent accuracy over extended task horizons
    • Run large-scale experiments, analyze results rigorously, and translate research findings into production-ready components
    • Collaborate closely with infrastructure engineers to ensure research prototypes scale efficiently on both cloud and on-device hardware
    • Contribute to the company’s intellectual property through publications, patents, and open-source contributions

    Nice to have:

    • Experience with on-device or edge inference optimization (quantization, distillation, MoE architectures)
    • Familiarity with enterprise software deployment, compliance, or regulated industries
    • Track record of open-source contributions in RL or LLM ecosystems
    • Experience with distributed training at scale (FSDP, DeepSpeed, Megatron)
    • Experience training and fine-tuning large language models

    Other:

    • Opportunity to shape the future of enterprise AI from the ground up
    • Access to cutting-edge research
    • Direct impact on the product

    Prepare for this role

    Recommended resources to build the skills for this position. Sponsored.

    Python for Everybody Specialization

    Coursera

    Learn Python from scratch — variables, data structures, web scraping, and databases.

    Generative AI with Large Language Models

    Coursera

    Comprehensive LLM course covering transformer architecture, fine-tuning, RLHF, and deployment.

    Python 3 Programming Specialization

    Coursera

    Intermediate Python covering classes, inheritance, APIs, and data processing.

    More LLM jobs

    AI Architect

    QISG

    Houston

    Today

    Generative AI Integration Engineer

    Anduril Industries

    Costa Mesa

    Today

    AI Engineer

    OneMagnify

    Remote

    Today

    AI Platform Engineer

    Bloomerang

    United States

    Today

    AI Platform Engineer

    Bloomerang

    United States

    Today

    AI Agent Architect, Customer Experience

    Airtable

    United States

    1 day ago