Back to Jobs

Staff Machine Learning Engineer - Agentic AI

Remote, USA Full-time Posted 2026-06-25

Job Description

Team: AI Agents | Location: Melbourne / Sydney  

What we have built

We run production AI agents that autonomously resolve customer service tickets across 100,000+ Zendesk accounts. Each agent takes a customer issue, decomposes it into a multi-step plan, executes real actions refunds, order modifications, escalations through live APIs, and closes the ticket without a human in the loop.

The core uses a proprietary iterative architecture: goals decompose into plans, reusable skills are pulled from a registry, execution is evaluated, and the result feeds the next attempt. Successful resolution patterns are synthesised into new skills and written back into the registry the system learns from its own execution history.

On GAIA-class multi-step tool-use benchmarks, our agents match the best published results. Internally, 158+ scenario-based evals run continuously against real Zendesk tickets, scored through Braintrust with regression detection on every deploy.

What you will own

  • Architecture: The iterative planner works. What we have not solved: plan decomposition under ambiguous goals, memory-tier interference across concurrent sessions, over-eager skill acquisition, and multi-agent delegation via A2A. These are yours to take on.

  • Domain-specialised training: We are building toward RL-trained models specialised for customer service resolution. The data pipeline is instrumented. The next step reward curricula, rollout systems, feedback loops is a 6–12 month build. You own both the science and the systems.

  • Evaluation infrastructure: 158+ evals run continuously, but multi-turn evaluation and automated trajectory analysis are early. You will build the quality gates that block deploys when performance drops, integrated into CI from the start.

  • Guardrails at scale: Tool misuse, cascading action chains, prompt injection, hallucination loops: the threat surface for autonomous agents at enterprise scale is real. You will design the multi-layered defences supervisor patterns, capabilities-based access control, output validation that work across thousands of concurrent sessions without adding latency.

What we are looking for

  • 5+ years building production ML/AI systems. You have shipped agent architectures that handle planning, tool dispatch, memory, and failure recovery. If your experience is LangChain tutorials, this is not the right fit.

  • You have built internal evals because you know why public benchmarks lie, and you have the scars to prove it.

  • Python and PyTorch fluency, plus at least one agent framework and the judgment to know when to throw it out and build custom.

  • Bonus: genuine depth in RL for language models reward shaping, online/offline tradeoffs, reward hacking as a diagnostic. We are building toward domain-specialised training and need someone who can lead that work.

The intelligent heart of customer experience

Zendesk software was built to bring a sense of calm to the chaotic world of customer service. Today we power billions of conversations with brands you know and love.

Zendesk believes in offering our people a fulfilling and inclusive experience. Our hybrid way of working, enables us to purposefully come together in person, at one of our many Zendesk offices around the world, to connect, collaborate and learn whilst also giving our people the flexibility to work remotely for part of the week.

As part of our commitment to fairness and transparency, we inform all applicants that artificial intelligence (AI) or automated decision systems may be used to screen or evaluate applications for this position, in accordance with Company guidelines and applicable law.

Zendesk is an equal opportunity employer, and we’re proud of our ongoing efforts to foster global diversity, equity, & inclusion in the workplace. Individuals seeking employment and employees at Zendesk are considered without regard to race, color, religion, national origin, age, sex, gender, gender identity, gender expression, sexual orientation, marital status, medical condition, ancestry, disability, military or veteran status, or any other characteristic protected by applicable law. We are an AA/EEO/Veterans/Disabled employer. If you are based in the United States and would like more information about your EEO rights under the law, please click here.

Zendesk endeavors to make reasonable accommodations for applicants with disabilities and disabled veterans pursuant to applicable federal and state law. If you are an individual with a disability and require a reasonable accommodation to submit this application, complete any pre-employment testing, or otherwise participate in the employee selection process, please send an e-mail to [email protected] with your specific accommodation request.

Apply To This Job

Similar Jobs

Administrative Assistant II

Remote, USA Full-time

Technical Consultant Cardiac Rhythm Management

Remote, USA Full-time

Sales Representative II

Remote, USA Full-time

Sr EP Mapping Specialist CAS

Remote, USA Full-time

Administrative Assistant II

Remote, USA Full-time

Customer Success Specialist

Remote, USA Full-time

Sales Representative II

Remote, USA Full-time

Sr EP Mapping Specialist CAS

Remote, USA Full-time

Pet Care Specialist for Dog Walking and Pet Sitting Company

Remote, USA Full-time

Senior Client Director

Remote, USA Full-time

Experienced Chat Moderator – Remote Community Management and Moderation Specialist

Remote, USA Full-time

Experienced Lead Engineer - Red Team for Cybersecurity Adversary Simulation and Threat Emulation

Remote, USA Full-time

Urgently Require 24 Part-Time Lecturer Pool Multicultural and Gender Studies in Chico, CA

Remote, USA Full-time

Experienced Entry-Level Data Entry Specialist – Thriving Remote Opportunity

Remote, USA Full-time

Temp - Project Coordinator, Product Operations

Remote, USA Full-time

Director of Organic Growth

Remote, USA Full-time

Experienced NICU Utilization Management Nurse RN – Remote Opportunity for Talented Professionals in Texas or Missouri

Remote, USA Full-time

Experienced Remote Data Entry Clerk – Administrative Support and Data Management Expertise for E-commerce and Entertainment Industry

Remote, USA Full-time

Content Moderator Jobs | Remote Entry Level | Help Create a Safe Online Space | Earn $25-$35/hr

Remote, USA Full-time

Registered Dietitian (Partner) - Washington License

Remote, USA Full-time