Back to Jobs

AI Engineer- Responsible AI

Remote, USA Full-time Posted 2026-06-25

Role Overview Build the Future of Safe and Responsible AI Are you an experienced AI engineer advancing the frontiers of AI safety, LLM jailbreak detection and defense, and agentic AI, with publications and production deployments to show for it? Join us to translate pioneering research into robust, scalable security systems and trustworthy LLM platforms that resist adversarial and behavioral exploits at enterprise scale. The Mission We're tackling cutting-edge AI safety across adversarial robustness, jailbreak defense, agentic workflows, and human-in-the-loop risk modeling. As an AI Engineer, you'll own high-impact projects from research conception through production deployment, directly contributing to our platform's security guarantees while building scalable, maintainable infrastructure.

What You'll Do

  • Advance AI Safety: Design, implement, and evaluate attack and defense strategies for LLM jailbreaks (prompt injection, obfuscation, narrative red teaming) and deploy them as production-grade services.
  • Build Scalable Safety Infrastructure: Architect and deploy distributed safety evaluation pipelines handling millions of requests, with real-time monitoring, alerting, and incident response capabilities.
  • Large-Scale Data Engineering: Design ETL pipelines for processing terabytes of safety-related data (attack patterns, behavioral logs, model outputs); build data lakes and feature stores for safety ML systems.
  • Evaluate AI Behavior: Analyze and simulate human-AI interaction patterns at scale to uncover behavioral vulnerabilities, social engineering risks, and over-defensive vs. permissive response tradeoffs.
  • Agentic AI Security: Build production workflows for multi-agent safety (agent self-checks, regulatory compliance, defense chains) spanning perception, reasoning, and action.
  • MLOps & Model Deployment: Deploy safety models to production using containerized microservices, implement CI/CD pipelines for model updates, and manage model versioning and A/B testing infrastructure.
  • Benchmark & Harden LLMs: Create reproducible, automated evaluation protocols for safety, over-defensiveness, and adversarial resilience across diverse models with continuous integration.

Example Problems You Might Tackle

  • Production Red-Teaming Platform: Build and operate an automated red-teaming infrastructure that continuously probes advanced LLMs (GPT-4o, GPT-5, LLaMA, Mistral, Gemma) at scale, with dashboards and alerting.
  • Real-Time Defense Systems: Implement context-aware, multi-turn attack detection and guardrail mechanisms as low-latency services handling 10K+ requests per second.
  • Agent Self-Regulation at Scale: Develop agentic architectures for autonomous self-check and self-correct with distributed orchestration and fault tolerance.
  • Safety Data Platform: Design and build data infrastructure for collecting, storing, and analyzing petabyte-scale safety telemetry with streaming analytics.

Minimum Qualifications

  • Master's degree in CS/EE/ML/Security or related field (Ph.D. preferred)
  • 2+ years of industry experience in applied ML/AI research or ML engineering
  • Track record of publications in AI Safety, NLP robustness, or adversarial ML (ACL, NeurIPS, ICML, EMNLP, IEEE S&P, etc.) or equivalent applied research impact
  • Strong Python and PyTorch/JAX skills with experience deploying ML models to production
  • Demonstrated experience in at least one of: LLM jailbreak attacks/defense, agentic AI safety, adversarial ML, or human-AI interaction vulnerabilities
  • Experience with containerization (Docker, Kubernetes) and cloud platforms (AWS, GCP, or Azure)
  • Proven ability to take research from concept to code to production deployment with rigorous testing and monitoring

Preferred Qualifications

  • Experience in adversarial prompt engineering, jailbreak detection (narrative, obfuscated, sequential attacks)
  • Prior work on multi-agent architectures or robust defense strategies for LLMs in production environments
  • Experience with large-scale data processing frameworks (Spark, Flink, Kafka) and data warehousing
  • MLOps expertise: model serving (Triton, TensorRT, vLLM), experiment tracking (W&B, MLflow), and CI/CD for ML
  • Infrastructure as Code experience (Terraform, Pulumi) and DevOps best practices
  • Experience with distributed computing frameworks (Ray, Dask) for scalable training and evaluation
  • Familiarity with observability stacks (Prometheus, Grafana, DataDog) and incident management
  • First-author publications, strong GitHub profile, or significant open-source contributions

Our Stack

  • Modeling: PyTorch/JAX, Hugging Face, vLLM, Mistral, LLaMA, OpenAI APIs
  • Safety: Red-teaming frameworks, LLM benchmarking (SODE, ART, HarmBench), human behavior simulation
  • Infrastructure: Kubernetes, Docker, Terraform, AWS/GCP, Ray, Spark
  • MLOps: Triton Inference Server, Weights & Biases, MLflow, Airflow, ArgoCD
  • Data: PostgreSQL, Redis, Kafka, Snowflake/BigQuery, dbt
  • Observability: Prometheus, Grafana, DataDog, PagerDu

Apply tot his job Apply To this Job

Similar Jobs

Product Manager - AI & Innovation - REMOTE

Remote, USA Full-time

Director, Technical Project Management - Remote

Remote, USA Full-time

Senior QA Engineer - Remote US

Remote, USA Full-time

Applied AI Scientist (Health) (26-002) Remote / Telecommute Jobs

Remote, USA Full-time

Lead Specialist, AI Scientist

Remote, USA Full-time

Remote | SWE (Terminal and CLI Dev Tools Focused) — $75–$80/hour

Remote, USA Full-time

Staff, Advanced Analytics, CS Safety

Remote, USA Full-time

Specialist, Safety

Remote, USA Full-time

Contracts Specialist III

Remote, USA Full-time

Data Entry Specialist – Remote Amazon E‑Commerce & Cloud Operations Accuracy Expert (Work‑From‑Home)

Remote, USA Full-time

Experienced Customer Service Representative – Delivering Exceptional Support to arenaflex Customers Worldwide

Remote, USA Full-time

Join Today: Remote Healthcare Recruiter | WFH

Remote, USA Full-time

Technical Program Manager, Real Time Communication, Google Meet

Remote, USA Full-time

Senior Platform Engineer

Remote, USA Full-time

Experienced Part-Time Remote Data Entry Specialist for blithequark - Join the Magic Behind the Scenes in Data Management and Entry

Remote, USA Full-time

Experienced Data Entry Associate – Work-from-Home Opportunity with blithequark

Remote, USA Full-time

Director, Operational Technology and Cyber Security

Remote, USA Full-time

Associate Director, HR Technology & Analytics

Remote, USA Full-time

Remote Part‑Time Data Entry Specialist – No Experience Required – Flexible Work‑From‑Home Opportunity with arenaflex

Remote, USA Full-time

Mechanical Claims Adjuster (Remote)

Remote, USA Full-time