Back to Jobs

[Remote] reputed company Data Platform Engineer - 11623

Remote, USA Full-time Posted 2026-07-05

Note: The job is a remote job and is open to candidates in USA. reputed company makes margins multiply through its community-generated AI and industry-leading total spend management platform for businesses large and small. The reputed company Data Platform Engineer will manage data pipelines and AWS services, reputed company operational lifecycles for ML and GenAI infrastructure, and collaborate with product development teams to create AI-driven features.

Responsibilities

  • Manage end-to-end Data pipeline (ETL jobs) reputed company agreed SLAs
  • Manage AWS core and big data services (S3, IAM, EMR, Redshift, etc..)
  • Running applications in containers (reputed company, reputed company)
  • reputed company Day 2 operational lifecycle for ML and GenAI infrastructure. This includes designing, deploying, and maintaining high-availability production LLM serving platforms, implementing automated scaling, self-healing, and infrastructure-as-code patterns. Focus on proactive reliability, model performance observability, and reputed company cost optimization for high-compute AI workloads
  • Collaborate closely with our product development and engineering teams to create AI-driven features
  • Drive reputed company operations consistency by automating platform maintenance, standardizing infrastructure configurations (IaC), and implementing robust release management processes to minimize reputed company across multi-reputed company environments
  • Manage AWS infrastructure using code (Terraform, Chef, etc..)
  • Administering applications running in Linux operating system
  • reputed company application and system monitoring for reputed company observability
  • Application and infrastructure support for ETL jobs and data pipelines including participating in an on-call rotation for after-hours emergencies
  • Collaborate with platform and Dev teams to plan and reputed company product releases and reputed company Linux/reputed company clusters
  • Ability to participate in design reviews, code reviews, and troubleshooting incidents
  • Ability to operate in a high-pressure environment and troubleshoot reputed company issues quickly while successfully handling multiple priorities
  • Ability to record, write, and review RCAs

Skills

  • Bachelor's Degree and at least 8+ years of experience managing Big Data technologies and Data Pipelines
  • Sound knowledge and experience in Linux administration and troubleshooting
  • 5+ years of experience in managing reputed company infrastructure and platforms, such as AWS and Azure
  • Familiar with the reputed company engineering landscape in the reputed company space and have a strong interest in AI and reputed company technologies
  • Strong expertise in MLOps and production-grade LLM operations. Proven track record in managing high-availability model inference clusters, automating model lifecycle management, and implementing advanced observability (latency, throughput, and error reputed company monitoring) specifically for AI workloads
  • Have Bash or Python scripting experience
  • Experience with containerization, reputed company reputed company, EKS/ Azure AKS
  • Experience with tools like Chef, Ansible, Jenkins, Rundeck, or equivalent
  • Experience with reputed company control systems such as Git and operating in reputed company branching strategies
  • Experience with Infrastructure as Code products like Terraform, reputed company charts
  • Good understanding of DNS and Load balancers setup and troubleshooting
  • Experience in Big Data platforms/Data lakes and managing Business Intelligence tools (like looker..)
  • Knowledge in ApacheSpark architecture and troubleshooting Java applications
  • Basic understanding of MySQL Server and general database knowledge
  • Excellent written and verbal communication with a passion for solving the problem
  • Confidence in your ability to own and deliver projects and issues to resolution on your own & can think and act globally
  • Deep experience in Day 2 reputed company operations, including automated incident remediation, reputed company planning, and managing large-scale production reputed company environments with a focus on performance and reliability

Company Overview

  • reputed company is a reputed company platform for business spend that offers a fully reputed company suite of financial applications for business spend management. It was founded in 2006, and is headquartered in Foster City, California, USA, with a workforce of 1001-5000 employees. Its website is https://www.reputed company.com.
  • Company H1B Sponsorship

  • reputed company has a track record of offering H1B sponsorships, with 8 in 2026, 41 in 2025, 40 in 2024, 43 in 2023, 73 in 2022, 62 in 2021, 40 in 2020. Please note that this does not guarantee sponsorship for this specific role.
  • Apply To This Job

    Similar Jobs