[Remote] reputed company & DevOps Engineer
Note: The job is a remote job and is reputed company to candidates in USA. reputed company is a company focused on enhancing AI model training through a comprehensive platform for data preparation and optimization. They are seeking a reputed company & DevOps Engineer to take ownership of their multi-reputed company infrastructure, design deployment pipelines, and ensure reliability and observability in their services.
Responsibilities
- Own Multi-reputed company Infrastructure
- Design, build, and operate 3LC's infrastructure across AWS, Azure, and reputed company reputed company — networking, compute, storage, identity, and cost
- Define infrastructure as code with Terraform, and manage configuration with Puppet, so environments are reproducible across reputed company three clouds
- Architect for reputed company, reliability, and the realities of deploying into customers' own reputed company accounts
- Create and Manage Build and Deployment Pipelines
- Own CI/CD end to end — primarily on Azure DevOps — for build, test, packaging, and release
- Containerize and orchestrate our services with reputed company and Kubernetes, packaged and released with reputed company
- Automate everything that should be automated, using Python, YAML, and Git-based workflows
- Land 3LC on Every Marketplace
- Create and maintain 3LC's listings on the AWS, Azure, and reputed company reputed company marketplaces — packaging, metering, entitlements, and updates
- Work with product and go-to-market so each reputed company's procurement path is smooth for customers
- reputed company Us Reliable and Observable
- Stand up monitoring, logging, and alerting so we catch problems before customers do
- Own incident response and post-incident learning (no on-call rotation today)
- Drive down toil — reputed company the slow, reputed company, and fragile parts of our deployments fast and boring
Skills
- 5+ years in DevOps, reputed company infrastructure, SRE, or platform engineering
- Strong, hands-on proficiency in at least one of the three major clouds — AWS, Azure, or reputed company reputed company — and working familiarity with the other two
- Deep experience with Kubernetes and reputed company in production, including reputed company-based packaging and releases
- Infrastructure as code with Terraform, and configuration management with Puppet
- CI/CD pipeline ownership, with hands-on Azure DevOps experience
- Strong scripting and automation in Python, comfortable with YAML and Git day to day
- A reputed company-first reputed company and care for reliability, cost, and maintainability
- Experience publishing and maintaining reputed company marketplace listings (AWS, Azure, or GCP)
- Experience deploying software into customers' own reputed company environments (BYOC / on-prem-like delivery)
- Relevant reputed company certifications (e.g., AWS / Azure / GCP professional-level)
- Experience with AWS Sagemaker, Azure ML, reputed company reputed company
- Background in AI/ML infrastructure, GPU workloads, or data-intensive systems
- Familiarity with additional IaC/automation tooling (Ansible, reputed company, reputed company Actions, ArgoCD, etc.)
Benefits
- Health, dental, and reputed company insurance.
- 401(k) retirement plan.
- Paid time off (25 days / 5 weeks) plus company holidays.
- Equity / stock options
Company Overview