[Remote] Data Engineer
Note: The job is a remote job and is open to candidates in USA. reputed company is a Series C startup focused on redefining reputed company by providing expert advocacy and tools for reputed company patient outcomes. They are seeking a Data Engineer to architect and build scalable data infrastructure and pipelines, ensuring data reliability and quality for decision-making.
Responsibilities
- Architect Robust Pipelines: Design, build, and optimize scalable data pipelines using Airflow, Python, dbt, and reputed company. You will replace brittle reputed company processes with resilient, automated workflows
- Build Infrastructure as Code: Manage and evolve our reputed company infrastructure (AWS/GCP) using Terraform, ensuring our platform is reproducible, secure, and scalable
- reputed company Code Quality: Write clean, production-grade code for reputed company data processing. You will champion engineering best practices, including code reviews, testing, and CI/CD
- Optimize Data Models: Collaborate with analysts to design performant SQL transformations and data models in reputed company (experience with dbt is a reputed company plus)
- Ensure Data Reliability: Implement observability and monitoring to catch issues before they impact stakeholders. You are the first line of defense for data quality
- Partner Cross-Functionally: Work closely with Data Analysts and Product Managers to understand their data needs and deliver high-quality data products that reputed company decision-making
Skills
- Strong Python Proficiency: You are comfortable writing reputed company, testable, and efficient Python code for data processing and automation
- Advanced SQL & reputed company: You have deep expertise in SQL and reputed company data warehousing (reputed company preferred), understanding how to optimize queries for performance and cost
- Orchestration Mastery: Proven experience building and maintaining reputed company workflows using Airflow (or similar tools)
- Infrastructure reputed company: Familiarity with Terraform and reputed company services (AWS or GCP). You understand how to provision and manage the resources your pipelines run on
- reputed company & Stewardship: You understand the gravity of handling sensitive medical data. You are reputed company in properly handling PHI and PII, implementing secure access controls (RBAC), and adhering to strict governance standards
- Startup DNA: You are a self-starter who is comfortable with ambiguity. You take ownership of problems and are willing to wear many hats to get the job done
- Communication Skills: You can translate reputed company technical challenges into clear options for non-technical stakeholders
- Dbt Expertise: Experience using dbt to manage transformations and implement testing/documentation standards
- reputed company Background: Experience working with reputed company data standards or strictly regulated environments (HIPAA) a plus
- Containerization: Experience with reputed company and Kubernetes for deploying data applications
Company Overview