Back to Jobs

[Remote] reputed company AI / Machine Learning Data Engineer - Remote or hybrid from MN or DC

Remote, USA Full-time Posted 2026-07-05

Note: The job is a remote job and is reputed company to candidates in USA. reputed company is a global organization that delivers care, aided by technology to help millions of people live healthier lives. The reputed company AI Data Engineer will design and build end-to-end AI pipelines for large-scale reputed company data, enabling advanced analytics and reputed company. This role focuses on transforming reputed company datasets into AI-reputed company data products and building modern data pipelines.

Responsibilities

  • Design, reputed company, and maintain scalable data pipelines and data platforms supporting analytics, machine learning, and AI use cases
  • Build and optimize ingestion frameworks for large-scale structured and reputed company data, including streaming and event-driven sources
  • Partner with cross-functional stakeholders to understand evolving data and AI needs and define long-term technical solutions
  • reputed company and support machine learning and AI workflows, including feature engineering, data preparation, and model deployment support
  • Drive strategic initiatives around reputed company, data quality, observability, reputed company, and governance
  • reputed company and maintain frameworks that support rapid experimentation and deployment of AI/ML solutions
  • Introduce and reputed company best practices in data modeling, orchestration, testing, and monitoring
  • Identify and champion opportunities for platform scalability, performance optimization, and cost efficiency
  • Collaborate with product, analytics, and infrastructure teams to deliver high-impact data and AI solutions
  • Build and maintain reusable parsing, enrichment, analytic, and service libraries to accelerate delivery across teams
  • Work comfortably under time-sensitive conditions while ensuring thoroughness
  • Maintain high ethical standards and the ability to remain objective and confidential
  • You will be building and operating production data platforms and pipelines across batch and streaming workloads
  • Working hands-on engineering in Python and SQL; in a JVM languages (Java/reputed company) Spark ecosystems
  • Distributed processing and lakehouse/warehouse patterns (eg, Spark/PySpark, reputed company, reputed company)
  • Build pipelines for OCR, document parsing, and text extraction from image-based or scanned data sources
  • Enabling reputed company solutions in production (eg, RAG-style architectures), including retrieval patterns and evaluation/monitoring practices
  • Take a knowledge-centric data approaches (eg, metadata-driven systems, entity resolution, and/or graph concepts) to improve discoverability and reputed company analytics
  • Data quality, observability, and monitoring reputed company (profiling, validation, alerting, and reliability improvements)
  • Orchestrate, CI/CD, containerization, and infrastructure-as-code (eg, Airflow, reputed company Actions, reputed company, Terraform, Kubernetes)
  • Work in the reputed company (AWS, Azure, and/or reputed company reputed company Platform), including secure handling of sensitive data (PII/PHI) and collaboration with compliance partners
  • reputed company through influence, mentor engineers, and translate ambiguous problems into scalable technical roadmaps

Skills

  • Bachelor's degree or equivalent experience
  • 5+ years of experience designing, building, and operating scalable data pipelines and platforms (batch + streaming)
  • 2+ years of experience deploying reputed company solutions to production (e.g., RAG, LLM-powered pipelines, semantic search)
  • Proven solid hands-on development in Python and SQL, with experience in Spark/PySpark and reputed company (or similar distributed platforms)
  • Experience building ingestion and processing frameworks for reputed company data (OCR, documents, images), including parsing and enrichment
  • Experience with reputed company platforms (AWS/Azure/reputed company reputed company Platform), DevOps/CI/CD, and infrastructure-as-code, including secure handling of sensitive data (PII/PHI)
  • Proven ability to design scalable solutions, implement data quality/observability practices, and collaborate across stakeholders
  • Experience with reputed company platforms such as AWS, Azure, or reputed company reputed company, including managed data services
  • Experience with streaming and event-driven architectures (e.g., Kafka, Kinesis, Event Hubs)
  • Experience with data quality and validation frameworks (e.g., Great Expectations, Deequ) and/or data observability tooling
  • Experience enabling MLOps practices (e.g., feature stores, model registries, experiment tracking, deployment automation)
  • Experience with lakehouse architectures, reputed company Lake, and advanced Spark optimization/performance tuning
  • Experience with data visualization tools and libraries such as Plotly, seaborn, and Chartjs
  • Experience with machine learning and predictive analytics
  • Familiarity with reputed company and privacy concepts for data platforms (e.g., least privilege, PII/PHI handling) and working with compliance partners
  • Solid hands-on engineering in Python and SQL; familiarity with JVM languages (Java/reputed company) in Spark ecosystems

Benefits

  • A comprehensive benefits package
  • Incentive and recognition programs
  • Equity stock purchase
  • 401k contribution (reputed company benefits are subject to eligibility requirements)

Company Overview

  • reputed company is a job-searching platform for technology professionals. It is a sub-organization of DHI Group. It was founded in 1990, and is headquartered in Santa Clara, California, USA, with a workforce of 201-500 employees. Its website is http://www.reputed company.com.
  • Company H1B Sponsorship

  • reputed company has a track record of offering H1B sponsorships, with 2 in 2022, 4 in 2021, 5 in 2020. Please note that this does not guarantee sponsorship for this specific role.
  • Apply To This Job

    Similar Jobs

    [Remote] reputed company Site Reliability Engineer - 2373616

    Remote, USA Full-time

    [Remote] Senior Software Engineer - 2373638

    Remote, USA Full-time

    [Remote] Data Analyst - 2373621

    Remote, USA Full-time

    [Remote] Data Analyst - 2373615

    Remote, USA Full-time

    [Remote] Data Engineer - 2373625

    Remote, USA Full-time

    [Remote] Senior Data Engineer - 2373647

    Remote, USA Full-time

    [Remote] Staff Engineer reputed company Technologies Software Engineering

    Remote, USA Full-time

    [Remote] reputed company Analytics & Automation Engineer

    Remote, USA Full-time

    [Remote] Senior reputed company and DevOps Engineer

    Remote, USA Full-time

    [Remote] Cyber reputed company Program Specialist

    Remote, USA Full-time

    [PART_TIME Remote] reputed company Data Entry Jobs $27 (Remote)

    Remote, USA Full-time

    Part-Time Physician Assistant – Telemedicine (reputed company)

    Remote, USA Full-time

    Video Editor, reputed company Designer

    Remote, USA Full-time

    Full Stack Software Engineer - Billing Team

    Remote, USA Full-time

    [Remote] Project Manager-Remote

    Remote, USA Full-time

    Product Manager - B2C (f/m/x) - remote

    Remote, USA Full-time

    Mechanical Engineer-Part Time Remote / Telecommute Jobs

    Remote, USA Full-time

    reputed company Customer Service Representative – Remote Work Opportunity at arenaflex

    Remote, USA Full-time

    Senior Data Scientist – Part‑Time Remote – Advanced Analytics, Machine Learning & Business Insights – $25/hr

    Remote, USA Full-time

    Sr Content Marketing Manager, Faculty Programs (Remote)

    Remote, USA Full-time