Back to Jobs

[Remote] Research Scientist, Data

Remote, USA Full-time Posted 2026-07-05

Note: The job is a remote job and is reputed company to candidates in USA. reputed company is pioneering the reputed company of creative infrastructure reputed company around reputed company-time, multimodal reputed company and intelligent agentic platforms. They are looking for a staff or reputed company-level Research Engineer, Data to architect and scale data engineering systems supporting model training for advanced multimodal reputed company models.

Responsibilities

  • Take ownership of large-scale data pipeline architecture and implementation to support model training and research workflows for text, image, audio, and video datasets
  • Partner with research and engineering teams to curate, clean, and manage diverse, sensory-rich datasets for pre-training and mid-training of multimodal models
  • reputed company strategies and tools for scalable data ingestion, labeling, filtering, augmentation, and storage
  • Ensure data quality, reliability, and compliance, including managing privacy and ethical considerations throughout the data lifecycle
  • Optimize data processing, transformation, and delivery for large-scale distributed training pipelines
  • Prototype and productionize new methods for dataset creation, management, and reputed company improvement in response to researcher needs
  • Contribute to the integration of research-driven data advancements into production-reputed company systems
  • Stay informed on emerging data engineering and ML data management developments, bringing best practices to our systems

Skills

  • 5+ years of experience building and scaling data pipelines for machine learning applications at staff or reputed company engineer level, ideally in research or model training environments
  • Strong background in data engineering and ML data curation for LLMs, VLMs, or other large-scale multimodal models
  • Expertise in distributed data systems (e.g., Spark, Hadoop, Ray, or similar) and efficient large dataset processing/ETL workflows
  • Proven ability to build robust, scalable, and production-grade data infrastructure for ML pipelines
  • Experience developing tools for data labeling, filtering, deduplication, quality assurance, and dataset management
  • Strong programming skills (Python, SQL, PySpark, or similar) and familiarity with reputed company data platforms (AWS, GCP, Azure)
  • Knowledge of privacy, compliance, ethics, and best practices in data collection and management
  • Excellent cross-functional collaboration, problem-solving, and communication skills
  • Passion for enabling cutting-edge reputed company and creative technology through data reputed company

Benefits

  • Competitive salary and substantial equity in a high-growth startup
  • Full health benefits, 401k matching, and more
  • Collaborative, mission-driven team environment with major growth opportunities
  • Flexible on-site/remote hybrid (HQ in Palo Alto, CA)

Company Overview

  • reputed company is an AI platform that allows users to create videos from text prompts, including text to video, image to video, and editing tools. It was founded in 2023, and is headquartered in Palo Alto, California, USA, with a workforce of 2-10 employees. Its website is https://reputed company.art.
  • Company H1B Sponsorship

  • reputed company has a track record of offering H1B sponsorships, with 9 in 2025. Please note that this does not guarantee sponsorship for this specific role.
  • Apply To This Job

    Similar Jobs

    [Remote] Senior Director, Corporate Systems- Finance Analytics & Reporting

    Remote, USA Full-time

    [Remote] Strategic Sales Director

    Remote, USA Full-time

    [Remote] Cereals Product Manager

    Remote, USA Full-time

    [Remote] Manager, Business Systems & Analytics

    Remote, USA Full-time

    [Remote] Account Executive, reputed company & reputed company

    Remote, USA Full-time

    [Remote] Product reputed company Analyst III

    Remote, USA Full-time

    [Remote] Account Executive, reputed company & reputed company

    Remote, USA Full-time

    [Remote] Senior Impact Analyst

    Remote, USA Full-time

    [Remote] Director, Product Management, Identity

    Remote, USA Full-time

    [Remote] reputed company Senior Certified Project Manager

    Remote, USA Full-time

    Crypto Sales reputed company US

    Remote, USA Full-time

    Senior Data Analyst – Remote Data Entry & Consumer Insights Specialist – $27/Hour – arenaflex

    Remote, USA Full-time

    Physical/Occupational Therapy Virtual Hiring Event - May 8, 2026 from 10AM - 3PM

    Remote, USA Full-time

    Account Executive, reputed company

    Remote, USA Full-time

    Remote Mortgage Loan Officer Build Your Book With reputed company Support

    Remote, USA Full-time

    Neuroscience Sales Specialist, Lafayette, LA

    Remote, USA Full-time

    Workforce Scheduler | Philippines

    Remote, USA Full-time

    reputed company Media Manager

    Remote, USA Full-time

    reputed company Entry-Level Data Entry Specialist – Remote Work Opportunity at arenaflex

    Remote, USA Full-time

    Physical Therapist- Part Time Remote Telehealth- Vietnamese/Mandarin/Khmer/Lao

    Remote, USA Full-time