Back to Jobs

Network Reliability Engineer

Remote, USA Full-time Posted 2026-06-11

#HPC #AI #GPU #CLUSTERS YOUR DAILY ROUTINE - Build a large AI infrastructure with monitoring, diagnosis, and remediation of production incidents- Troubleshoot high-impact production issues in collaboration with other engineering teams - Participate in an on-call rotation to handle incidents and ensure service continuity - Implement and maintain observability solutions to monitor AI infrastructure and application health - Contribute to AI infrastructure lifecycle management across different environments and countries - Promote and apply best practices in terms of stability, resiliency, scalability, and security - Maintain clear technical documentation for tools and procedures - Contribute to system and tool evolution based on production feedback - Collaborate closely with development teams to ensure infrastructure readiness- Participate in team rituals and knowledge-sharing initiatives ABOUT YOU 🎯 SOFTSKILLS : - Proactive and solution-oriented mindset - Passion for automation and continuous improvement - Strong collaboration and communication skills - Ability to work independently and in a team - Willingness to mentor and share knowledge 💻 HARDSKILLS : - Experience with Go or Python - Strong scripting skills (Bash, Python) - Hands-on experience with Linux systems (Ubuntu/Debian) - Preferred hands-on experience with GPU & HPC infrastructure - Knowledge of networking (TCP/IP, DNS, BGP, load-balancing, IPv6, etc.) - Familiarity with monitoring and logging tools (Prometheus, Grafana, Elastic, etc.) - Comfortable with Infrastructure-as-Code (Ansible, Salt, AWX, etc.) - Experience managing relational databases (MariaDB) - Understanding of CI/CD pipelines (GitLab) - Comfortable with English (written and spoken) \n \n200 zł - 250 zł an hour \n Apply To This Job

Similar Jobs

Albanian SEO Specialist

Remote, USA Full-time

Entry-level: Business Development Representative (BDR) – Sales Development (w/m/d)

Remote, USA Full-time

Consumer Safety Executive-PLM

Remote, USA Full-time

Team Lead - Financial Services Operations

Remote, USA Full-time

Specialist - Financial Services Operations

Remote, USA Full-time

Staff Accountant - India Exchange

Remote, USA Full-time

Senior Associate- Financial Services Operations

Remote, USA Full-time

Senior Regulatory Affairs Associate- Clinical Trial Application

Remote, USA Full-time

Communications Consultant

Remote, USA Full-time

GTM & Agentic Operations

Remote, USA Full-time

Experienced Customer Service Representative – Delivering Exceptional Financial Services Experience at arenaflex

Remote, USA Full-time

[REMOTE] Medicare Sales – $200-$350/App + 100% Inbound Leads (WE RETAIN YOUR CLIENTS FOR YOU)

Remote, USA Full-time

WORK FROM HOME/HOME BASED INSURANCE AGENT

Remote, USA Full-time

Coding Quality Associate Analyst

Remote, USA Full-time

Director of Decentralized Clinical Trials – Remote Part‑Time Leadership Role in Innovative Clinical Research at arenaflex ($26/hr)

Remote, USA Full-time

Remote Customer Service Lead Engineer – Voice Infrastructure & Open‑Source Telephony (Hybrid/Flex) – $25/hr – CA – arenaflex

Remote, USA Full-time

Manager, Manufacturing Engineering

Remote, USA Full-time

Experienced Data Entry and Content Writer for arenaflex Outdoor Guidebooks

Remote, USA Full-time

Manager, GOP Services

Remote, USA Full-time

[Remote] Financial Analyst I OCR

Remote, USA Full-time