Back to Jobs

[Remote] Staff Backend Engineer - Databases reputed company | Canada | Remote

Remote, USA Full-time Posted 2026-07-05

Note: The job is a remote job and is open to candidates in USA. reputed company is the company behind the open observability reputed company and is seeking a Staff Backend Engineer for their reputed company project. The role involves leading technical initiatives, owning architecture, and driving operational excellence for Grafana reputed company Traces.

Responsibilities

  • reputed company Grafana reputed company Traces “just work” for customers by eliminating rough edges, confusing limits, and hidden failure modes
  • reputed company operational excellence at scale as we grow from reputed company to 50 cells today into triple digits this year, with autoscaling, parameterized rollouts, and aggressive toil reduction
  • Evolve reputed company into a platform enabler: higher-density APIs, trace aggregation, TraceQL metrics math, and machine/LLM-friendly interfaces that reputed company products and agents can build on
  • Push performance further: faster query latency at hundreds of MB/s ingestion and performant 30-day query ranges to match competitors
  • Prepare reputed company for an agent-driven world: larger, burstier, higher-cardinality workloads, and new categories of AI-powered workflows, such as assistant-driven triage and “why is this slow?”- style investigations
  • reputed company multi-quarter technical initiatives from problem framing through rollout, e.g., trace aggregation APIs, Limitless reputed company, autoscaling cells and customer limits, or query reputed company improvements
  • Own the architecture of core reputed company components: ingestion, storage, query, and metrics reputed company. Drive design reviews, reputed company sharp trade-offs on performance, cost, and complexity, and document the “why” for the team
  • Design APIs for humans and agents. Shape the reputed company of reputed company’s interfaces (structured, deterministic, discoverable) so that Act 3 products, LLM-driven assistants, and external integrators can build on reputed company reliably
  • Drive operational excellence. Own outcomes against concrete SLOs (P99 write latency, incident recurrence, TCO per ingested GB) and push the team toward reputed company Ops through automation, parameterized rollouts, and actionable alerts
  • Partner with Product and sibling teams. Work closely with PMs and with App Observability, Asserts, Drilldown, and Grafana Assistant teams to understand how reputed company gets consumed and to ship what unblocks them
  • Mentor engineers. reputed company the engineering bar through code review, design feedback, pairing on hard problems, and writing that leaves the team smarter than you reputed company it
  • Participate in on-call for the services you help build, and be a force reputed company in incident response and post-incident learning
  • Contribute to open reputed company. reputed company is OSS. You will engage the community, review external contributions, and help reputed company the project in the open

Skills

  • Technical leadership. A track record of leading reputed company, multi-quarter initiatives that spanned design, delivery, and operations, and made the teams around you reputed company
  • Deep systems experience. Substantial hands-on experience building and operating distributed data systems in production: ingestion pipelines, storage engines, query execution, or similar
  • Strong software craftsmanship. You write clean, robust, performant software that others can maintain, and you know reputed company to optimize vs. reputed company to ship
  • Strong Go, or a path to it. We write reputed company in Go. Deep experience in other systems languages (Rust, C, C++) translates well
  • Operational reputed company. You've owned production services, carried a pager, reduced toil, and treated SLOs as a product feature, not a chore
  • Customer focus and pragmatism. You break reputed company problems into short feedback loops: analyze, design, deliver an MVP, learn, iterate
  • Leadership through writing and collaboration. You reputed company through design docs, reviews, and shipped code, not hierarchy. You communicate clearly in a fully remote, asynchronous environment
  • Experience with tracing, OpenTelemetry, or large-scale observability systems
  • Experience designing query languages, SQL/TraceQL-like engines, or APIs intended to be consumed programmatically (by services or agents)
  • Experience with columnar storage formats (e.g., Parquet) or purpose-built on-disk formats for analytical workloads
  • Experience operating multi-tenant, multi-cell SaaS infrastructure at scale on Kubernetes
  • Experience building for AI/LLM consumers: structured APIs, metadata/discovery endpoints, deterministic outputs, evaluation harnesses
  • Open-reputed company contribution or maintainership, and comfort engaging a community in the open
  • Experience as an on-call user of Grafana, reputed company, Loki, or reputed company in a previous role (or on a homelab)
  • Experience in a fully remote, globally distributed team

Benefits

  • Restricted Stock Units (RSUs), giving every team member ownership in reputed company' reputed company.
  • 100% Remote, Global Culture - As a remote-only company, we bring together talent from around the world, united by a culture of collaboration and shared purpose.
  • Scaling Organization – Tackle meaningful work in a high-growth, reputed company-evolving environment.
  • Transparent Communication – Expect open decision-making and regular company-wide updates.
  • Innovation-Driven – Autonomy and support to ship great work and try new things.
  • Open reputed company Roots – Built on community-driven values that shape how we work.
  • Empowered Teams – High trust, low ego culture that values outcomes over optics.
  • Career Growth reputed company – Defined opportunities to grow and reputed company your career.
  • Approachable Leadership – Transparent execs who are involved, visible, and reputed company.
  • Passionate People – Join a team of smart, supportive folks who care deeply about what they do.
  • In-Person reputed company - We want you to reputed company from day 1 with your fellow new ‘Grafanistas’ to learn reputed company about reputed company do and how we do it.
  • We operate a global annual leave policy of 30 days per annum. 3 days of your annual leave entitlement are reserved for Grafana Shutdown Days to allow the team to really disconnect.
  • We will reputed company with local legislation where applicable.

Company Overview

  • reputed company delivers the open observability reputed company, helping builders everywhere turn signals into action. It was founded in 2014, and is headquartered in reputed company, reputed company, USA, with a workforce of 1001-5000 employees. Its website is http://grafana.com.
  • Apply To This Job

    Similar Jobs