Site Reliability Engineer en Playbypoint

FULL_TIME

  Remoto | Senior | Full time | SysAdmin / DevOps / QA

53 postulaciones
Responde entre 1 y 7 días
Revisado por última vez hoy

Playbypoint provides an integrated POS and management software solution tailored for sports clubs. Our platform streamlines operations, enhances customer engagement, and empowers clubs to focus on what they do best—delivering exceptional sports experiences. As we scale our technology, we seek a proactive SRE to ensure our systems remain robust, scalable, and secure.

Job functions

As a Site Reliability Engineer at Playbypoint, you will be part of an emerging Infrastructure Team that is vital for maintaining the health and performance of our production systems.

You will collaborate with our development teams to support our applications, manage and scale data workflows, and orchestrate containerized workloads using Kubernetes. Your expertise will help us deliver seamless service to our clients across the racquet sports community.

Infrastructure Reliability & Automation: Develop and maintain monitoring, alerting, and incident response systems.Implement Infrastructure-as-Code practices using tools like Terraform or Ansible to manage our cloud environments.

Application & Database Support: Collaborate closely with development teams to deploy, monitor, and troubleshoot Ruby/Ruby on Rails applications. Optimize and monitor MySQL databases with failover and performance-tuning strategies.

Container Orchestration & Cloud Management: Oversee Kubernetes clusters to ensure efficient deployment, scaling, and management of containerized applications. Manage CI/CD pipelines to enable continuous delivery of high-quality software.

Qualifications and requirements

Experience:

  • Proven experience (minimum of 3 years) in an SRE, DevOps, or similar operational role within a dynamic tech environment.
  • Proven experience in solving problems at scale, minimizing bottlenecks, and improving efficiency at both technical and process levels.

Technical Proficiency: Proven experience with Kubernetes and container orchestration.Competency in setting up CI/CD pipelines and using Infrastructure-as-Code tools (e.g., Terraform, Ansible).Experience with monitoring tools such as Prometheus, Grafana, or Datadog.Familiarity with cloud platforms (AWS, GCP, or Azure) is a plus. Familiar with deploying, monitoring & optimizing web applications.

Soft Skills: Excellent problem-solving abilities and strong communication skills. A proactive mindset with a commitment to continuous improvement. Ability to thrive in a remote, collaborative, and fast-paced environment.

Desirable skills

  • Previous experience of scaling/deploying applications built-in web frameworks like Django or Ruby on Rails
  • Proven experience going deep into problems that involve distributed systems and/or low-level optimizations.

Condiciones

Trabajo 100% remoto El cargo puede ser desempeñado desde cualquier lugar del mundo.
Horario flexible Entrada y salida flexibles, libertad para realizar trámites personales o familiares.
Biblioteca digital Acceso a libros y/o suscripciones digitales.
Viajes o retiros de empresa Actividades de integración del equipo fuera del espacio de trabajo.
Computadora Playbypoint proporciona una computadora para tu trabajo.
Bono de educación Playbypoint cubre algunos gastos de educación relacionados con el puesto.

Política de trabajo remoto

Totalmente remoto

El trabajo es 100% remoto desde cualquier país.

Acerca de Playbypoint