Job Description

About the Role
We are seeking a skilled and versatile Senior Platform Engineer / SRE with strong backend development experience and a passion for system reliability and operational excellence. This role combines responsibilities across backend engineering, infrastructure reliability, release management, and production support.
The ideal candidate will be responsible for ensuring the stability and scalability of cloud-native backend systems while coordinating effectively with multiple product teams to manage releases and troubleshoot production issues. Strong communication and multitasking skills are essential, along with a solid grasp of backend development principles – preferably in Golang, our core language.

Experience: 6+ Years

Core Responsibilities
Backend & Platform Engineering
Design, build, and maintain scalable microservices and cloud-native systems (primarily in Golang; Java & Ruby experience a plus).
Participate in backend development and review code changes across multiple projects.
Build and manage event-driven architectures (AWS SNS/SQS, Lambda).
Develop and maintain data pipelines using AWS Glue, S3, and Athena.
Collaborate with DevOps and development teams to improve infrastructure reliability and delivery speed.
Implement and optimize CI/CD pipelines (AWS CodePipeline, GitHub Actions, etc.).
Ensure observability through tools like OpenTelemetry, Sentry, and custom metrics.Production Support & System Reliability
Own production support activities, focusing on system uptime, performance, and reliability.
Troubleshoot complex issues using monitoring/logging tools; identify root causes, correlations, and performance bottlenecks.
 ● Perform backend-level debugging and optimize database performance (PostgreSQL, Redis).
Apply SRE best practices to drive operational improvements, incident response, and postmortems.
Release Management & Coordination
Lead and manage release cycles, coordinating with multiple product teams.
Ensure adherence to release timelines and handle change requests efficiently.
Use project management tools (e.g., Jira) to track progress, manage sprints, and facilitate cross-team collaboration.
Communicate clearly with technical and non-technical stakeholders.

Must-Have Skills
Backend Development: Strong experience with microservices; expertise in Golang or Java and Ruby
Cloud Platforms: Deep hands-on experience with AWS services like ECS, Lambda, S3, Glue, SNS/SQS.
DevOps & Infra: Terraform, Docker, CI/CD tools (AWS CodePipeline, Jenkins, GitHub Actions).
Monitoring & Testing: Experience with tools like Sentry, OpenTelemetry, Pytest, and other observability frameworks.
Databases: Proficiency in PostgreSQL and Redis.
Project/Release Management: Experience using Jira or equivalent; ability to lead releases and collaborate across teams.
Debugging & Performance Tuning: Strong troubleshooting skills at backend and database layers.

Preferred Skills
Proven experience in Site Reliability Engineering (SRE) or Platform Engineering roles.
Familiarity with FastAPI, Django, or similar frameworks.
Exposure to HIPAA-compliant or other regulated environments.
Understanding of infrastructure security best practices.
Familiarity with LLM-based systems and vector databases is a plus.

 

Drop your resume at deepthi@hamon.in