Open roles

Site Reliability Engineer

Role info
Consultant
Full Time
Bengaluru
Competitive
Share this role

The role

We are looking for an experienced Backup & Disaster Recovery (DR) Engineer to ensure all virtual machines, on-premises infrastructure, and cloud platforms are fully protected, recoverable, and compliant with enterprise resiliency standards. This role requires strong hands-on expertise in cloud backup technologies, including Azure Backup and Recovery Services Vaults, AWS backup services, and Commvault, along with operational knowledge of disaster recovery solutions such as Azure Site Recovery (ASR) and equivalent AWS services. The engineer will work closely with Customer SOC and Resiliency teams to ensure core applications and infrastructure are DR-ready, tested, and compliant.


Responsibilites

Key Responsibilities

Backup Management & Operations (Core Responsibility)

· Design, implement, and manage backup strategies for:

o Virtual Machines (Windows & Linux)

o On-premises infrastructure

o Cloud workloads and platform services

· Ensure all systems are:

o Properly backed up

o Meeting RPO/RTO requirements

o Monitored for backup health and failures

· Perform regular backup verification and reporting

· Act as the technical escalation point for backup-related incidents

---

Cloud Backup Technologies (Mandatory Focus)

· Strong hands-on experience with:

o Azure Backup

o Azure Recovery Services Vaults

o Equivalent AWS backup services

· Configure and manage:

o Backup policies

o Retention policies

o Cross-region backup where applicable

· Ensure cloud workloads are aligned with enterprise backup standards

---

Commvault Backup & Restore

· Hands-on experience with Commvault for:

o Backup configuration

o Restore operations

o Troubleshooting backup failures

· Perform restore testing and support recovery during incidents

· Ensure Commvault environments are maintained, patched, and optimized

---

Disaster Recovery & Resiliency (Mandatory)

· Design and manage Disaster Recovery (DR) solutions using:

o Azure Site Recovery (ASR)

o Equivalent AWS DR services

· Plan and execute:

o DR drills

o Failover and failback activities

o Documentation and validation of recovery procedures

· Work closely with Resiliency teams to ensure:

o Core applications are DR-enabled

o Infrastructure meets resiliency and compliance requirements

· Maintain DR runbooks and recovery documentation

---

Patching, Compliance & VM Health

· Ensure VMs are:

o Patched regularly

o Secure and compliant with policies

· Validate that patching does not impact backup or DR configurations

· Work with platform and operations teams to remediate compliance gaps

---

Security & SOC Collaboration

· Work closely with Customer SOC teams to:

o Ensure backup and DR logs are available for audits and investigations

o Support security incidents involving data recovery

· Align backup and recovery processes with security and compliance controls

· Participate in audits, assessments, and risk mitigation activities


The candidate

Required Skills & Experience

Mandatory Skills

· 10–12 years of experience in backup, recovery, and infrastructure resiliency

· Strong hands-on expertise with:

o Azure Backup

o Azure Recovery Services Vaults

o Azure Site Recovery

· Experience with AWS backup and DR services

· Proven hands-on experience with Commvault backup and restore

· Solid understanding of:

o Backup architectures

o DR strategies and best practices

o RPO/RTO requirements

· Experience managing:

o VM patching

o Compliance validation

---

Nice-to-Have / Added Advantage

· Experience with additional backup tools (Veeam, etc.)

· Automation or scripting exposure (PowerShell, CLI)

· Multi-cloud DR strategy experience

· Experience in large enterprise or regulated environments

---

Personal Attributes

· Strong ownership and reliability mindset

· Excellent troubleshooting and analytical skills

· Ability to work cross-functionally with SOC, Resiliency, and Platform teams

· Strong documentation and communication skills

· Proactive, risk-aware approach to infrastructure protection