About the role
Job Summary We are seeking a highly skilled DevOps Engineer with strong experience in AWS cloud, CI/CD automation, containerization, and Infrastructure as Code (IaC). The ideal candidate will design, implement, and manage scalable, secure, and highly available cloud infrastructure while enabling continuous integration and continuous deployment pipelines. The role involves close collaboration with development, QA, and operations teams to ensure seamless deployments, improved reliability, and optimized cloud performance for applications and services. Cloud and Infrastructure Management The candidate will architect and manage highly available and scalable infrastructure on AWS, including services such as EC2, S3, RDS, VPC, IAM, EKS, ELB, and Route53. They will implement Infrastructure as Code using tools such as Terraform, CloudFormation, and Ansible. Responsibilities include managing auto-scaling and load balancing for web and backend applications, standardizing AWS environments across regions to reduce configuration drift, and implementing AWS security best practices including IAM policies, KMS encryption, and security group configurations. CI/CD and Automation The role requires designing and implementing CI/CD pipelines using tools such as Jenkins, GitLab CI, and AWS CodePipeline. The engineer will automate build and deployment processes using scripting languages such as Groovy and Python. They will integrate tools like Maven, SonarQube, and Nexus or JFrog into the pipelines and manage release cycles across development, QA, UAT, and production environments. The role also includes configuring Jenkins master-slave architecture and implementing role-based access control. Containerization and Orchestration The engineer will deploy and manage containerized applications using Docker and Kubernetes, particularly on Amazon EKS. They will create and maintain Dockerfiles and Docker Compose configurations and manage container registries. Responsibilities include managing Kubernetes resources such as pods, services, ingress, ConfigMaps, secrets, and namespaces, and implementing networking solutions using tools like Flannel, Calico, or WeaveNet. Monitoring and Reliability Engineering The candidate will implement monitoring solutions using tools such as Prometheus, Grafana, CloudWatch, Datadog, and Splunk. They will configure alerts and dashboards to maintain high application availability. The role includes conducting root cause analysis, leading post-incident reviews, reducing mean time to recovery through proactive monitoring and automation, and maintaining detailed runbooks and incident response documentation. Build and Release Management The engineer will manage build artifacts using tools such as Maven and Ant. They will perform branching, merging, tagging, and release activities using version control systems such as Git, SVN, and Bitbucket. The role involves supporting multiple parallel development streams and handling production deployments and validation processes. Database and Application Support Responsibilities include supporting Oracle and MySQL databases, deploying applications on Apache Tomcat and WebLogic, and providing L1 and L2 support for Java and PHP applications. The engineer will troubleshoot logs and coordinate with development teams for issue resolution.