Manage and maintain AWS-based server infrastructure for the website, including EC2, S3, RDS, Lambda, and networking components.
Ensure high availability, reliability, and uptime (target 99.99%) of web services and infrastructure.
Automate server deployment, configuration, backups, and monitoring using infrastructure as code tools such as Terraform, CloudFormation, or Ansible.
Implement and manage CI/CD pipelines for rapid and safe code deployment, using tools like Jenkins, GitLab CI, or AWS CodePipeline.
Monitor system performance; troubleshoot and resolve crashes, downtime, or reliability issues swiftly.
Ensure security and compliance for servers and cloud resources by managing IAM policies, encryption, and regular vulnerability assessments.
Collaborate with development teams to optimize deployment workflows, automate processes, and support smooth releases.
Design and implement effective backup and disaster recovery strategies.
Stay updated with new AWS features/services to improve performance, reliability, and cost-effectiveness of cloud infrastructure.
Bachelor’s degree in Computer Science, Information Technology, or related field.
1+ years of experience with DevOps practices, server administration, and automation in an AWS environment.
Hands-on experience with Linux/Unix server management.
Proficiency in AWS cloud platform, including EC2, S3, RDS, Lambda, VPC, CloudFormation, and IAM.
Experience with automation/configuration management tools like Terraform, Ansible, Chef, or Puppet.
Solid scripting skills with Bash, Python, or similar languages.
Understanding of networking fundamentals, load balancers, and DNS in AWS cloud setup.
Familiar with monitoring tools such as CloudWatch, Prometheus, Nagios, or the ELK stack.
Good troubleshooting, analytical, and communication skills.
Linux/Unix administration
AWS cloud services (EC2, S3, RDS, Lambda, VPC)
Docker and Kubernetes
CI/CD tools and practices
Scripting (Bash, Python)
Infrastructure as Code (Terraform, CloudFormation)
Monitoring and alerting tools