Site Reliability Engineer
Summary
Site Reliability Engineer at Hamravesh
12/2021 - Present
Created, managed, and scaled high-load Kubernetes clusters with over 100 nodes in different zones and with different
configurations.
Designed and built a feature-rich system to regularly take consistent backup of all stateful applications running on multiple
Kubernetes clusters in different zones, able to make tens of thousands of stateful applications on k8s fault-tolerant.
Designed and built an automation solution for fault-tolerant Database Replication and Data Migration between different zones
using sustainable workflows.
Implemented Logging, Monitoring, and Alerting and developed SLOs and SLIs for created and/or maintained projects and services.
Worked closely with web and mobile developers on the implementation of the UI and UX of platform and infrastructure products.
Provided documentations for both public and internal products targeting customers and co-workers in two languages.
Taken part in developing a PaaS on top of pure Kubernetes serving about 2000 users
Expectations
Sophisticated cloud-based stack and professional co-workers working in a cloud-native environment using the latest technologies based on Kubernetes
Employment Preferences
Relocation destinations:
- Netherlands
- Sweden
- Germany
- Australia
Expected Base Salary
**,000 USD
Academic Degree
Experience
Total Professional Experience
Startup Experience
Big-Tech Companies
Enterprise Experience
Skills
- Relational Databases
- PostgreSQL
- MySQL
- NoSQL Databases
- MongoDB
- Async Programming
- Software Architecture
- Python
- Django
- Django Rest Framework
- Java
- TypeScript
- GRPC
- Workflow Management
- Cadence
- Temporal
- Network Programming
- Version Control
- GIT
- Test-Driven Development
- Algorithms
- Data Structures
- Linux Systems Administration [LPIC-1
- LPIC-2
- LPIC-3 Security]
- Networking
- Containers
- CRI-O
- Containerd
- Docker
- Podman
- LXC
- Container Orchestration
- Kubernetes
- Helm
- [CKA
- CKAD
- CKS Equal Knowledge]
- Virtualization
- KVM
- ESXI
- Configuration
- Management
- Ansible
- Infrastructure As Code
- Terraform
- Microservice Architecture
- CI
- CD
- Github Actions
- GitLab
- Jenkins
- GitOps
- ArgoCD
- Scripting
- Bash
- Web Servers
- Proxies
- Nginx
- Caddy
- HAProxy
- Traefik
- Monitoring
- Alerting
- Prometheus
- VictoriaMetrics
- Grafana
- Logging
- ELK Stack
- Distributed Tracing
- OpenTelemtry
- Signoz
- Web Security
- OWASP
- Framework
- Cryptography
- On-Premise Cloud
- OpenStack [COA Equal Knowledge]
- AWS
- EC2
- EKS
- ELB
- S3
- ECS
- RDS
- DynamoDB
- CloudFront
- GCP
- GCE
- GKE
- Software-Defined Storage
- Ceph
Contacts are hidden
Send a connection request to the candidate to get their contact details.
Contact Candidate
