Lebenslauf

Experienced Site Reliability Engineer with a strong background in software engineering and networking, and over 7 years of experience supporting multinational IT organizations. Passionate about building container and cloud-based solutions and applying expertise to data-oriented infrastructure challenges.

Senior DevOps Engineer

Sep 2023 — Present

Caruso Dataplace · Remote

  • Led the migration to a modern, scalable infrastructure using Terraform, Terragrunt, and ArgoCD, establishing repeatable and auditable deployment workflows
  • Used Terraform and Terragrunt to provision both AWS cloud resources (EKS, MSK, RDS Aurora) and production-ready platform tooling (SonarQube, SigNoz, ArgoCD) as fully codified, repeatable deployments
  • Worked closely with software developers and architects to implement and optimize microservices architecture
  • Collaborated with the security team to conduct internal penetration testing and proactively mitigate vulnerabilities
  • Developed and maintained Kubernetes Operators in Go to streamline and automate Day-2 operations across the platform

DevOps Engineer

Mar 2023 — Sep 2023

Cocus AG · Berlin, Remote

  • Contributed to the design and implementation of the Hydra Project, participating in architectural discussions and delivering infrastructure components
  • Implemented cloud infrastructure using CloudFormation, enabling scalable and fully automated deployments
  • Contributed to the lifecycle management of backend services powered by Go
  • Managed and responded to Prisma Cloud security alerts, ensuring timely investigation and resolution
  • Designed and implemented AWS resource architectures to support evolving project requirements

System Development Engineer

Nov 2022 — Feb 2023

Amazon · Germany, Remote

  • Contributed to the design and implementation of the LTT product, taking part in technical planning and delivery discussions
  • Developed end-to-end solutions for LTT project tasks, from initial concept through to production deployment
  • Tested and iterated on code before and after production releases to ensure system reliability and stability
  • Built working relationships with peers, new teammates, and colleagues across the business to drive project success

DevOps Engineer

Jul 2021 — Aug 2023

Cocus AG · Remote

  • Enhanced data filtering and processing of real-time streaming workloads using the NiFi and Kafka platform
  • Used Ansible to provision packages, build Amazon Machine Images, and deploy non-destructive changes using dynamic inventories. Leveraged Terraform to provision and maintain infrastructure as code
  • Supervised the deployment of core application stack software within the DevOps toolchain, improving overall system reliability
  • Improved key performance indicators for incident and change management, leading to measurable gains in customer satisfaction

System Engineer

Feb 2019 — May 2021

Unbelievable Machine GmbH (*um) · Berlin

  • Leveraged the Ambari platform to automate Hadoop and Kafka cluster deployments and monitor their operational status
  • Used Ansible and Puppet frameworks to automate cloud deployments through custom scripts and workflows
  • Supervised and deployed core application stack software for load balancer systems, ensuring high availability and reliability
  • Improved KPIs for incident and change management, contributing to an increase in customer satisfaction scores

System Administrator

Apr 2016 — Oct 2016

PersianGig · Tehran

  • Used the OpenStack platform to automate the deployment of virtual private servers and monitor their operational status
  • Leveraged Puppet frameworks to automate cloud deployments through custom scripts and workflows
  • Supervised and deployed core application stack software for database and web server systems
  • Provided technical support to resolve customer issues and served as their primary point of contact for billing and general inquiries
  • Improved incident management KPIs, contributing to higher customer satisfaction scores

Network Engineer

Jan 2015 — Mar 2016

Homa Telecom · Tehran

  • Collaborated with a cross-functional team of engineers while maintaining close engagement with leadership and business stakeholders to drive innovation and growth
  • Configured and launched a comprehensive networking stack, including routers, load balancers, and firewalls
  • Optimized wireless networking equipment to achieve maximum throughput in a highly congested radio frequency environment
  • Designed and deployed a comprehensive network and hardware monitoring system using the Zabbix platform, capable of tracking and reporting a wide range of operational metrics
  • Improved network connectivity stability and backbone redundancy, reducing downtime and increasing resilience

Languages

Python Go JavaScript SQL Bash Java

Frameworks

Django Flask LangChain Serverless Framework

Tools

Kubernetes ArgoCD Terraform Spark Kafka GitLab SonarQube Galera PostgreSQL CloudFormation Ansible NiFi

Platforms

Linux AWS Azure IBM Cloud Web Arduino Raspberry Pi

Soft Skills

Event Management Technical Writing Public Speaking Time Management

Bachelor of Software Engineering

2005 – 2010

Azad University of Shiraz · Shiraz, Iran

Courses: Operating Systems, Data Structures, Algorithms, Programming Models, Networking, Databases

  • Developing on AWS (AWS, 2022)
  • Google Certified Associate Cloud Engineer (Google, 2021)
  • Certified Kubernetes Administrator (CNCF, 2020)
  • DevOps Tools Engineer LPI-701 (LPI, 2020)

Streamzilla

2021–2022

Porsche AG

A one-stop platform for all data streaming needs within Porsche AG. An internally managed service enabling engineering product teams to build and run applications leveraging the low latency, high throughput, and fault tolerance of Apache Kafka and Apache NiFi. Designed with a cloud-agnostic approach to be highly scalable and deployable across cloud, hybrid cloud, and on-premise environments.

Legacy Database Migration

2022

Fidor Bank

Migrated a legacy RDBMS cluster processing millions of daily transactions to a modern data warehouse. The new platform used an Infrastructure as Code approach with Ansible, supporting deployment across public and private cloud providers. Data consistency was ensured using Galera, while ProxySQL separated read and write nodes to prevent split-brain scenarios.

UMCP

2020–2021

Unbelievable Machine

A managed private cloud platform for B2B customer services. Enabled engineering product teams to build and run applications leveraging the scalability and high availability of Red Hat OpenShift and Knative for microservices workloads. Fully automated using the Ansible framework to support configurable, large-scale management.

CNBDP

2019–2021

BMW AG

A managed big data solution for data processing within BMW AG. Enabled engineering product teams to build and run applications for high-throughput data workloads. Hundreds of petabytes of data were stored on HDFS, and hundreds of gigabytes were ingested and streamed daily using Hadoop ecosystem tools including Spark, Hive, HBase, and Oozie. Adopted an Infrastructure as Code approach using Ansible and Ambari to ensure scalability across any infrastructure.

  • 1st Runner-Up at OpenStack Hackathon, Berlin — 2019
  • 3rd Runner-Up at DCI Hackathon, Berlin — 2018

Brave Ambassador

Jan 2020 – Present

Brave · Global, Online

Organized events, conducted workshops, and delivered technical sessions reaching over 1,000 developers.

Tutor

Jan 2019 – Present

ReDI School · Berlin, Germany

Delivered online and in-person technical and soft-skills training to over 200 students.

English Fluent
German Fluent
Persian Native