Mar 2025 — Present
Sr. DevOps / SRE Engineer
· Which?
Outside IR35
London
Media & Publishing
Team of 7
B2C & B2B
StackAWS (EKS/ECS, EC2, VPC, ALB, RDS/Aurora, IAM, CloudWatch, Lambda, OpenSearch) · Terraform · CloudFormation · Kubernetes · ArgoCD (GitOps) · Prometheus · Grafana · Bitbucket · Jenkins · Bash / Python / Go · Snowflake · Databricks.
Responsibilities
- Greenfield setup of scalable, secure AWS infrastructure using IaC (Terraform / CloudFormation).
- Implemented observability with Prometheus, Grafana and CloudWatch — actionable runbooks and SLOs.
- Drove automation and GitOps (ArgoCD / Kubernetes); standardised CI/CD with GitHub.
- Supported incident response and escalation; collaborated with AI / platform teams on reliability and security hardening.
- Databricks Lakehouse to build agentic AI systems reasoning over real-time and historical data.
- Implemented Snowflake to unify structured consumer, pricing and behavioural datasets for governed BI / AI insights.
Achievements
- Built an agentic DevOps copilot (Claude + tools) that triages incidents from logs / alerts, proposes fixes and opens validated PRs — cutting MTTR by 20%.
- Defined and delivered Golden Path dev → prod environments as code, enabling repeatable, self-service provisioning.
- Embedded Golden Path observability standards: monitoring, alerting, ownership and runbooks to accelerate detection and resolution.
- Streamlined the release flow by minimising manual steps via GitOps automation.
- Production-grade platform for B2C / B2B workloads with built-in security baselines (least-privilege IAM, network policies, secrets management).
Mar 2024 — Nov 2024
DevOps / SRE Engineer
· Ticketmaster
London
Entertainment
Team of 4
B2C Retailer
StackAWS (RDS PostgreSQL, Aurora, Redis, ECR, EC2, VPC, ELB, IAM, EKS, ECS, CloudWatch, CloudFormation, SSM, OpenSearch, Lambda, Kinesis, EventBridge) · Terraform · GitLab · Nginx · Bash · YAML · Angular (TypeScript) · Java (Spring Boot) · Python · Go · Fastly · Prometheus · Grafana · Snowflake · Databricks.
Responsibilities
- Continuously increased monitoring, alerting and logging coverage; standardised environments and pipelines.
- Managed Prometheus alerts, supported Grafana dashboards, wrote step-by-step runbooks for every alert.
- Led DevOps team development on GitOps tooling (ArgoCD + Kubernetes); ran grooming, stand-ups and PR reviews.
- Conducted ongoing CI/CD pipeline performance analysis to find bottlenecks.
- Supported Snowflake usage for large-scale customer / transaction analytics — focus on performance and governance.
- Implemented Databricks workflows for data engineering and downstream AI / ML experimentation.
Achievements
- Standardised environments and pipelines across 200 projects — 50% reduction in configuration errors.
- Reduced CI/CD build and deployment times by 40%, improving delivery speed.
- Aligned with Kubernetes migration team to transition critical projects to modern infrastructure with improved scalability.
May 2023 — Mar 2024
DevOps Engineer
· MindGym
London
EdTech
Team of 29
B2B SaaS
StackAWS (RDS, Aurora, Redis, ECR, EC2, VPC, ELB, IAM, EKS, ECS, CloudWatch, Route 53, CodeDeploy / Pipeline / Build, CloudFormation, SSM, OpenSearch, Lambda, Kinesis, EventBridge) · Terraform · GitHub · Bash · Python · Jupyter · Mailchimp · Angular · Java · Go · HubSpot · Backstage · Prometheus · Grafana · PowerBI · Metabase.
Responsibilities
- Increased monitoring / alerting / logging coverage; managed Prometheus alerts and Grafana dashboards with step-by-step runbooks.
- Led DevOps team for GitOps tooling (ArgoCD, Kubernetes), facilitated grooming and stand-ups, owned budget and platform cost optimisation.
- Conducted performance analysis of Go, Node.js and Python apps — identified concurrency anti-patterns and blocking I/O.
Achievements
- Reduced MTTR by 40% through monitoring tools, dashboards and DR / IR processes.
- Improved system reliability with Ansible + modular IaC — 60% decrease in system issues.
- Optimised AWS cloud costs by 30% across 25+ AWS accounts and environments.
Oct 2022 — Mar 2023
Sr. DevOps / Platform Engineer
· PetLab Co.
London
E-Commerce
Team of 15
B2C Online Retail
StackAzure · AWS (Aurora, RDS, Redis, ECR, EC2, ECS, Elastic Beanstalk, SAM Serverless, VPC, ELB, CloudWatch, Route 53, CodePipeline / Build / Deploy, CloudFormation, IAM, SSM, OpenSearch, Lambda, Kinesis, EventBridge) · Terraform (reusable) · Bitbucket · Nginx · Bash · PHP (Symfony) · Python · Klaviyo · Mailchimp · Angular · Stripe · Stitch.
Oct 2020 — Sep 2022
Sr. DevOps / SRE Engineer
· SeedLegals
London
LegalTech / FinTech
Team of 20
B2B SaaS
StackAWS (RDS PostgreSQL, Redis, CodePipeline, ECR, EC2, ECS, Elastic Beanstalk, SAM, VPC, ELB, CloudWatch, Route 53, CloudFormation, IAM, SSM, OpenSearch, Lambda, EventBridge) · Terraform · CDK (TypeScript) · GitHub · CircleCI · CloudFlare · GoDaddy · Tableau · Nginx · Python · Mailchimp · Angular · Spring Boot · WPengine · Intercom · HubSpot · FusionAuth · Stripe · Stitch.
Responsibilities
- Spearheaded confidential security initiatives — vulnerability assessments and countermeasures.
- Implemented Terraform across demo and multi-environment infrastructure for scale and consistency.
- Led the DevOps team with autonomy; enhanced CI/CD pipelines to slash deployment time.
- Built monitoring systems to track performance, find bottlenecks and proactively address issues.
Achievements
- Performance analysis of Java apps — fixed concurrency anti-patterns and blocking I/O — 2× to 10× performance gains.
- CI/CD pipeline for IaC delivered 40% reduction in infrastructure problems.
- Implemented security best practices ensuring compliance with industry standards.
Nov 2018 — Oct 2020
Sr. DevOps Consultant
· NowaSys
London / Lahore
Software
Team of 10
B2B Analytics Consultancy
StackGCP · Azure · AWS (EMR Spark, Kafka, Keyspaces, Kinesis, RDS, ElastiCache Redis, ECR, Linux2, CloudTrail, VPC, ELB, CloudWatch, Route 53, CodePipeline, CloudFormation, IAM, ECS, EC2, ElasticSearch) · GitHub · Jenkins · Nginx · Bash · Python · Docker · SonarQube · AngularJS · ReactJS · VueJS · Spring Boot · Django · Laravel · ActiveMQ · Terraform.
Responsibilities
- Managed a diverse multi-cloud environment, ensuring interoperability across services and platforms.
- Collaborated with development teams using GitHub for version control and stakeholder workflows.
- Supported diverse stacks — frontend (Angular / React / Vue) and backend (Spring Boot, Django, Laravel).
Achievements
- Introduced Docker — 25% reduction in deployment-related issues.
- Established monitoring / logging — 15% reduction in system downtime.
- SonarQube quality gates enforced via pipelines — failing builds when conditions weren't met.
- Assisted 30+ clients in deploying analytics systems with a 100% success rate.
Dec 2015 — Nov 2018
Head of DevOps
· Hybytes
London / Lahore
Software
Team of 25
B2B IT Consultancy
StackGCP · Azure · AWS · Terraform · Kubernetes · Docker · GitHub · JIRA · Confluence · Jenkins · PaloAlto · Ansible · Kafka · Cassandra · Chef Solo · Puppet · Bash · Python · PHP · Nagios · Zabbix · NewRelic · Datadog · Kibana.
Responsibilities
- Communicated DevOps strategy, achievements and challenges to key stakeholders.
- Identified training needs and ran skill-development programmes for the DevOps team.
- Evaluated and managed external vendor relationships for DevOps services and tools.
- Owned budgets for infrastructure, tools and team — optimised costs without compromising quality.
Achievements
- Presented technical solutions effectively to both technical and non-technical audiences.
- Improved company performance through capacity and resource planning.
- Built a highly motivated team and department supporting business growth.
Oct 2014 — Nov 2015
Team Lead — Cloud
· AdMaxim
Lahore
AdTech
Team of 15
B2B Online Marketing
StackRackSpace · AWS · Bash · Local DC (ESXi, pfSense, Windows Server 2008, NFS) · Dell hardware · Hadoop · RTB · Kafka · Druid · Tracker clusters · MySQL · MongoDB · Nagios · Grafana.
Responsibilities
- Led migration from PHP monolith to microservices.
- Managed platform budgets and optimised resource utilisation.
- Guided the dev team toward DevOps practice and streamlined workflows.
- Managed cross-functional team of devs, testers and sysadmins.
- Maintained comprehensive technical documentation and architecture diagrams.
Achievements
- Redesigned deployment architecture — 30% reduction in downtime.
- Improved team productivity by 25% through training on new tech and best practices.
- Reduced deployment time by 50% through automation.
- Implemented automated monitoring for 400 servers using Nagios and Bash.
- Migrated from Rackspace to AWS with spot instances — costs went from $135K → $35K (72% reduction).
Jan 2010 — Oct 2014
IT Operations Manager
· Hybytes
Lahore
Software
Team of 10
B2B IT Consultancy
StackDigital Ocean · AWS · Bash · Local Data Center (ESXi, pfSense, Windows Server, NFS) · Nagios · Cisco routers / switches · HP ProLiant DL360 · DNS · DHCP · Active Directory · Samba 4 · NFS · WAN/LAN.