about skills experience projects blog certifications contact ⬇ Download Resume
whoami — platform & ai engineer, london

Rahul
Kumar

Building infrastructure that thinks.
Lead Platform Engineer · 4× GCP Certified · CKA · GenAI Leader —
turning complex cloud systems into elegant, scalable platforms.

▶  Get in touch •  View projects ↓  Resume.pdf
10+
Years Experience
GCP Certified
$15k
Monthly Saved
95%
Vuln Catch Rate
about

Who I am

I'm a Lead Platform Engineer based in London, currently at Rackspace Technology, with 10+ years building and scaling cloud infrastructure across GCP and AWS.

I lead the Internal Developer Platform (IDP) strategy — abstracting Kubernetes complexity for 6+ product teams, driving SRE adoption with SLOs and error budgets, and delivering real business impact: 25% cloud compute cost reduction ($15k/month) and environment provisioning slashed from 3 days to 15 minutes.

I hold 4× Google Cloud Professional certifications (Architect, DevOps, Security, Developer), a GenAI Leader badge, and CKA. Currently deep into AI/ML engineering — building production agent systems with Google ADK, Gemini Flash 2.5, Vertex AI, and MCP Toolbox.

Open to senior Platform Engineering, DevOps, and AI/ML Engineer roles. Remote or hybrid, London-based.

rahul@cloud-box: ~
$ cat profile.json {   "name": "Rahul Kumar",   "role": "Lead Platform Engineer",   "company": "Rackspace Technology",   "location": "London, UK 🇬🇧",   "experience": "10+ years",   "certifications": "4x GCP + CKA + GenAI",   "focus": [     "Platform Engineering",     "AI/ML Engineering",     "Cloud Infrastructure"   ],   "impact": {     "cost_reduction": "$15k/mo",     "provisioning": "3 days → 15 min",     "vuln_catch_rate": "95%"   },   "status": "open_to_opportunities" } $
skills

Tech Stack

☁️
Google Cloud Platform
GKEVertex AIBigQuery Cloud RunPub/SubDataflow Cloud BuildArtifact RegistryVPC
⚙️
DevOps & CI/CD
TerraformHelmArgoCD GitHub ActionsJenkinsAnsible PackerGitLab CI
🐳
Platform & Containers
KubernetesDockerIstio KustomizePrometheusGrafana OpenTelemetryFluentd
🤖
AI/ML Engineering
Vertex AIGemini Flash 2.5Google ADK BigQuery MLMLflowMCP Toolbox LangChainPython
🛡️
Security & Networking
Cloud SCCIAMTerrascan VPC SCZero TrustRBAC mTLSCloudflare
💻
Languages & Scripting
PythonBashGo HCLYAMLSQL Jinja2
experience

Work History

Lead Platform Engineer
@ Rackspace Technology
Jul 2022 — Present · London, UK
  • Architected the Internal Developer Platform (IDP), abstracting Kubernetes complexity and providing self-service infrastructure primitives for 6+ product teams.
  • Drove SRE adoption across engineering — defined SLOs and error budgets for critical transactional flows, significantly reducing unplanned downtime.
  • Cut cloud compute costs 25% ($15k/month) by optimising Kubernetes resource requests and limits across 50+ microservices.
  • Reduced environment provisioning from 3 days to 15 minutes with a modular Terraform library for standardised VPC and GKE provisioning.
  • Achieved 95% critical vulnerability catch rate pre-deployment by integrating Cloud SCC and Terrascan into CI/CD pipelines.
  • Designed automated data pipelines for Staging/Dev environment hydration with PII masking for GDPR compliance.
  • Implemented end-to-end APISIX API gateway for microservices traffic management, routing, and observability.
  • Building AI agent systems using Google ADK, Gemini Flash 2.5, Vertex AI, BigQuery, and MCP Toolbox.
Senior Cloud Engineer
@ Rackspace Technology
Apr 2020 — Jun 2022 · Gurgaon, India
  • Managed GCP environments for multiple enterprise customers, maintaining uptime and performance SLAs.
  • Built HTTP/HTTPS/TCP load balancers with backend services, SSL termination, and health checks.
  • Configured IAM roles, permissions, and security policies enforcing least-privilege access across multi-tenant environments.
  • Provisioned on-demand VMs, GKE clusters, Cloud SQL instances, and storage per customer requirements.
  • Set up VPN tunnels connecting on-premises and multi-cloud environments to GCP; maintained firewall rules and VPC configurations.
Project Engineer — DevOps
@ Wipro Ltd
Apr 2017 — Apr 2020 · Pune, India
  • Led transition from traditional SDLC to DevOps practices, improving deployment frequency and release reliability.
  • Configured VPC networks, subnets, firewalls, and load balancers on Google Cloud Platform.
  • Designed and maintained Hadoop infrastructure and data processing pipelines.
  • Deployed applications on App Engine for zero-downtime releases.
projects

Featured Work

🤖
AI Agent System — Google ADK
Multi-agent orchestration system built with Google Agent Development Kit, Gemini Flash 2.5, BigQuery, and MCP Toolbox. Addresses token efficiency challenges at scale with large tool sets in production environments.
Google ADKGemini Flash 2.5 BigQueryMCP ToolboxPython
🧠
Personal Knowledge Engine
AI-powered "second brain" over saved content from Instagram, LinkedIn, and Gmail. Enables semantic search and intelligent retrieval across personal knowledge sources using RAG and Vertex AI embeddings.
RAGVertex AI EmbeddingsPythonGmail API
Internal Developer Platform
Self-service IDP for engineering teams — standardised golden paths for service deployment, automated environment provisioning, and integrated observability on GKE with ArgoCD and Backstage.
GKEArgoCD BackstageHelmTerraform
🏗️
Cloudflare IaC Recovery System
Terraform-based infrastructure recovery and provisioning system for Cloudflare resources. Migrated to v5 provider schema with Jinja2 template automation for configuration generation at scale.
TerraformCloudflare Jinja2PythonHCL
📊
MLOps Pipeline — Vertex AI
End-to-end ML lifecycle pipeline on Vertex AI covering custom training, hyperparameter tuning, model evaluation, registry, endpoint deployment, and continuous monitoring with drift detection.
Vertex AIKubeflow Pipelines BigQueryCloud Build
📧
Automated AI Email Digest
Automated workflow using Make.com and the Anthropic API to generate intelligent daily email digests, summarising key content and insights from multiple sources with contextual prioritisation.
Make.comAnthropic API ClaudeAutomation
certifications

Credentials

🏅
Professional Cloud Architect
Google Cloud Platform
🏅
Professional Cloud DevOps Engineer
Google Cloud Platform
🏅
Professional Cloud Security Engineer
Google Cloud Platform
🏅
Professional Cloud Developer
Google Cloud Platform
🤖
Certified GenAI Leader
Google Cloud Platform
☸️
Certified Kubernetes Administrator (CKA)
Cloud Native Computing Foundation
🎯
Professional ML Engineer
Google Cloud Platform
In Progress
blog

Writing

GCP · Cost Apr 2026 8 min read
How I Cut $15k/Month in GCP Costs Without Killing Performance
Real numbers, real techniques. After inheriting 50+ microservices with bloated resource requests, I systematically profiled, right-sized, and restructured our GKE workloads — without a single production incident.
AI/ML · Agents Mar 2026 10 min read
Google ADK + Gemini Flash 2.5: Building Production AI Agents That Actually Work
Most AI agent demos look impressive and fall apart in production. Here's the architecture, the failure modes I hit, and the patterns that held up under real load — with MCP Toolbox and BigQuery in the mix.
Platform Eng Feb 2026 12 min read
Building an IDP on GKE: The Decisions I Wish Someone Had Warned Me About
We built an Internal Developer Platform from scratch for 6 product teams. The tech was the easy part — the org design, the golden path debates, and the resistance from senior engineers nearly derailed it.
contact

Get in Touch

Let's build something great.

Open to senior Platform Engineering, DevOps, and AI/ML Engineer roles — as well as consulting engagements and interesting collaborations. Based in London, available remote or hybrid.

new_message.sh
✓ Message sent! I'll get back to you soon.