Aashish Bajpai — Senior Performance Engineer
Senior Performance Engineer

SRE · DevPerfOps · Bengaluru, India

Aashish Bajpai

9+ years turning test signals into SLO-grade confidence — at VMware, EY, Oracle & beyond.

My Story

Full-stack performance and reliability engineer with ~9 years of expertise in performance optimisation, software development & site reliability engineering. Problem solver, research oriented & builder by attitude — able to own projects end-to-end and translate business requirements into mature solutions.

At PerfSage, I build open-source tools and patterns that make performance engineering faster to execute and easier to understand. I also write at thinkbits.org, sharing practical insights and battle-tested lessons from the field.

By the Numbers

0
Years XP
0
Companies
0
Certifications
0
Incidents Learned

Technical Skills

Apache JMeter95%
API & Load Performance Testing93%
SRE / SLO / SLI Engineering90%
Python & Automation (Locust, Terraform)88%
AWS & Cloud Infrastructure87%
Kubernetes & Container Orchestration85%
Chaos Engineering & Fault Injection82%
Observability (Datadog, Grafana, Prometheus)80%

Experience Timeline

VMware Current
Jun 2021 – Present · Bengaluru

Senior Member of Technical Staff — SRE & Dev

  • Led CDN migration from Imperva ➔ AWS (CloudFront + WAF + Shield) — 30% cost saving, 99.99% availability
  • Performance & chaos testing for a Kubernetes multi-tenant identity platform serving SSO across all VMware cloud
  • Automated 200+ performance tests as-a-service; built Terraform scripts reducing manual infra ops by 80%
Ernst & Young (EY) Dec 2020 – Jun 2021 · Bengaluru

Site Reliability Engineer

  • Onboarded & optimised performance for 15+ EY cloud-native microservices as SRE-as-a-Service
  • Defined SLO/SLI & error budgets; resolved 15+ production incidents as sole on-call engineer
  • Built “Synthetic Tests as a Service” tool — reduced manual testing effort by 70%
Oracle May 2020 – Dec 2020 · Bengaluru

Performance Engineer — OCI

  • Owned production performance & reliability incidents for multiple Oracle Cloud Infrastructure services
  • Leveraged data analytics to predict architecture/app issues from historical incident patterns
  • Capacity planning for “Chat Bot as a Service” and blameless post-mortem process rollout
Infor Aug 2018 – May 2020 · Ahmedabad

Performance Engineer

  • Slashed BI dashboard load time from >50 s to ~5 s through major architectural changes
  • Built Ruby/JMeter framework for automated script generation; monitoring with Grafana, Datadog & CloudWatch
  • Reduced false positives in performance strategy, directly improving ROI and production predictability
Cavisson Systems Mar 2016 – Aug 2018 · Noida

Performance Engineer

  • Full-spectrum benchmarking: microservices, databases, messaging queues, app containers (JBoss, Tomcat)
  • JVM tuning reduced average CPU by >10%; UI performance optimisation for Macy's production (Akamai mPulse, IBM TeaLeaf)
  • Resilience testing on AWS: Auto Scaling, Load Balancing, CloudFront — earned AWS SA Certification

Tools I Use

JMeter
Load Testing
Python
Locust / Scripts
k6
Load Testing
Kubernetes
Orchestration
Docker
Containers
PostgreSQL
Databases
Datadog
Observability
Grafana
Dashboards
Prometheus
Metrics
Terraform
IaC
eBPF
Kernel Tracing
Linux
Systems