Pushpander Singh — Backend Engineer

SHIPPED ✓

SELECTED PROJECTS

// stuff that actually shipped to production

LIVE · SOLO BUILD

Pitstop

Road trip planner for Indian highways that breaks long drives into smart, vehicle-aware checkpoints. Solo-built backend, frontend, design, and infra.

▸Hash-based route caching with ~95% hit rate (~Rs.0.20 per cached trip)
▸3-layer cost defense to keep Google Maps spend flat at scale
▸PostGIS spatial layer filtering 1,300+ checkpoints within 5km of route polyline in sub-100ms
▸Vehicle range + fuel level aware planning logic

NestJSPostgreSQL + PostGISRedisNext.jsGoogle MapsMapbox

LIVE ↗

LIVE · SOLO BUILD

Burrow

Corporate-verified flatmate finder for Gurgaon. Trust-first model: verify your work email, then list a room or browse listings. Solving the 'is this person actually who they say they are' problem in flat-hunting.

▸Email-domain verification gates the platform for listing and messaging
▸Dual listing model: people with flats + people seeking flatmates
▸Progressive data collection to reduce signup friction
▸Built on Neon Postgres + Upstash Redis + Cloudflare R2 for cost-flat scaling

NestJSNext.js 14PostgreSQL (Neon)Redis (Upstash)R2Google MapsGCP

LIVE ↗

LIVE · SOLO BUILD

SafeDate

Date planning + safety app for Indian metros. Discover verified date venues, share live check-ins with trusted contacts, and get nudged to confirm safety mid-date.

▸Zero-cost venue pipeline: OpenStreetMap + Foursquare + Unsplash
▸Auto check-in nudges over SMS/push at user-defined intervals
▸1,000+ pre-seeded venues across Delhi NCR, Mumbai, Bangalore at launch
▸LinkedIn outreach drove first 200 users without a marketing budget

Next.jsPostgreSQLSocket.ioNestJSRedisOSM APIFoursquare API

WORK HISTORY

// professionally caffeinated since 2021

Alt Mobility

Software Engineer

Feb 2024 – Present · Delhi, India

▸Architected event-driven inventory module for in-house GMS using RabbitMQ for bidirectional sync with third-party IMS. Used Postgres row versioning with optimistic locking plus Strategy/Factory adapters for multi-tenant orchestration. Cut operational cycle from 4 days to 1.5 days.
▸Built real-time telemetry pipeline processing 1M+ events/day from 1,000+ EVs using Node.js, RabbitMQ, and Postgres. Added retry + dead-letter flows to reduce data loss from 15% to under 2%.
▸Designed PCI-DSS compliant payment system handling 2,500+ monthly transactions via Easebuzz webhooks. Automated reconciliation reduced payment failures from 12% to 3%.
▸Stack: NestJS · TypeScript · Postgres · Redis · RabbitMQ · Kafka

Prodo Technologies

Software Engineer

Apr 2023 – Jan 2024 · Gurugram, India

▸Built push notification system with Node.js, NestJS, and OneSignal for segmented campaigns to 10K+ users, improving CTR by 30% vs broadcast baseline.
▸Implemented bidirectional sync between production DB and Zoho CRM via Kafka, processing 50K+ records/day with exactly-once delivery semantics.
▸Added multi-currency support to payment flows using Currency Beacon API in Spring Boot, enabling 3 new international markets.
▸Stack: NestJS · Spring Boot · Node.js · Kafka · MongoDB

Zapit.io

Backend Developer (Intern)

Mar 2023 – Apr 2023 · Remote

▸Built RESTful APIs for UPI liquidity provider dashboard using Node.js, Express, and MongoDB to support real-time monitoring for 10+ financial partners.
▸Implemented gRPC pub/sub for inter-service communication, reducing event delivery latency by 60% vs HTTP baseline.
▸Stack: Node.js · Express · MongoDB · gRPC

Prodo Technologies

Backend Developer (Intern)

Dec 2022 – Feb 2023 · Gurugram, India

▸Built internal analytics dashboard APIs with NestJS + MongoDB, integrating Zoho CRM data for sales workflows.
▸Added Redis caching with TTL invalidation, reducing API response times by 50%.
▸Created S3 image optimization pipeline (PNG to base64 with ~70% compression), reducing bandwidth costs proportionally.
▸Stack: NestJS · MongoDB · Redis · AWS S3

STRESS RELIEF

// when production catches fire, stay calm. unlike michael.

🔥

“How I handle incidents, post-mortems, and 3 AM PagerDuty calls.”

🔥

URGENT

SEV1INC-001TTD: 8mTTR: 42m

STRESS

9/10

Payment reconciliation pipeline silently dropped 3% of Easebuzz webhook events

WHAT HAPPENED

Our reconciliation job stopped marking ~3% of successful Easebuzz transactions as settled. Payments completed on gateway, but our system kept them as pending and users received duplicate reminders.

IMPACT

~75 transactions/day were misclassified for 3 days before alerts. Around Rs.4.2L looked unreconciled, support tickets rose by ~40%, and user trust took a hit even though no money was actually lost.

ROOT CAUSE

Easebuzz changed webhook field casing in a minor SDK update (`paymentStatus` to `payment_status`). Our parser lacked fallback mapping; failures were logged but not alerted, so signal got buried.

WHAT I DID

Cut off auto-retry reminders to stop spamming users.
Ran a one-off backfill script over raw webhook payloads from S3 archive and reconciled against Postgres.
Added PagerDuty schema-validation alert if parse failures exceed 0.5% in any 5-minute window.
Published customer communication through support within 2 hours of resolution.

WHAT WE LEARNED

Parse-failure logs without alerting create fake confidence. Every external integration now has a parse-failure SLI with a hard threshold.

SEV2INC-002TTD: 2mTTR: 18m

STRESS

6/10

Webhook retry storm DDOSed our own service during a partner outage

WHAT HAPPENED

A downstream partner API went down for 11 minutes. Our unbounded retry logic stacked retries and then burst 12K queued attempts in 90 seconds when service recovered.

IMPACT

Partner-dependent flows saw 18 minutes of effective downtime. RabbitMQ queue depth spiked 40x; there was no data loss but recovery extended due to self-inflicted throttling.

ROOT CAUSE

Retry strategy had exponential backoff but no jitter, no max attempts, and no circuit breaker. Failures kept amplifying traffic while dependency remained down.

WHAT I DID

Paused webhook consumers in RabbitMQ to stop storm expansion.
Drained dead-letter queue to a holding table for controlled replay.
Restored traffic gradually (10% to 50% to 100% over 15 minutes).
Shipped circuit breaker with bulkhead + jitter; capped retries at 5.

WHAT WE LEARNED

Retries without circuit breakers amplify dependency outages. Bulkheads and bounded retries are now defaults for partner integrations.

URGENT

SEV1INC-003TTD: 31mTTR: 47m

STRESS

10/10

Multi-tenant adapter leaked partial inventory data across tenants for 47 minutes

WHAT HAPPENED

A routing refactor in our GMS inventory adapter caused some `getInventoryByLocation` reads to resolve against the wrong tenant schema. Tenant A could briefly see line items from Tenant B.

IMPACT

6 tenants were affected and around 200-300 line items were exposed read-only. No write corruption occurred, but disclosures were required for enterprise clients plus a security review.

ROOT CAUSE

Tenant context moved from request scope to a singleton cache of last seen tenant for perceived performance gains. Under concurrency, stale tenant IDs leaked across requests on the same Node worker.

WHAT I DID

Reverted the offending commit immediately after pattern confirmation in logs.
Extracted affected request IDs and mapped exposure per tenant.
Partnered with compliance to draft disclosures for impacted enterprise tenants.
Replaced singleton helper with AsyncLocalStorage request scoping.
Added integration test: 200 concurrent requests across 5 tenants with zero cross-contamination assertion.

WHAT WE LEARNED

Tenant context is security-critical state and must remain request-scoped. Any performance optimization touching tenant scoping now goes through security-sensitive review.

SEV3INC-004TTD: 12mTTR: 90m

STRESS

4/10

RabbitMQ queue grew unbounded during a traffic spike from fleet sync

WHAT HAPPENED

A scheduled bulk fleet sync started during a manual backfill. Producers outran consumers and telemetry queue depth climbed from ~500 to ~480K in about 25 minutes.

IMPACT

No data loss due to durable RabbitMQ queues, but consumer lag reached 22 minutes and broker disk usage rose from 18% to 71%. Real-time dashboards lagged for roughly 90 minutes.

ROOT CAUSE

Two heavy workloads collided against consumers sized for steady-state traffic. Producer-side backpressure and queue-depth autoscaling were missing.

WHAT I DID

Scaled consumer deployment 3x to process backlog faster.
Paused backfill job until spike passed.
Monitored broker disk closely and prepared failover if usage crossed 85%.
Added producer-side rate limit for bulk sync and queue-depth autoscaler for consumers.

WHAT WE LEARNED

Steady-state capacity planning is not enough for bursty systems. Backfills now avoid peak fleet-write windows and rate limits protect queue health.

INCIDENT RESPONSE
PLAYBOOK

✓Acknowledge alert within 5 min
✓Communicate in #incidents Slack
✓Mitigate before finding root cause
✓Page in help early, not late
✓Blameless post-mortem within 48h
✓Track action items to completion

WHAT FOLKS SAY

// kind words from people I've worked with

“I have been here for 18 years. He's good. Now leave me alone.”
— Stanley

“

I've worked with Pushpander across two companies now and the pattern's the same: he picks up ambiguous, half-spec'd backend problems and ships them clean. The inventory module he architected at Alt is the kind of thing most engineers would over-engineer or under-think - he somehow does neither. Also, he asks the right questions in design reviews. The annoying right questions.

Mehul

SDE-3 @ Alt Mobility

“

Working alongside Pushpander on the GMS rollout was the calmest cross-team launch I've been part of. He sweats the unsexy stuff - schema versioning, retry semantics, dead-letter handling - which means by the time you're on call, there's nothing weird to figure out at 2 AM. Solid teammate, faster than he gives himself credit for.

Shailesh Kumar

SDE-2 @ Alt Mobility

“

Pushpander and I built Verinet - a corporate-verified social platform - together over a few months, and I learned more about backend system design from him in that window than from any course. He thinks in trade-offs, not absolutes. Want a Postgres-vs-Redis call broken down with actual reasoning? He'll give it to you in five minutes flat.

Abhinav Ashish

DAA @ ZS

“

Knew Pushpander since college and watched him grow through Prodo and Alt Mobility. What's stayed consistent: he's curious about systems other people consider boring - payment reconciliation, queue ordering guarantees, telemetry pipelines. That's the rarest quality in a backend engineer. The boring infra is where the real money and real failures live.

Prateek Jha

Software Engineer @ Zomato

HEY, I'MPUSHPANDER.

SELECTED PROJECTS

Pitstop

Burrow

SafeDate

WORK HISTORY

Alt Mobility

Prodo Technologies

Zapit.io

Prodo Technologies

STRESS RELIEF

Payment reconciliation pipeline silently dropped 3% of Easebuzz webhook events

WHAT HAPPENED

IMPACT

ROOT CAUSE

WHAT I DID

WHAT WE LEARNED

Webhook retry storm DDOSed our own service during a partner outage

WHAT HAPPENED

IMPACT

ROOT CAUSE

WHAT I DID

WHAT WE LEARNED

Multi-tenant adapter leaked partial inventory data across tenants for 47 minutes

WHAT HAPPENED

IMPACT

ROOT CAUSE

WHAT I DID

WHAT WE LEARNED

RabbitMQ queue grew unbounded during a traffic spike from fleet sync

WHAT HAPPENED

IMPACT

ROOT CAUSE

WHAT I DID

WHAT WE LEARNED

INCIDENT RESPONSEPLAYBOOK

WHAT FOLKS SAY

TECH STACK

LANGUAGES

BACKEND

DATA

MESSAGING

INFRA

OBSERVABILITY

HIRE ME

FREELANCE? FULL-TIME?JUST A CHAT?

WRITING

Why I went from Spring Boot to NestJS - and stayed

Reconciling 2,500 payments/month without losing your mind

Building a 1M event/day telemetry pipeline on a single RabbitMQ cluster

95% cache hit rate on Google Maps without breaking the budget

HEY, I'M
PUSHPANDER.

INCIDENT RESPONSE
PLAYBOOK

FREELANCE? FULL-TIME?
JUST A CHAT?