About

I'm a Founding Engineer & AI Architect with 2 years of hands-on experience building large-scale AI systems, distributed backend infrastructures, and production-grade full-stack platforms. At ProPeers, I own and engineer the core systems that power 80%+ of total platform traffic including Roadmaps, RoadmapAI, AskAI, CodeLLM, and the Contextual AI Code Editor.

I design high-scale backend architectures, real-time data pipelines, aggregation engines for 100K+ users, Redis-backed caching layers, search-validation systems, role-based access flows, rate-limiting frameworks, and CI/CD deployment automation that cut release time by 34% and improved reliability across 150+ microservices. Through SSR, dynamic imports and hybrid rendering patterns, I’ve reduced key user journey response times from 1.1s → 200ms, delivering a noticeably smoother product experience.

As an AI Architect, I build Agentic AI pipelines, RAG retrieval systems, MCP protocol layers, and multi-model inference workflows using Azure OpenAI, Azure Databricks, GPT models, and Llama 3.x OSS models. My work spans token optimization, context-window compression, semantic chunking, and adaptive prompt engineering to deliver intelligent experiences at <1s latency under real production traffic.

I’ve engineered RoadmapAI with a self-learning RAG pipeline (text-embedding-ada-002, ChromaDB, semantic filters, vector enrichment), achieving ~99% roadmap accuracy and lifting roadmap ratings from the early 12% baseline. I built CodeLLM, a production AI judge featuring multi-language detection, dual-layer JSON parsing, COMPILATION/RUNTIME/VALIDATION error classification, and deterministic verdict synthesis for educational code evaluation.

I developed AskAI with MCP-layered prompts, resource-type detection (roadmap/article/practice), O1/O3 model routing, token metering, and auto-structured responses improving resolution speed 2× and engagement 3×. I also built the AI Code Editor with ~40ms inference, inline reasoning, multi-language execution, and deep integration with RoadmapAI and CodeLLM, significantly boosting editor retention.

Beyond AI flows, I’ve implemented token-based tiered access systems (one-time/monthly/yearly) on top of these capabilities, and engineered self-optimizing RAG pipelines and distributed multi-model inference workflows that balance accuracy, cost and latency under real-world traffic.

On the product and platform side, I’ve delivered Individual Roadmap Communities, scalable live-stream pipelines, error-resilient API layers, multi-step onboarding flows, connected roadmap progress engines, and search validation systems ensuring hallucination-free retrieval across Roadmaps and RoadmapAI.

At the infrastructure layer, I’ve reduced downtime by 90% (4 hours → 45 mins/month), stabilized Azure VM workloads, eliminated Bastion and high-cost D8 VM footprints, fixed bandwidth cost spikes, and built high-availability fallback layers with cache-first routing and distributed failover.

Day-to-day, I work across MERN + TypeScript, Node.js microservices, Docker/Kubernetes, Azure Cloud, Databricks, CI/CD automation, Prometheus/Grafana observability, and async caching pipelines powering 100K+ monthly active operations.

Outside core engineering, I’m a Problem-Solving & DSA Enthusiast with 5000+ problems solved, a 1400+ day coding streak, and top 0.1% global rankings across platforms. As a mentor to 40,000+ learners, I help engineers master DSA, System Design, Development, DevOps, and Remote Job Preparation, guiding them from theory to real-world success.

I love building scalable systems, intelligent architectures, and next-generation AI-first engineering experiences that blend reliability, performance, and deep technical innovation.

Experience

ProPeers

Founding Engineer

July 2025 – Present · Delhi, India · Remote

Architected the full AI ecosystem powering RoadmapAI, CodeLLM, AskAI and the AI Code Editor building Agentic AI pipelines, RAG systems, MCP server architecture and LLM orchestration that now drives 80%+ of total platform traffic.
Engineered RoadmapAI end-to-end with a self-learning RAG pipeline (text-embedding-ada-002, ChromaDB, semantic filtering, adaptive difficulty) and MCP-layered prompts, achieving sub-second inference and large-scale personalization.
Delivered ~99% personalized roadmap accuracy using Agentic flows, structured prompt masks, multi-model routing, and RAG optimization directly improving RoadmapAI user ratings from the early 12% baseline.
Built CodeLLM, an AI judge with multi-language detection, dual-layer JSON parsing, context-aware error classification (COMPILATION/RUNTIME/VALIDATION), semantic retrieval and deterministic verdict synthesis.
Developed AskAI, an agentic programming assistant using MCP-based prompt pipelines, resource-aware context analysis, dynamic O3Mini/O1 routing, token metering and automated formatting boosting engagement 3× and answer resolution speed 2×.
Shipped the AI Code Editor with real-time AI review (<40ms), inline reasoning, multi-language execution and deep RoadmapAI/CodeLLM integration raising editor retention by 40%.
Scaled Roadmap features to 120K+ organic users and improved MAU by 46% through rapid iteration, tight user-feedback loops and stable AI feature launches.
Delivered Individual Roadmap Communities enabling peer-matching, shared progress tracking and roadmap-level micro-communities.
Optimized CI/CD and deployment systems, cutting deployment time by 34%, automating multi-service rollouts, and enabling safer high-frequency releases.
Reduced platform downtime by 90% (4 hrs to 45 mins/month) via infra hardening, progressive fallbacks, cache-first routing, real-time health checks and load-aware autoscaling.
Implemented complete analytics & aggregation pipelines for 100K+ users with Redis caching, chunked batch aggregation, API acceleration and advanced rate-limit enforcement.
Developed full search-validation engines (Roadmaps + RoadmapAI), ensuring context-safe retrieval, hallucination-resistance and consistent multi-node semantic validation.
Performed Azure cost & infra optimization VM right-sizing, eliminated Bastion, stabilized Redis/Entra costs, contained Cognitive Service spikes and resolved large bandwidth egress surges.

SDE - 1

July 2024 – July 2025 · Delhi, India · Remote

Built and scaled the flagship "Roadmaps" feature, delivering 100+ curated learning paths across DSA, Development, and System Design used by 100K+ users. Improved personalization and relevance, while reducing API response time from 2.1s to < 300ms, resulting in a 7x faster experience and 40% higher user engagement.
Worked on complex APIs to reduce processing time and improved tab switching experience for smoother navigation
Developed and integrated the "AskAI + Discussion Forum", an intelligent peer-programming assistant where users can interact with AI to solve DSA/Dev doubts and collaborate with others enabling on-demand doubt resolution and community learning.
Engineered a Session Recording Bot using Python, Selenium, and headless Azure VMs with deep link automation automating session joining and recording, cutting down 100% of manual effort and improving reliability.
Optimized 150+ APIs by implementing advanced caching layers, async processing, and API pipelines, reducing backend latency by up to 70% and improving system throughput.
Reduced core web vitals TBT, LCP, and FCP from 4.4s to 990ms through advanced frontend optimizations (SSR, dynamic imports, lazy-loading APIs), significantly boosting UX for 15K+ monthly active users.
Led the end-to-end performance overhaul of the platform, focusing on smoother tab-switching experiences, minimal downtime, and blazing-fast navigation across the app.
Migrated MongoDB from Atlas to self-hosted replica sets, wrote automated backup & recovery scripts, set up VMs, and integrated cron-based backups to Azure Blob, ensuring data durability and cost-efficiency.
Set up real-time monitoring and alerting with Prometheus and Grafana, ensuring system health, proactive issue resolution, and enhanced DevOps visibility.
Deployed scalable CI/CD pipelines using Azure, GitLab, and Vercel, ensuring zero-downtime deployments and faster iteration cycles across teams.
Handled end-to-end production deployment and scaling for a system serving 15K+ users, maintaining high availability, fault tolerance, and robust performance at scale.

Cloud Conduction

Junior Software Engineer

Jan 2024 – June 2024 · USA, · Remote

Built an AI-powered chat application from the ground up using React and .NET, improving frontend efficiency by 60% and backend performance by 30%, delivering a highly responsive user experience.
Integrated and optimized AI model responses, reducing latency from 1.86s to 1.2s (35% faster) through strategic API design, caching, and performance tuning.
Designed scalable cloud architecture on Microsoft Azure for AI workloads, improving system throughput by 10% while significantly reducing infrastructure costs via autoscaling and resource optimization.
Developed modern, responsive UI components in React that improved user engagement metrics by 25%, including better retention and interaction rates.
Implemented secure, scalable API gateways in .NET Core, capable of handling 500+ concurrent requests with 99.9% uptime, supporting production-level reliability.
Led the implementation of new features using the MERN stack, cutting down development time by 40%, and accelerating product iteration cycles.
Established CI/CD pipelines (Azure DevOps & GitHub Actions), reducing deployment failures by 75% and enabling faster, automated releases.
Conducted in-depth code reviews and optimization, reducing technical debt by 30%, standardizing best practices across teams, and improving maintainability.
Owned and managed the complete project lifecycle, from initial system design and dev planning to production deployment, server setup, and post-launch support.

Impactful Work As a ( INDIVIDUAL CONTRIBUTOR )

INDIVIDUAL CONTRIBUTOR

I’ve engineered the core of our AI ecosystem RoadmapAI, CodeLLM, AskAI, and the AI Code Editor, designing end-to-end Agentic AI pipelines with RAG-driven personalization, MCP-layered orchestration, and multi-model LLM architectures that deliver real-time learning guidance, deterministic code evaluation, and deeply context-aware programming assistance at scale.

I’ve worked hands-on with leading LLM MODELS and AI platforms including OpenAI, Google AI, Anthropic (Claude), MistralAI, Meta, Grok, Moonshot and Databricks, implementing intelligent model routing, fallback strategies, cost-aware inference, and latency-optimized multi-provider execution.

The system leverages Vector Databases (ChromaDB) for semantic context retrieval and long-term memory, enabling high-precision RAG workflows. I’ve implemented token-level streaming responses to deliver real-time AI output, along with response caching, embedding reuse, and prompt-result memoization to significantly reduce latency, repeated inference, and overall token costs.

My work spans LLM System Design, tokenization & reasoning flows, streaming & tool-calling agents, vectorized context pipelines, and high-availability AI microservices, forming the intelligence backbone of the platform.

I’ve also strengthened the platform’s foundation by optimizing over 150+ critical APIs for latency, reliability, throughput, and large-scale fault-tolerance.