Hands-on System Architect who leads a 40-person engineering team to design and ship systems at massive scale. As the architect of MSN.com, I lead backend services design, partner with Data Scientists to productionize ML models, and personally code key infrastructure — 400K requests/sec, 2 billion users, $1.3B annual revenue, 2PB of data daily.
As the architect of MSN.com, I lead the engineering team to design backend services, feature store, user profile, and ranking infrastructure end-to-end. I partner closely with Data Scientists to productionize User Understanding, Document Understanding, and ranking models onto the production platform — powering 400K requests per second, 2 billion users, $1.3B in annual revenue, and 2PB of data daily. I also personally developed key features for Microsoft Copilot — Daily Briefing, Discover, podcast integration, and personalized news. On top of that, I hand-coded a C# workflow engine and a multi-agent AI platform on MAF + GPT-5.4. Finding someone who can architect at this breadth, lead teams, partner with DS, and still write production code — from large-scale services to GenAI agents — is extremely rare.
Every system personally architected, designed, and coded.
As the architect of MSN.com, lead the engineering team to design the full backend: serving layer, feature store, user profile, data pipelines (2PB/day), ranking infrastructure, and telemetry. Partner with Data Scientists to productionize User Understanding, Document Understanding, and ranking models onto the production platform. 400K requests/sec peak, 2 billion global users, $1.3B annual revenue.
400K QPS $1.3B REVENUE 2B USERS 2PB/DAYPersonally developed Daily Briefing and Discover for Microsoft Copilot. Brought podcast functionality and personalized news to the Copilot platform, integrating MSN's content intelligence and ranking infrastructure into Microsoft's flagship AI product.
PERSONALLY DEVELOPEDHand-built on Microsoft's AutoGen Framework (MAF) + GPT-5.4. Pluggable Skills system, hierarchical SubAgent orchestration, autonomous multi-step task execution with human-in-the-loop oversight. Driving the next generation of Microsoft's AI experiences.
PERSONALLY CODEDProduction-grade C# execution engine inspired by N8N. Data Scientists design workflows visually and the engine runs them at scale with zero hand-written code. 10x faster experiment-to-production velocity, zero scalability bottlenecks.
PERSONALLY CODEDEnd-to-end multimodal GenAI platform producing 100K+ pieces of content daily. AI-generated content now surpasses traditional curated feeds in impressions, CTR, and revenue — a paradigm shift for Microsoft's content surfaces.
OUTPERFORMS STATIC FEEDSCLIP-based Image2Text embeddings + ANN indexing + two-stage semantic ranking. Optimized to P99 < 20ms latency, powering real-time visual search across Microsoft's content surfaces.
P99 < 20MSArchitect and engineer of Project Florida, Microsoft Research's cross-device federated learning platform. Designed the click-to-deploy orchestration infrastructure and device SDKs enabling privacy-preserving ML training across millions of devices for Ads targeting — without raw user data ever leaving the device.
ARCHITECT & ENGINEER MS RESEARCHA 20+ year journey through the entire stack.
Not just keywords — technologies I've personally shipped in production.
Open to Principal Architect, Technical Fellow, and Staff+ Engineering roles.