Currently accepting new clients — Q3 2025 Let's talk →
Services Work About Get in touch →
Shillong, Northeast India

We build AI
that actually
ships.

ex-CTO & Founder BusRestro · $5M valuation
Google DSC Lead 1 of 180 in India
Staff MLOps Eng Quillbot · 100K+ MAU

LLMGuys is an elite AI and MLOps agency from the hills of Shillong. We design, train, and deploy production-grade AI systems — not prototypes that die in staging.

Start a project →
5M+
Daily active users served at peak scale
$500K
Monthly cloud savings generated for a single client
150%
GPU utilization improvement via MPS multiplexing
8+
Years building production systems at scale

End-to-end AI engineering.
No hand-holding required.

01

AI Product Engineering

From fine-tuning LLMs to shipping neural TTS and multimodal search — we design and launch AI products that users actually want to use. Real models. Real pipelines. Real users.

  • LLM finetuning & RAG systems
  • AI-powered SaaS product development
  • Multimodal AI — vision, voice, video
  • Custom model training & ONNX optimization
02

MLOps & Infrastructure

Production ML pipelines that don't fall over at 3am. Cost-optimized inference, cross-cloud autoscaling, and deployment automation built to outlast the hype cycle.

  • ML training & deployment pipelines
  • Inference optimization (1.5× throughput)
  • Cross-cloud autoscaling & cost control
  • MLFlow, KServe, RabbitMQ
03

Platform Engineering

We migrate, scale, and harden cloud infrastructure with zero downtime. Full observability baked in from day one — because you shouldn't find out about outages from your users.

  • Kubernetes & Istio deployments
  • Observability stack (OTel, Grafana, Loki)
  • Zero-downtime cloud migrations
  • Incident response & alerting (sub-2min MTTD)
04

Full Stack Development

Scalable web applications architected for real growth — from MVP to millions of users. We've built it, scaled it, migrated it, and kept it running. Ask us about the 5M DAU platform.

  • Next.js, Nest.js, React
  • MongoDB, microservices architecture
  • Real-time collaborative systems (CRDT)
  • Payment systems & analytics

Projects that moved
the needle.

01 2023–Now

Quillbot AI Suite

Single-handedly designed and launched 5+ production AI products at one of the world's leading writing tools — including neural TTS with custom voice synthesis and a Gemini-powered AI search engine. Achieved 1.5× inference throughput via ONNX quantization and FasterTransformer, and built a cross-cloud autoscaler that cut costs by 20% across AWS, GCP, Coreweave, and Hetzner. Boosted GPU utilization by 150% through MPS multiplexing on T4, A100, and L40s.

MLOps LLM Neural TTS KServe ONNX 100K+ MAU
02 2022–2023

GKE Migration & Observability Overhaul

Executed a zero-downtime migration from App Engine to GKE with Istio service mesh, generating $500K+ in monthly savings. Simultaneously deployed a complete observability stack for under $1K/month using OpenTelemetry, Grafana, Tempo, Loki, and Thanos — with sub-2-minute MTTD via event-driven PagerDuty and Prometheus alerting.

GKE Kubernetes Istio OpenTelemetry $500K/mo Saved
03 2020–2022

Real-Time Collaborative Editor at 5M DAU

Engineered a real-time collaborative editing platform serving 100K+ MAU using MongoDB sharding and YJS CRDT algorithms. Built the foundational architecture to handle 5M daily active users with Next.js SSR, Nest.js microservices, and MongoDB clustering. Reduced payment disputes by 75% (0.4% → 0.1%) via ML-based fraud detection with Stripe Radar. Executed a zero-downtime migration from Firebase Firestore to MongoDB Atlas with full consistency validation.

Next.js MongoDB CRDT 5M DAU ML Fraud Detection
04 2023–Now

Tamanna — AI Fashion & Video Platform

Built a cutting-edge AI-powered fashion and virtual photoshoot platform. Trained and optimized Wan2.2 Face LoRA for Image-to-Video generation with high-fidelity identity retention. Integrated Qwen Image Edit for controllable inpainting, relighting, and pose adaptation. Engineered cross-domain CLIP-based style transfer pipelines replicating Pinterest-grade aesthetics, plus real-time neural face reenactment for live video streams.

LoRA Diffusion CLIP I2V Wan2.2 Computer Vision
05 Consulting

BusRestro — IoT Fleet Management

Led ops and tech teams to deliver a $300K IoT fleet management contract for the Uttarakhand Transport Corporation. Then translated that technical execution into investor confidence — securing $53K in seed funding and a $5M Series A valuation from Pioneer Publicity Corporation and prop.in. The rare kind of project that proves technical depth and business acumen aren't mutually exclusive.

IoT Fleet Management $300K Gov Contract $5M Series A

Aashis
Kumar

Founder · ex-CTO · ex-COO · Principal Engineer
ex-Google DSC Lead ex-Founder MBA + BTech

Based in the cloud-kissed hills of Shillong, Aashis Kumar is among India's sharpest AI engineers — currently serving as Staff MLOps Engineer at Quillbot, where he's single-handedly shipped 5+ production AI products and built infrastructure that serves millions of users daily.

He's been in the founder seat too. As CTO and COO of BusRestro, he owned the entire technology stack, secured seed funding from Pioneer Publicity Corporation, scaled the company to a $5M Series A valuation, and personally oversaw the hardware installation of GPS trackers and cameras across interstate bus corridors for the Uttarakhand Transport Corporation. Before that, he architected the full-stack platform at HeyHomie.me — powering WhatsApp-native storefronts, plug-and-play payments, Meta Business API integrations, and AI-driven customer retargeting for SMBs across India.

He was selected as one of just 180 Google Developer Student Club Leads across all of India — hosting events, mentoring projects, and giving talks on everything from Git and React to research paper writing. That same instinct to teach and build in public defines how LLMGuys operates: no black boxes, no bloated teams, just precise technical execution and clear thinking.

LLMGuys isn't a typical agency. It's a one-person force multiplier for founders and engineering teams who need AI done properly — with production-grade infrastructure, not slide decks and demos.

Current
Quillbot
Staff MLOps Engineer
5+ AI products shipped. 100K+ MAU.
ex-CTO & ex-COO
BusRestro
$5M Series A · $300K Gov Contract
GPS fleet deployed statewide for UTC
ex-Architect
HeyHomie.me
WhatsApp-native storefronts
Meta Business API · AI retargeting for SMBs
ex-Community Lead
Google DSC
1 of 180 DSC Leads in India
Hosted events, gave talks, mentored projects
Education
Liverpool John Moores
MBA — AI thesis on Learneo products
Quillbot, Coursehero & Symbolab
Education
Manipal University
BTech, GPA 8.5/10
Awarded by Director of Alumni Relations
Ready to build?

Let's ship something
remarkable.

Whether you need an AI product built from scratch, an MLOps pipeline that actually scales, or a technical co-founder who gets things done — this is the call to make.

📍 Shillong, Meghalaya — Northeast India