Shillong, Northeast India

We build AI
that actually
ships.

ex-CTO & FounderBusRestro · $5M valuation

Google DSC Lead1 of 180 in India

Staff MLOps EngQuillbot · 100K+ MAU

LLMGuys is an elite AI and MLOps agency from the hills of Shillong. We design, train, and deploy production-grade AI systems — not prototypes that die in staging.

Start a project →

↓

5M+

Daily active users served at peak scale

$500K

Monthly cloud savings generated for a single client

150%

GPU utilization improvement via MPS multiplexing

Years building production systems at scale

What we do

End-to-end AI engineering.
No hand-holding required.

AI Product Engineering

From fine-tuning LLMs to shipping neural TTS and multimodal search — we design and launch AI products that users actually want to use. Real models. Real pipelines. Real users.

LLM finetuning & RAG systems
AI-powered SaaS product development
Multimodal AI — vision, voice, video
Custom model training & ONNX optimization

MLOps & Infrastructure

Production ML pipelines that don't fall over at 3am. Cost-optimized inference, cross-cloud autoscaling, and deployment automation built to outlast the hype cycle.

ML training & deployment pipelines
Inference optimization (1.5× throughput)
Cross-cloud autoscaling & cost control
MLFlow, KServe, RabbitMQ

Platform Engineering

We migrate, scale, and harden cloud infrastructure with zero downtime. Full observability baked in from day one — because you shouldn't find out about outages from your users.

Kubernetes & Istio deployments
Observability stack (OTel, Grafana, Loki)
Zero-downtime cloud migrations
Incident response & alerting (sub-2min MTTD)

Full Stack Development

Scalable web applications architected for real growth — from MVP to millions of users. We've built it, scaled it, migrated it, and kept it running. Ask us about the 5M DAU platform.

Next.js, Nest.js, React
MongoDB, microservices architecture
Real-time collaborative systems (CRDT)
Payment systems & analytics

Selected work

Projects that moved
the needle.

012023–Now

Quillbot AI Suite

Single-handedly designed and launched 5+ production AI products at one of the world's leading writing tools — including neural TTS with custom voice synthesis and a Gemini-powered AI search engine. Achieved 1.5× inference throughput via ONNX quantization and FasterTransformer, and built a cross-cloud autoscaler that cut costs by 20% across AWS, GCP, Coreweave, and Hetzner. Boosted GPU utilization by 150% through MPS multiplexing on T4, A100, and L40s.

MLOpsLLMNeural TTSKServeONNX100K+ MAU

→

022022–2023

GKE Migration & Observability Overhaul

Executed a zero-downtime migration from App Engine to GKE with Istio service mesh, generating $500K+ in monthly savings. Simultaneously deployed a complete observability stack for under $1K/month using OpenTelemetry, Grafana, Tempo, Loki, and Thanos — with sub-2-minute MTTD via event-driven PagerDuty and Prometheus alerting.

GKEKubernetesIstioOpenTelemetry$500K/mo Saved

→

032020–2022

Real-Time Collaborative Editor at 5M DAU

Engineered a real-time collaborative editing platform serving 100K+ MAU using MongoDB sharding and YJS CRDT algorithms. Built the foundational architecture to handle 5M daily active users with Next.js SSR, Nest.js microservices, and MongoDB clustering. Reduced payment disputes by 75% (0.4% → 0.1%) via ML-based fraud detection with Stripe Radar. Executed a zero-downtime migration from Firebase Firestore to MongoDB Atlas with full consistency validation.

Next.jsMongoDBCRDT5M DAUML Fraud Detection

→

042023–Now

Tamanna — AI Fashion & Video Platform

Built a cutting-edge AI-powered fashion and virtual photoshoot platform. Trained and optimized Wan2.2 Face LoRA for Image-to-Video generation with high-fidelity identity retention. Integrated Qwen Image Edit for controllable inpainting, relighting, and pose adaptation. Engineered cross-domain CLIP-based style transfer pipelines replicating Pinterest-grade aesthetics, plus real-time neural face reenactment for live video streams.

LoRADiffusionCLIPI2VWan2.2Computer Vision

→

05Consulting

BusRestro — IoT Fleet Management

Led ops and tech teams to deliver a $300K IoT fleet management contract for the Uttarakhand Transport Corporation. Then translated that technical execution into investor confidence — securing $53K in seed funding and a $5M Series A valuation from Pioneer Publicity Corporation and prop.in. The rare kind of project that proves technical depth and business acumen aren't mutually exclusive.

IoTFleet Management$300K Gov Contract$5M Series A

→

The person behind it

Aashis
Kumar

Founder · ex-CTO · ex-COO · Principal Engineer

ex-Google DSC Leadex-FounderMBA + BTech

Based in the cloud-kissed hills of Shillong, Aashis Kumar is among India's sharpest AI engineers — currently serving as Staff MLOps Engineer at Quillbot, where he's single-handedly shipped 5+ production AI products and built infrastructure that serves millions of users daily.

He's been in the founder seat too. As CTO and COO of BusRestro, he owned the entire technology stack, secured seed funding from Pioneer Publicity Corporation, scaled the company to a $5M Series A valuation, and personally oversaw the hardware installation of GPS trackers and cameras across interstate bus corridors for the Uttarakhand Transport Corporation.

He was selected as one of just 180 Google Developer Student Club Leads across all of India — hosting events, mentoring projects, and giving talks on everything from Git and React to research paper writing. That same instinct to teach and build in public defines how LLMGuys operates: no black boxes, no bloated teams, just precise technical execution and clear thinking.

LLMGuys isn't a typical agency. It's a one-person force multiplier for founders and engineering teams who need AI done properly — with production-grade infrastructure, not slide decks and demos.

Call →LinkedIn →GitHub →

Current

Quillbot

Staff MLOps Engineer 5+ AI products shipped. 100K+ MAU.

ex-CTO & ex-COO

BusRestro

$5M Series A · $300K Gov Contract GPS fleet deployed statewide for UTC

ex-Architect

HeyHomie.me

WhatsApp-native storefronts Meta Business API · AI retargeting for SMBs

ex-Community Lead

Google DSC

1 of 180 DSC Leads in India Hosted events, gave talks, mentored projects

Education

Liverpool John Moores

MBA — AI thesis on Learneo products Quillbot, Coursehero & Symbolab

Education

Manipal University

BTech, GPA 8.5/10 Awarded by Director of Alumni Relations

Ready to build?

Let's ship something
remarkable.

Whether you need an AI product built from scratch, an MLOps pipeline that actually scales, or a technical co-founder who gets things done — this is the call to make.

Call us directly +91 96606 90531

📍 Shillong, Meghalaya — Northeast India

We build AIthat actuallyships.

End-to-end AI engineering.No hand-holding required.