N
Available for hire · London, UK · No sponsorship needed

Nikesh Jagdish Malik

AI Solutions Engineer · London

I turn messy business problems into shipped AI systems — at a fraction of the bill you’d expect.

AWS, GCP, Azure when needed. Hetzner + Coolify when smarter. Built end-to-end, by me.

96%
LLM API cost reduction
92.5%
OSS vs premium accuracy
70%
Infrastructure savings
40+
Docker containers in prod

Most production work sits in private repos under client / company NDA.

Open to roles

AI Automation ExpertAI ConsultantLLM EngineerAI Solutions EngineerAI GeneralistMagnetic DeveloperTechnical Project ManagerTechnical ConsultantImplementation ConsultantCustomer Success Engineer

Twelve verticals. One operator.

From construction procurement to wealth management — applied AI delivered in real-world contexts across two continents.

Construction

Construction

CMS Desk · Spencer AI · Blueprint

Procurement

Procurement

AutoQuote · Rebate Mgmt · POD Linking

Logistics

Logistics

Way2Packers · 80+ cities

Tax & Compliance

Tax & Compliance

Tax2Solutions · UK + India + USA

HR Tech

HR Tech

PixelEMS · 7 AI agents

EdTech

EdTech

Way2Class · 25+ countries

Wealth Management

Wealth Management

Wealth1 · India's first PMS / AIF marketplace

Marketing

Marketing

SocialPixelHub · 12+ verticals

Document AI

Document AI

AutoQuote · 12 agents · 95+ fields

Government

Government

TransKai · multi-language YouTube review

Accessibility

Accessibility

ALT-Kai · WCAG 2.2 + BenTech

Solar / Renewables

Solar / Renewables

Manav Solar Solutions · 5,000+ families · 12 MW+

Construction
Procurement
Logistics
Tax & Compliance
HR Tech
EdTech
Wealth Management
Marketing
Document AI
Government
Accessibility
Solar / Renewables
Construction
Procurement
Logistics
Tax & Compliance
HR Tech
EdTech
Wealth Management
Marketing
Document AI
Government
Accessibility
Solar / Renewables

Numbers that matter.

Every metric traced back to a shipped system, a real client, a live dashboard. No vanity numbers.

96%

LLM API cost reduction

£800/mo → £30/mo at CMS Desk

92.5%

OSS vs premium accuracy

MSc dissertation finding

70%

Infrastructure savings

Hetzner + Coolify over AWS/GCP

40+

Docker containers in prod

across CMS Desk workloads

12

Specialised AI agents

in the AutoQuote pipeline

5+

Years applied AI

2021 — present

100K+

Monthly LinkedIn impressions

News in Pixel · organic

25+

Articles published

Tutorialspoint · Codedamn (+ ghostwriting)

About

I turn business problems into shipped systems.

Less hardcore engineer — more AI generalist and delivery lead. I pick the right model, the right tool, the right shortcut, and get the thing into production. Five years of exactly that across 12 industries.

Nikesh Jagdish Malik

Plate I.

Nikesh Malik

London, United Kingdom

MSc

Birmingham

est. 2020

I’m the person you bring in when a messy business problem needs to become a working system in production — fast.

Not the algorithm purist. Not the deepest researcher in the room. I’m the operator who knows which model to pick, which tool to reach for, and how to get the thing live this week — without breaking what’s already shipped.

At CMS Desk that means Spencer AI, AutoQuote (12-agent doc pipeline), Blueprint Analysis, and the multi-model router that cut LLM spend 96%. Through PenPixel LLP, AI products shipped across logistics, tax, HR, marketing, edtech, wealth management, and energy.

I won’t sell you an AWS bill you don’t need. I know AWS, GCP, and Azure inside-out — but most of my production work runs on Hetzner VPS with Coolify, self-built, self-orchestrated, end-to-end by me. Cost-effective architecture isn’t a side-effect — it’s the choice.

“Get the smallest useful version into production. Measure it. Make it better. Repeat.”

— How I work

BasedLondon, UK
StatusAvailable immediately
VisaUK Graduate Visa · 2028
ModeOn-site · Hybrid · Remote
EducationMSc Birmingham (Merit)
LanguagesEnglish · Hindi

Cost Engineering

Cost is not a side-effect of architecture. It is the architecture.

Three production wins that compound. The compounding effect across 40+ containers and dozens of chains is what gets you to 96%.

LLM API spend

96%
cut

Before

£800/mo

After

£30/mo

Multi-model LLM router

Routed across OpenRouter, AWS Bedrock, Azure OpenAI, GCP Vertex, and self-hosted Hetzner OSS. Simple tasks → Llama 3 / Mistral. Medium → Gemini Flash. Premium reserved for high-stakes reasoning.

Infrastructure

70%
saved

Before

AWS · GCP

After

Hetzner + Coolify

Infrastructure migration

40+ Docker containers moved to Hetzner orchestrated by Coolify — production reliability maintained, monthly bill collapsed.

Research finding

92.5%
OSS accuracy

Before

Premium models

After

Open-source · zero cost

MSc dissertation evidence

12 LLMs benchmarked across 1,144 verified bugs and 4,320 evaluations. Open-source models achieve 92.5% of premium accuracy — the empirical justification for everything above.

Every workflow gets routed to the cheapest model that can do the job correctly. Every host runs on the cheapest infrastructure that meets the SLA. Compound that across 40+ containers — and you get to 96%.

Work

18 projects. Six worth a deep dive.

Featured case studies, products and platforms, plus notable engagements. Most production code sits in private repos under company policy.

Featured case studies

TransKai · YT Transcribe → Translate → Watch
2025PRIVATE

TransKai · YT Transcribe → Translate → Watch

Multi-Lingual Video Intelligence · Brand & On-Screen Detection

AI agent that downloads any YouTube video, auto-detects the language, transcribes with Whisper, translates with Gemini, AND simultaneously watches the video to detect on-screen text, brand mentions, sponsor placements, and speaker turns. Outputs a multi-column spreadsheet for downstream review.

Any language → EnglishWhisper + GeminiOn-screen brand detectionXLSX / PDF export
Government · Content ReviewView →

Products & platforms

More work

Experience

Built, shipped, & delivered.

Five roles, five years of learning — AI engineer, founder, ops manager, and freelance technical author across two of India's largest developer-education platforms.

Jun 2025 — Present

CMS Desk

AI Solutions Engineer — Delivery & Implementation

Designing and delivering applied AI for construction and procurement — scoping with stakeholders, architecting solutions, managing end-to-end delivery. 40+ Docker containers in production on Hetzner with custom Prometheus/Grafana observability across every chain.

  • Spencer AI — agentic NL-to-SQL chatbot · 26+ tables real-time synced (every 2s) · 10+ SQL injection patterns blocked · 6 attack patterns refused at prompt level · multi-tenant RBAC.
  • AutoQuote — multi-tenant document AI with 12 specialised agents (invoice, POD, quotation, credit note, receipt, rebate, bank reconciliation, etc.) · 20 clients in parallel · Wasabi storage · SSE real-time progress.
  • Blueprint Analysis — automated material take-off from 2D/3D construction blueprints · room-level entity extraction · procurement-ready output.
  • Multi-Model LLM Routing across OpenRouter, AWS Bedrock, Azure OpenAI, GCP Vertex, and self-hosted Hetzner OSS models — 96% API cost reduction (£800 → £30/mo).
  • Custom observability platform — Prometheus + Grafana dashboards tracking cost per provider, P50/P95/P99 latency, error rates, fallback triggers, and token usage per chain.
  • Selected Hetzner + Coolify over AWS/GCP — 70%+ infrastructure cost reduction with production reliability.
LangChainLangGraphpgvectorQdrantAWS BedrockAzure OpenAIGCP VertexPythonFastAPIHetznerCoolify
Sep 2023 — Present

PenPixel LLP

Founder — Technical Delivery & Product Management

Founded and lead a digital solutions company delivering AI-powered products, websites, and automation. End-to-end delivery from requirements through launch across logistics, tax, HR, marketing, edtech, and wealth management.

  • Way2Packers — 400+ page logistics platform · AI chatbot (Gemini) · 13 API endpoints · 80+ city pages · 50+ neighborhood pages · 30-min quote system.
  • PixelEMS — multi-tenant Agentic OS for UK SMEs · 7 specialised AI agents · UK HMRC RTI / Right-to-Work / NEST integration · 41% faster approvals · 62% onboarding time saved.
  • SocialPixelHub — AI marketing automation · 60s brand learning · multi-model content (Gemini/GPT/Claude/FLUX/Imagen 4) · 6 platforms · 12+ industries · ₹5,999–18,999/mo.
  • News in Pixel — automated short-form news platform · grown organically to 100K+ monthly LinkedIn impressions in 6 months.
  • Client services across logistics, tax, education, marketing, and wealth management sectors.
  • Managed cross-functional team of up to 15 staff · workflow processes that cut delivery times 30% · trained and onboarded 10+ team members.
Next.jsReactTypeScriptLangChainMongoDBPostgreSQLn8nVercel
Sep 2022 — May 2025

Way2Class

Technical Operations Manager

Managed technical operations and cross-functional delivery — coordinating frontend, backend, marketing, and SEO teams across the organisation.

  • Supervised up to 15 staff — hiring, onboarding, delegation, performance management.
  • Oversaw development of internal employee management platform (Next.js, Node.js, Firebase, GCP) used across multiple companies — attendance, payroll, leave.
  • Led hiring — sourcing, screening, onboarding technical and non-technical members.
  • Drove migration from legacy systems to modern architecture — 40% infrastructure cost reduction.
Next.jsNode.jsFirebaseGCPTeam Leadership
Dec 2022 — Jul 2024

Tutorialspoint & Codedamn

Technical Content Writer · Freelance

Wrote tutorials, code walkthroughs, and developer-facing documentation across SQL, Python, JavaScript, React, Node.js, database management, and full-stack web development for two major developer-learning platforms.

Technical WritingSQLPythonWeb Dev

Stack

Tools of the trade.

Production-tested across five years of shipping AI systems.

AI · LLMs · Frameworks

OpenAI
Anthropic
Google Gemini
Meta Llama
Mistral
DeepSeek
LangChain
LangGraph
Qdrant
OpenRouter
Hugging Face
Hermes
OpenAI
Anthropic
Google Gemini
Meta Llama
Mistral
DeepSeek
LangChain
LangGraph
Qdrant
OpenRouter
Hugging Face
Hermes

Languages · Frameworks

Python
TypeScript
JavaScript
Next.js
React
Node.js
FastAPI
Tailwind CSS
Python
TypeScript
JavaScript
Next.js
React
Node.js
FastAPI
Tailwind CSS

Cloud · Infrastructure

AWS Bedrock
GCP Vertex
Azure OpenAI
Hetzner
Coolify
Docker
Vercel
n8n
AWS Bedrock
GCP Vertex
Azure OpenAI
Hetzner
Coolify
Docker
Vercel
n8n

Databases · Observability

PostgreSQL
pgvector
MongoDB
Redis
MySQL
Firebase
Prometheus
Grafana
PostgreSQL
pgvector
MongoDB
Redis
MySQL
Firebase
Prometheus
Grafana

AI-Assisted Coding

The agent stack behind the work.

Every line of production code at CMS Desk and PenPixel ships with AI agents in the loop. Daily-driver tools — not just on the CV, but in the terminal.

Claude Code

Claude Code

Primary terminal-native coding agent

Anthropic's official CLI — full repo context, multi-file edits, agent SDK

OpenAI Codex

OpenAI Codex

Async background tasks

Cloud agent for delegated long-running work, PR review

OpenCode

OpenCode

Open-source coding agent

Self-hosted alternative for sensitive client work

GitHub CLI + AI

GitHub CLI + AI

Git workflow automation

PR drafts, commit messages, code review at the terminal

Pi (Inflection)

Pi (Inflection)

Conversational reasoning

Interactive thinking partner for architecture decisions

Hermes

Hermes

Agentic routing layer

Open-weights model used in the multi-model router fallback chain

Ollama

Ollama

Local LLM serving

Self-hosted Llama / Mistral / DeepSeek runtime on Hetzner

[ The point ]

Magnetic developer means knowing which agent to reach for — terminal-native for repo work, async for delegated tasks, self-hosted when the data is sensitive. The right tool for the right cut of work.

Voice

Recent thinking on production AI.

From recent LinkedIn posts — unfiltered, written from production experience at CMS Desk.

Education

Schooled and self-sharpened.

MSc Birmingham (Merit), B.Tech Mumbai (First Class CGPI 9.01), four professional certifications from LangChain Academy and Qdrant, plus two peer-reviewed IRJET papers.

Education

University of Birmingham

Sep 2024 — Sep 2025

MSc Advanced Computer Science

2:1 Merit · Distinction in Dissertation (75/100)

Dissertation: "Open vs Closed LLMs vs Traditional Linters: A Comprehensive Empirical Analysis Across Four Critical Dimensions" — supervised by Prof. Rami Bahsoon. Benchmarked 12 LLMs and 3 static analysis tools across 1,144 verified bugs using a novel four-dimensional framework. 4,320 primary evaluations with full statistical validation (ANOVA, Welch's t-test, Cohen's d). Key finding: open-source LLMs achieve 92.5% of premium model accuracy at zero cost.

Selected modules

Intelligent Software Engineering70
Advanced Networking67
Mobile & Ubiquitous Computing66
Human-Computer Interaction58
Designing & Managing Secure Systems57
Algorithms & Complexity56

Pillai College of Engineering, Mumbai

Jan 2021 — May 2024

B.Tech Computer Engineering

CGPI 9.01/10 · UK First Class · US GPA 3.6/4.0

Specialisation in Blockchain & Cybersecurity. Smart India Hackathon participant 2022–2024. 2nd place, Inter-College Business & Startup Competition. Final semester SGPI 9.14/10 (Project: Grade O, Industry Internship: Grade O).

Certifications

Qdrant Essentials

Qdrant Academy · 100% Score (20/20)

ID: QDRANT-1E3F62CC

Mar 2026

Agent Observability & Evaluations

LangChain Academy · LangSmith tracing, eval metrics, debugging production agents

ID: zbwgw7xzv5

Mar 2026

LangChain (Python)

LangChain Academy · Production patterns

Mar 2026

LangGraph

LangChain Academy · Stateful multi-agent workflows

Mar 2026

Working philosophy

Certificates prove curiosity. Shipped systems prove competence. I aim to do both — keep learning, keep delivering.

Contact

Let’s build
something useful.

Open to permanent roles, contract engagements, and advisory work in applied AI, AI engineering, and technical delivery. Based in London, available immediately.

— Nikesh.