// FOUNDER'S DESK

Notes from the founder's
desk.

Long-form writing, field notes, and benchmarks from Kamalakar Devaki — on edge AI, sovereign intelligence, the cloud-tax problem, and what it actually takes to build a full-stack AI company from the transistor up.

Pieces published

Topics

Author

K. Devaki

Cadence

Weekly

// LATEST

Fresh from the desk.

May 2026EdgeMatrixLinkedIn post

EdgeMatrix v0.0.5 — Throughput and Efficiency Gains

EdgeMatrix v0.0.5 ships throughput and efficiency improvements across multiple LLMs on NVIDIA L40s vs vLLM — with FlashAttention 4, context parallelism, and 190+ model families now supported.

Read on LinkedIn

All articles published on LinkedIn — read, comment, and share natively over there.

// ARCHIVE

Filter by topic.

50 pieces across edge AI, model architecture, silicon, runtime, and strategy. Use the filters to narrow by topic.

April 2026

Strategy

Enterprise AI Has a Token Leakage Problem

Enterprise AI bills aren't high because models are expensive — they're high because tokens leak. Hallucinations, poor orchestration, context overload. A full-stack approach (model strategy + runtime + monitoring) cuts token consumption 30–40%.

Notes from the founder'sdesk.

Fresh from the desk.

EdgeMatrix v0.0.5 — Throughput and Efficiency Gains

Filter by topic.

Enterprise AI Has a Token Leakage Problem

Why the Next 100× AI Returns May Not Be in Models

Shakti Architecture: Designing Language & Vision Models for the Real World

Shakti-4B — A Production-Grade Vision-Language Model

The Edge AI Systems Problem — Here's How We're Solving It

Shakti-4B + OCR — Beating DeepSeek

Raising the Bar on On-Device AI — ExSLerate v2 Benchmarks

Benchmarking LMCache vs EdgeMatrix — Why Caching Alone Isn't Enough

Core/Edge Energy Performance in AI Chips

Redefining LLM Inference — How EdgeMatrix Outperforms vLLM

Engineering Scalable Edge AI — The Semiconductor Stack

Building the Full-Stack AI Future — Chip, Runtime, Models

Why AI Chip Makers Need In-House Research, Now More Than Ever

Lexicons, Nexons, Shakti — A Continuum of Intelligence

Shakti LLM Series — Post 2: Built or Borrowed?

Shakti LLM Series — Post 1: Why We Built a Sovereign Language Model

GenAI · Edge AI · Multi-Modal LLMs — Field Note

Escape the Cloud Tax

Escape the Cloud Tax — Post 5: Serve Faster, Spend Smarter, Scale

LLM Inference Acceleration — Field Note

LLM MLOps on the Edge — Field Note

LLM Inference MLOps — Notes

EdgeMatrix vs the Cloud Tax — Field Note

ExSLerate — On-Chip AI for the Edge

EdgeMatrix — Scaling 70B-Parameter Models for Enterprise AI

Shakti-4B's OCR Capabilities — Comprehensive Evaluation

How EdgeMatrix Is Redefining Enterprise AI — More for Less Cost

Shakti-4B — Multi-Modal AI Model Powering Intelligence

Shakti-1B — Vision-Language Model Built for Enterprise

LingoForge — Revolutionizing How Enterprises Harness AI

Revolutionizing ASR with Samba-ASR

Speech Recognition Innovation — Field Note

Shakti LLM · Generative AI — Recognition

Real-World Applications of Shakti LLMs — Revolutionizing AI

Shakti LLMs Driving On-Device AI Workplace Agents

Precision & Power — Shakti's Blueprint for AI Excellence

Harnessing the Power of Shakti — LLM Series

From Edge to Excellence — Shakti LLM Revolution for Enterprise

NASSCOM DeepTech Club — Startup Badge Awarded

Make in India · AI for Good · Enterprise AI

Shakti-2.5B — Live on Hugging Face

Shakti — A 2.5-Billion-Parameter Small Language Model

Revolutionizing UI Localization Testing with LLMs

Shakti LLM · Responsible AI — Field Note

Introducing LexiQ — Your AI-Powered Assistant for KPI & PowerBI

Optimized Llama3-Med42-8B GGUF SandLogic Lexicon

Unlocking Bilingual AI — A SandLogic Lexicon-Based Approach

Turbocharge Your AI with SandLogic Lexicons

Introducing HaluMon — Ensuring Language-Model Reliability

Eight years. One thesis.

Want a deeper conversation? Talk to us.

Notes from the founder's
desk.