Alphons Jaimon

Fine-Tuned LLM with PIE Assembly, 6M, 5.56mb, 7tok/sec

December 3, 2025
Embedded, Llm, Assembly

Taking our ESP32 LLM from prototype to our badge: Q8_0 quantization, inline PIE assembly achieving 16 int8 MACs per cycle, three-phase training, and SAM robotic TTS. This is where the badge gets its voice.

Destructuring llama2.c and Running it on an ESP32-S3

November 15, 2025
Hardware, Embedded systems, Llm, Ai, Gen ai

Deep dive into running a Large Language Model on an ESP32-S3 microcontroller - exploring llama2.c, SIMD optimizations, and the challenges of streaming inference on embedded hardware.