Alphons Jaimon

Destructuring llama2.c and Running it on an ESP32-S3

November 15, 2025
Hardware, Embedded systems, Llm, Ai, Gen ai

Deep dive into running a Large Language Model on an ESP32-S3 microcontroller - exploring llama2.c, SIMD optimizations, and the challenges of streaming inference on embedded hardware.

Embedded ML

Destructuring llama2.c and Running it on an ESP32-S3