About Me¶

Hi! I'm Ayoub, an ML Engineer passionate about making large language models more efficient and accessible.

What I Do¶

I focus on:

LLM Inference Optimization - Making models run faster and more efficiently
Quantization Techniques - Reducing model size without sacrificing quality
ML Systems - Building scalable infrastructure for ML workloads
Technical Writing - Sharing knowledge and insights with the community

Connect With Me¶

GitHub: Github
LinkedIn: Linkedin

Why This Blog?¶

I started this blog to document my learnings, share hands-on experience, and provide practical insights about LLM inference, quantization, and other ML optimization techniques. The field is moving fast, and I believe in learning in public.

Feel free to reach out if you want to discuss anything related to ML optimization!