About Me¶
Hi! I'm Ayoub, an ML Engineer passionate about making large language models more efficient and accessible.
What I Do¶
I focus on:
- LLM Inference Optimization - Making models run faster and more efficiently
- Quantization Techniques - Reducing model size without sacrificing quality
- ML Systems - Building scalable infrastructure for ML workloads
- Technical Writing - Sharing knowledge and insights with the community
Connect With Me¶
Why This Blog?¶
I started this blog to document my learnings, share hands-on experience, and provide practical insights about LLM inference, quantization, and other ML optimization techniques. The field is moving fast, and I believe in learning in public.
Feel free to reach out if you want to discuss anything related to ML optimization!