I build high performance systems for modern data centers, with a focus on AI and ML workloads and their efficiency at scale. I am comfortable moving across different problems, but my core experience is in systems software. You can find me at the Computer Laboratory, where I am visiting Andrew Moore at the University of Cambridge.

I spent about two years at Huawei working on Compute Express Link (CXL) across QEMU and the Linux kernel, exploring how memory systems should evolve for modern data centers. More recently, I have been focusing on AI and ML workloads and the systems challenges they introduce. I am interested in optimizing these workloads at scale across networking, memory, and runtime layers, with a growing emphasis on energy efficiency alongside performance.

My interests broadly center on computer systems for data centers, especially where real systems meet emerging workloads. I enjoy technical discussions and early stage ideas, and I am always open to exchanging thoughts.


I received my PhD from Queen Mary University of London, advised by Gianni Antichi and Brent Stephens (Utah / Google), see my PhD dissertation. I was more than fortunate to work with both Gianni and Brent; beyond their technical depth, they each have distinct styles that strongly shaped how I think about systems and research.

Before starting my PhD, I was initially an Intern, and later a Research Assistant at NUS, advised by Djordje Jevdjic, working on memory management for data-center applications. I also spent three months at EPFL, studying the feasibility of using virtualization to improve the availability of electrical grid controllers, advised by Jean-Yves Le Boudec. Btw, Lausanne is beautiful, make sure you pay a visit.

I completed my undergraduate and master’s studies at IUST (Iran), working with Mohsen Sharifi, where I spent two years as a student in his laboratory.


Selected Publications (all pubs)

Enabling Fast Networking in the Public Cloud
A. Sanaee, V. Jabrayilov, I. Marinos, F. Shahinfar, D. Saxena, G. Antichi, and K. Kaffes
ACM ASPLOS 26, Code Paper

IPA: Inference Pipeline Adaptation to Achieve High Accuracy and Cost-Efficiency
S. Ghafouri, K. Razavi, M. Salmani, A. Sanaee, T. Botran, L. Wang, J. Doyle, P. Jamshidi
ICPE 25

Scalable and Effective Page-table and TLB Management on NUMA Systems
B. Gao, Q. Kang, H. Tee, K. Chu, A. Sanaee, D. Jevdjic
USENIX ATC 24, Paper, Talk, Podcast

Backdraft: a Lossless Virtual Switch that Prevents the Slow Receiver Problem
A. Sanaee, F. Shahinfar, G. Antichi, B. Stephens
USENIX NSDI 22, Paper, Talk, Code, Podcast

Morpheus: Domain-Specific Run Time Optimization for Software Data Planes
S. Miano, A. Sanaee, G. Retravi, G. Antichi
ACM ASPLOS 22, Paper, Talk, Code, Podcast


Workshops

Reconciling High Accuracy, Cost-Efficiency, and Low Latency of Inference Serving
M. Salmani, S. Ghafouri, A. Sanaee, K. Razavi, M. Mühlhäuser, J. Doyle, P. Jamshidi, M. Sharifi
MLSys Workshop @ EuroSys 23, Paper, Talk


Teaching

I enjoy mentoring students and working closely on open-ended projects.


Service


Previous Mentees


Some thoughts

Occasional notes, opinions, and unfinished ideas:


Useful Graduate School Stuff


Linux kernel stuff


Random Stuff


Company interview rejections (since 2018)

Listed mostly as a reminder that this is normal: