Ruokai is currently a final year Ph.D. student in the Department of Electrical Engineering at Yale University, where he is advised by Prof. Priyadarshini Panda.
His research focuses on designing energy-efficient computer architectures, systems, and algorithms for AI workloads, particularly those involving asymmetric operand precision or sparsity. He is also interested in neuromorphic computing, as enablers for bio-plausible and energy-efficient deep learning (spiking neural networks).
Prior to joining Yale, he earned his B.S. from the University of Wisconsin-Madison, majored in in Electrical Engineering, Computer Science, and Mathematics. During his undergraduate, he worked with Prof. Joshua San Miguel on designing computer architectures for stochastic computing.
Autonomous sharding planning in large scale distributed LLM inference system.
Paper: https://arxiv.org/abs/2509.00217 (Accepted to Workshop on ML for Systems, NeurIPS 2025)
Architecture design and modeling for Cerebras’s next-generation wafer-scale engine.
Working on projects that improving the energy efficiency of neural networks, in particular, spiking neural networks.
Worked on projects that applying unary computing to the deep neural networks. Construted a PyTorch-basede library for unary computing.
Instructor: Prof. Priya Panda. Course Description.
Instructor: Prof. Rajit Manohar. Course Description.