Ruskaruma

inference engineering ยท agent systems ยท compilers

I focus on how models and agents actually run, building high-performance AI runtimes: inference engines, CUDA kernels, and agent execution layers.

Alongside deep work in compiler design, I'm an active contributor to open-source AI systems and infrastructure.

GitHub โ†— ๐• โ†—