Specializing in CUDA kernels, FlashAttention, quantization techniques, and building high-performance deep learning systems.
Sorted by most recently updated on GitHub
Loading projects from GitHub...
Machine Learning Engineer Intern
Dec 2025 - Present
Building computer vision pipelines for medical imaging. Optimizing deep learning models for real-time inference on AWS.
Graduate Research Assistant
Jun 2025 - Present
CTMap project (IEEE ICC 2026) - LLM-enabled path planning. Research in neural architecture optimization and GPU profiling.
UC Berkeley EECS • Jul 2025
Oracle Cloud • Jun 2024 - Jun 2026
Let's discuss how I can help optimize your ML systems.