Rohan Shetty

Software Engineering @ University of Waterloo

Daily Codex/GitHub Actions experiment: one scoped site change per day.

Education

University of Waterloo

Bachelor of Software Engineering  ·  2021 – 2026  ·  Waterloo, Canada

Experience

HeyGen

May 2025 – Aug 2025
Software Engineer Intern · AI Infrastructure · Toronto, Canada
  • Optimized AI video generation model by integrating sage attention into the PyTorch pipeline; utilized outlier smoothing and Int4 quantization to decrease end-to-end latency by 10%.
  • Engineered logging and profiling utilities to expose metrics on memory usage and inference bottlenecks.
  • Architected a storage migration from AWS S3 to Cloudflare R2 for model checkpoints, reducing storage costs by 35%.

Huawei

Jan 2025 – Apr 2025
AI Systems Research Intern · Markham, Canada
  • Benchmarked Microsoft's BitNet against llama.cpp and llama3.c, conducting top-down microarchitecture analysis to isolate compute/memory bottlenecks.
  • Designed a custom C++ data prefetcher with page boundary awareness, resulting in a 6% speedup in mobile inference benchmarks.
  • Proposed a specialized kernel for 1-bit LLMs supporting fine-grained structured sparsity, reducing MatMul overhead by 20%.

Tactic Studios

May 2024 – Aug 2024
Game Programmer Intern · London, Canada
  • Developed 20+ responsive, data-driven UI modules in Java for RPG title "Killer Inn" published by Square Enix.
  • Engineered "Expression Resources," a core engine tool enabling dynamic object referencing, decoupling game logic from asset data and accelerating iteration cycles.

Besty AI

Sep 2023 – Dec 2023
Software Engineer Intern · New York, USA
  • Integrated GPT-4 into automated upselling workflows for rental property hosts, generating over $300 in additional weekly revenue per user post-launch.
  • Built a real-time product analytics dashboard in React.js visualizing 500+ daily interactions using LLAMA-2 for guest intent classification.
  • Optimized backend performance by deploying Node.js workers for SQL preprocessing, capping API response time at 100ms.

Behaviour Interactive

Jan 2023 – Apr 2023
Game Programmer Intern · Toronto, Canada
  • Spearheaded gameplay development of the "Dead by Daylight" 7th Anniversary update in C++, collaborating with 20+ engineers.
  • Created an object highlighting system supporting numerous shader properties and events on game objects.
  • Leveraged Unreal Engine's network replication system for stability under 200ms latency and 2% packet loss.

Projects