Hani Mir

Previously leading model finetuning at Character.ai
Scaling LLMs to 20k+ QPS, reducing inference costs by 75%

Hani Mir
10+
Years Experience
70B
Parameter Models Scaled
75%
Inference Cost Reduction
20k+
QPS at Scale

LLM Infrastructure

Scaling 70B parameter models to 20k+ QPS, implementing parameter-efficient fine-tuning, and building real-time inference pipelines.

Distributed Systems

Building resilient infrastructure managing O(100k) securities, achieving 4 nines reliability, and scaling from startup to enterprise.

ML Optimization

Driving 3x GPU utilization improvements, 75% cost reductions, and implementing safety alignment with measurable impact.