I'm a Senior Director leading the inference team at
Together AI. My contributions
to AI infrastructure have been recognized by the U.S. government with
O-1A
and
EB-1A
extraordinary ability classifications. More about my work and
background on my
LinkedIn
profile.
I was
SGLang Core
Maintainer from July 2024 through January 2026. I've initiated and led
the end-to-end DeepSeek V3/R1 effort on SGLang — from
day-0 support
and
performance optimization
to
large-scale EP deployment
and
GB200 NVL72 integration—driving roadmap, coordination, and execution across community
collaborations that pushed the frontier of open-source inference
engines.
“Most of the team graduated from the top universities in China,” said Yineng Zhang, a lead software engineer at Baseten in San Francisco who works on the SGLang, a project not part of DeepSeek that helps people build on top of DeepSeek’s system. “They are very smart and very young.”
While employees at big Chinese technology companies are limited to collaborating with colleagues, “if you work on open source, you work with talent around the world,” said Yineng Zhang, lead software engineer at Baseten in San Francisco who works on the open source SGLang project. He helps other people and companies build products using DeepSeek’s system.