I'm a Principal AI Researcher at
Together AI and
SGLang Core
Maintainer.
I've initiated and led the end-to-end
DeepSeek V3/R1 effort on SGLang — from
day-0 support
and
performance optimization
to
large-scale EP deployment
and
GB200 NVL72 integration—driving roadmap, coordination, and execution across community
collaborations that pushed the frontier of open-source inference
engines.
My contributions to AI infrastructure have been
recognized by the U.S. government with
O-1A
and
EB-1A
extraordinary ability classifications.
More about my work and
background on my
LinkedIn profile.
“Most of the team graduated from the top universities in China,” said Yineng Zhang, a lead software engineer at Baseten in San Francisco who works on the SGLang, a project not part of DeepSeek that helps people build on top of DeepSeek’s system. “They are very smart and very young.”
While employees at big Chinese technology companies are limited to collaborating with colleagues, “if you work on open source, you work with talent around the world,” said Yineng Zhang, lead software engineer at Baseten in San Francisco who works on the open source SGLang project. He helps other people and companies build products using DeepSeek’s system.