I’m a Senior Director leading the inference team at Together AI. I also serve on the governing board of the LightSeek Foundation, where I co-created TokenSpeed — a speed-of-light LLM inference engine. My work in AI infrastructure has been recognized through O-1A and EB-1A extraordinary ability classifications in the United States. More about my work and background on my LinkedIn.
“Most of the team graduated from the top universities in China,” said Yineng Zhang, a lead software engineer at Baseten in San Francisco who works on the SGLang, a project not part of DeepSeek that helps people build on top of DeepSeek’s system. “They are very smart and very young.”
While employees at big Chinese technology companies are limited to collaborating with colleagues, “if you work on open source, you work with talent around the world,” said Yineng Zhang, lead software engineer at Baseten in San Francisco who works on the open source SGLang project. He helps other people and companies build products using DeepSeek’s system.