Triton-distributed: Programming Overlapping Kernels on Distributed AI Systems with the Triton Compiler
  
  
    
  
  
  
  
  
    
  
      Size Zheng, 
      Wenlei Bao, 
      Qi Hou, 
      Xuegui Zheng, 
      Jin Fang, 
      Chenhui Huang, 
      Tianqi Li, 
      Haojie Duanmu, 
      Renze Chen, 
      Ruifan Xu, 
      Yifan Guo, 
      Ningxin Zheng, 
      Ziheng Jiang, 
      Xinyi Di, 
      Dongyang Wang, 
      Jianxi Ye, 
      Haibin Lin, 
      Li-Wen Chang, 
      Liqiang Lu, 
      Yun Liang, 
      Jidong Zhai, 
      Xin Liu
   
  
  
  
  
    
    
      
    
    January, 2025