Hongpeng Guo

Hongpeng Guo

Senior Research Scientist

ByteDance Seed

Biography

I am a Senior Research Scientist at ByteDance Seed, where I work on large-scale reinforcement learning training infrastructure for foundation models (LLMs and VLMs). My work focuses on machine learning systems and distributed systems for efficient and stable post-training.

I received my Ph.D. in Computer Science from the University of Illinois Urbana-Champaign and my B.Eng. in Computer Engineering from the University of Hong Kong.

Previously, I worked at Anyscale, Jane Street, Meta, and Google.

Interests
  • Reinforcement Learning Systems
  • Distributed Systems
  • Large-Scale Model Training
  • Machine Learning Infrastructure
Education
  • Ph.D. in Computer Science, 2024

    University of Illinois, Urbana Champaign

  • B.Eng. in Computer Engineering, 2018

    University of Hong Kong

Experience

 
 
 
 
 
ByteDance Seed
Senior Research Scientist
Mar 2025 – Present San Jose, CA
 
 
 
 
 
Anyscale
Software Engineer
Apr 2024 – Mar 2025 San Francisco, CA
 
 
 
 
 
Jane Street Capital
Quantitative Trading Intern
Jun 2023 – Aug 2023 NYC, NY
 
 
 
 
 
Meta
Software Engineer Intern, Machine Learning
May 2022 – Aug 2022 Menlo Park, CA
 
 
 
 
 
Google
Software Engineer Intern
May 2021 – Aug 2021 Remote, US

Featured Publications

(2023). ScaleFlow: Efficient Deep Vision Pipeline with Closed-Loop Scale-Adaptive Inference. In ACM MM 2023.

Cite

(2022). BoFL: Bayesian Optimized Local Training Pace Control for Energy Efficient Federated Learning. In Middleware 2022.

PDF Cite Code Poster Slides DOI

(2022). Multi-View Scheduling of Onboard Live Video Analytics to Minimize Frame Processing Latency. In ICDCS 2022.

PDF Cite DOI

(2021). CrossRoI: Cross-camera Region of Interest Optimization for Efficient Real Time Video Analytics at Scale. In MMsys 2021.

PDF Cite Code Slides DOI

(2021). DeepRT: A Soft Real Time Scheduler for Computer Vision Applications on the Edge. In SEC 2021.

PDF Cite DOI

(2020). Secure Broadcast Protocol for Unmanned Aerial Vehicle Swarms. In ICCCN 2020.

PDF Cite Slides DOI

(2019). Thanos: Incentive Mechanism with Quality Awareness for Mobile Crowd Sensing. In TMC 2019.

PDF Cite DOI

(2019). Dynamic Task Pricing in Multi-Requester Mobile Crowd Sensing with Markov Correlated Equilibrium. In INFOCOM 2019.

PDF Cite DOI

Contact