Zhengxu Yan

About

Publications

Home

About

Publications

Home

Zhengxu Yan

Zhengxu Yan

About Me

I'm a Computer Science M.S. student at Stanford University and a recent UC Berkeley graduate in Computer Science and Data Science. My work spans verifiable code generation, diffusion transformer systems, and AI for social impact, and I love building collaborative teams that turn research into real-world impact.

Stanford, CA

CV (Jan, 2025)

Publications

Under Review

  • Ye, Z., Yan, Z., He, J., Kasriel, T., Yang, K., & Song, D. VERINA: Benchmarking Verifiable Code Generation. Under review at ICLR 2026. https://arxiv.org/abs/2505.23135
  • Luo, M., Hao, A., Yan, Z., Cao, C., & Nguyen, Q. L. N. DiT-Serve: An Efficient Serving Engine for Diffusion Transformers. Under review at ICLR 2026.

Published

  • Yan, Z., Dube, V., Heselton, J., Johnson, K., Yan, C., Jones, V., Blaskewicz Boron, J., & Shade, M. (2024). Understanding older people’s voice interactions with smart voice assistants: a new modified rule-based natural language processing model with human input. Frontiers in Digital Health, 6, 1329910. https://doi.org/10.3389/fdgth.2024.1329910
  • Jones, V. K., Yan, C., Shade, M. Y., Boron, J. B., Yan, Z., Heselton, H. J., Johnson, K., & Dube, V. (2024). Reducing Loneliness and Improving Social Support among Older Adults through Different Modalities of Personal Voice Assistants. Geriatrics (Basel, Switzerland), 9(2), 22. https://doi.org/10.3390/geriatrics9020022

Working Papers

VeriAgentBench: Benchmarking Project-Level Verifiable Code Agents in the Wild

  • Introduced a benchmark of 50 real-world repositories supporting repository-level verification of existing projects and verifiable generation from scratch.
  • Supervised by Dr. Dawn Song and Dr. Kaiyu Yang, I developed an automated evaluation pipeline measuring proof validity and functional correctness, revealing that state-of-the-art code agents achieve single-digit success rates on VeriAgentBench.

Adaptive Operations Management in Buildings: A Reinforcement Learning Approach for Operational Adaptability in Healthcare Facilities

  • Introduced a reinforcement learning-based framework to enable adaptive and integrated operations management in healthcare facilities, optimizing spatial, social, and operational performance through coordinated resource sharing.
  • Supervised by Dr. Yehuda Kalay and Dr. Davide Schaumann, I utilized deep reinforcement learning (RL) and simulation to develop a smart building management system for the Cardiac Catheterization Lab at St. Bernardine Medical Center, significantly enhancing facility adaptability and operational efficiency. The manuscript is in preparation for journal review.

Work Experience

Teaching Assistant for Large Language Model Agents (CS 194-196)

August 2024 - Present

Department of Electrical Engineering and Computer Sciences at UC Berkeley

  • Coordinated course logistics, managed lecture recordings, enhanced assignment materials, and provided academic support by addressing student inquiries.
  • Assisted in organizing and coordinating the LLM Agents Hackathon, hosted by Berkeley RDI in conjunction with the LLM Agents MOOC, designed to foster innovation, expand the AI agent community, and advance LLM agent technology.

Research Assistant

February 2024 - Present

UC Berkeley Sky Computing Lab

  • Competitively selected through UC Berkeley EECS Diversifying Access to Research in Engineering (DARE) and Sky Summer Undergraduate Programs to join the DiT-Serving Project as a research assistant.
  • Led a team of 5 research assistants in integrating Ring Attention and Brick Attention into Diffusion Transformer (DiT) models, pioneering scalable video generation techniques.
  • Directed continuous batching strategies to optimize system throughput, enhancing performance efficiency across video processing requests.

Research Assistant

February 2024 - Present

UC Berkeley College of Environmental Design

  • Competitively selected to join the Smart Hospital Project through UC Berkeley Undergraduate Research Apprentice Program.
  • Led a team of 3 research assistants in the development and implementation of a deep reinforcement learning-based smart building management system for the Cardiac Catheterization Lab at St. Bernardine Medical Center.

Machine Learning (ML) Engineer Lead

January 2024 - Present

UC Berkeley School of Public Health

  • Selected as lead computer scientist for Rapid Reviews: Infectious Diseases via the UC Berkeley CDSS Discovery Program.
  • Independently leveraged a fine-tuned Large Language Model (LLM) to efficiently categorize and identify preprints within the RR\ID domain.
  • Led a team of 3 UC Berkeley CS students in utilizing LLM APIs to analyze and provide insights on medical preprints.
  • Directed a team of 5 UC Berkeley CS students in engineering scripts to automate the collection of medical preprints.
  • Collaborated with the Dean of the UC Berkeley School of Public Health and researchers from UCSF to develop automated systems for the academic review of infectious disease papers, set to be published by MIT Press.

Research Assistant

April 2023 – September 2023

University of Nebraska

  • Collaborated with gerontology and nursing researchers to develop a natural language processing (NLP) model designed to automate the processing of speech-to-text data from user interactions with AI-enabled smart voice assistants.

Backend Engineer Lead

October 2022 – December 2022

Coffee Tea, Inc.

  • Led a team of 3 UC Berkeley computer science students in backend development for a social platform.
  • Directed the design and development of backend REST APIs using FastAPI, Poetry, Alembic, and PostgreSQL.

Projects

Machine Learning

Multi-Agent LLM Trading System

August 2024 – Present

Python, Pytorch, Tensorflow, AutoGen

  • Developed a multi-agent trading system integrating LLMs with specialized agents for multimodal data processing, layered memory, and Retrieval-Augmented Generation, aiming to exceed standard prediction models by 5-7% in market accuracy.

IM2SPAIN Project

January 2024 – May 2024

Python

  • Employed nearest neighbors (k-NN) to predict geographic coordinates based on CLIP embeddings of geo-tagged images from Flickr, capturing diverse landscape features across Spain.

MNIST Competition

January 2024 – May 2024

Python, Pytorch

  • Engaged in the Kaggle MNIST classification challenge, leveraging Linear and Quadratic Discriminant Analysis, Logistic Regression, Multi-layer Perceptron, Support Vector Machines, and Convolutional Neural Networks to optimize prediction accuracy.

Language Identification RNN

August 2023 – December 2023

Python, Tensorflow

  • Engineered an RNN to identify word languages, achieving over 81% test set accuracy.

Graph Partitioning for Unsupervised Learning

August 2022 – December 2022

Golang, AWS, Google Cloud

  • Developed graph partitioning algorithms using the Kernighan–Lin method to support unsupervised machine learning workflows.
  • Implemented a relational database and Golang backend deployed on AWS and Google Cloud to manage outputs and orchestrate large-scale experiments.

Systems Programming and Software Development

Pintos Operating System

January 2023 – May 2023

C, Rust

  • Led the development of a comprehensive Pintos operating system, focusing on systems programming, memory allocation, resource management, file systems, networking, and security.

Secure Client Application

January 2023 – May 2023

Golang

  • Developed a secure client Golang application incorporating cryptographic primitives for authentication and file management.

Gitlet Version Control System

January 2021 – May 2022

Java

  • Created "Gitlet," a Git-like Version Control System, to streamline tracking and management of code changes across projects.

Education

Stanford University

Expected Graduation: April 2027

Master of Science in Computer Science

University of California, Berkeley

August 2021 – May 2025

Bachelor of Arts in Computer Science and Data Science

GPA: 3.95/4.00, High Distinction

Selected Coursework:

Optimization Models (EECS 127), Probability for Data Science (Data C140), Computer Security (CS 161), Operating Systems (CS 162), Efficient Algorithm (CS 170), Computer Vision (CS 180), Artificial Intelligence (CS 188), Machine Learning (CS 189), LLM Agents (CS 194-196).

访客 Visitors:
浏览量 Page Views:
CC BY-NC-SA 4.0 © 2024 Zhengxu Yan
Powered by Hexo & 枫叶 Fengye
0 %