Zhoujun Cheng 承洲骏

Glad to meet you here! I am Zhoujun Cheng, a second-year Master student at Shanghai Jiao Tong University (SJTU) advised by Prof. Fan Cheng. Before that, I received the B.S. degree in computer science (IEEE class) at SJTU. Currently, I work as a research assistant in HKUNLP group advised by Prof. Tao Yu. I also worked closely with Senior Researcher Haoyu Dong in Microsoft Research Asia.

My research interest lies in NLP, especially LLM-powered language agents, code generation, semantic parsing, and structured knowledge grounding.

I am looking for a Ph.D. position starting from 2024 Fall. Please feel free to contact me if you are interested in my research.

Email  /  Google Scholar  /  Semantic Scholar  /  Twitter  /  github  /  cv

profile photo
Projects
OpenAgents (2K github stars, 5K demo users)

OpenAgents is a platform for LLM-powered Agents, serving as user-centric intelligent agents and are easily deployed locally. We developed agents targeting three real-world scenarios, including:

  • Data Agent: code interpreter augmented with data tools
  • Plugins Agent: 200+ plugins for daily life
  • Web Agent: autonomous web browsing

  • demo | code | docs | twitter
    Publications
    OpenAgents: An Open Platform for Language Agents in the Wild
    Tianbao Xie*, Fan Zhou*, Zhoujun Cheng*, Peng Shi*, Luoxuan Weng*, Yitao Liu*, Toh Jing Hua, Junning Zhao, Qian Liu, Che Liu, Leo Z. Liu, Yiheng Xu, Hongjin Su, Dongchan Shin, Caiming Xiong, Tao Yu
    Preprint 10.2023
    pdf | code | demo | docs

    An open platform for using, hosting, and building language agents.

    Binding Language Models in Symbolic Languages
    Zhoujun Cheng*, Tianbao Xie*, Peng Shi, Chengzu Li, Rahul Nadkarni, Yushi Hu, Caiming Xiong, Dragomir Radev, Mari Ostendorf, Luke Zettlemoyer, Noah A. Smith, Tao Yu
    ICLR 2023 (Spotlight)
    pdf | code | demo

    A training-free neural-symbolic framework mapping task inputs to programs.

    Batch Prompting: Efficient Inference with Large Language Model APIs
    Zhoujun Cheng, Jungo Kasai, Tao Yu
    EMNLP 2023 Industry Track
    pdf | code

    A simple prompting approach that enables the LLMs to run inference in batches to save budgets&time.

    TaCube: Pre-computing Data Cubes for Answering Numerical-Reasoning Questions over Tabular Data
    Fan Zhou, Mengkang Hu, Haoyu Dong, Zhoujun Cheng, Shi Han, Dongmei Zhang
    EMNLP 2022 (Oral)
    pdf | code

    Precomputing aggregation/arithmetic results to assist table numerical reasoning.

    HiTab: A Hierarchical Table Dataset for Question Answering and Natural Language Generation
    Zhoujun Cheng*, Haoyu Dong*, Zhiruo Wang*, Ran Jia, Jiaqi Guo, Yan Gao, Shi Han, Jian-Guang Lou, Dongmei Zhang
    ACL 2022
    pdf | code | dataset

    A hierarchical table dataset for question answering and natural language generation.

    FORTAP: Using Formulae for Numerical-Reasoning-Aware Table Pretraining
    Zhoujun Cheng*, Haoyu Dong*, Ran Jia, Pengfei Wu, Shi Han, Fan Cheng, Dongmei Zhang
    ACL 2022
    pdf | code

    Adopting spreadsheet formulas to enhance numerical reasoning skills of table modeling.

    Table Pre-training: A Survey on Model Architectures, Pretraining Objectives, and Downstream Tasks
    Haoyu Dong, Zhoujun Cheng, Xinyi He, Mengyu Zhou, Anda Zhou, Fan Zhou, Ao Liu, Shi Han, Dongmei Zhang
    IJCAI 2022 Survey Track
    pdf

    A survey on various tabular models, especially on the pretrained transformers.

    KeypointNet: A Large-scale 3D Keypoint Dataset Aggregated from Numerous Human Annotations
    Yang You, Yujing Lou*, Chengkun Li*, Zhoujun Cheng, Liangwei Li, Lizhuang Ma, Cewu Lu, Weiming Wang
    CVPR 2020
    pdf | code

    A large-scale and diverse 3D keypoint dataset.

    Human Correspondence Consensus for 3D Object Semantic Understanding
    Yujing Lou*, Yang You*, Chengkun Li*, Zhoujun Cheng, Liangwei Li, Lizhuang Ma, Weiming Wang, Cewu Lu
    ECCV 2020
    pdf | code

    Learning dense semantic correspondences on 3D objects.


    Awards&Services
  • Reviewer of ACL Rolling Review 2022/2023, NAACL SUKI Workshop 2022, NeurIPS TRL Workshop 2022/2023, ICLR DL4C Workshop 2022
  • National Scholarship (top 2%), 2018
  • Shanghai Outstanding Graduates, 2021
  • MSRA Stars of Tomorrow (Award of Excellent Intern), 2021
  • SJTU Zhiyuan Honors Scholarship (top 5%), 2018-2020

  • Beyond Academics

    I am a fan of NBA and enjoy playing basketball. I would like to highly recommend a Sina Weibo blogger @旋猫, who provides professional and insightful analysis on NBA in China.
    I also have fun with Dota2 (usually watch games rather than playing it myself, a.k.a. 云玩家)



    Last update on November 2023

    Design and source code from Jon Barron's website