Welcome :)

Hello there! I’m Ke Yang (杨可), currently a third-year Ph.D. at UIUC under the guidance of Professor ChengXiang Zhai. I obtained my bachelor’s degree from Tsinghua University. I previously interned i) with Professor Heng Ji’s group at UIUC, ii) at Amazon AWS, and iii) with the Deep Learning Group at Microsoft Research.

I work on AI agents, language models, information retrieval, and multimodality foundation models (of top interest). I’m also keen on resolving AI bias and discrimination. Always open to collaboration — feel free to get in touch!

During the winter break of 2022, I collaborated with two of my undergrad classmates to create Zempath, an online social platform incorporating our trained chatbots with distinctive personalities.

News

2025-05: New preprint: Ten Principles for the Economics of AI Agents. My first perspective paper—started out happily reflecting on AI agent incentives, then halfway through realized: oh no, I’ve been working on something kind of dangerous this whole time. 😅 Debugging welcome. 🧵
2025-05: 👩‍💻 Joined Microsoft Research as a summer intern. Hello, Redmond!
2025-05: New preprint on benchmarking Just-in-time Information Recommendation, which is about AI assistants proactively recommending the right information at the right time. 🕵

Selected Publications

arXiv 2025

Ten Principles of AI Agent Economics

Ke Yang, ChengXiang Zhai

We propose ten principles of AI agent economics, offering a framework to understand how AI agents make decisions, influence social interactions, and participate in the broader economy.

arXiv 2025

yang2025jirarenabenchmarkdatasetjustintime

JIR-Arena: The First Benchmark Dataset for Just-in-time Information Recommendation

Ke Yang, Kevin Ros, Shankar Kumar Senthil Kumar, ChengXiang Zhai

We introduce Just-in-time Information Recommendation, a transformative AI-driven information service that proactively addresses user information gaps with minimal user effort.

arXiv 2024

yang2024tinyhelenscurriculumtrainingevaluating

TinyHelen's First Curriculum: Training and Evaluating Tiny Language Models in a Simpler Language Environment

Ke Yang, Volodymyr Kindratenko, ChengXiang Zhai

We train and evaluate tiny language models using a text dataset with simplified vocabularies and linguistic structures, mimicking how children learn language through simplified environments as part of their initial curriculum.

ICLR 2025

AgentOccam: A Simple Yet Strong Baseline for LLM-Based Web Agents

Ke Yang, Yao Liu, Sapana Chaudhary, Rasool Fakoor, Pratik Chaudhari, George Karypis, Huzefa Rangwala

AgentOccam surpasses the previous state-of-the-art and concurrent LLM-based web agent with its observation and action space alignment. We achieve this without using in-context examples, new agent roles, online feedback or search strategies.

NeurIPS 2024 D&B Track

Bias and Volatility: A Statistical Framework for Evaluating Large Language Model's Stereotypes and the Associated Generation Inconsistency

Yiran Liu*, Ke Yang*, Zehan Qi, Xiao Liu, Yang Yu, ChengXiang Zhai (* indicates equal contributions)

Bias-Volatility Framework measures discrimination in models by considering both their consistently biased preference and preference variation across contexts.

ICLR Workshop 2024

If LLM Is the Wizard, Then Code Is the Wand: A Survey on How Code Empowers Large Language Models to Serve as Intelligent Agents

Ke Yang*, Jiateng Liu*, John Wu, Chaoqi Yang, Yi R. Fung, Sha Li, Zixuan Huang, Xu Cao, Xingyao Wang, Yiquan Wang, Heng Ji, ChengXiang Zhai (* indicates equal contributions)

The Wizard survey explores the synergy between code and large language models (LLMs), highlighting how code empowers LLMs and benefits LLM when they serve as intelligent agents. We emphasized code’s readability, symbolic abstraction, and graph structure, presenting it as a valuable component in LLMs’ training corpus.

AAAI 23

ADEPT: A DEbiasing PrompT Framework

Ke Yang, Charles Yu, Yi Fung, Manling Li, Heng Ji

ADEPT introduces a novel debaising loss function based on counterfactual bias and manifold learning insights. “Prompt” here refers to prompt-tuning (peft) rather than prompt-engineering.

Zempath

In the promotional video for Zempath, we unveil our driving inspirations and fundamental principles. We showcase the seamless user experience of engaging in chats, posting either anonymously or under one’s real name, indulging in conversations with our personalized chatbots, and forging new connections with like-minded individuals. Let’s delve into a snippet from this captivating video:

Zempath

Miscellaneous

I am an amateur novelist, painter, and photographer. I take photos of cats, my sister, grandparents, friends, campus, etc., in my spare time.