FINTECH THRUST SEMINAR | Mechanism Design for Large Language Models

Name: FINTECH THRUST SEMINAR | Mechanism Design for Large Language Models
Start: 2024-03-08
End: 2024-03-08
Location: Zoom ID: 872 8458 2682 Passcode: Fintech

8 March 2024

9:00am - 10:20am

Zoom ID: 872 8458 2682 Passcode: Fintech

Supporting the below United Nations Sustainable Development Goals:支持以下聯合國可持續發展目標：支持以下联合国可持续发展目标：

Mechanism Design for Large Language Models

Abstract:

We investigate auction mechanisms to support the emerging format of AI-generated content. We in particular study how to aggregate several LLMs in an incentive compatible manner. In this problem, the preferences of each agent over stochastically generated contents are described/encoded as an LLM. A key motivation is to design an auction format for AI generated ad creatives to combine inputs from different advertisers. We argue that this problem, while generally falling under the umbrella of mechanism design, has several unique features. We propose a general formalism—the token auction model—for studying this problem. A key feature of this model is that it acts on a token-by-token basis and lets LLM agents influence generated contents through single dimensional bids.

We first explore a robust auction design approach, in which all we assume is that agent preferences entail partial orders over outcome distributions. We formulate two natural incentive properties, and show that these are equivalent to a monotonicity condition on distribution aggregation. We also show that for such aggregation functions, it is possible to design a second-price auction, despite the absence of bidder valuation functions. We then move to designing concrete aggregation functions by focusing on specific valuation forms based on KL-divergence, a commonly used loss function in LLM. The welfare-maximizing aggregation rules turn out to be the weighted (log-space) convex combination of the target distributions from all participants. We conclude with experimental results in support of the token auction.

Joint work with Paul Duetting, Vahab Mirrokni, Renato Paes Leme, Song Zuo. Paper accepted to WWW’24 as oral presentation; PDF is available here https://arxiv.org/pdf/2310.10826.pdf.

Venue Opening Hour

09:00am-10:20am

Event Format

Seminar, Lecture, Talk

Speakers / Performers:

Prof. Haifeng XU

The University of Chicago

Haifeng Xu is an assistant professor in Computer Science at the University of Chicago, an AI2050 Early Career Fellow and a (part-time) research scientist at Google. His work has focused on the economics of data and machine learning, including designing learning algorithms for multi-agent decision making and designing markets for data and ML algorithms. He has published extensively at leading machine learning and computational economics conferences, and serves as area chair or senior program committee for premier venues such as ICML, EC, AAAI, IJCA, etc. His research has been recognized by multiple awards, including the AI2050 Early Career fellow, Google Faculty Research Award, ACM SIGecom Dissertation Award (honorable mention), IFAAMAS Distinguished Dissertation Award (runner-up), and multiple best paper awards; his works have been generously supported by varied agencies including NSF, Army Research Office, Office of Naval Research, Schmidt Science, and Google Research.

Language

English

Recommended For

PG students

UG students

Organizer

Financial Technology Thrust, HKUST(GZ)

Business & Management

Engineering

Humanities & Social Science

Public Policy

Science & Technology