International Workshop on Federated Foundation Models
In Conjunction with NeurIPS 2024 (FL@FM-NeurIPS'24)


Final Submission Deadline: August 30, 2024 (23:59:59 AoE)
Notification Due: September 30, 2024 (23:59:59 AoE)
Final Version Due: October 15, 2024 (23:59:59 AoE)
Workshop Date: December 14-15, 2024
Venue: Vancouver Convention Center, Vancouver, BC, Canada

Call for Papers

Foundation models (FMs) are typically associated with large language models (LLMs), like ChatGPT, and are characterized by their scale and broad applicability. While these models provide transformative capabilities, they also introduce significant challenges, particularly concerning dis-tributed model management and related data privacy, efficiency, and scalability. The training of foundation models is data and resource intensive and the conventional methods are typically centralized; this creates significant challenges including regulatory and privacy concerns in real-world use cases. These include distributed training data, computational resources to manage distributed data repositories, and development of and alignment with regulatory guidelines (e.g., GDPR) that restrict sharing sensitive data.

Federated learning (FL) is an emerging paradigm that can mitigate these challenges by training a global but distributed model using distributed data. The extensive application of machine learning to analyze and draw insight from real-world, distributed, and sensitive data necessitates familiarity with and adoption of this relevant and timely topic within the general scientific community. As FL allows self-interested data owners to collaboratively train models, end-users can become co-creators of AI solutions. By adopting federated learning approaches, we can leverage distributed data and computing power available across different sources while respecting user privacy.

The rise of FMs amplifies the importance and relevance of FL as a crucial research direction. With FMs becoming the norm in machine learning development, the focus shifts from model architecture design to tackling the issues surrounding privacy-preserving and distributed learning. Advancements in FL methods have the potential to unlock the use of FMs, enabling efficient and scalable training while safeguarding sensitive data.

FMs such as GPT-4 encoded with vast knowledge and powerful emergent abilities have achieved remarkable success in various natural language processing and computer vision tasks. Grounding FMs by adapting them to domain-specific tasks or augmenting them with domain-specific knowledge enables us to exploit the full potential of FMs. However, grounding FMs faces several challenges, stemming primarily from constrained computing resources, data privacy, model heterogeneity, and model ownership. Federated Transfer Learning (FTL), the combination of FL and transfer learning, provides promising solutions to address these challenges. In recent years, the need for grounding FMs leveraging FTL, coined FTL-FM, has arisen strongly in both academia and industry.

With this in mind, we invite original research contributions, position papers, and work-in-progress reports on various aspects of federated learning in the era of foundation models. Since the emergence of foundation models has been a relatively recent phenomenon, their full impact on federated learning has not yet been well explored or understood. We hope to provide a platform to facilitate interaction among students, scholars, and industry professionals from around the world to discuss the latest advancements, share insights, and identify future directions in this exciting field.

This workshop aims to bring together academic researchers and industry practitioners to address open issues in this interdisciplinary research area. For industry participants, we intend to create a forum to communicate problems are practically relevant. For academic participants, we hope to make it easier to become productive in this area. The workshop will focus on the theme of combining FL with FM to open up opportunities to address new challenges. The workshop topics include but are not limited to:
Theory and algorithmic foundations:
  • Federated in-context learning
  • Federated neuro-symbolic learning
  • Impact of heterogeneity in FL of large models
  • Multi-stage model training (e.g., base model + fine tuning)
  • Optimization advances in FL (e.g., beyond first-order and local methods)
  • Prompt tuning and design in federated settings
  • Self-supervised learning in federated settings
Leveraging foundation models to improve federated learning:
  • Adaptive aggregation strategies for FL in heterogeneous environments
  • Foundation model enhanced FL knowledge distillation
  • Overcoming data interoperability challenges using foundation models
  • Personalization of FL with foundation models
Federated learning for training and tuning foundation models:
  • Fairness, bias, and interpretability challenges in FL with foundation models
  • Federated transfer learning with foundation models
  • FL-empowered multi-agent foundation model systems
  • FL techniques for training large-scale foundation models
  • Hardware for FL with foundation models
  • Optimization algorithms for federated training of foundation models
  • Privacy-preserving mechanisms in FL with foundation models
  • Resource-efficient FL with foundation models
  • Security and robustness considerations in FL with foundation models
  • Systems and infrastructure for FL with foundation models
  • Vertical federated learning with foundation models
  • Vulnerabilities of FL with foundation models

More information on previous workshops can be found here.


Submission Instructions

The main text of a submitted paper is limited to 9 content pages, including all figures and tables, following NeurIPS'24 template. Additional pages containing references don't count as content pages. An optional appendix of any length is allowed and should be put at the end of the paper (after references). Submissions are double-blind (author identity shall not be revealed to the reviewers), so the submitted PDF file should not include any identifiable information of authors.

Submissions are collected on OpenReview at the following link: https://openreview.net/group?id=NeurIPS.cc/2024/Workshop/Federated_Learning.
Accepted papers and their review comments will be posted on OpenReview in public. Due to the short timeline, we will not have a rebuttal period, but the authors are encouraged to interact and discuss with reviewers on OpenReview after the acceptance notifications are sent out. Rejected papers and their reviews will remain private and not posted in public.

For questions, please contact: han[dot]yu[at]ntu[dot]edu[dot]sg


Keynote Speakers

   

Title: Federated Large Language Models and Their Applications

Speaker: Qiang Yang, Chief AI Officer (CAIO), WeBank / Professor Emeritus, Hong Kong University of Science and Technology

Biography
Qiang Yang is the head of the AI Department at WeBank (Chief AI Officer) and Professor Emeritus at the Computer Science and Engineering (CSE) Department of the Hong Kong University of Science and Technology (HKUST), where he was a former head of CSE Department and founding director of the Big Data Institute (2015-2018). His research interests include AI, machine learning, and data mining, especially in transfer learning, automated planning, federated learning, and case-based reasoning. He is a fellow of several international societies, including ACM, AAAI, IEEE, IAPR, and AAAS. He received his Ph.D. in Computer Science in 1989 and his M.Sc. in Astrophysics in 1985, both from the University of Maryland, College Park. He obtained his B.Sc. in Astrophysics from Peking University in 1982. He had been a faculty member at the University of Waterloo (1989-1995) and Simon Fraser University (1995-2001). He was the founding Editor-in-Chief of the ACM Transactions on Intelligent Systems and Technology (ACM TIST) and IEEE Transactions on Big Data (IEEE TBD). He served as the President of International Joint Conference on AI (IJCAI, 2017-2019) and an executive council member of Association for the Advancement of AI (AAAI, 2016-2020). Qiang Yang is a recipient of several awards, including the 2004/2005 ACM KDDCUP Championship, the ACM SIGKDD Distinguished Service Award (2017), and AAAI Innovative Application Awards (2018, 2020 and 2022). He was the founding director of Huawei's Noah's Ark Lab (2012-2014) and a co-founder of 4Paradigm Corp, an AI platform company. He is an author of several books including Intelligent Planning (Springer), Crafting Your Research Future (Morgan & Claypool), and Constraint-based Design Recovery for Software Engineering (Springer).

   

Title: TBA

Speaker: Martin J. McKeown, Professor, The University of British Columbia

Biography
Dr. McKeown is the PPRI/UBC Chair in Parkinson's Research, Director at the Pacific Parkinson's Research Centre (PPRC), Professor in the Department of Medicine, and associate member in the Department of Electrical and Computer Engineering at the University of British Columbia, Canada. The PPRC is deemed an International Centre of Excellence by the (US-based) National Parkinson's Foundation. He did his Engineering Physics, Medicine and Neurology training at McMaster, University of Toronto, and University of Western Ontario, respectively. He did a 3yr research fellowship at the Computational Neurobiology Laboratory at the Salk Institute for Biological Studies in San Diego before being hired as an Assistant Professor of Medicine and Biomedical Engineering at Duke University. He was recruited to UBC in 2003. He has been responsible for a variety of peer-reviewed research projects funded through the National Institute of Health (US-NIH), the National Parkinson's Foundation (US-NPF), the Canadian Foundation for Innovation (CFI), the Natural Sciences and Engineering Research Council of Canada (NSERC), the Canadian Institutes of Health Research (CIHR), the International Association of Translational Neuroscience, and the (US) Whitaker Foundation. He was a member of the Neuroscience A (NSA) Canadian CIHR Scientific peer review committee as well as a member of the Scientific Advisory Board of the Parkinson's Society of Canada. He has authored over 180 peer-reviewed papers and book chapters. His interests include examining novel treatments for Parkinson's and exploring how Engineering methods can be used to enrich the lives of people with Parkinson's.

   

Title: TBA

Speaker: Nicholas Lane, Professor, University of Cambridge / Co-Founder and CSO, Flower Labs

Biography
Prof Lane a full Professor in the department of Computer Science and Technology at the University of Cambridge, where he leads the Cambridge Machine Learning Systems lab (CaMLSys). The mission of CaMLSys is to invent the next-generation of breakthrough ML-centric systems. He is also a Fellow of St. John's College. Alongside his academic roles, he is the co-founder and Chief Scientific Officer of Flower Labs, a venture-backed AI company (YCW23) behind the Flower federated learning framework. Flower Labs seeks to enable an AI future that is collaborative, open and distributed.

   

Title: Federated Optimization Beyond Standard Empirical Risk Minimization

Speaker: Gauri Joshi, Associate Professor, Carnegie Mellon University

Biography
Gauri Joshi is a faculty member in the ECE department at Carnegie Mellon University. Gauri completed her Ph.D. from MIT EECS, and received her B.Tech and M.Tech from the Indian Institute of Technology (IIT) Bombay. Her awards include the MIT Technology Review 35 under 35 Award, ONR Young Investigator and NSF CAREER Award, Best Paper awards at MobiHoc 2022 and SIGMETRICS 2020, and the Institute Gold Medal of IIT Bombay (2010).

   

Title: Understanding and Overcoming the Impact of System Dynamics in Learning

Speaker: Shiqiang Wang, Staff Research Scientist, IBM T. J. Watson Research Center

Biography
Shiqiang Wang is a Staff Research Scientist at IBM T. J. Watson Research Center, NY, USA. He received his Ph.D. from Imperial College London, United Kingdom, in 2015. His current research focuses on the intersection of distributed computing, machine learning, networking, and optimization, with a broad range of applications including data analytics, edge-based artificial intelligence (Edge AI), Internet of Things (IoT), and future wireless systems. He has made foundational contributions to edge computing and federated learning that generated both academic and industrial impact. Dr. Wang serves as an associate editor of the IEEE Transactions on Mobile Computing and IEEE Transactions on Parallel and Distributed Systems. He has also been actively organizing workshops at the intersection of edge computing and machine learning, and regularly participates in technical program committees (TPCs) of prominent conferences and review panels of research grants. He received the IEEE Communications Society (ComSoc) Leonard G. Abraham Prize in 2021, IEEE ComSoc Best Young Professional Award in Industry in 2021, IBM Outstanding Technical Achievement Awards (OTAA) in 2019, 2021, 2022, and 2023, multiple Invention Achievement Awards from IBM since 2016, Best Paper Finalist of the IEEE International Conference on Image Processing (ICIP) 2019, and Best Student Paper Award of the Network and Information Sciences International Technology Alliance (NIS-ITA) in 2015.


Organizers



Sai Praneeth Karimireddy
(UC Berkeley)
   

Xiaoxiao Li
(UBC)
   

Songtao Lu
(IBM)
   

Stacy Patterson
(RPI)
   

Pascal Poupart
(U Waterloo)
   

Han Yu
(NTU)
   

Organized by