“What's wrong with LLMs and what we should be building instead” - Tom Dietterich - #VSCF2023
Thomas G. Dietterich is emeritus professor of computer science at Oregon State University. He is one of the pioneers of the field of machine learning.
He served as executive editor of the journal called Machine Learning (1992–98) and helped co-found the Journal of Machine Learning Research.
He is one of the members of our select valgrAI Scientific Council.
Keynote: “What's wrong with LLMs and what we should be building instead”
Abstract: Large Language Models provide a pre-trained foundation for training many interesting AI systems. However, they have many shortcomings. They are expensive to train and to update, their non-linguistic knowledge is poor, they make false and self-contradictory statements, and these statements can be socially and ethically inappropriate. This talk will review these shortcomdifferentings and current efforts to address them within the existing LLM framework. It will then argue for a , more modular architecture that decomposes the functions of existing LLMs and adds several additional components. We believe this alternative can address all of the shortcomings of LLMs. We will speculate about how this modular architecture could be built through a combination of machine learning and engineering.
Timeline:
00:00-02:00 - Introducción
00:00-02:00 Introduction to large language models and their capabilities
02:01-3:14 Problems with large language models: Incorrect and contradictory answers
03:15-4:28 Problems with large language models: Dangerous and socially unacceptable answers
04:29-6:40 Problems with large language models: Expensive to train and lack of updateability
06:41-12:58 Problems with large language models: Lack of attribution and poor non-linguistic knowledge
12:59-15:02 Benefits and limitations of retrieval augmentation
15:03-15:59 Challenges of attribution and data poisoning
16:00-18:00 Strategies to improve consistency in model answers
18:01-21:00 Reducing dangerous and socially inappropriate outputs
21:01-25:26 Learning and applying non-linguistic knowledge
25:27-37:35 Building modular systems to integrate reasoning and planning
37:36-39:20 Large language models have surprising capabilities but lack knowledge bases.
39:21-40:47 Building modular systems that separate linguistic skill from world knowledge is important.
40:48-45:47 Questions and discussions on cognitive architectures and addressing the issue of miscalibration.
45:48 Overcoming flaws in large language models through prompting engineering and verification.
Follow us!
LinkedIn: https://www.linkedin.com/company/valgrai/
Instagram: https://www.instagram.com/valgrai/
Youtube: https://www.youtube.com/@valgrai/
Twitter: https://twitter.com/fvalgrai
2023-10-26T13:52:57Z1280
https://www.youtube.com/embed/cEyHsMzbZBs