MLLM RSS
Max Tegmark: Language Models Understand Time and Space
Microsoft's VISUALChatGPT Takes the Industry By STORM
Visual ChatGPT: Talking, Drawing and Editing with Visual Foundation Models Chenfei Wu, Shengming Yin, Weizhen Qi, Xiaodong Wang, Zecheng Tang, Nan Duan ChatGPT is attracting a cross-field interest as it provides a language interface with remarkable conversational competency and reasoning capabilities across many domains. However, since ChatGPT is trained with languages, it is currently not capable of processing or generating images from the visual world. At the same time, Visual Foundation Models, such as Visual Transformers or Stable Diffusion, although showing great visual understanding and generation capabilities, they are only experts on specific tasks with one-round fixed inputs and outputs....
Sparks of AGI: early experiments with GPT-4
CSAIL lectures with Sebastien Bubeck (April 6,2023) The new wave of AI systems, ChatGPT and its more powerful successors, exhibit extraordinary capabilities across a broad swath of domains. In light of this, we discuss whether artificial INTELLIGENCE has arrived.
Ilya Sutskever (OpenAI Chief Scientist) - Building AGI, Alignment, Spies, Microsoft, & Enlightenment
Asked Ilya Sutskever (Chief Scientist of OpenAI) about - time to AGI - leaks and spies - what's after generative models - post AGI futures - working with MSFT and competing with Google - difficulty of aligning superhuman AI Timestamps 00:00 Time to AGI 05:57 What’s after generative models? 10:57 Data, models, and research 15:27 Alignment 20:53 Post AGI Future 26:56 New ideas are overrated 36:22 Is progress inevitable? 41:27 Future Breakthroughs
What you need to know about multimodal language models
This article is part of Demystifying AI, a series of posts that (try to) disambiguate the jargon and myths surrounding AI. OpenAI has released GPT-4, the latest edition of its flagship large language model (LLM). And though few details are available, what we do know is that it will be a “multimodal” LLM, according to a Microsoft executive who spoke at a company event last week. Basically, multimodal LLMs combine text with other kinds of information, such as images, videos, audio, and other sensory data. Multimodality can solve some of the problems of the current generation of LLMs. Multimodal language models will also unlock new...
Tags
- All
- Agentic AI
- AGI
- AI
- AI Art
- AI Ethics
- AI Girlfriends
- AI Models
- AI Risk
- ai tools
- Alan D. Thompson
- Alexandr Wang
- Andrew Huberman
- Andrew Ng
- Artificial Cognition
- Aurora Supercomputer
- Authenticity
- Autism Spectrum
- AutoGPT
- Aza Raskin
- Azure Open AI
- Azure OpenAI Service
- Bias Compensation
- Bias Therapy
- Brian Roemmele
- Chain-of-Thought Prompting
- ChatGPT
- Christopher Rufo
- climate change
- Cognition Enhancement
- Cognitive Bias
- Cognitive Content
- Cognitive Performance
- Collective Intelligence
- Collective Stupidity
- Communication
- Consciousness
- Cosmology
- Critical Race Theory
- Daniel Dennett
- Daniel Schmachtenberger
- David Shapiro
- Deep Thought
- Dennis Prager
- Digital Minds
- Digital Thoughts
- Diversity
- Dojo
- Douglas Murray
- Elon Musk
- Emad Mostaque
- Equity
- Eric Weinstein
- Ethical Community Development
- Ethics
- Everyman
- Exponential Enterprise
- Fei-Fei Li
- Foresight
- Fred Lerdahl
- Frontiers Forum
- Futurecrafting
- Futurework
- Gary Marcus
- Gemini
- Gender
- Gender Pronouns
- Generative AI
- Generative Theory of Tonal Music (GTTM)
- Geoffrey Hinton
- Geoffrey Miller
- Glenn Loury
- Governance
- GPGICs
- GPT-4
- GPT-5
- Higher Education
- Human Potential
- Humanities
- Identity
- Ilya Sutskever
- Implicit Association Tests
- Intel
- Intelligence
- James Lindsay
- Joe Rogan
- Jordan B Peterson
- Jungian Archetypes
- Konstantin Kisin
- Language
- Lex Fridman
- Libra
- Life Coaching
- Liv Boeree
- Male Loneliness
- Marcus Aurelius
- Marcus T. Anthony
- Matt Walsh
- Matthew Berman
- Max Tegmark
- MemoryGPT
- Mental Health
- metabotropic receptors (MRs)
- Metacrisis
- Michio Kaku
- Microsoft AI
- Microsoft Copilot
- Microsoft Jarvis
- Microsoft Open AI
- Microsoft Semantic Kernel
- Millennials
- Mind Reading
- Minecraft
- Mirella Lapata
- MIT
- MLLM
- Moha Bensofia
- Morality
- Multimodal Large Language Model
- Multiversal Stories
- Music
- Narcissism
- Neurodivergence
- Neuroplasticity
- Neuroscience
- Nvidia
- OpenAI
- optical computers
- Personal Development
- Peter Bannon
- Peter H. Diamandis
- Philosophy
- pinecone
- Psychology
- Ramani Durvasula
- Ray Jackendoff
- Ray Kurzweil
- Reflection
- Reid Hoffman
- Relationships
- Religion
- Richard Haier
- Robotic Process Automation (RPA)
- robotics
- Sabine Hossenfelder
- Sam Altman
- Sam Harris
- Sebastien Bubeck
- semantic search
- Seneca
- Simulation
- Singularity Ready
- Stephen Fry
- String theory
- Stupidity
- Super Alignment
- Superintelligence
- Susan Blackmore
- Synthetic Intelligence
- Synthetic Mind
- Technology
- Terence McKenna
- Tesla
- Tesla AI
- The Hero Archetype
- Theism
- Theory of Mind
- Thomas Sowell
- Thought
- Thought Experiments
- Transactivism
- transcendence
- Translation
- Tree of Thoughts
- Tristan Harris
- Turing Lectures
- Unconscious Bias Training
- Victor Davis Hanson
- Wes Roth
- Will Caster
- Woke Ideologies
- Worker Productivity
- Worker Satisfaction
- Yann LeCun
- Yuval Noah Harari