Talk:Restless Souls/Technology: Difference between revisions

m
no edit summary
mNo edit summary
mNo edit summary
Line 577: Line 577:
:: The hybrid approach together with MoE is called Jamba by AI21 and is also used by Nvidia.  
:: The hybrid approach together with MoE is called Jamba by AI21 and is also used by Nvidia.  
:: Matured mamba transformer hybrids will make agentic AI smarter and therefore way more safer to use by default.
:: Matured mamba transformer hybrids will make agentic AI smarter and therefore way more safer to use by default.
:: In 2026 Sam Altman said OpenAI's next breakthrough is expected within two years, possibly meaning such hybrid approach which either gives him a something close to a world model, if not the real thing. The knowledge gained from Sora is rumored to flow into that. Google is working on a (''native'') "general purpose world model" instead. So, Genie 3 will - when it is released - probably perform better.
* '''Physical AI''' = Physical Artificial Intelligence. Basically AI used in robots, including self-driving cars.
* '''Physical AI''' = Physical Artificial Intelligence. Basically AI used in robots, including self-driving cars.
:: The general idea is: Like humans or other real organisms, AIs benefit from having an "inner world" to improve understanding and reasoning. The use of large language models (LLMs) is optional but can be a useful design choice to assist humans in directing such systems.
:: The general idea is: Like humans or other real organisms, AIs benefit from having an "inner world" to improve understanding and reasoning. The use of large language models (LLMs) is optional but can be a useful design choice to assist humans in directing such systems.
:: Training from ''first-hand sensor input'' is obvious but real world actions can be dangerous and are - because realtime - ''slow'' in context of the computer age. Therefore, AIs are alternatively pre-trained in a simulation where the robot is represented by a digital twin. ''Real world training will be kept for fine-tuning.''' Modern physical AIs are in overall multimodal.  
:: Training from ''first-hand sensor input'' is obvious but real world actions can be dangerous and are - because realtime - ''slow'' in context of the computer age. Therefore, AIs are alternatively pre-trained in a simulation where the robot is represented by a digital twin. ''Real world training will be kept for fine-tuning.''' Modern physical AIs are in overall multimodal.  
:: Alternatively, physical AIs are trained from video. An this posses ''second-hand sensor input'' the reasoning capabilities are less potent.
:: Alternatively, physical AIs are trained from video. An this posses ''second-hand sensor input'' the reasoning capabilities are less potent.
::: In 2026 Sam Altman said OpenAI's next breakthrough is expected within two years, possibly meaning such hybrid approach which either gives him a something close to a world model, if not the real thing. The knowledge gained from Sora is rumored to flow into that. Google is working on a (''native'') "general purpose world model" instead. So, Genie 3 will - when it is released - probably perform better.
:: Training with motion capture data is quickly done - and often falls short in generating stable locomotion as additional training would be needed.<!--Embodied AI-->
:: Training with motion capture data is quickly done - and often falls short in generating stable locomotion as additional training would be needed.<!--Embodied AI-->
* '''World model''' = World models get trained on multimodal data, especially videos.<!--Will it be Gemini 4 or 5?-->
* '''World model''' = World models get trained on multimodal data, especially videos.<!--Will it be Gemini 4 or 5?-->
8,852

edits