Talk:Restless Souls/Technology: Difference between revisions

Talk:Restless Souls/Technology (view source)

Revision as of 23:28, 5 January 2026

38 bytes added , 5 January

m

no edit summary

Paradox-01

8,699

edits

@@ Line 564: / Line 564: @@
 ** The actual data-holding model, including its parameters and weights. Most often, this is a Large Language Model (LLM) or a Large Multimodal Model (LMM). The learned data consist of statistical patterns about text, images, or other media. Very most of the original raw data cannot be reconstructed from the model.
 ** Reinforcement learning from human feedback (RLHF) and (its successor RLAIF) can be named as another important feature that added a reward model for higher quality and alignment.
-** Other features like COT, MoE, the use of external tools (e.g. via MCP) and context expansion are better described as incremental improvements in the evolution of GenAI.<!--
+** Other features or milestones like COT, MoE, context expansion and the use of external tools via MCP to compensate own shortcomings are better described as incremental improvements in the evolution of GenAI.<!--
 * GPT = Generative pre-trained transformers (Large Language Model with the actual "learning" part de facto outsourced to humans: Reinforcement learning from human feedback (RLHF), in best case GPTs have a ''transplanted base intelligence'' but they lack the important feature to really learn for themselves. Low quality "synthetic data" can even worsen the models.
 :: After the obvious slowdown in advancement through scaling, chain of thought (COT) was introduced. It is also known under the marketing term "reasoning [AI]". (See: Gemini 2.0 Flash Thinking and ChatGPT o3, the later one especially [https://techxplore.com/news/2024-12-ai-human-general-intelligence.html trained to cheat … erm ... score high in the ARC-AGI test].)