Posts for: #Moe-Jepa

Self-Rag

**

Advancing Agentic Knowledgeable Self-Awareness: A Research Agenda Extending arXiv:2504.03553

1. Introduction

The development of artificial intelligence (AI) agents capable of complex tasks necessitates mechanisms for robust and efficient knowledge utilization. A critical aspect of this is self-awareness regarding the agent’s own knowledge state – understanding what it knows, what it doesn’t know, and when external information is required. The paper arXiv:2504.03553 introduces the concept of “agentic knowledgeable self-awareness” and proposes the “KnowSelf” method as a novel approach to instill this capability in language agents. KnowSelf utilizes special tokens and a two-stage training process to explicitly signal the agent’s perceived knowledge state and guide its information processing strategy (e.g., relying on internal parameters vs. seeking external knowledge).

[Read more]

MoE-JEPA

Research Proposal: MoE-JEPA World Models for Efficient Reinforcement Learning and Planning

Abstract

Current AI research emphasizes the development of sophisticated world models capable of understanding complex dynamics, particularly from video data, often leveraging self-supervised learning (SSL) for representation extraction. Predictive models in abstract spaces (like JEPA) are gaining prominence over generative ones. Simultaneously, Mixture of Experts (MoE) offers a way to scale neural network capacity efficiently. This proposal outlines a research approach combining these trends: developing an Action-Conditioned Mixture-of-Experts Joint-Embedding Predictive Architecture (MoE-JEPA) world model. This model will be pre-trained using self-supervision on large video datasets to learn robust visual representations and environment dynamics. The MoE structure will allow the model to efficiently capture diverse or multi-modal dynamics within an environment by routing inputs to specialized expert sub-networks. This sophisticated world model will then be integrated into a model-based Reinforcement Learning (RL) framework to enable efficient planning and decision-making for agents (e.g., robots) interacting with complex environments. We hypothesize that this approach will lead to more accurate world models, improved sample efficiency in RL, and better generalization across tasks compared to monolithic world models.

[Read more]