DECODE: Overcoming Knowledge Update Inconsistencies in MLLMs

The Challenge of Knowledge Updates in MLLMs

Multimodal Large Language Models (MLLMs) represent a significant step in the evolution of artificial intelligence, combining natural language understanding with the processing of other modalities, such as images. These models are designed to interpret and generate responses based on complex inputs, such as text queries accompanied by images. However, managing and updating their knowledge base, a process known as Knowledge Editing, presents unexpected challenges that can compromise the reliability and consistency of their responses.

An emerging problem, termed "editing decoupling failure," has been identified as a significant obstacle. This phenomenon occurs when updated information in an MLLM, obtained through multimodal inputs (e.g., a text query paired with an image), is not retained if the model is queried with separate unimodal inputs. In practice, the model might provide a correct and updated answer when receiving an image and text together, but revert to outdated or pre-edit facts if presented with only the image or only the text. This inconsistency is particularly critical for enterprise applications that demand precision and consistency in every usage scenario.

Empirical Analysis and the Root Cause

In-depth empirical analysis has revealed the underlying cause of this "editing decoupling failure." Contrary to the assumption that entity knowledge in MLLMs is stored as a unified representation, research suggests that it is instead distributed across distinct, modality-specific pathways. This means that information related to an entity does not reside in a single "location" within the model but is fragmented and handled separately for textual and visual inputs.

As a result, knowledge updates that have been optimized or "biased" towards multimodal queries fail to propagate effectively to the corresponding unimodal circuits. If a Knowledge Editing operation modifies an entity's representation for a combined text-image input, that modification might not correctly reach or influence the representation of the same entity when presented as text only or image only. This architectural disconnection creates a gap in knowledge consistency, making models less predictable and more prone to errors in real-world usage scenarios.

DECODE: A Solution for Consistency

To address this critical gap, a new approach called DECODE has been proposed. This Framework is designed to explicitly disentangle and localize modality-specific neuron groups for knowledge management. DECODE's objective is to ensure that knowledge updates are consistent and propagate correctly across all modalities, regardless of the input type.

Through a series of experiments, DECODE has demonstrated that it consistently achieves effective knowledge updates under different modality triggers. This means that the model, once updated with DECODE, is able to maintain correct knowledge whether it receives multimodal or unimodal inputs. By mitigating "editing decoupling failures," DECODE significantly enhances the reliability and robustness of MLLMs, making them more suitable for applications where response consistency is paramount.

Implications for On-Premise Deployments

For organizations evaluating the deployment of MLLMs in self-hosted or air-gapped environments, the consistency and reliability of knowledge updates are paramount. In contexts where data sovereignty and regulatory compliance are stringent constraints, predictable model behavior is essential. An MLLM that exhibits inconsistencies in its responses depending on the input modality can introduce operational and security risks, as well as complicate model validation and certification.

Solutions like DECODE, which aim to ensure the integrity and consistency of knowledge within models, are therefore crucial for DevOps leads and infrastructure architects. The ability to perform robust and predictable Knowledge Editing reduces the TCO associated with model maintenance and Fine-tuning, minimizing the need for costly full re-training or manual interventions to correct discrepancies. For those evaluating on-premise deployments, AI-RADAR offers analytical frameworks on /llm-onpremise to assess the trade-offs between control, performance, and costs, emphasizing the importance of inherently stable and consistent models.