NUS researchers' MRAgent framework reduces LLM agent memory retrieval to 118K tokens per query — vs. 3.26M for LangMem — using step-by-step reasoning.
DSpark can make decoding faster, but acceptance quality still determines how much speed the system actually realizes.