GitHub - zjunlp/MemP: MemP: Exploring Agent Procedural Memory

🌻Acknowledgement

Our code is referenced and adapted from Langchain, ETO. And Thanks to ETO provide the trajectory on train set.

🌟Overview

Large Language Models based agents excel at diverse tasks yet they suffer from brittle procedural memory that is manually engineered or entangled in static parameters. In this work we investigate strategies to endow agents with a learnable updatable and lifelong procedural memory. We propose MemP that distills past agent trajectories into both fine grained step by step instructions and higher level script like abstractions and explore the impact of different strategies for Build Retrieval and Update of procedural memory. Coupled with a dynamic regimen that continuously updates corrects and deprecates its contents this repository evolves in lockstep with new experience. Empirical evaluation on TravelPlanner and ALFWorld shows that as the memory repository is refined agents achieve steadily higher success rates and greater efficiency on analogous tasks. Moreover procedural memory built from a stronger model retains its value migrating the procedural memory to a weaker model can also yield substantial performance gains.

In MemP, we support two strategies for building procedural memory: one constructs procedural memory offline using existing trajectories, and the other adopts a self-learning approach, starting from scratch to execute agent tasks online while actively learning procedural memory.

🔧Installation

git clone https://github.com/zjunlp/MemP
cd ProceduralMem
pip install -r requirements.txt

After installed, init neccessary Environment Variables

export OPENAI_API_KEY=YOUR_API_KEY
export OPENAI_API_BASE=YOUR_API_BASE_URL
export EMBEDDING_MODEL_KEY=YOUR_EMBEDDING_MODEL_KEY
export EMBEDDING_MODEL_BASE_URL=YOUR_EMBEDDING_MODEL_BASE_URL

✏️Offline Running

python run_memp_offline.py \
    --model your_model_name \
    --split dev_or_test \
    --batch_size concurrency_num \
    --max_steps n \
    --exp_name save_name \
    --few_shot \
    --use_memory

📝Online Running

python run_memp_online.py \
    --model your_model_name \
    --split dev_or_test \
    --batch_size concurrency_num \
    --max_steps n \
    --exp_name save_name \
    --few_shot \
    --use_memory \
    --overwrite

🚩Citation

If this work is helpful, please kindly cite as: