CleanAgent: Automatically Standardize Your Data with LLM-based Agents
Introduction
CleanAgent is a tool that automates the data standardization process with LLM-based Agents. You only need to upload the table that needs to be standardized, and CleanAgent will get your hands free, standardize your data automatically and return the standardized table to you.
Our demo paper CleanAgent: Automating Data Standardization with LLM-based Agents will be published in the future.
If you think this small project is interesting or helpful to you, please considering give us a star 🌟 🌟 🌟 ! 😃
Usage
Please use CleanAgent with the following commands:
git clone https://github.com/sfu-db/CleanAgent cd CleanAgent conda create -n cleanagent python=3.10.6 conda activate cleanagent export OPENAI_API_KEY="Your_Openai_API_Key" pip install -r requirements.txt chainlit run main.py
Then you can see the web UI of CleanAgent on http://localhost:8000
Demonstration
You also can watch the following 4-mins demo to learn how to use CleanAgent:
Citation
If you think CleanAgent is useful for you, please considering to cite our CleanAgent:
@article{qi2024cleanagent, title={CleanAgent: Automating data standardization with LLM-based agents}, author={Qi, Danrui and Miao, Zhengjie and Wang, Jiannan}, journal={arXiv preprint arXiv:2403.08291}, year={2024} }
