amindadgar - Overview

Hi there šŸ˜ƒšŸ‘‹

I'm Amin — nice to meet you!

I’m an AI Engineer at TogetherCrew, where I build data pipelines and LLM-powered systems to support decentralized online communities.


šŸ’¼ What I Work On

  • Designing retrieval-augmented generation (RAG) systems using llama-index, with persistent caching, data deduplication, and relevance-based embedding logic.
  • Creating ETL pipelines that handle large-scale community data across multiple platforms using Apache Airflow and Temporal. I implement optimizations like indexing only the latest data and deduplicating via hash comparison.
  • Evaluating LLM outputs with custom RAG evaluation metrics including coverage, relevance, and confidence scoring.
  • Developing LLM agents using LangChain and CrewAI for orchestrated multi-agent tasks.
  • Continuously improving data quality and pipeline efficiency by investigating upstream community behavior data and pushing for tighter integration across microservices.

🧠 Projects

šŸ”¹ Hivemind-Bot — RAG System, LLM

Message-driven system that performs retrieval over embedded organizational data to generate LLM-based responses. Communicates with other services via broker queues.

šŸ”¹ Hivemind-ETL — ETL, Caching

ETL DAGs using Apache Airflow for data embedding and summarization. Includes caching mechanisms to avoid redundant embeddings and indexing strategies based on timestamped persistence.

šŸ”¹ Temporal Worker — Workflow Orchestration & Data Processing

Implemented scalable, fault-tolerant workflows using Temporal for orchestrating asynchronous data processing tasks. Features include data deduplication via hashing, ETL orchestration, message brokering, and seamless integration with microservices for real-time event-driven pipelines.

šŸ”¹ Violation Detection — LLM Classification

Fine-tuned a custom LLM to detect community violations in messages. Built pipelines for classification and automated reporting.

šŸ”¹ Agents Workflow — CrewAI, LangChain

Developed multi-agent LLM apps using CrewAI and LangChain, focused on community data analysis and dynamic decision-making tasks.

šŸ”¹ TC-Analyzer — Analytics Library

Python library for behavioral analytics on community members. Features graph analytics and activity-based segmentation.


🧰 Tech Stack

  • Languages: Python, LaTeX
  • Databases: Qdrant, PostgreSQL, Neo4j, MongoDB
  • Messaging & Workflow: RabbitMQ, Apache Airflow, Temporal
  • Frameworks & Tools: Docker, Flask, llama-index, LangChain, CrewAI, Git

šŸ™Œ Volunteer Work

  • Co-Founder, AI Community Group (Nov 2024 – Present) Host a weekly AI series covering topics like LLMs, RAG pipelines, agent systems, and prompt engineering. 🌐 Website

  • Co-Founder, Cassandra AI Group (Oct 2021 – Oct 2023) Ran an academic AI community with a focus on accessible ML research and student support. Organized workshops, study sessions, and a two-day conference. 🌐 Website | YouTube | GitHub


šŸ“« Get in Touch


šŸ“Š GitHub Stats

Contribution Stats


✨ A Quote to Remember

"After everything, what remains is kindness — so don't hesitate to help others." 😊