Hi there šš
I'm Amin ā nice to meet you!
Iām an AI Engineer at TogetherCrew, where I build data pipelines and LLM-powered systems to support decentralized online communities.
š¼ What I Work On
- Designing retrieval-augmented generation (RAG) systems using
llama-index, with persistent caching, data deduplication, and relevance-based embedding logic. - Creating ETL pipelines that handle large-scale community data across multiple platforms using
Apache AirflowandTemporal. I implement optimizations like indexing only the latest data and deduplicating via hash comparison. - Evaluating LLM outputs with custom RAG evaluation metrics including coverage, relevance, and confidence scoring.
- Developing LLM agents using
LangChainandCrewAIfor orchestrated multi-agent tasks. - Continuously improving data quality and pipeline efficiency by investigating upstream community behavior data and pushing for tighter integration across microservices.
š§ Projects
š¹ Hivemind-Bot ā RAG System, LLM
Message-driven system that performs retrieval over embedded organizational data to generate LLM-based responses. Communicates with other services via broker queues.
š¹ Hivemind-ETL ā ETL, Caching
ETL DAGs using Apache Airflow for data embedding and summarization. Includes caching mechanisms to avoid redundant embeddings and indexing strategies based on timestamped persistence.
š¹ Temporal Worker ā Workflow Orchestration & Data Processing
Implemented scalable, fault-tolerant workflows using Temporal for orchestrating asynchronous data processing tasks. Features include data deduplication via hashing, ETL orchestration, message brokering, and seamless integration with microservices for real-time event-driven pipelines.
š¹ Violation Detection ā LLM Classification
Fine-tuned a custom LLM to detect community violations in messages. Built pipelines for classification and automated reporting.
š¹ Agents Workflow ā CrewAI, LangChain
Developed multi-agent LLM apps using CrewAI and LangChain, focused on community data analysis and dynamic decision-making tasks.
š¹ TC-Analyzer ā Analytics Library
Python library for behavioral analytics on community members. Features graph analytics and activity-based segmentation.
š§° Tech Stack
- Languages: Python, LaTeX
- Databases: Qdrant, PostgreSQL, Neo4j, MongoDB
- Messaging & Workflow: RabbitMQ, Apache Airflow, Temporal
- Frameworks & Tools: Docker, Flask, llama-index, LangChain, CrewAI, Git
š Volunteer Work
-
Co-Founder, AI Community Group (Nov 2024 ā Present) Host a weekly AI series covering topics like LLMs, RAG pipelines, agent systems, and prompt engineering. š Website
-
Co-Founder, Cassandra AI Group (Oct 2021 ā Oct 2023) Ran an academic AI community with a focus on accessible ML research and student support. Organized workshops, study sessions, and a two-day conference. š Website | YouTube | GitHub
š« Get in Touch
- āļø Email: dadgaramin96@gmail.com
- š¼ LinkedIn: linkedin.com/in/mramin22
- š¬ Discord: mramin22#1669
- š¦ Twitter: @mramin22
š GitHub Stats
⨠A Quote to Remember
"After everything, what remains is kindness ā so don't hesitate to help others." š