njhill - Overview

Skip to content

Navigation Menu

Sign in

Appearance settings

Pinned Loading

  1. A high-throughput and memory-efficient inference and serving engine for LLMs

    Python 71.8k 13.9k

  2. IBM development fork of https://github.com/huggingface/text-generation-inference

    Python 63 36

  3. Alternative etcd3 java client

    Java 162 42

  4. Distributed Model Serving Framework

    Java 187 79

  5. Netty project - an event-driven asynchronous network application framework

    Java 34.8k 16.3k

  6. Abstracted helper classes providing consistent key-value store functionality, with zookeeper and etcd3 implementations

    Java 5 2