wrmedford - Overview

Skip to content

Navigation Menu

Sign in

Appearance settings

Pinned Loading

  1. ETHOS: Efficient Transformers via Hypernetwork-Organized Sparsity

    Jupyter Notebook 7

  2. Scaling Laws for Mixture of Experts Models

    Jupyter Notebook 15 1

  3. Second Generation of Large Language Models

    Python 21 1