Integrate Liger (Linkedin GPU Efficient Runtime) Kernel to Trainer by JasonZhu1313 · Pull Request #32860 · huggingface/transformers

@JasonZhu1313 marked this pull request as draft

August 17, 2024 00:18

ArthurZucker

@JasonZhu1313 JasonZhu1313 changed the title [WIP] Integrate Liger (Linkedin GPU Efficient Runtime) Kernel to Trainer Integrate Liger (Linkedin GPU Efficient Runtime) Kernel to Trainer

Aug 19, 2024

muellerzr

Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>
Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>
Co-authored-by: Byron Hsu <byronhsu1230@gmail.com>
Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>
Co-authored-by: Byron Hsu <byronhsu1230@gmail.com>
Co-authored-by: Byron Hsu <byronhsu1230@gmail.com>

@shimizust

@shimizust

@shimizust

helloworld1

@JasonZhu1313

@JasonZhu1313

@JasonZhu1313

@JasonZhu1313

helloworld1

@shimizust

@JasonZhu1313

@JasonZhu1313

@JasonZhu1313

@JasonZhu1313

@shimizust

ByronHsu

BernardZach pushed a commit to BernardZach/transformers that referenced this pull request

Dec 5, 2024
…uggingface#32860)

* add liger integration

* fix syntax

* fix import issue

* add trainer.md

* Use _apply_liger_kernel()

* Fixed log message

* Update docs/source/en/trainer.md

Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>

* Update docs/source/en/trainer.md

Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>

* Update src/transformers/training_args.py

Co-authored-by: Byron Hsu <byronhsu1230@gmail.com>

* Update src/transformers/trainer.py

Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>

* Update src/transformers/training_args.py

Co-authored-by: Byron Hsu <byronhsu1230@gmail.com>

* Update docs/source/en/trainer.md

Co-authored-by: Byron Hsu <byronhsu1230@gmail.com>

* Fixed checkstyle and updated readme

* Added test

* Fixed checkstyle

* fix docstring

* rename use_liger to use_liger_kernel

* Trigger Build

* Added test

* add fix-copies

* Fixed copy inconsistencies

---------

Co-authored-by: shimizust <sshimizu@linkedin.com>
Co-authored-by: Steven Shimizu <shimizust@gmail.com>
Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>
Co-authored-by: Byron Hsu <byronhsu1230@gmail.com>