HPU support by IlyasMoutawwakil · Pull Request #3378 · huggingface/accelerate
All tests that don't require fp16/fp8 are passing on gaudi1.
I'm slowly reenabling some tests that are now passing on gaudi2 with Synapse 1.20+Pytorch 2.6, like pippy.
I added an explanation of every test skipped on HPU, there are mainly three reasons for skipping:
- unsupported hpu device indexing (hpu:1).
- unsupported empty_cache() op.
- missing bnb support.
one last test that fails with no explanation is test_multi_device_merge_fsdp_weights, for now all I'm seeing is # Synapse detected a device critical error that requires a restart, I can investigate it later outside of the scope of this PR.