RecursionError when using enable_sequential_cpu_offload with FP8 wrapper

Using `enable_sequential_cpu_offload()` together with the FP8 optimization wrapper leads to a RecursionError during inference.

This occurs in `videox_fun/utils/fp8_optimization.py`.

## Reproduction
in `notebook.ipynb`:

```
pipe.enable_sequential_cpu_offload()

with torch.no_grad():
    pipe(...)
```
## ERROR : 

`RecursionError: maximum recursion depth exceeded`

## ATTACHMENTS : 

<img width="1985" height="698" alt="Image" src="https://github.com/user-attachments/assets/0d54007a-c430-4147-b63d-938393246125" />

## PROBABLE CAUSE : 

The FP8 wrapper (in `videox_fun/utils/fp8_optimization.py`) overrides `forward` and performs `.to()` inside the forward pass.

When combined with `accelerate` hooks used in sequential CPU offloading:
- `.to()` triggers recursive module traversal
- wrapped forward functions are re-entered
- leading to infinite recursion

Additionaally, `original_forward` may capture an already wrapped forward function, further contributing to recursion.


## RUN ENVIRONMENT : 
- Platform: Linux (Ubuntu)
- PyTorch: provided version [req.txt]
- accelerate: provided version [req.txt]


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RecursionError when using enable_sequential_cpu_offload with FP8 wrapper #8

Reproduction

ERROR :

ATTACHMENTS :

PROBABLE CAUSE :

RUN ENVIRONMENT :

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

RecursionError when using enable_sequential_cpu_offload with FP8 wrapper #8

Description

Reproduction

ERROR :

ATTACHMENTS :

PROBABLE CAUSE :

RUN ENVIRONMENT :

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions