Update `clipdrop-main` by clementchadebec · Pull Request #4 · initml/diffusers

clementchadebec · 2024-06-12T08:53:11Z

What does this PR do?

Update clipdrop-main branch with diffusers main branch
All tests passed

* pag_initial * pag_docs * edit_docs * custom * typo * delete_docs * whitespace * make style --------- Co-authored-by: Sayak Paul <[email protected]>

* Add `final_sigma_zero` to UniPCMultistep Effectively the same trick as DDIM's `set_alpha_to_one` and DPM's `final_sigma_type='zero'`. Currently False by default but maybe this should be True? * `final_sigma_zero: bool` -> `final_sigmas_type: str` Should 1:1 match DPM Multistep now. * Set `final_sigmas_type='sigma_min'` in UniPC UTs

* fix ip adapter support * Update sag pipelines tests, adjust sag pipeline to pass tests --------- Co-authored-by: YiYi Xu <[email protected]>

…arigold v0.1.5 (huggingface#7524) * add resample option; check denoise_step; update ckpt path * Add seeding in pipeline to increase reproducibility * fix typo * fix typo

update

* Fix SVD bug (shape of `time_context`) * Formatting code * Formatting src/diffusers/models/transformers/transformer_temporal.py by `make style && make quality` --------- Co-authored-by: kevinkhwu <[email protected]> Co-authored-by: Sayak Paul <[email protected]> Co-authored-by: Dhruv Nair <[email protected]>

fix

* add HD-Painter pipeline * style fixing * refactor, change doc, fix ruff * fix docs * used correct ruff version --------- Co-authored-by: Hayk Manukyan <[email protected]> Co-authored-by: Sayak Paul <[email protected]>

* add from_pipe --------- Co-authored-by: yiyixuxu <yixu310@gmail,com> Co-authored-by: Sayak Paul <[email protected]> Co-authored-by: Dhruv Nair <[email protected]>

…ggingface#7550) * initial-commit pipeline created * updated README.md

update

* make nightly workflow dispatchable. * add a note about running the release tests to setup.py

* remove class assignments for linear and conv. * fix: self.nn

* start printing the tensors. * print full throttle * set static slices for 7 tests. * remove printing. * flatten * disable test for controlnet * what happens when things are seeded properly? * set the right value * style./ * make pia test fail to check things * print. * fix pia. * checking for animatediff. * fix: animatediff. * video synthesis * final piece. * style. * print guess. * fix: assertion for control guess. --------- Co-authored-by: Dhruv Nair <[email protected]>

* 7529 do not disable autocast for cuda devices * Remove typecasting error check for non-mps platforms, as a correct autocast implementation makes it a non-issue * add autocast fix to other training examples * disable native_amp for dreambooth (sdxl) * disable native_amp for pix2pix (sdxl) * remove tests from remaining files * disable native_amp on huggingface accelerator for every training example that uses it * convert more usages of autocast to nullcontext, make style fixes * make style fixes * style. * Empty-Commit --------- Co-authored-by: bghira <[email protected]> Co-authored-by: Sayak Paul <[email protected]>

* add: utility to format our docs too 📜 * debugging saga * fix: message * checking * should be fixed. * revert pipeline_fixture * remove empty line * make style * fix: setup.py * style.

* UniPC UTs iterate solvers on FP16 It wasn't catching errs on order==3. Might be excessive? * UniPC Multistep fix tensor dtype/device on order=3 * UniPC UTs Add v_pred to fp16 test iter For completions sake. Probably overkill?

* UniPC Multistep add `rescale_betas_zero_snr` Same patch as DPM and Euler with the patched final alpha cumprod BF16 doesn't seem to break down, I think cause UniPC upcasts during some phases already? We could still force an upcast since it only loses ≈ 0.005 it/s for me but the difference in output is very small. A better endeavor might upcasting in step() and removing all the other upcasts elsewhere? * UniPC ZSNR UT * Re-add `rescale_betas_zsnr` doc oops

…face#7491) * refactor transformers 2d into multiple legacy variants. * fix: init. * fix recursive init. * add inits. * make transformer block creation more modular. * complete refactor. * remove forward * debug * remove legacy blocks and refactor within the module itself. * remove print * guard caption projection * remove fetcher. * reduce the number of args. * fix: norm_type * group variables that are shared. * remove _get_transformer_blocks * harmonize the init function signatures. * transformer_blocks to common * repeat .

* increase number of workers for the tests. * move to beefier runner. * improve the fast push tests too. * use a beefy machine for pytorch pipeline tests * up the number of workers further.

* Update pipeline_animatediff_video2video.py * commit with test for whether latent input can be passed into animatediffvid2vid

* Skip `test_freeu_enabled ` on MPS * Small fixes - import skip_mps correctly - disable all instances of test_freeu_enabled * Empty commit to trigger tests * Empty commit to trigger CI

* reduce block sizes for unet1d. * reduce blocks for unet_2d. * reduce block size for unet_motion * increase channels. * correctly increase channels. * reduce number of layers in unet2dconditionmodel tests. * reduce block sizes for unet2dconditionmodel tests * reduce block sizes for unet3dconditionmodel. * fix: test_feed_forward_chunking * fix: test_forward_with_norm_groups * skip spatiotemporal tests on MPS. * reduce block size in AutoencoderKL. * reduce block sizes for vqmodel. * further reduce block size. * make style. * Empty-Commit * reduce sizes for ConsistencyDecoderVAETests * further reduction. * further block reductions in AutoencoderKL and AssymetricAutoencoderKL. * massively reduce the block size in unet2dcontionmodel. * reduce sizes for unet3d * fix tests in unet3d. * reduce blocks further in motion unet. * fix: output shape * add attention_head_dim to the test configuration. * remove unexpected keyword arg * up a bit. * groups. * up again * fix

add set_begin_index for all if pipelines

* add audioldm2 tts * change gpt2 max new tokens * remove unnecessary pipeline and class * add TTS to AudioLDM2Pipeline * add TTS docs * delete unnecessary file * remove unnecessary import * add audioldm2 slow testcase * fix code quality * remove AudioLDMLearnablePositionalEmbedding * add variable check vits encoder * add use_learned_position_embedding --------- Co-authored-by: Dhruv Nair <[email protected]>

) Allow safety and feature extractor arguments to be passed to convert_from_ckpt Allows management of safety checker and feature extractor from outside of the convert ckpt class. Co-authored-by: Sayak Paul <[email protected]>

* Restore unet params back to normal from EMA when validation call is finished * empty commit --------- Co-authored-by: Sayak Paul <[email protected]>

* disable test * update --------- Co-authored-by: yiyixuxu <yixu310@gmail,com>

* Support multiimage masking --------- Co-authored-by: Sayak Paul <[email protected]> Co-authored-by: YiYi Xu <[email protected]>

* Hunyuan Team: add HunyuanDiT related updates --------- Co-authored-by: XCLiu <[email protected]> Co-authored-by: yiyixuxu <[email protected]>

* add hunyuandit doc * update hunyuandit doc * update hunyuandit 2d model * update toctree.yml for hunyuandit

* update * update * update * update

* Update transformer2d.md title For the other classes (e.g., UNet2DModel) the title of the documentation coincides with the name of the class, but that was not the case for Transformer2DModel. * Update model docs titles for consistency with class names

minor docs changes in hunyuandit

…ngface#8370) * handle norm_type of transformer2d_model safely. * log an info when old model class is being returned. * Apply suggestions from code review Co-authored-by: Dhruv Nair <[email protected]> * remove extra stuff --------- Co-authored-by: Dhruv Nair <[email protected]>

…STRING (huggingface#8401) Update code example in pipeline_stable_unclip_img2img.py Previous code caused an error when run

huggingface#8396) * feat: introduce qkv fusion for Hunyuan * fix copies

feat: support chunked ff.

) * remove legacy code from load_attn_procs. * finish first draft * fix more. * fix more * add test * add serialization support. * fix-copies * require peft backend for lora tests * style * fix test * fix loading. * empty * address benjamin's feedback.

…e#8399) * allow hunyuan dit to run under 6GB for GPU VRAM * add section in the docs/

…uggingface#8385) * fix: euledm when using the exp sigma schedule. * fix-copies * remove print. * reduce friction * yiyi's suggestioms

* add training code of gligen * fix code quality tests. --------- Co-authored-by: Sayak Paul <[email protected]>

* Fix typos * Trim trailing whitespaces * Remove a trailing whitespace * chore: Update MarigoldDepthPipeline checkpoint to prs-eth/marigold-lcm-v1-0 * Revert "chore: Update MarigoldDepthPipeline checkpoint to prs-eth/marigold-lcm-v1-0" This reverts commit fd742b3. * pokemon -> naruto * `DPMSolverMultistep` -> `DPMSolverMultistepScheduler` * Improve Markdown stylization * Improve style * Improve style * Refactor pipeline variable names for consistency * up style

…uggingface#8402) * optimizations to the hunyuan dit docs. * Apply suggestions from code review Co-authored-by: Steven Liu <[email protected]> * Update docs/source/en/api/pipelines/hunyuandit.md Co-authored-by: Steven Liu <[email protected]> --------- Co-authored-by: Steven Liu <[email protected]>

* fix: legacy model mapping * remove print

* single file usage * edit

* Refactor code to remove unnecessary calls to `to(torch_device)` * Refactor code to remove unnecessary calls to `to("cuda")` * Update pipeline_stable_diffusion_diffedit.py

* first draft * secret * tiktok * capital matters * dataset matter * don't be a prick * refact * only on main or tag * document with an example * Update destination dataset * link * allow manual trigger * better * lin --------- Co-authored-by: Sayak Paul <[email protected]>

…#7830) * feat: support saving a model in sharded checkpoints. * feat: make loading of sharded checkpoints work. * add tests * cleanse the loading logic a bit more. * more resilience while loading from the Hub. * parallelize shard downloads by using snapshot_download()/ * default to a shard size. * more fix * Empty-Commit * debug * fix * uality * more debugging * fix more * initial comments from Benjamin * move certain methods to loading_utils * add test to check if the correct number of shards are present. * add a test to check if loading of sharded checkpoints from the Hub is okay * clarify the unit when passed as an int. * use hf_hub for sharding. * remove unnecessary code * remove unnecessary function * lucain's comments. * fixes * address high-level comments. * fix test * subfolder shenanigans./ * Update src/diffusers/utils/hub_utils.py Co-authored-by: Lucain <[email protected]> * Apply suggestions from code review Co-authored-by: Lucain <[email protected]> * remove _huggingface_hub_version as not needed. * address more feedback. * add a test for local_files_only=True/ * need hf hub to be at least 0.23.2 * style * final comment. * clean up subfolder. * deal with suffixes in code. * _add_variant default. * use weights_name_pattern * remove add_suffix_keyword * clean up downloading of sharded ckpts. * don't return something special when using index.json * fix more * don't use bare except * remove comments and catch the errors better * fix a couple of things when using is_file() * empty --------- Co-authored-by: Lucain <[email protected]>

* Move away from * unused constant * Add custom error

HyoungwonCho and others added 30 commits March 30, 2024 10:52

Perturbed-Attention Guidance (huggingface#7512)

9d20ed3

* pag_initial * pag_docs * edit_docs * custom * typo * delete_docs * whitespace * make style --------- Co-authored-by: Sayak Paul <[email protected]>

Fix IP Adapter Support for SAG Pipeline (huggingface#7260)

ca61287

* fix ip adapter support * Update sag pipelines tests, adjust sag pipeline to pass tests --------- Co-authored-by: YiYi Xu <[email protected]>

[Community pipeline] Marigold depth estimation update -- align with m…

c2e8786

…arigold v0.1.5 (huggingface#7524) * add resample option; check denoise_step; update ckpt path * Add seeding in pipeline to increase reproducibility * fix typo * fix typo

Fix typo in CPU offload test (huggingface#7542)

7aa4514

update

fix the cpu offload tests (huggingface#7544)

7f724a9

fix

add HD-Painter pipeline (huggingface#7520)

5266ab7

* add HD-Painter pipeline * style fixing * refactor, change doc, fix ruff * fix docs * used correct ruff version --------- Co-authored-by: Hayk Manukyan <[email protected]> Co-authored-by: Sayak Paul <[email protected]>

add a from_pipe method to DiffusionPipeline (huggingface#7241)

7956c36

* add from_pipe --------- Co-authored-by: yiyixuxu <yixu310@gmail,com> Co-authored-by: Sayak Paul <[email protected]> Co-authored-by: Dhruv Nair <[email protected]>

[Community pipeline] SDXL Differential Diffusion Img2Img Pipeline (hu…

73ba810

…ggingface#7550) * initial-commit pipeline created * updated README.md

Fix FreeU tests (huggingface#7540)

5d21d4a

update

[Release tests] make nightly workflow dispatchable. (huggingface#7541)

5d83f50

* make nightly workflow dispatchable. * add a note about running the release tests to setup.py

[Chore] remove class assignments for linear and conv. (huggingface#7553)

000fa82

* remove class assignments for linear and conv. * fix: self.nn

add: utility to format our docs too 📜 (huggingface#7314)

4a34307

* add: utility to format our docs too 📜 * debugging saga * fix: message * checking * should be fixed. * revert pipeline_fixture * remove empty line * make style * fix: setup.py * style.

[Chore] increase number of workers for the tests. (huggingface#7558)

ad55ce6

* increase number of workers for the tests. * move to beefier runner. * improve the fast push tests too. * use a beefy machine for pytorch pipeline tests * up the number of workers further.

Update pipeline_animatediff_video2video.py (huggingface#7457)

35db2fd

* Update pipeline_animatediff_video2video.py * commit with test for whether latent input can be passed into animatediffvid2vid

Skip test_freeu_enabled on MPS (huggingface#7570)

71f49a5

* Skip `test_freeu_enabled ` on MPS * Small fixes - import skip_mps correctly - disable all instances of test_freeu_enabled * Empty commit to trigger tests * Empty commit to trigger CI

[IF| add set_begin_index for all IF pipelines (huggingface#7577)

6133d98

add set_begin_index for all if pipelines

[Docs] fix bugs in callback docs (huggingface#7594)

7e808e7

Add missing restore() EMA call in train SDXL script (huggingface#7599)

8e46d97

* Restore unet params back to normal from EMA when validation call is finished * empty commit --------- Co-authored-by: Sayak Paul <[email protected]>

disable test_conversion_when_using_device_map (huggingface#7620)

a341b53

* disable test * update --------- Co-authored-by: yiyixuxu <yixu310@gmail,com>

Multi-image masking for single IP Adapter (huggingface#7499)

a0cf607

* Support multiimage masking --------- Co-authored-by: Sayak Paul <[email protected]> Co-authored-by: YiYi Xu <[email protected]>

gnobitab and others added 28 commits June 1, 2024 12:41

Tencent Hunyuan Team: add HunyuanDiT related updates (huggingface#8240)

4136044

* Hunyuan Team: add HunyuanDiT related updates --------- Co-authored-by: XCLiu <[email protected]> Co-authored-by: yiyixuxu <[email protected]>

Tencent Hunyuan Team - Updated Doc for HunyuanDiT (huggingface#8383)

174cf86

* add hunyuandit doc * update hunyuandit doc * update hunyuandit 2d model * update toctree.yml for hunyuandit

Update slow test actions (huggingface#8381)

4d633bf

* update * update * update * update

Fix AsymmetricAutoencoderKL forward (huggingface#8378)

6be43bd

[HunyuanDiT] minor docs changes in hunyuandit (huggingface#8395)

3ff39e8

minor docs changes in hunyuandit

Update code example in pipeline_stable_unclip_img2img.py EXAMPLE_DOC_…

07cd200

…STRING (huggingface#8401) Update code example in pipeline_stable_unclip_img2img.py Previous code caused an error when run

[Hunyuan DiT] feat: enable fusing qkv projections when doing attention (

14f7b54

huggingface#8396) * feat: introduce qkv fusion for Hunyuan * fix copies

[Hunyuan] feat: support chunked ff. (huggingface#8397)

a8ad666

feat: support chunked ff.

[Hunyuan] allow Hunyuan DiT to run under 6GB for GPU VRAM (huggingfac…

2f6f426

…e#8399) * allow hunyuan dit to run under 6GB for GPU VRAM * add section in the docs/

[Scheduler] fix: EDM schedulers when using the exp sigma schedule. (h…

48207d6

…uggingface#8385) * fix: euledm when using the exp sigma schedule. * fix-copies * remove print. * reduce friction * yiyi's suggestioms

Gligen training (huggingface#7906)

d3881f3

* add training code of gligen * fix code quality tests. --------- Co-authored-by: Sayak Paul <[email protected]>

Update tailscale action to main (huggingface#8403)

7ebd359

[Core] fix: legacy model mapping (huggingface#8416)

a3faf3f

* fix: legacy model mapping * remove print

[docs] Single file usage (huggingface#8412)

151a56b

* single file usage * edit

Optimize test files by fixing CPU-offloading usage (huggingface#8409)

ec1aded

* Refactor code to remove unnecessary calls to `to(torch_device)` * Refactor code to remove unnecessary calls to `to("cuda")` * Update pipeline_stable_diffusion_diffedit.py

Fix mirror_community_pipeline.yml name (huggingface#8425)

5fd6825

Fix mirror community pipeline (huggingface#8426)

716b206

Final fix for mirror community pipeline (huggingface#8427)

b63c956

Move away from cached_download (huggingface#8419)

0d68ddf

* Move away from * unused constant * Add custom error

feat(ci): add trufflehog secrets detection (huggingface#8430)

83bc6c9

Merge branch 'main' into clement/update/merge-main

9578d7e

clementchadebec requested a review from benjaminaubin June 12, 2024 08:53

clementchadebec merged commit bf2578e into clipdrop-main Jul 3, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update `clipdrop-main`#4

Update `clipdrop-main`#4
clementchadebec merged 539 commits intoclipdrop-mainfrom
clement/update/merge-main

clementchadebec commented Jun 12, 2024 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

20 participants

Conversation

clementchadebec commented Jun 12, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

20 participants

clementchadebec commented Jun 12, 2024 •

edited

Loading