Steven Liu
|
4fc5e29ab0
|
[docs] Add pipelines to table (#9282)
update pipelines
|
2024-12-23 13:02:15 +05:30 |
|
Yoach Lacombe
|
030a134311
|
Stable Audio integration (#8716)
* WIP modeling code and pipeline
* add custom attention processor + custom activation + add to init
* correct ProjectionModel forward
* add stable audio to __initèè
* add autoencoder and update pipeline and modeling code
* add half Rope
* add partial rotary v2
* add temporary modfis to scheduler
* add EDM DPM Solver
* remove TODOs
* clean GLU
* remove att.group_norm to attn processor
* revert back src/diffusers/schedulers/scheduling_dpmsolver_multistep.py
* refactor GLU -> SwiGLU
* remove redundant args
* add channel multiples in autoencoder docstrings
* changes in docsrtings and copyright headers
* clean pipeline
* further cleaning
* remove peft and lora and fromoriginalmodel
* Delete src/diffusers/pipelines/stable_audio/diffusers.code-workspace
* make style
* dummy models
* fix copied from
* add fast oobleck tests
* add brownian tree
* oobleck autoencoder slow tests
* remove TODO
* fast stable audio pipeline tests
* add slow tests
* make style
* add first version of docs
* wrap is_torchsde_available to the scheduler
* fix slow test
* test with input waveform
* add input waveform
* remove some todos
* create stableaudio gaussian projection + make style
* add pipeline to toctree
* fix copied from
* make quality
* refactor timestep_features->time_proj
* refactor joint_attention_kwargs->cross_attention_kwargs
* remove forward_chunk
* move StableAudioDitModel to transformers folder
* correct convert + remove partial rotary embed
* apply suggestions from yiyixuxu -> removing attn.kv_heads
* remove temb
* remove cross_attention_kwargs
* further removal of cross_attention_kwargs
* remove text encoder autocast to fp16
* continue removing autocast
* make style
* refactor how text and audio are embedded
* add paper
* update example code
* make style
* unify projection model forward + fix device placement
* make style
* remove fuse qkv
* apply suggestions from review
* Update src/diffusers/pipelines/stable_audio/pipeline_stable_audio.py
Co-authored-by: YiYi Xu <yixu310@gmail.com>
* make style
* smaller models in fast tests
* pass sequential offloading fast tests
* add docs for vae and autoencoder
* make style and update example
* remove useless import
* add cosine scheduler
* dummy classes
* cosine scheduler docs
* better description of scheduler
---------
Co-authored-by: YiYi Xu <yixu310@gmail.com>
|
2024-12-23 13:02:14 +05:30 |
|
Steven Liu
|
de414618ba
|
[docs] Refactor image quality docs (#7758)
* refactor
* code snippets
* fix path
* fix path in guide
* code outputs
* align toctree title
* title
* fix title
|
2024-12-23 13:02:11 +05:30 |
|
Manuel Brack
|
87c83dd381
|
[Pipeline] Add LEDITS++ pipelines (#6074)
* Setup LEdits++ file structure
* Fix import
* LEditsPP Stable Diffusion pipeline
* Include variable image aspect ratios
* Implement LEDITS++ for SDXL
* clean up LEditsPPPipelineStableDiffusion
* Adjust inversion output
* Added docu, more cleanup for LEditsPPPipelineStableDiffusion
* clean up LEditsPPPipelineStableDiffusionXL
* Update documentation
* Fix documentation import
* Add skeleton IF implementation
* Fix documentation typo
* Add LEDTIS docu to toctree
* Add missing title
* Finalize SD documentation
* Finalize SD-XL documentation
* Fix code style and quality
* Fix typo
* Fix return types
* added LEditsPPPipelineIF; minor changes for LEditsPPPipelineStableDiffusion and LEditsPPPipelineStableDiffusionXL
* Fix copy reference
* add documentation for IF
* Add first tests
* Fix batching for SD-XL
* Fix text encoding and perfect reconstruction for SD-XL
* Add tests for SD-XL, minor changes
* move user_mask to correct device, use cross_attention_kwargs also for inversion
* Example docstring
* Fix attention resolution for non-square images
* Refactoring for PR review
* Safely remove ledits_utils.py
* Style fixes
* Replace assertions with ValueError
* Remove LEditsPPPipelineIF
* Remove unecessary input checks
* Refactoring of CrossAttnProcessor
* Revert unecessary changes to scheduler
* Remove first progress-bar in inversion
* Refactor scheduler usage and reset
* Use imageprocessor instead of custom logic
* Fix scheduler init warning
* Fix error when running the pipeline in fp16
* Update documentation wrt perfect inversion
* Update tests
* Fix code quality and copy consistency
* Update LEditsPP import
* Remove enable/disable methods that are now in StableDiffusionMixin
* Change import in docs
* Revert import structure change
* Fix ledits imports
---------
Co-authored-by: Katharina Kornmeier <katharina.kornmeier@stud.tu-darmstadt.de>
|
2024-12-23 13:02:09 +05:30 |
|
Sayak Paul
|
86e62ae4c5
|
change to 2024 in the license (#6902)
change to 2024
|
2024-12-23 13:02:07 +05:30 |
|
UmerHA
|
e192ae08d3
|
Add ControlNet-XS support (#5827)
* Check in 23-10-05
* check-in 23-10-06
* check-in 23-10-07 2pm
* check-in 23-10-08
* check-in 231009T1200
* check-in 230109
* checkin 231010
* init + forward run
* checkin
* checkin
* ControlNetXSModel is now saveable+loadable
* Forward works
* checkin
* Pipeline works with `no_control=True`
* checkin
* debug: save intermediate outputs of resnet
* checkin
* Understood time error + fixed connection error
* checkin
* checkin 231106T1600
* turned off detailled debug prints
* time debug logs
* small fix
* Separated control_scale for connections/time
* simplified debug logging
* Full denoising works with control scale = 0
* aligned logs
* Added control_attention_head_dim param
* Passing n_heads instead of dim_head into ctrl unet
* Fixed ctrl midblock bug
* Cleanup
* Fixed time dtype bug
* checkin
* 1. from_unet, 2. base passed, 3. all unet params
* checkin
* Finished docstrings
* cleanup
* make style
* checkin
* more tests pass
* Fixed tests
* removed debug logs
* make style + quality
* make fix-copies
* fixed documentation
* added cnxs to doc toc
* added control start/end param
* Update controlnetxs_sdxl.md
* tried to fix copies..
* Fixed norm_num_groups in from_unet
* added sdxl-depth test
* created SD2.1 controlnet-xs pipeline
* re-added debug logs
* Adjusting group norm ; readded logs
* Added debug log statements
* removed debug logs ; started tests for sd2.1
* updated sd21 tests
* fixed tests
* fixed tests
* slightly increased error tolerance for 1 test
* make style & quality
* Added docs for CNXS-SD
* make fix-copies
* Fixed sd compile test ; fixed gradient ckpointing
* vae downs = cnxs conditioning downs; removed guess
* make style & quality
* Fixed tests
* fixed test
* Incorporated review feedback
* simplified control model surgery
* fixed tests & make style / quality
* Updated docs; deleted pip & cursor files
* Rolled back minimal change to resnet
* Update resnet.py
* Update resnet.py
* Update src/diffusers/models/controlnetxs.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
* Update src/diffusers/models/controlnetxs.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
* Incorporated review feedback
* Update docs/source/en/api/pipelines/controlnetxs_sdxl.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
* Update docs/source/en/api/pipelines/controlnetxs.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
* Update docs/source/en/api/pipelines/controlnetxs.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
* Update docs/source/en/api/pipelines/controlnetxs.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
* Update src/diffusers/models/controlnetxs.py
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
* Update src/diffusers/models/controlnetxs.py
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
* Update src/diffusers/pipelines/controlnet_xs/pipeline_controlnet_xs.py
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
* Update docs/source/en/api/pipelines/controlnetxs.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
* Update src/diffusers/pipelines/controlnet_xs/pipeline_controlnet_xs_sd_xl.py
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
* Incorporated doc feedback
---------
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>
|
2023-12-06 23:33:47 +01:00 |
|
Steven Liu
|
87a09d66f3
|
[docs] SDXL Turbo (#6065)
api docs
|
2023-12-06 14:33:14 -08:00 |
|
Steven Liu
|
172acc98b9
|
[docs] Update pipeline list (#5952)
add to list
|
2023-11-29 14:08:39 +01:00 |
|
estelleafl
|
5ae3c3a56b
|
[ldm3d] Ldm3d upscaler to community pipeline (#5870)
---------
Co-authored-by: Aflalo <estellea@isl-gpu27.rr.intel.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: YiYi Xu <yixu310@gmail.com>
|
2023-11-28 09:00:39 -10:00 |
|
M. Tolga Cangöz
|
8092017d3f
|
[Docs] Fix typos and update files at API's Pipelines page 1 (#5744)
* Fix typos, update, add Copyright info, and trim trailing whitespace
* Update alt_diffusion.md
* Remove nonoperational demo
* Update docs/source/en/api/pipelines/consistency_models.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
* Update docs/source/en/api/pipelines/latent_consistency_models.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
|
2023-11-14 10:36:20 -08:00 |
|
Steven Liu
|
7c3a75a1ce
|
[docs] General updates (#5378)
* first draft
* feedback
* feedback
|
2023-10-24 11:51:55 -07:00 |
|
Sayak Paul
|
a7508a76f0
|
add: pushtohubmixin to pipelines and schedulers docs overview. (#4607)
* add: pushtohubmixin to pipelines and schedulers docs overview.
* Apply suggestions from code review
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
|
2023-08-15 22:23:17 +05:30 |
|
camenduru
|
c6ae9b7df6
|
Where did this 'x' come from, Elon? (#4277)
* why mdx?
* why mdx?
* why mdx?
* no x for kandinksy either
---------
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
|
2023-07-26 18:18:14 +02:00 |
|