diffusers

Author	SHA1	Message	Date
David Steinberg	d8d8e86924	Fix a dead link (#9116 ) Co-authored-by: Aryan <aryan@huggingface.co>	2024-12-23 13:02:15 +05:30
zR	dbf5d348e6	Add CogVideoX text-to-video generation model (#9082 ) * add CogVideoX --------- Co-authored-by: Aryan <aryan@huggingface.co> Co-authored-by: sayakpaul <spsayakpaul@gmail.com> Co-authored-by: Aryan <contact.aryanvs@gmail.com> Co-authored-by: yiyixuxu <yixu310@gmail.com> Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2024-12-23 13:02:15 +05:30
latentCall145	f771be1d7b	Flux fp16 inference fix (#9097 ) * clipping for fp16 * fix typo * added fp16 inference to docs * fix docs typo * include link for fp16 investigation --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-12-23 13:02:15 +05:30
Álvaro Somoza	3510d0ef5e	[Kolors] Add PAG (#8934 ) * txt2img pag added * autopipe added, fixed case * style * apply suggestions * added fast tests, added todo tests * revert dummy objects for kolors * fix pag dummies * fix test imports * update pag tests * add kolor pag to docs --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-12-23 13:02:15 +05:30
Dhruv Nair	47874e837d	[Single File] Add single file support for Flux Transformer (#9083 ) * update * update * update --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-12-23 13:02:15 +05:30
Ahn Donghoon (안동훈 / suno)	f25823781d	add PAG support for Stable Diffusion 3 (#8861 ) add pag sd3 --------- Co-authored-by: HyoungwonCho <jhw9811@korea.ac.kr> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: crepejung00 <jaewoojung00@naver.com> Co-authored-by: YiYi Xu <yixu310@gmail.com> Co-authored-by: Aryan <contact.aryanvs@gmail.com> Co-authored-by: Aryan <aryan@huggingface.co>	2024-12-23 13:02:15 +05:30
Aryan	9dbffc8c60	PAG variant for HunyuanDiT, PAG refactor (#8936 ) * copy hunyuandit pipeline * pag variant of hunyuan dit * add tests * update docs * make style * make fix-copies * Update src/diffusers/pipelines/pag/pag_utils.py * remove incorrect copied from * remove pag hunyuan attn procs to resolve conflicts * add pag attn procs again * new implementation for pag_utils * revert pag changes * add pag refactor back; update pixart sigma * update pixart pag tests * apply suggestions from review Co-Authored-By: yixu310@gmail.com * make style * update docs, fix tests * fix tests * fix test_components_function since list not accepted as valid __init__ param * apply patch to fix broken tests Co-Authored-By: Sayak Paul <spsayakpaul@gmail.com> * make style * fix hunyuan tests --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-12-23 13:02:15 +05:30
Sayak Paul	0db81141b9	[Flux] minor documentation fixes for flux. (#9048 ) * minor documentation fixes for flux. * clipskip * add gist	2024-12-23 13:02:15 +05:30
Tolga Cangöz	c6ac793955	Errata: Fix typos & `\s+$` (#9008 ) * Fix typos * chore: Fix typos * chore: Update README.md for promptdiffusion example * Trim trailing white spaces * Fix a typo * update number * chore: update number * Trim trailing white space * Update README.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update README.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2024-12-23 13:02:14 +05:30
Sayak Paul	c8a236ba5c	[Core] Add PAG support for PixArtSigma (#8921 ) * feat: add pixart sigma pag. * inits. * fixes * fix * remove print. * copy paste methods to the pixart pag mixin * fix-copies * add documentation. * add tests. * remove correction file. * remove pag_applied_layers * empty	2024-12-23 13:02:14 +05:30
Sayak Paul	7739beb740	Flux pipeline (#9043 ) add flux! Signed-off-by: Adrien <adrien@huggingface.co> Co-authored-by: Adrien <adrien.69740@gmail.com> Co-authored-by: Anatoly Belikov <abelikov@singularitynet.io> Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com> Co-authored-by: yiyixuxu <yixu310@gmail.com>	2024-12-23 13:02:14 +05:30
Aryan	e28e5373f9	PAG variant for AnimateDiff (#8789 ) * add animatediff pag pipeline * remove unnecessary print * make fix-copies * fix ip-adapter bug * update docs * add fast tests and fix bugs * update * update * address review comments * update ip adapter single test expected slice * implement test_from_pipe_consistent_config; fix expected slice values * LoraLoaderMixin->StableDiffusionLoraLoaderMixin; add latest freeinit test	2024-12-23 13:02:14 +05:30
Aryan	cf513e4205	[core] Move community AnimateDiff ControlNet to core (#8972 ) * add animatediff controlnet to core * make style; remove unused method * fix copied from comment * add tests * changes to make tests work * add utility function to load videos * update docs * update pipeline example * make style * update docs with example * address review comments * add latest freeinit test from #8969 * LoraLoaderMixin -> StableDiffusionLoraLoaderMixin * fix docs * Update src/diffusers/utils/loading_utils.py Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com> * fix: variable out of scope --------- Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>	2024-12-23 13:02:14 +05:30
Yoach Lacombe	030a134311	Stable Audio integration (#8716 ) * WIP modeling code and pipeline * add custom attention processor + custom activation + add to init * correct ProjectionModel forward * add stable audio to __initèè * add autoencoder and update pipeline and modeling code * add half Rope * add partial rotary v2 * add temporary modfis to scheduler * add EDM DPM Solver * remove TODOs * clean GLU * remove att.group_norm to attn processor * revert back src/diffusers/schedulers/scheduling_dpmsolver_multistep.py * refactor GLU -> SwiGLU * remove redundant args * add channel multiples in autoencoder docstrings * changes in docsrtings and copyright headers * clean pipeline * further cleaning * remove peft and lora and fromoriginalmodel * Delete src/diffusers/pipelines/stable_audio/diffusers.code-workspace * make style * dummy models * fix copied from * add fast oobleck tests * add brownian tree * oobleck autoencoder slow tests * remove TODO * fast stable audio pipeline tests * add slow tests * make style * add first version of docs * wrap is_torchsde_available to the scheduler * fix slow test * test with input waveform * add input waveform * remove some todos * create stableaudio gaussian projection + make style * add pipeline to toctree * fix copied from * make quality * refactor timestep_features->time_proj * refactor joint_attention_kwargs->cross_attention_kwargs * remove forward_chunk * move StableAudioDitModel to transformers folder * correct convert + remove partial rotary embed * apply suggestions from yiyixuxu -> removing attn.kv_heads * remove temb * remove cross_attention_kwargs * further removal of cross_attention_kwargs * remove text encoder autocast to fp16 * continue removing autocast * make style * refactor how text and audio are embedded * add paper * update example code * make style * unify projection model forward + fix device placement * make style * remove fuse qkv * apply suggestions from review * Update src/diffusers/pipelines/stable_audio/pipeline_stable_audio.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * make style * smaller models in fast tests * pass sequential offloading fast tests * add docs for vae and autoencoder * make style and update example * remove useless import * add cosine scheduler * dummy classes * cosine scheduler docs * better description of scheduler --------- Co-authored-by: YiYi Xu <yixu310@gmail.com>	2024-12-23 13:02:14 +05:30
Sayak Paul	3566f4b18a	[Docs] credit where it's due for Lumina and Latte. (#9000 ) credit where it's due for Lumina and Latte.	2024-12-23 13:02:14 +05:30
Álvaro Somoza	edddf3d417	[Kolors] Add IP Adapter (#8901 ) * initial draft * apply suggestions * fix failing test * added ipa to img2img * add docs * apply suggestions	2024-12-23 13:02:14 +05:30
Aryan	b7ddd2bb99	[core] AnimateDiff SparseCtrl (#8897 ) * initial sparse control model draft * remove unnecessary implementation * copy animatediff pipeline * remove deprecated callbacks * update * update pipeline implementation progress * make style * make fix-copies * update progress * add partially working pipeline * remove debug prints * add model docs * dummy objects * improve motion lora conversion script * fix bugs * update docstrings * remove unnecessary model params; docs * address review comment * add copied from to zero_module * copy animatediff test * add fast tests * update docs * update * update pipeline docs * fix expected slice values * fix license * remove get_down_block usage * remove temporal_double_self_attention from get_down_block * update * update docs with org and documentation images * make from_unet work in sparsecontrolnetmodel * add latest freeinit test from #8969 * make fix-copies * LoraLoaderMixin -> StableDiffsuionLoraLoaderMixin	2024-12-23 13:02:14 +05:30
Aryan	0f2c512fb6	[docs] pipeline docs for latte (#8844 ) * add pipeline docs for latte * add inference time to latte docs * apply review suggestions	2024-12-23 13:02:14 +05:30
Nguyễn Công Tú Anh	adcd3682bf	add PAG support sd15 controlnet (#8820 ) * add pag support sd15 controlnet * fix quality import * remove unecessary import * remove if state * fix tests * remove useless function * add sd1.5 controlnet pag docs --------- Co-authored-by: anhnct8 <anhnct8@fpt.com>	2024-12-23 13:02:14 +05:30
Sayak Paul	0ace726d8a	[Docs] add AuraFlow docs (#8851 ) * add pipeline documentation. * add api spec for pipeline * model documentation * model spec	2024-12-23 13:02:14 +05:30
Dhruv Nair	c166a0a90d	Add single file loading support for AnimateDiff (#8819 ) * update * update * update * update	2024-12-23 13:02:14 +05:30
Álvaro Somoza	1028de9d9d	[Core] Add Kolors (#8812 ) * initial draft	2024-12-23 13:02:14 +05:30
PommesPeter	2256ec51ff	[Alpha-VLLM Team] Add Lumina-T2X to diffusers (#8652 ) --------- Co-authored-by: zhuole1025 <zhuole1025@gmail.com> Co-authored-by: YiYi Xu <yixu310@gmail.com>	2024-12-23 13:02:14 +05:30
YiYi Xu	ace869b5ac	[doc] add a tip about using SDXL refiner with hunyuan-dit and pixart (#8735 ) * up * Apply suggestions from code review Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2024-12-23 13:02:13 +05:30
Shauray Singh	5f10c18270	add PAG support for SD architecture (#8725 ) * add pag to sd pipelines	2024-12-23 13:02:13 +05:30
XCL	f488493082	[Tencent Hunyuan Team] Add Hunyuan-DiT ControlNet Inference (#8694 ) * add controlnet support --------- Co-authored-by: xingchaoliu <xingchaoliu@tencent.com> Co-authored-by: yiyixuxu <yixu310@gmail,com>	2024-12-23 13:02:13 +05:30
Álvaro Somoza	89a6943efc	[Docs] SD3 T5 Token limit doc (#8654 ) * doc for max_sequence_length * better position and changed note to tip * apply suggestions --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-12-23 13:02:13 +05:30
YiYi Xu	5efc438c7e	add PAG support (#7944 ) * first draft --------- Co-authored-by: yiyixuxu <yixu310@gmail,com> Co-authored-by: Junhwa Song <ethan9867@gmail.com> Co-authored-by: Ahn Donghoon (안동훈 / suno) <suno.vivid@gmail.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2024-12-23 13:02:13 +05:30
Steven Liu	473acb9579	[docs] Add note for float8 (#8685 ) add note	2024-12-23 13:02:13 +05:30
Tolga Cangöz	f6172748c6	Errata - Fix typos and improve style (#8571 ) * Fix typos * Fix typos & up style * chore: Update numbers --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-12-23 13:02:13 +05:30
Tolga Cangöz	1ced1c40d8	Discourage using deprecated `revision` parameter (#8573 ) * Discourage using `revision` * `make style && make quality` * Refactor code to use 'variant' instead of 'revision' * `revision="bf16"` -> `variant="bf16"` --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-12-23 13:02:13 +05:30
Tolga Cangöz	2c56360222	Errata - Trim trailing white space in the whole repo (#8575 ) * Trim all the trailing white space in the whole repo * Remove unnecessary empty places * make style && make quality * Trim trailing white space * trim --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-12-23 13:02:13 +05:30
王奇勋	86da4dcf8e	Support SD3 ControlNet and Multi-ControlNet. (#8566 ) * sd3 controlnet --------- Co-authored-by: haofanwang <haofanwang.ai@gmail.com>	2024-12-23 13:02:13 +05:30
Vasco Ramos	410eb1ad86	[SD3 Docs] Corrected title about loading model with T5 "without" -> "with" (#8602 ) [SD3 Docs] Corrected title about loading model with T5 Corrected the documentation title to "Loading the single file checkpoint with T5" Previously, it incorrectly stated "Loading the single file checkpoint without T5" which contradicted the code snippet showing how to load the SD3 checkpoint with the T5 model	2024-12-23 13:02:13 +05:30
Sayak Paul	db74292bb3	[Core] Add `shift_factor` to SD3 tiny autoencoder (#8618 ) * shift factor argument to tiny * remove shift factor rejigging from the sd3 docs	2024-12-23 13:02:13 +05:30
Álvaro Somoza	fcbb2ffac9	[SD3] TAESD3 docs (#8607 ) * tased3 docs * apply suggestion --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-12-23 13:02:13 +05:30
Dhruv Nair	fa640cc7d7	Expand Single File support in SD3 Pipeline (#8517 ) * update * update	2024-12-23 13:02:12 +05:30
Radamés Ajna	19b14dc11b	Fix small typo (#8498 )	2024-12-23 13:02:12 +05:30
Dhruv Nair	d0b14d0f08	Add Stable Diffusion 3 (#8483 ) * up * add sd3 * update * update * add tests * fix copies * fix docs * update * add dreambooth lora * add LoRA * update * update * update * update * import fix * update * Update src/diffusers/pipelines/stable_diffusion_3/pipeline_stable_diffusion_3.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * import fix 2 * update * Update src/diffusers/models/autoencoders/autoencoder_kl.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * Update src/diffusers/models/autoencoders/autoencoder_kl.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * Update src/diffusers/models/autoencoders/autoencoder_kl.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * Update src/diffusers/models/autoencoders/autoencoder_kl.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * Update src/diffusers/models/autoencoders/autoencoder_kl.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * Update src/diffusers/models/autoencoders/autoencoder_kl.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * Update src/diffusers/models/autoencoders/autoencoder_kl.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * Update src/diffusers/models/autoencoders/autoencoder_kl.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * Update src/diffusers/models/autoencoders/autoencoder_kl.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * Update src/diffusers/models/autoencoders/autoencoder_kl.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * Update src/diffusers/models/autoencoders/autoencoder_kl.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * update * update * update * fix ckpt id * fix more ids * update * missing doc * Update src/diffusers/schedulers/scheduling_flow_match_euler_discrete.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * Update src/diffusers/schedulers/scheduling_flow_match_euler_discrete.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * Update docs/source/en/api/pipelines/stable_diffusion/stable_diffusion_3.md Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * Update docs/source/en/api/pipelines/stable_diffusion/stable_diffusion_3.md Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * update' * fix * update * Update src/diffusers/models/autoencoders/autoencoder_kl.py * Update src/diffusers/models/autoencoders/autoencoder_kl.py * note on gated access. * requirements * licensing --------- Co-authored-by: sayakpaul <spsayakpaul@gmail.com> Co-authored-by: YiYi Xu <yixu310@gmail.com>	2024-12-23 13:02:12 +05:30
Tolga Cangöz	9ed762d207	Optimize test files by fixing CPU-offloading usage (#8409 ) * Refactor code to remove unnecessary calls to `to(torch_device)` * Refactor code to remove unnecessary calls to `to("cuda")` * Update pipeline_stable_diffusion_diffedit.py	2024-12-23 13:02:12 +05:30
Sayak Paul	751ed84019	[Hunyuan] add optimization related sections to the hunyuan dit docs. (#8402 ) * optimizations to the hunyuan dit docs. * Apply suggestions from code review Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/api/pipelines/hunyuandit.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2024-12-23 13:02:12 +05:30
Tolga Cangöz	d027cb4326	Errata (#8322 ) * Fix typos * Trim trailing whitespaces * Remove a trailing whitespace * chore: Update MarigoldDepthPipeline checkpoint to prs-eth/marigold-lcm-v1-0 * Revert "chore: Update MarigoldDepthPipeline checkpoint to prs-eth/marigold-lcm-v1-0" This reverts commit `fd742b30b4`. * pokemon -> naruto * `DPMSolverMultistep` -> `DPMSolverMultistepScheduler` * Improve Markdown stylization * Improve style * Improve style * Refactor pipeline variable names for consistency * up style	2024-12-23 13:02:12 +05:30
Sayak Paul	9873a35ae6	[Hunyuan] allow Hunyuan DiT to run under 6GB for GPU VRAM (#8399 ) * allow hunyuan dit to run under 6GB for GPU VRAM * add section in the docs/	2024-12-23 13:02:12 +05:30
Sayak Paul	9b1c118692	[HunyuanDiT] minor docs changes in hunyuandit (#8395 ) minor docs changes in hunyuandit	2024-12-23 13:02:12 +05:30
XCL	d6c7e17867	Tencent Hunyuan Team - Updated Doc for HunyuanDiT (#8383 ) * add hunyuandit doc * update hunyuandit doc * update hunyuandit 2d model * update toctree.yml for hunyuandit	2024-12-23 13:02:12 +05:30
Anton Obukhov	0be111f3d0	[Pipeline] Marigold depth and normals estimation (#7847 ) * implement marigold depth and normals pipelines in diffusers core * remove bibtex * remove deprecations * remove save_memory argument * remove validate_vae * remove config output * remove batch_size autodetection * remove presets logic move default denoising_steps and processing_resolution into the model config make default ensemble_size 1 * remove no_grad * add fp16 to the example usage * implement is_matplotlib_available use is_matplotlib_available, is_scipy_available for conditional imports in the marigold depth pipeline * move colormap, visualize_depth, and visualize_normals into export_utils.py * make the denoising loop more lucid fix the outputs to always be 4d tensors or lists of pil images support a 4d input_image case attempt to support model_cpu_offload_seq move check_inputs into a separate function change default batch_size to 1, remove any logic to make it bigger implicitly * style * rename denoising_steps into num_inference_steps * rename input_image into image * rename input_latent into latents * remove decode_image change decode_prediction to use the AutoencoderKL.decode method * move clean_latent outside of progress_bar * refactor marigold-reusable image processing bits into MarigoldImageProcessor class * clean up the usage example docstring * make ensemble functions members of the pipelines * add early checks in check_inputs rename E into ensemble_size in depth ensembling * fix vae_scale_factor computation * better compatibility with torch.compile better variable naming * move export_depth_to_png to export_utils * remove encode_prediction * improve visualize_depth and visualize_normals to accept multi-dimensional data and lists remove visualization functions from the pipelines move exporting depth as 16-bit PNGs functionality from the depth pipeline update example docstrings * do not shortcut vae.config variables * change all asserts to raise ValueError * rename output_prediction_type to output_type * better variable names clean up variable deletion code * better variable names * pass desc and leave kwargs into the diffusers progress_bar implement nested progress bar for images and steps loops * implement scale_invariant and shift_invariant flags in the ensemble_depth function add scale_invariant and shift_invariant flags readout from the model config further refactor ensemble_depth support ensembling without alignment add ensemble_depth docstring * fix generator device placement checks * move encode_empty_text body into the pipeline call * minor empty text encoding simplifications * adjust pipelines' class docstrings to explain the added construction arguments * improve the scipy failure condition add comments improve docstrings change the default use_full_z_range to True * make input image values range check configurable in the preprocessor refactor load_image_canonical in preprocessor to reject unknown types and return the image in the expected 4D format of tensor and on right device support a list of everything as inputs to the pipeline, change type to PipelineImageInput implement a check that all input list elements have the same dimensions improve docstrings of pipeline outputs remove check_input pipeline argument * remove forgotten print * add prediction_type model config * add uncertainty visualization into export utils fix NaN values in normals uncertainties * change default of output_uncertainty to False better handle the case of an attempt to export or visualize none * fix `output_uncertainty=False` * remove kwargs fix check_inputs according to the new inputs of the pipeline * rename prepare_latent into prepare_latents as in other pipelines annotate prepare_latents in normals pipeline with "Copied from" annotate encode_image in normals pipeline with "Copied from" * move nested-capable `progress_bar` method into the pipelines revert the original `progress_bar` method in pipeline_utils * minor message improvement * fix cpu offloading * move colormap, visualize_depth, export_depth_to_16bit_png, visualize_normals, visualize_uncertainty to marigold_image_processing.py update example docstrings * fix missing comma * change torch.FloatTensor to torch.Tensor * fix importing of MarigoldImageProcessor * fix vae offloading fix batched image encoding remove separate encode_image function and use vae.encode instead * implement marigold's intial tests relax generator checks in line with other pipelines implement return_dict __call__ argument in line with other pipelines * fix num_images computation * remove MarigoldImageProcessor and outputs from import structure update tests * update docstrings * update init * update * style * fix * fix * up * up * up * add simple test * up * update expected np input/output to be channel last * move expand_tensor_or_array into the MarigoldImageProcessor * rewrite tests to follow conventions - hardcoded slices instead of image artifacts write more smoke tests * add basic docs. * add anton's contribution statement * remove todos. * fix assertion values for marigold depth slow tests * fix assertion values for depth normals. * remove print * support AutoencoderTiny in the pipelines * update documentation page add Available Pipelines section add Available Checkpoints section add warning about num_inference_steps * fix missing import in docstring fix wrong value in visualize_depth docstring * [doc] add marigold to pipelines overview * [doc] add section "usage examples" * fix an issue with latents check in the pipelines * add "Frame-by-frame Video Processing with Consistency" section * grammarly * replace tables with images with css-styled images (blindly) * style * print * fix the assertions. * take from the github runner. * take the slices from action artifacts * style. * update with the slices from the runner. * remove unnecessary code blocks. * Revert "[doc] add marigold to pipelines overview" This reverts commit a505165150afd8dab23c474d1a054ea505a56a5f. * remove invitation for new modalities * split out marigold usage examples * doc cleanup --------- Co-authored-by: yiyixuxu <yixu310@gmail.com> Co-authored-by: yiyixuxu <yixu310@gmail,com> Co-authored-by: sayakpaul <spsayakpaul@gmail.com>	2024-12-23 13:02:12 +05:30
Dhaivat Bhatt	d1ddc5eb78	Add details about 1-stage implementation in I2VGen-XL docs (#8282 ) * Add details about 1-stage implementation * Add details about 1-stage implementation	2024-12-23 13:02:12 +05:30
Junsong Chen	d7bbea7a89	[docs] add doc for PixArtSigmaPipeline (#7857 ) * 1. add doc for PixArtSigmaPipeline; --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> Co-authored-by: Guillaume LEGENDRE <glegendre01@gmail.com> Co-authored-by: Álvaro Somoza <asomoza@users.noreply.github.com> Co-authored-by: Bagheera <59658056+bghira@users.noreply.github.com> Co-authored-by: bghira <bghira@users.github.com> Co-authored-by: Hyoungwon Cho <jhw9811@korea.ac.kr> Co-authored-by: yiyixuxu <yixu310@gmail.com> Co-authored-by: Tolga Cangöz <46008593+standardAI@users.noreply.github.com> Co-authored-by: Philip Pham <phillypham@google.com>	2024-12-23 13:02:12 +05:30
Aryan	f6b690301d	[Pipeline] AnimateDiff SDXL (#6721 ) * update conversion script to handle motion adapter sdxl checkpoint * add animatediff xl * handle addition_embed_type * fix output * update * add imports * make fix-copies * add decode latents * update docstrings * add animatediff sdxl to docs * remove unnecessary lines * update example * add test * revert conv_in conv_out kernel param * remove unused param addition_embed_type_num_heads * latest IPAdapter impl * make fix-copies * fix return * add IPAdapterTesterMixin to tests * fix return * revert based on suggestion * add freeinit * fix test_to_dtype test * use StableDiffusionMixin instead of different helper methods * fix progress bar iterations * apply suggestions from review * hardcode flip_sin_to_cos and freq_shift * make fix-copies * fix ip adapter implementation * fix last failing test * make style * Update docs/source/en/api/pipelines/animatediff.md Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com> * remove todo * fix doc-builder errors --------- Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>	2024-12-23 13:02:11 +05:30
Steven Liu	de414618ba	[docs] Refactor image quality docs (#7758 ) * refactor * code snippets * fix path * fix path in guide * code outputs * align toctree title * title * fix title	2024-12-23 13:02:11 +05:30

1 2 3 4 5 ...

309 Commits