GitHub - Drisney/Fooocus_extend_fx: Focus on prompting and generating

I would like to introduce a new fork of the popular generative neural network Fooocus - Fooocus extend. I would like to point out that this fork can be run both locally on your computer and via Google Colab. Let's look at everything in order.

Launch. If you will run it on a local machine, you can safely skip this item.

Before launching you are offered to choose the following settings Fooocus Profile - select which profile will be loaded at startup (default, anime, realistic). Fooocus Theme - select a theme - light or dark. Tunnel - select launch tunnel. When it happens that gradio stops working for some reason, you can choose cloudflared tunnel. However, the generation is a bit slower Memory patch - adds a few keys to the launch bar that allow you to optimise your graphics card if you are using the free version of Google Colab. If you have paid access, this item can be disabled GoogleDrive output - connects your GoogleDisk and save all of your generation directly to it.

The “Extensions” panel is divided into three groups:

in generation - modules working during generation
modules - generation modules
tools - auxiliary tools

“in generation” panel

Prompt Translate

Allows to translate both positive and negative prompts from any language into English, both before generation and directly during generation.

PhotoMaker (works only in the local version)

The module allows you to generate an image with a reference face Upload a photo of your face - your reference face should be uploaded here. Unlike normal generation, this 30-step generation on Nvidia 3060 12GB takes about 4 minutes.

InstantID (works only in the local version)

The module allows you to generate an image with a reference face Upload a photo of your face - your reference face should be uploaded here.

Upload a reference pose image (Optional) - a reference pose image will be uploaded here. Optional.

Pregeneration image - if this mode is enabled, then a new image will be generated as a reference pose image based on your Prompt.

If you have not specified a pose reference image, an image will be generated from your prompt based only on the face image and the same resolution.

If a pose reference image is present, the dimensions will be the same as the pose reference image.

IdentityNet strength (for fidelity) - responsible for the accuracy of face repetition.

Image adapter strength (for detail) - responsible for details

ContrloNet

canny - used to define image contours
depth - used to create a depth map of the image Schedulers - select the Schedulers that will be used when generating the image with our face. Enhance non-face region - enhance non-face parts of the image

Unlike normal generation, this 30-step generation on Nvidia 3060 12GB takes about 7 minutes.

Inswapper

This module is also intended for face replacement.

Source Image Index - index of the face in the reference image. Faces are numbered from left to right from top to bottom starting from zero. If you specify -1, the average mixed face of all available faces will be taken as the reference face

Target Image Index - index of the face in the output image. This is the index of the face to be replaced in the output image. If you specify -1, all faces will be replaced.

Source Face Image - image with face

CodeFormer

Face enhancement module with upscale capability

Pre_Face_Align - aligns the face if it is tilted

Background Enchanced - improve background quality

Face Upsample - adjust reference face to the size of the input image face

Upscale - image enlargement

Codeformer_Fidelity - signability coefficient, inversely proportional to quality

“modules” panel

Image Batch (batch image processing)

In a nutshell, this module allows you to perform group scaling of images, as well as create images based on a group of existing images using ImagePromt (ControlNet). To better understand this module, I suggest you conduct some experiments yourself. But I want to note that using it allows you to use your images as references and change their style depending on the prompt and model you choose. First, depending on the "Uplode ZIP-file" selector, you need to upload several reference images or a zip archive with them. The archive should not contain subfolders, file names should not contain non-Latin characters and spaces.

Next, select the mode of changing the image resolution

NOT scale - the source image resolution will not be taken into account during generation
to ORIGINAL - this means that immediately before generation the resolution equal to the resolution of the source image will be selected.
to OUTPUT - in this case, before generation the resolution of the source image will be changed to the generated one with preserving the proportions

Depending on what you want to do with the source images, select Action - Upscale or ImagePrompt. From the Method drop-down list, select the appropriate image processing method. If you are using ImagePrompt, you must also select the "Stop at" and "Weight" options.

Start batch - starts the process for execution.

When the process is finished, click on Output->Zip button to create an archive with all previously created images from the output folder. The archive itself will appear in the "Download a Zip file" window. You can download it from there.

Clear Output - clear the output folder. It should be noted that not only the folder for the current date is cleared, but also the whole folder.

Prompt Batch (batch processing of prompts)

This module allows you to run generation of several hints sequentially one after another. To do this, you should fill in the table. Enter a positive hint in the hint column and a negative hint in the negative hint column respectively. Clicking the New Row button will add an empty row to the end of the table. Delete Last Row deletes the last row of the table. Start batch starts the execution of the list of prompts to generate. You can also choose to add basic positive and negative hints. None - no base hints will be added. Prefix - base hints will be added before the table hints. Suffix - basic hints will be added after hints from the table.

only positive prompts - if this item is active, all prompts in the file will be considered positive, otherwise, both positive and negative prompts will be loaded Load prompts from file - allows to load the list of positive and negative prompts from a text file into the table. The file can have any extension

As an example, consider a file with the following contents

---- start of file -------

------ end of file --------

If the item ‘only positive prompts’ is active, then the table with prompts will have the following form

Otherwise

This means that in the first case all prompts in the file are treated as positive, and empty lines are ignored. In the second case, the file first contains a line with a positive prompt, followed by a line with a negative prompt. If you don't need to specify a negative hint, leave this line blank, but the positive hint line must always be there.

OneButtonPrompt - allows you to generate prompts, generate variations of your prompt and runs to generate images from them. I will not dwell on the detailed description of this module - I will only point out the main functions

In the ‘Main’ tab, you can select the preset of the prompt generation theme, as well as specify unchanging prompt prefixes and suffixes.

In the ‘Workflow assist’ tab you can generate 5 prompts, transfer each of them to the Fooocus prompt field and also to the Workflow field. If you select Workflow mode, it will generate not a new prompt, but variations of the prompt specified in the Workflow text field. The level of variation is selected by the engine located just below it

In this tab you can select the prompt syntax for different generation models, the length of the prompt, and enable the prompt generation enhancer.

This is where you can control the generation of negative prompts

In this tab you can start the image generation queue by generated samples. Before starting you need to specify the aspect ratio of the generated image (Size to generate), the number of generated prompts (Generation of prompts), and the models to use (Model to use).

X/Y/Z Plot

This extension allows you to make image grids to make it easier to see the difference between different generation settings and choose the best option. You can change the following parameters - Styles, Steps, Aspect Ratio, Seed, Sharpness, CFG (Guidance) Scale, Checkpoint, Refiner, Clip skip, Sampler, Scheduler, VAE, Refiner swap method, Softness of ControlNet, and also replace words in the prompt and change their order

Inswapper

The full analog of this module in the “in generation” panel, unlike which you need to load an additional input image

CodeFormer

The full analog of this module in the “in generation” panel, unlike which you need to load an additional input image

Remove Background

This extension is designed to add background removal, image/video processing, and blending to your projects. It provides precise background removal with support for multiple models, chroma keying, foreground adjustments, and advanced effects. Whether you’re working with images or videos, this extension provides everything you need to efficiently process visual content.

Key Features:

Multi-model background removal: Supports u2net, isnet-general-use, and other models.

Chroma keying support: Removes specific colors (green, blue, or red) from the background.

Blending modes: 10 powerful blending modes for image compositing.

Foreground adjustments: Scale, rotate, flip, and position elements precisely.

Video and image support: Easily process images and videos.

Multi-threaded processing: Efficiently process large files with streaming and GPU support. Customizable output formats: Export to PNG, JPEG, MP4, AVI, and more.

All processing results are automatically saved to the output folder without saving to History Log

“tools” panel

Civitai Helper

This extension allows you to download models for generation from the civitai website. To download a model you first need to specify your Civitai_API_key. In the Download Model section in the Civitai URL field you need to specify a link to the required model from the browser address bar and click Get Model Info by Civitai URL. After analysing the link you will be given information about the model. You will also be able to select the version of the model before downloading. This extension also allows you to find duplicates of downloaded models and check for updates. In addition, there is a group download option.

TextMask

Fast text editor with mask creation for ControlNet and Inpaint!

What this tool can do:

Adds up to 5 text blocks to any image
Supports fonts of any size up to 300px
Works with Cyrillic, hieroglyphics and other non-Latin characters
Generates black and white mask with one click
Lets you forget Photoshop for 90% of simple text tasks
Supports a sufficient number of fonts

How to use:

Upload an image
Add and edit the text as you like
Generate a mask in two clicks
Download and use in ControlNet (CPDS or PyraCanny) or as a mask for Inpaint to stylise text and blend into an image.

OpenPoseEditor

This module allows you to create skeletons for subsequent image creation using OpenPose ControlNet. You can also create a skeleton from an existing image.

Photopea - a free online analogue of Photoshop

In addition to these extensions, there are several other add-ons available

Select the resolution and aspect ratio of the generated image

This setting is located in the generation resolution selection tab. Here you can select the number of horizontal and vertical points, aspect ratio. To apply the settings, click the Set button and select this resolution from the list of proposed resolutions. Your resolution will be the second to last. You can also select a random aspect ratio for each generation from the available list of aspect ratios

Wildcard

This module allows you not to memorise existing files with wildcard words, but to select them directly from a list of dictionaries. You can also select directly the item you need from the list.

OpenPose ControlNet

Allows you to create an image based on the pose skeleton.

Recolor ControlNet

Allows you to colorize an image based on a black and white image.

Scribble ControlNet

Allows you to color an image based on a sketch.

Manga Recolor ControlNet

This is a specially trained ControlNet model designed to automatically colorize grayscale images in anime style.

The model takes grayscale anime images as input and generates a colorized version.

An anime model is required for proper generation. Works with or without a clarifying prompt.

Save Image Grid for Each Batch

Filename Prefix

This setting may be useful when working on several projects to separate one from another.

Paths and Presets

Here you can change the paths to your models if they are already in other folders on the disk. If your Сheckponts or LORAs are in different folders, then the paths to them can be specified separated by commas (,). After changing the path, it is best to restart FooocusExtend.

Also here you can create a new preset based on the existing settings, and delete any of the existing ones, except for default and initial. The preset saves the following parameters: base model, refiner, refiner_switch, loras settings, cfg scale, sharpness, CFG Mimicking from TSNR, clip_skip, sampler, scheduler, Forced Overwrite of Sampling Step, Forced Overwrite of Refiner Switch Step, performance, image number, prompt negative, styles selections, aspect ratio, vae, inpaint_engine_version

Load file of style

Allows you to upload a file (in *.json format) with custom styles

View trigger words of LoRA

If trigger words are not shown, then you need to scan LoRa, in the Civitai Helper module in the Scan Models for Civitai section

All suggestions and questions can be voiced in the Telegram-group

Change log

v9.1.4

Some bug fix

v9.1.3

Add Manga Recolor ControlNet

v9.1.2

Added selector for choosing to load individual files or zip archive in Image Batch module

v9.1.1

Add view trigger words of LoRA
Bug fix in wildcards
Add Civitai_API_key saving
Add preview files for InstantId and Photomaker styles

v9.1.0

Add PhotoMaker module
Add PhotoMaker styles
Add Random Aspect Ratio

v9.0.0

Dividing extensions into groups
Add InstantID module
Add Inswapper module
Add CodeFormer module
Add InstanID styles

v8.1.0

Some bug fix
Add TextMask - Fast text editor with mask creation for ControlNet and Inpaint

v8.0.4

User style upload
Fixed maximum height of PromptBatch
Option to load only positive prompts

v8.0.3

Some bug fix
Add load prompt from files in PromptBatch
The ImageBatch extension interface has been simplified

v8.0.2

Some bug fix

v8.0.1

Add Filename Prefix
Add Paths and Presets Settings

Save Image Grid for Each Batch
Add X/Y/Z Plot Extention
Prompt Batch is now in the extensions panel
Images Batch has become easier to manage while retaining its functionality
Images Batch is now in the extensions panel
Add support VAE and LyCoris in Civitai_Helper
The extension Remove Background has been changed
Add auto update on startup

Add OpenPoseEditor
Fix bug in Image Batch Mode
Added cell selection in Image Batch Mode
Added selection of adding base prompts in Prompt Batch Mode
Add OpenPose ControlNet
Add Recolor ControlNet
Add Scribble ControlNet

Add Prompt Batch Mode
Rename Batch Mode to Images Batch Mode
Fixed an incorrect start random number in Batch Mode
Add visual management of Wildcard and Words/phrases of wildcard
Added the ability to set any resolution for the generated image
Add OneButtonPrompt