Styling with Fooocus

Sometime in 2024

Setup

Alright, folks! Let’s dive into the exciting world of image generators. Today, we’re testing Fooocus - a software that creates beautiful images using various models and prompts. We want to see how simple it is for users and how creative its output can be. To make things reproducible and comparable, I’ll be using the same seed (123456) and resolution (1024x1024) and extreme fast setting (except in one case) for every generation.

Now, buckle up as we take a black box testing approach - enabling different styles one by one to generate a wide array of distinctive outputs. This way, we’ll have plenty of examples to share and discuss! So grab your coffee, sit back, and let the magic unfold before your eyes!

Lets compare

In this next section, we will be comparing various scenarios to see how noticeable the differences are. By examining these distinct setups, I aim to provide you with an understanding of Fooocus’s capabilities and how it handles different prompts and models. So, let’s dive into the first scenario!

Version difference

We start our journey by comparing the performance of the juggernaut_v7 model against the v6 version (which is the default model in the software right now).
My expectations were high, as I anticipated that v7 should bring a noticeable improvement in image quality and finer details.
It wasn’t deliver too many improvements. In most cases, the upgrade did not result in better visuals, some noticeable skin tone changes to make it photo realistic but mostly those.
Prompt: redhead pirate girl

Style	V6	v7
Mk Encaustic Paint
Mk Bauhaus Style
Photo Neon Noir

However, there were some styles where the difference between the two versions was hardly noticeable. But that doesn’t mean v7 isn’t worth trying out – it’s simply a testament to how well-optimized its predecessor was!

Style	V6	v7
Mk Blacklight Paint
Mk Patachitra Painting

Link to v6 Gallery
Link to v7 Gallery

Speed vs Quality

In another easy comparison, I set Fooocus to generate images with the quality setting after trying the extreme fast setting.
No surprises here either – the images with the quality settings are absolutely stunning! However, it’s worth mentioning that Fooocus does have some default settings in place when changing to quality, that can be undone or added for fast setting, but it means that it’s not just a sampling step change.
But hey, who doesn’t love a little extra eye candy while we wait?
Prompt: redhead pirate girl

Style	V7 Extreme fast	v7 Quality
Photo Hdr
MRE Undead Art
Misc Grunge

Link to v7 Extreme fast Gallery (same as in the previous paragraph)
Link to v7 Quality Gallery

V2 Style bias

Here’s the comparison I was really looking forward to – how Fooocus V2 style expansion affects the overall output.
This dynamic feature is a game-changer, allowing us to combine various styles more effectively!
However, it does come with its own set of biases. In some cases enabling Fooocus V2 seemed to tone down the original styles and steer images towards a more generic professional look.
This was quite noticeable when using very specific styles such as Quilling, Kirigami, or one-line art. Prompt: dancing dog in bow tie

Style	With Fooocus V2	Plain style
Papercraft Paper Quilling
Papercraft Kirigami
Mk One Line Art

A funny observation I made while testing this feature is that when Fooocus V2 is enabled,
the “unreal” part of the prompt seemed to be overshadowed by the bias completely.
Altho it’s mostly not present with the plain models either it’s more prevalent with the V2 on, but it can be solved with more fine-tuning on the prompt
For example, a prompt like “dog dancing” would lose that playful aspect and instead focus solely on the close up photo elements. It’s an interesting phenomenon to explore!

Style	With Fooocus V2	Plain style
SAI Comic Book
SAI Anime

Link to Fooocus v2 on Gallery
Link to Fooocus v2 off Gallery

Be Creative

Now that we’ve explored various scenarios and comparisons, let’s see how creative Fooocus can be!
Note: this test is more about the capabilities of the underlying model rather than the software itself. However, since inner optimizations are still in play, it presents an excellent opportunity to push this tool to its limits.
Prompt1: busy city street in the 1600’s
Prompt2: colorful scary toy lama creative
Prompt3: poster of a new marvel movie about a lizard wizard, creative

Style	Historic street	Toy lama	Lizard Wizard
Mk Tlingit Art
Mk Ukiyo E
Glitchcore

As mentioned earlier, one interesting observation was that the enabled Fooocus V2 (by default) seemed to have a strong bias towards 3D/photorealistic representations of the main subject.
This creative challenge allowed to test the model’s flexibility and adaptability.

Link to Historic street Gallery
Link to Toy lama Gallery
Link to Lizard WizardGallery

Summary

In conclusion, the Fooocus V2 style expansion has a noticeable impact on the overall look and feel of images generated by this software. While it may not be perfect for every scenario, having a more realistic appearance can be advantageous in certain cases. Of course, our comparisons haven’t touched upon proper prompt engineering or experimentation with different LoRA models – something that could help break free from the inherent biases.

But what truly stands out is how easy Fooocus is to set up and use while still delivering impressive results. It’s heartwarming to see such high-quality open-source tools thrive in an environment where countless paid AI options emerge daily.