<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	
	xmlns:georss="http://www.georss.org/georss"
	xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#"
	>

<channel>
	<title>Jenxi Seow</title>
	<atom:link href="https://jenxi.com/feed/" rel="self" type="application/rss+xml" />
	<link>https://jenxi.com</link>
	<description>Mindfulness, productivity, and AI</description>
	<lastBuildDate>Mon, 23 Sep 2024 04:45:37 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>
	hourly	</sy:updatePeriod>
	<sy:updateFrequency>
	1	</sy:updateFrequency>
	

<image>
	<url>https://i0.wp.com/jenxi.com/wp-content/uploads/Jenxi-round-favicon.png?fit=32%2C32&#038;ssl=1</url>
	<title>Jenxi Seow</title>
	<link>https://jenxi.com</link>
	<width>32</width>
	<height>32</height>
</image> 
<site xmlns="com-wordpress:feed-additions:1">9978634</site>	<item>
		<title>How to use Stable Diffusion (Part 1)</title>
		<link>https://jenxi.com/how-to-use-stable-diffusion-part-1/</link>
					<comments>https://jenxi.com/how-to-use-stable-diffusion-part-1/#respond</comments>
		
		<dc:creator><![CDATA[Jenxi Seow]]></dc:creator>
		<pubDate>Fri, 19 Jan 2024 14:34:56 +0000</pubDate>
				<category><![CDATA[Blog]]></category>
		<category><![CDATA[Stable Diffusion]]></category>
		<category><![CDATA[Stable Diffusion Web UI]]></category>
		<guid isPermaLink="false">https://jenxi.com/?p=15558</guid>

					<description><![CDATA[In this guide, you’ll learn how to use Stable Diffusion using the Stable Diffusion Web UI by Automatic1111.]]></description>
										<content:encoded><![CDATA[
<p>To get started with <a href="https://jenxi.com/what-is-stable-diffusion/" data-type="post" data-id="15451">Stable Diffusion</a>, you’ll first need to use a user interface to let you operate it without having to key in commands into the terminal. In this guide, you’ll learn how to use Stable Diffusion using the <a href="https://github.com/AUTOMATIC1111/stable-diffusion-webui" data-type="link" data-id="https://github.com/AUTOMATIC1111/stable-diffusion-webui" target="_blank" rel="noopener">Stable Diffusion Web UI by Automatic1111</a>. This is the easier tool available out there and I feel is perfect for beginners to get started. </p>



<p>Once you have understood the process of creating images using the Web UI, you can then explore other tools like <a href="https://github.com/comfyanonymous/ComfyUI" target="_blank" rel="noopener">ComfyUI</a> that are designed for power users.</p>



<figure class="wp-block-image size-large"><img data-recalc-dims="1" fetchpriority="high" decoding="async" width="1140" height="432" src="https://i0.wp.com/jenxi.com/wp-content/uploads/ComfyUI.webp?resize=1140%2C432&#038;ssl=1" alt="" class="wp-image-15470" srcset="https://i0.wp.com/jenxi.com/wp-content/uploads/ComfyUI.webp?resize=1140%2C432&amp;ssl=1 1140w, https://i0.wp.com/jenxi.com/wp-content/uploads/ComfyUI.webp?resize=700%2C265&amp;ssl=1 700w, https://i0.wp.com/jenxi.com/wp-content/uploads/ComfyUI.webp?resize=768%2C291&amp;ssl=1 768w, https://i0.wp.com/jenxi.com/wp-content/uploads/ComfyUI.webp?resize=1536%2C583&amp;ssl=1 1536w, https://i0.wp.com/jenxi.com/wp-content/uploads/ComfyUI.webp?w=1859&amp;ssl=1 1859w" sizes="(max-width: 1000px) 100vw, 1000px" /><figcaption class="wp-element-caption">ComfyUI workflow</figcaption></figure>



<p>This article will guide you through the process to generating your first image in Stable Diffusion. I started with the goal of helping you understand the basics in a single post, but I realised that there are a lot to digest if I do a deep dive and try to cover each parameter.</p>



<p>I’ve streamlined this guide so you understand enough to create your first AI generated image. Once you have a grasp of the process of how to use Stable Diffusion, I suggest reading the individual articles for the different steps and concepts to get a better grasp of what each step does, why I suggest doing it in a certain way, and explore how you can experiment on your own.</p>



<div class="wp-block-group has-background has-global-padding is-layout-constrained wp-container-core-group-is-layout-1 wp-block-group-is-layout-constrained" style="border-radius:8px;background-color:#f7f7f7;padding-top:var(--wp--preset--spacing--small);padding-right:clamp(30px, 4vw, 40px);padding-bottom:var(--wp--preset--spacing--small);padding-left:clamp(30px, 4vw, 40px)"><h2 class="simpletoc-title">Contents</h2>
<ul class="simpletoc-list">
<li><a href="#install-stable-diffusion-web-ui">Install Stable Diffusion Web UI</a>

</li>
<li><a href="#download-a-stable-diffusion-checkpoint">Download a Stable Diffusion checkpoint</a>

</li>
<li><a href="#generate-image-with-txt2img">Generate image with txt2img</a>


<ul><li>
<a href="#prompt-structure">Prompt structure</a>

</li>
<li><a href="#prompt-length">Prompt length</a>

</li>
<li><a href="#token-weightage">Token weightage</a>


</li>

</li>

<li><a href="#negative-prompt">Negative prompt</a>


</li>
</ul>
<li><a href="#next-steps">Next steps</a>
</li></ul></div>


<h2 class="wp-block-heading" id="install-stable-diffusion-web-ui">Install Stable Diffusion Web UI</h2>


<p>If you haven’t already, install Stable Diffusion Web UI.&nbsp;</p>



<p>I have written up a guide on <a href="https://jenxi.com/how-to-install-stable-diffusion-web-ui/">how to install Stable Diffusion Web UI</a>. I try to keep my installation guide updated, but things move fast in the Stable Diffusion world, so don’t be surprised things don’t work.&nbsp;</p>


<h2 class="wp-block-heading" id="download-a-stable-diffusion-checkpoint">Download a Stable Diffusion checkpoint</h2>


<p>When you first launch Stable Diffusion, the first option in the top left is the Stable Diffusion checkpoint option. This dropdown option lets you select the checkpoint you want to use to generate your image.</p>



<p>Check out my lists of the top Stable Diffusion checkpoints to browse the popular checkpoints. <a href="https://jenxi.com/subscribe/" data-type="page" data-id="2891">Sign up for my newsletter</a> to get the free Top 10 Realistic Checkpoints Database.</p>



<p>The <img src="https://s.w.org/images/core/emoji/15.0.3/72x72/1f504.png" alt="🔄" class="wp-smiley" style="height: 1em; max-height: 1em;" /> icon beside the dropdown lets you refresh the list if you moved a new checkpoint into the checkpoints folder. Click on the icon to update the list without having to restart the Web UI.</p>



<figure class="wp-block-image size-large"><img data-recalc-dims="1" decoding="async" width="1140" height="658" src="https://i0.wp.com/jenxi.com/wp-content/uploads/dreamshaper_8-TI.webp?resize=1140%2C658&#038;ssl=1" alt="" class="wp-image-15648" srcset="https://i0.wp.com/jenxi.com/wp-content/uploads/dreamshaper_8-TI.webp?resize=1140%2C658&amp;ssl=1 1140w, https://i0.wp.com/jenxi.com/wp-content/uploads/dreamshaper_8-TI.webp?resize=700%2C404&amp;ssl=1 700w, https://i0.wp.com/jenxi.com/wp-content/uploads/dreamshaper_8-TI.webp?resize=768%2C444&amp;ssl=1 768w, https://i0.wp.com/jenxi.com/wp-content/uploads/dreamshaper_8-TI.webp?w=1200&amp;ssl=1 1200w" sizes="(max-width: 1000px) 100vw, 1000px" /><figcaption class="wp-element-caption">X/Y plot to compare DreamShaper v8 output for different VAE and Clip Skip values.</figcaption></figure>



<p>If you have already installed Stable Diffusion Web UI using my <a href="https://jenxi.com/how-to-install-stable-diffusion-web-ui/">guide</a>, you would already have the DreamShaper checkpoint. You can skip to the next step.</p>



<p>For beginners, I recommend using DreamShaper, a good general purpose checkpoint.</p>



<ul class="wp-block-list">
<li>Download Dreamshaper v8 from <a href="https://civitai.com/models/4384/dreamshaper" target="_blank" rel="noopener">Civitai</a> or <a href="https://huggingface.co/Lykon/DreamShaper/tree/main" target="_blank" rel="noopener">Hugging Face</a>.</li>



<li>Move the checkpoint file into the checkpoints folder <code>\models\Stable-diffusion\</code>.</li>
</ul>



<p>Click on the <img src="https://s.w.org/images/core/emoji/15.0.3/72x72/1f504.png" alt="🔄" class="wp-smiley" style="height: 1em; max-height: 1em;" /> icon to update your list of checkpoints and select DreamShaper v8.</p>



<figure class="wp-block-image size-large"><img data-recalc-dims="1" decoding="async" width="1140" height="190" src="https://i0.wp.com/jenxi.com/wp-content/uploads/Stable-Diffusion-Web-UI-checkpoint.webp?resize=1140%2C190&#038;ssl=1" alt="" class="wp-image-15640" srcset="https://i0.wp.com/jenxi.com/wp-content/uploads/Stable-Diffusion-Web-UI-checkpoint.webp?resize=1140%2C190&amp;ssl=1 1140w, https://i0.wp.com/jenxi.com/wp-content/uploads/Stable-Diffusion-Web-UI-checkpoint.webp?resize=700%2C117&amp;ssl=1 700w, https://i0.wp.com/jenxi.com/wp-content/uploads/Stable-Diffusion-Web-UI-checkpoint.webp?resize=768%2C128&amp;ssl=1 768w, https://i0.wp.com/jenxi.com/wp-content/uploads/Stable-Diffusion-Web-UI-checkpoint.webp?resize=1536%2C257&amp;ssl=1 1536w, https://i0.wp.com/jenxi.com/wp-content/uploads/Stable-Diffusion-Web-UI-checkpoint.webp?w=1604&amp;ssl=1 1604w" sizes="(max-width: 1000px) 100vw, 1000px" /></figure>


<h2 class="wp-block-heading" id="generate-image-with-txt2img">Generate image with txt2img</h2>


<p>There are several tabs in your Web UI screen. My tabs might differ from yours since you are most likely on a default installation. My Web UI is very customised, so it will be different from what you see.</p>



<p>The txt2img tab is where you start your image generation if you want to create an image from a prompt.</p>


<h3 class="wp-block-heading" id="prompt-structure">Prompt structure</h3>


<p>I won’t go into the details of prompt engineering in this article, so I’ll just touch briefly on the way I write prompts.</p>



<p>There are several ways of writing prompts for Stable Diffusion. You can go with natural language and describe the scene you want to see. This method is commonly used in Midjourney and DALL-E3. You then let the AI interpret your description to create the image.</p>



<figure class="wp-block-image size-large"><img data-recalc-dims="1" height="500" width="1140" decoding="async" src="https://i0.wp.com/jenxi.com/wp-content/uploads/Stable-Diffusion-Web-UI-prompt-1-1140x500.webp?resize=1140%2C500&#038;ssl=1" alt="" class="wp-image-15642"/></figure>



<p>I personally prefer to use Danbooru tags in my prompts because that gives me more control over to the elements I want to appear. For example, you can use <code>1girl</code> or <code>1boy</code> to specify that there is just a girl or a boy in the image. If you use them both together, you will get a girl <em>and</em> a boy. If you go with <code>2girls</code> or <code>2boys</code>, you will get two girls or two boys. Use both and you get four people in the image.</p>



<p>Of course, you can also do a mix of the natural language and booru tags in your prompt. Stable Diffusion will have no problems interpreting it.</p>



<p>The <a href="https://danbooru.donmai.us/wiki_pages/howto:tag" target="_blank" rel="noopener">rule of thumb</a> for prompting with Danbooru tags is to tag what you see, not what you know. For example, don’t tag footwear if you won’t see the feet in the image.</p>



<p>However, I have a general structure for creating my prompts:</p>



<ul class="wp-block-list">
<li>Quality – These are tags that help to determine the quality and style, such as photorealistic, painting, anime, or illustrations etc. For example,&nbsp;<code>masterpiece, best quality, photorealistic</code>.</li>



<li>Composition – These are tags that describe the camera angle and lighting, such as a close-up or wide shot. You can refer to the list of camera angles and lighting types, For example, <code>cowboy shot, cinematic lighting</code>.</li>



<li>Subject – These are the tags that describe the subject. For example, <code>1girl, long hair, blue eyes</code>.</li>



<li>Supplementary – These are tags that add on to the subject, such as the outfit or pose. For example, <code>red dress, standing, looking at viewer</code>.</li>



<li>Background – These are the tags that describe the background in the image, For example, <code>Chinese village</code>.</li>



<li>Miscellaneous – Any other tags that I want to add on would be added at this part. I prefer to place my camera lens tags here if I use them. For example, <code>depth of field, canon 85mm f1.2</code>.</li>
</ul>



<p>Based on the examples above, the final prompt would be:<br><code>masterpiece, best quality, photorealistic, cowboy shot, cinematic lighting, 1girl, long hair, blue eyes, red dress, standing, looking at viewer, Chinese village, depth of field, canon 85mm f1.2</code></p>



<p>This is just a general approach to creating a prompt. Some checkpoints have a recommended way to prompting and would give better results if you follow that.</p>


<h3 class="wp-block-heading" id="prompt-length">Prompt length</h3>


<p>Stable Diffusion interprets prompts at 75 tokens per chunk. Once it goes beyond that, it will split the process into two parts. The prompt fields will show 0/75 when empty. Once you go past 75 tokens, it will show a total of 150 instead, to indicate the token count left before you use up the second part.</p>



<div class="wp-block-columns is-layout-flex wp-container-core-columns-is-layout-1 wp-block-columns-is-layout-flex">
<div class="wp-block-column is-layout-flow wp-block-column-is-layout-flow" style="flex-basis:100%">
<figure class="wp-block-image size-medium is-resized"><img data-recalc-dims="1" decoding="async" width="700" height="414" src="https://i0.wp.com/jenxi.com/wp-content/uploads/Stable-Diffusion-Web-UI-prompt-75-tokens.png?resize=700%2C414&#038;ssl=1" alt="" class="wp-image-15645" style="width:304px;height:auto" srcset="https://i0.wp.com/jenxi.com/wp-content/uploads/Stable-Diffusion-Web-UI-prompt-75-tokens.png?resize=700%2C414&amp;ssl=1 700w, https://i0.wp.com/jenxi.com/wp-content/uploads/Stable-Diffusion-Web-UI-prompt-75-tokens.png?resize=768%2C454&amp;ssl=1 768w, https://i0.wp.com/jenxi.com/wp-content/uploads/Stable-Diffusion-Web-UI-prompt-75-tokens.png?w=812&amp;ssl=1 812w" sizes="(max-width: 700px) 100vw, 700px" /><figcaption class="wp-element-caption">47 out of 75 tokens in 1 chunk</figcaption></figure>
</div>



<div class="wp-block-column is-layout-flow wp-block-column-is-layout-flow" style="flex-basis:100%">
<figure class="wp-block-image size-medium"><img data-recalc-dims="1" decoding="async" width="700" height="414" src="https://i0.wp.com/jenxi.com/wp-content/uploads/Stable-Diffusion-Web-UI-prompt-150-tokens.png?resize=700%2C414&#038;ssl=1" alt="" class="wp-image-15646" style="object-fit:cover" srcset="https://i0.wp.com/jenxi.com/wp-content/uploads/Stable-Diffusion-Web-UI-prompt-150-tokens.png?resize=700%2C414&amp;ssl=1 700w, https://i0.wp.com/jenxi.com/wp-content/uploads/Stable-Diffusion-Web-UI-prompt-150-tokens.png?resize=768%2C454&amp;ssl=1 768w, https://i0.wp.com/jenxi.com/wp-content/uploads/Stable-Diffusion-Web-UI-prompt-150-tokens.png?w=812&amp;ssl=1 812w" sizes="(max-width: 700px) 100vw, 700px" /><figcaption class="wp-element-caption">96 out of 150 tokens in 2 chunks</figcaption></figure>
</div>
</div>


<h3 class="wp-block-heading" id="token-weightage">Token weightage</h3>


<p>The tags in your prompts are weighted based on how far in front they are in your prompts. Tags with stronger weights are given more priority. If your output is not giving you the results you want, you might need to move your tags around to tweak their weightage.</p>


<h4 class="wp-block-heading" id="repeating-tags">Repeating tags</h4>


<p>If you look at other people’s prompts, something I highly recommend you to do to study how others build their prompts, you might see them repeating the same tags, or similar ones. For example, <code>1girl, solo</code>. Both tags have the same effect but repeating the similar concepts helps to give emphasis and tells the AI that this concept is important.</p>



<p>Some people repeat the same tags, for example, having <code>blue eyes</code> repeated again near the end of the earlier prompt that we had.</p>



<p><code>masterpiece, best quality, photorealistic, cowboy shot, cinematic lighting, 1girl, long hair, blue eyes, red dress, standing, looking at viewer, Chinese village, blue eyes, depth of field, canon 85mm f1.2</code></p>


<h4 class="wp-block-heading" id="moving-tokens">Moving tokens</h4>


<p>You can move your prompt elements around in Stable Diffusion Web UI using the keyboard shortcut ⌥ Option/ALT + left/right arrow keys to move them forward or backwards.</p>



<figure class="wp-block-image size-large"><img data-recalc-dims="1" decoding="async" src="https://i0.wp.com/media.cleanshot.cloud/media/77873/12ys73dGMCU7sI2ZkksoPL8BKqRAoxshfURA6kIR.gif?ssl=1" alt=""/></figure>



<p>Note that this moves comma-separated elements. Meaning that tags within two sets of commas (one at the start or end of the prompt) will be moved about.</p>


<h4 class="wp-block-heading" id="adjusting-weights">Adjusting weights</h4>


<p>You can also increase your tags weight by adding brackets and the weight using the syntax<code>(tag:weight)</code>. This allows you to adjust the weight of the prompt without having to move the tags around. Weightage can apply to not just tags, but also other prompt elements such as LoRAs and embedding.</p>



<p>You can add the weight of the token in Stable Diffusion Web UI using the keyboard shortcut ⌘ CMD/CTRL + up/down arrow keys to adjust the weight.</p>



<figure class="wp-block-image size-large"><img data-recalc-dims="1" decoding="async" src="https://i0.wp.com/media.cleanshot.cloud/media/77873/H1FZ7n59S520xXPRneLz0SSIasXRvKTYE6qRQrka.gif?ssl=1" alt=""/></figure>



<p>By default, the tags have a weight of 1 at the position in the prompt. You can increase (above 1.0) or decrease (below 1.0) the weightage to adjust its strength in your prompt. You can actually give negative values to the tags, though at this point you might want to just place the token in the negative prompt.</p>


<h3 class="wp-block-heading" id="negative-prompt">Negative prompt</h3>


<p>Those of you who have tried other AI image generators like Midjourney, Mage Space or Leonardo AI would be familiar with negative prompts. A negative prompt is basically what you want the AI to avoid in the image. Common negative tags are <code>worst quality, low quality, deformed hands, bad anatomy</code>.</p>



<p>You can also place embeddings and LoRAs in the negative prompt to reverse their effect. Bear in mind that this would only work if they support negative weights.</p>



<p>For example, placing a LoRA that adds detail in the negative prompt will actually remove details instead of adding details.</p>


<h4 class="wp-block-heading" id="negative-embeddings">Negative embeddings</h4>


<p>Instead of trying to add too many negative prompts there are negative embeddings that makes it easier to insert common negative tags. I won’t go into details of negative embeddings in this article since it is another topic on its own.</p>



<p>The DeepNegative is an example when you use the NG_DeepNegative_V1_75T embedding that includes 75 tokens, it actually applies 75 tokens to your negative prompt. Notice that the negative prompt field will show 75/75 tokens used when you input just NG_DeepNegative_V1_75T in the negative prompt.</p>



<p>Other kinds of negative embeddings are textual inversions or LoRAs trained with concepts you don’t want to, such as bad drawings or anatomy. Applying these in the negative prompt will make the AI avoid these concepts.</p>



<p>A fun, but potentially scary/scarring, experiment to try is to place a negative embedding in the positive prompt.</p>


<h2 class="wp-block-heading" id="next-steps">Next steps</h2>


<p>Now that we have our prompt ready, we move on to generating the image.</p>



<p><a href="https://jenxi.com/subscribe/" data-type="page" data-id="2891">Don’t forget to sign up for my newsletter</a> to get the free Top 10 Realistic Checkpoints Database. You can also browse my lists of <a href="http://10-best-stable-diffusion-checkpoints-sd-1-5" target="_blank">top Stable Diffusion checkpoints</a> on the blog.</p>
]]></content:encoded>
					
					<wfw:commentRss>https://jenxi.com/how-to-use-stable-diffusion-part-1/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
		<post-id xmlns="com-wordpress:feed-additions:1">15558</post-id>	</item>
		<item>
		<title>10+ Best Stable Diffusion checkpoints (SD 1.5)</title>
		<link>https://jenxi.com/10-best-stable-diffusion-checkpoints-sd-1-5/</link>
					<comments>https://jenxi.com/10-best-stable-diffusion-checkpoints-sd-1-5/#respond</comments>
		
		<dc:creator><![CDATA[Jenxi Seow]]></dc:creator>
		<pubDate>Fri, 19 Jan 2024 14:32:26 +0000</pubDate>
				<category><![CDATA[Blog]]></category>
		<category><![CDATA[Stable Diffusion]]></category>
		<category><![CDATA[Stable Diffusion checkpoint]]></category>
		<guid isPermaLink="false">https://jenxi.com/?p=15626</guid>

					<description><![CDATA[For those who are wondering what Stable Diffusion checkpoints to use, I’ve compiled the top 10 best Stable Diffusion checkpoints for SD 1.5.]]></description>
										<content:encoded><![CDATA[
<p>Now that you know <a href="https://jenxi.com/what-is-stable-diffusion/" data-type="post" data-id="15451">what Stable Diffusion is</a>, <a href="https://jenxi.com/how-to-install-stable-diffusion-web-ui/" data-type="post" data-id="15583">how to install it</a>, and the <a href="https://jenxi.com/how-to-use-stable-diffusion-part-1/" data-type="link" data-id="jenxi.com/how-to-use-stable-diffusion-part-1/">basic usage of Stable Diffusion</a>, you must be wondering what Stable checkpoints to use. I’ve compiled the top 10 best Stable Diffusion checkpoints for SD 1.5.</p>



<p>You’ve probably downloaded Stability AI’s official <a href="https://huggingface.co/runwayml/stable-diffusion-v1-5" target="_blank" rel="noopener">Stable Diffusion 1.5 model (SD 1.5)</a> during the installation process. While it is a big improvement over the <a href="https://huggingface.co/CompVis/stable-diffusion-v-1-4-original" target="_blank" rel="noopener">Stable Diffusion 1.4 model (SD 1.4)</a>, it is still a base model for general use with the main purpose of showcasing what Stable Diffusion 1.5 is capable of.</p>



<p>If you followed <a href="https://jenxi.com/how-to-install-stable-diffusion-web-ui/" data-type="link" data-id="https://jenxi.com/how-to-install-stable-diffusion-web-ui/">my installation guide</a>, you would have installed the DreamShaper v8 checkpoint. It’s the model I recommend to people who are new to Stable Diffusion.</p>



<p>To bring your Stable Diffusion to the next level, you need to get a custom checkpoint like DreamShaper. Before we dive into the top checkpoints, let’s have a brief look at what best Stable Diffusion checkpoints are.</p>



<div class="wp-block-group has-background has-global-padding is-layout-constrained wp-container-core-group-is-layout-2 wp-block-group-is-layout-constrained" style="border-radius:8px;background-color:#f7f7f7;padding-top:var(--wp--preset--spacing--small);padding-right:clamp(30px, 4vw, 40px);padding-bottom:var(--wp--preset--spacing--small);padding-left:clamp(30px, 4vw, 40px)"><h2 class="simpletoc-title">Contents</h2>
<ul class="simpletoc-list">
<li><a href="#the-best-stable-diffusion-checkpoints-ranked">The best Stable Diffusion checkpoints ranked</a>

</li>
<li><a href="#top-10-stable-diffusion-checkpoints">Top 10 Stable Diffusion checkpoints</a>


<ul><li>
<a href="#1-realistic-vision">1. Realistic Vision</a>

</li>
<li><a href="#2-chilloutmix">2. ChilloutMix</a>

</li>
<li><a href="#3-dreamshaper">3. DreamShaper</a>

</li>
<li><a href="#4-majicmix-realistic">4. MajicMix Realistic</a>

</li>
<li><a href="#5-uber-realistic-porn-merge-urpm">5. Uber Realistic Porn Merge (URPM)</a>

</li>
<li><a href="#6-epicrealism">6. epiCRealism</a>

</li>
<li><a href="#7-rev-animated">7. ReV Animated</a>

</li>
<li><a href="#8-perfect-world">8. Perfect World</a>

</li>
<li><a href="#9-meinamix">9. MeinaMix</a>

</li>
<li><a href="#10-beautiful-realistic-asians">10. Beautiful Realistic Asians</a>

</li>
<li><a href="#11-cyberrealistic-bonus">11. CyberRealistic (Bonus)</a>

</li>
<li><a href="#12-counterfeit-bonus">12. Counterfeit (Bonus)</a>

</li>
</ul>
<li><a href="#roundup">Round-up</a>

</li>
<li><a href="#free-notion-resource">Free Notion resource</a>
</li></ul></div>


<h2 class="wp-block-heading" id="the-best-stable-diffusion-checkpoints-ranked">The best Stable Diffusion checkpoints ranked</h2>


<p>I have listed the top 10 best Stable Diffusion checkpoints based on their popularity, ranking them based on the total number of downloads they have on Civitai.</p>


<h2 class="wp-block-heading" id="top-10-stable-diffusion-checkpoints">Top 10 Stable Diffusion checkpoints</h2>


<p>These checkpoints are ranked by popularity as of writing. Note that some of these checkpoints differ by very small number of downloads, so expect the rankings to fluctuate.</p>



<p>Regardless of their standings, these checkpoints are very established and popular amongst Stable Diffusion and are great to starting points for your Stable Diffusion journey.</p>


<h3 class="wp-block-heading" id="1-realistic-vision">1. Realistic Vision</h3>


<figure data-wp-context="{&quot;imageId&quot;:&quot;672f541045994&quot;}" data-wp-interactive="core/image" class="wp-block-image size-large wp-lightbox-container"><img data-recalc-dims="1" decoding="async" width="1140" height="342" data-wp-class--hide="state.isContentHidden" data-wp-class--show="state.isContentVisible" data-wp-init="callbacks.setButtonStyles" data-wp-on-async--click="actions.showLightbox" data-wp-on-async--load="callbacks.setButtonStyles" data-wp-on-async-window--resize="callbacks.setButtonStyles" src="https://i0.wp.com/jenxi.com/wp-content/uploads/RealisticVision-v41.webp?resize=1140%2C342&#038;ssl=1" alt="Realistic Vision v4.1 checkpoint" class="wp-image-15664" srcset="https://i0.wp.com/jenxi.com/wp-content/uploads/RealisticVision-v41.webp?resize=1140%2C342&amp;ssl=1 1140w, https://i0.wp.com/jenxi.com/wp-content/uploads/RealisticVision-v41.webp?resize=700%2C210&amp;ssl=1 700w, https://i0.wp.com/jenxi.com/wp-content/uploads/RealisticVision-v41.webp?resize=768%2C230&amp;ssl=1 768w, https://i0.wp.com/jenxi.com/wp-content/uploads/RealisticVision-v41.webp?resize=1536%2C461&amp;ssl=1 1536w, https://i0.wp.com/jenxi.com/wp-content/uploads/RealisticVision-v41.webp?resize=2048%2C614&amp;ssl=1 2048w" sizes="(max-width: 1000px) 100vw, 1000px" /><button
			class="lightbox-trigger"
			type="button"
			aria-haspopup="dialog"
			aria-label="Enlarge image: Realistic Vision v4.1 checkpoint"
			data-wp-init="callbacks.initTriggerButton"
			data-wp-on-async--click="actions.showLightbox"
			data-wp-style--right="state.imageButtonRight"
			data-wp-style--top="state.imageButtonTop"
		>
			<svg xmlns="http://www.w3.org/2000/svg" width="12" height="12" fill="none" viewBox="0 0 12 12">
				<path fill="#fff" d="M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z" />
			</svg>
		</button></figure>



<ul class="wp-block-list">
<li>Type: Merge</li>



<li>Usage: Photograph</li>



<li>Download: <a href="https://civitai.com/models/4201" data-type="link" data-id="https://civitai.com/models/4201" target="_blank" rel="noopener">Civitai</a></li>
</ul>



<p>It is a close fight between Realistic Vision and ChilloutMix, but Realistic Vision edges out slightly probably because it can generate a wider range of face types. It is merged from a long list of realistic checkpoints to squeeze the most realism out of them.</p>



<p>While I don’t use Realistic Vision that much, I use the inpainting checkpoint all the time. It is great for fixing photorealistic images.</p>


<h3 class="wp-block-heading" id="2-chilloutmix">2. ChilloutMix</h3>


<figure data-wp-context="{&quot;imageId&quot;:&quot;672f5410467a3&quot;}" data-wp-interactive="core/image" class="wp-block-image size-large wp-lightbox-container"><img data-recalc-dims="1" decoding="async" width="1140" height="342" data-wp-class--hide="state.isContentHidden" data-wp-class--show="state.isContentVisible" data-wp-init="callbacks.setButtonStyles" data-wp-on-async--click="actions.showLightbox" data-wp-on-async--load="callbacks.setButtonStyles" data-wp-on-async-window--resize="callbacks.setButtonStyles" src="https://i0.wp.com/jenxi.com/wp-content/uploads/ChilloutMix-Ni.webp?resize=1140%2C342&#038;ssl=1" alt="ChilloutMix Ni checkpoint" class="wp-image-15663" srcset="https://i0.wp.com/jenxi.com/wp-content/uploads/ChilloutMix-Ni.webp?resize=1140%2C342&amp;ssl=1 1140w, https://i0.wp.com/jenxi.com/wp-content/uploads/ChilloutMix-Ni.webp?resize=700%2C210&amp;ssl=1 700w, https://i0.wp.com/jenxi.com/wp-content/uploads/ChilloutMix-Ni.webp?resize=768%2C230&amp;ssl=1 768w, https://i0.wp.com/jenxi.com/wp-content/uploads/ChilloutMix-Ni.webp?resize=1536%2C461&amp;ssl=1 1536w, https://i0.wp.com/jenxi.com/wp-content/uploads/ChilloutMix-Ni.webp?resize=2048%2C614&amp;ssl=1 2048w" sizes="(max-width: 1000px) 100vw, 1000px" /><button
			class="lightbox-trigger"
			type="button"
			aria-haspopup="dialog"
			aria-label="Enlarge image: ChilloutMix Ni checkpoint"
			data-wp-init="callbacks.initTriggerButton"
			data-wp-on-async--click="actions.showLightbox"
			data-wp-style--right="state.imageButtonRight"
			data-wp-style--top="state.imageButtonTop"
		>
			<svg xmlns="http://www.w3.org/2000/svg" width="12" height="12" fill="none" viewBox="0 0 12 12">
				<path fill="#fff" d="M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z" />
			</svg>
		</button></figure>



<ul class="wp-block-list">
<li>Type: Merge</li>



<li>Usage: Photograph, digital painting</li>



<li>Download: <a href="https://civitai.com/models/6424/" target="_blank" rel="noopener">Civitai</a></li>
</ul>



<p>ChilloutMix is so good at creating realistic images that it stirred up a lot of controversy around the generation of images with a real person’s face. The issue snowballed to a point where the creator had to transfer ownership of the checkpoint to Civitai and go into hiding to avoid legal repercussions. I take this as proof of the checkpoint’s capability and popularity.</p>



<p>Besides being the most popular realistic checkpoint for Asian faces, ChilloutMix is also used by to train many LoRAs and checkpoints, even non-realistic ones.</p>



<p>ChilloutMix is released under the Dreamlike License due to the checkpoint it used in the merge that is rather restrictive. Model creators seem to have bypassed this by training models using generated images instead of just merging.</p>


<h3 class="wp-block-heading" id="3-dreamshaper">3. DreamShaper</h3>


<figure data-wp-context="{&quot;imageId&quot;:&quot;672f54104748e&quot;}" data-wp-interactive="core/image" class="wp-block-image size-large is-resized wp-lightbox-container"><img data-recalc-dims="1" decoding="async" width="1140" height="342" data-wp-class--hide="state.isContentHidden" data-wp-class--show="state.isContentVisible" data-wp-init="callbacks.setButtonStyles" data-wp-on-async--click="actions.showLightbox" data-wp-on-async--load="callbacks.setButtonStyles" data-wp-on-async-window--resize="callbacks.setButtonStyles" src="https://i0.wp.com/jenxi.com/wp-content/uploads/DreamShaper-v8.webp?resize=1140%2C342&#038;ssl=1" alt="DreamShaper v8 checkpoint" class="wp-image-15667" style="width:640px;height:auto" srcset="https://i0.wp.com/jenxi.com/wp-content/uploads/DreamShaper-v8.webp?resize=1140%2C342&amp;ssl=1 1140w, https://i0.wp.com/jenxi.com/wp-content/uploads/DreamShaper-v8.webp?resize=700%2C210&amp;ssl=1 700w, https://i0.wp.com/jenxi.com/wp-content/uploads/DreamShaper-v8.webp?resize=768%2C230&amp;ssl=1 768w, https://i0.wp.com/jenxi.com/wp-content/uploads/DreamShaper-v8.webp?resize=1536%2C461&amp;ssl=1 1536w, https://i0.wp.com/jenxi.com/wp-content/uploads/DreamShaper-v8.webp?resize=2048%2C614&amp;ssl=1 2048w" sizes="(max-width: 1000px) 100vw, 1000px" /><button
			class="lightbox-trigger"
			type="button"
			aria-haspopup="dialog"
			aria-label="Enlarge image: DreamShaper v8 checkpoint"
			data-wp-init="callbacks.initTriggerButton"
			data-wp-on-async--click="actions.showLightbox"
			data-wp-style--right="state.imageButtonRight"
			data-wp-style--top="state.imageButtonTop"
		>
			<svg xmlns="http://www.w3.org/2000/svg" width="12" height="12" fill="none" viewBox="0 0 12 12">
				<path fill="#fff" d="M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z" />
			</svg>
		</button></figure>



<ul class="wp-block-list">
<li>Type: Trained</li>



<li>Usage: Photograph, digital painting, anime</li>



<li>Download: <a href="https://civitai.com/models/4384/dreamshaper" target="_blank" rel="noopener">Civitai</a>, <a href="https://huggingface.co/Lykon/DreamShaper" target="_blank" rel="noopener">Hugging Face</a></li>
</ul>



<p>DreamShaper by Lyon is the checkpoint I recommend to all Stable Diffusion beginners. If you’ve followed my installation and getting started guides, you would already have DreamShaper installed.</p>



<p>It is a very flexible checkpoint and can generate a wide range of styles and realism levels.</p>


<h3 class="wp-block-heading" id="4-majicmix-realistic">4. MajicMix Realistic</h3>


<figure data-wp-context="{&quot;imageId&quot;:&quot;672f541047e09&quot;}" data-wp-interactive="core/image" class="wp-block-image size-large wp-lightbox-container"><img data-recalc-dims="1" decoding="async" width="1140" height="342" data-wp-class--hide="state.isContentHidden" data-wp-class--show="state.isContentVisible" data-wp-init="callbacks.setButtonStyles" data-wp-on-async--click="actions.showLightbox" data-wp-on-async--load="callbacks.setButtonStyles" data-wp-on-async-window--resize="callbacks.setButtonStyles" src="https://i0.wp.com/jenxi.com/wp-content/uploads/MajicMix-Realistic-v7.webp?resize=1140%2C342&#038;ssl=1" alt="" class="wp-image-15669" srcset="https://i0.wp.com/jenxi.com/wp-content/uploads/MajicMix-Realistic-v7.webp?resize=1140%2C342&amp;ssl=1 1140w, https://i0.wp.com/jenxi.com/wp-content/uploads/MajicMix-Realistic-v7.webp?resize=700%2C210&amp;ssl=1 700w, https://i0.wp.com/jenxi.com/wp-content/uploads/MajicMix-Realistic-v7.webp?resize=768%2C230&amp;ssl=1 768w, https://i0.wp.com/jenxi.com/wp-content/uploads/MajicMix-Realistic-v7.webp?resize=1536%2C461&amp;ssl=1 1536w, https://i0.wp.com/jenxi.com/wp-content/uploads/MajicMix-Realistic-v7.webp?resize=2048%2C614&amp;ssl=1 2048w" sizes="(max-width: 1000px) 100vw, 1000px" /><button
			class="lightbox-trigger"
			type="button"
			aria-haspopup="dialog"
			aria-label="Enlarge image"
			data-wp-init="callbacks.initTriggerButton"
			data-wp-on-async--click="actions.showLightbox"
			data-wp-style--right="state.imageButtonRight"
			data-wp-style--top="state.imageButtonTop"
		>
			<svg xmlns="http://www.w3.org/2000/svg" width="12" height="12" fill="none" viewBox="0 0 12 12">
				<path fill="#fff" d="M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z" />
			</svg>
		</button></figure>



<p>Type: Merge<br>Usage: Photograph, digital painting<br>MajicMix Realistic has become the standard for photorealism in the later half of 2023. It is one of my most used checkpoints, though nowadays I tend to switch between one of the MajicMix Realistic derivatives out there for a more specific look.</p>



<p>How popular is it? MajicMix Realistic has a recognisable face, especially before v7. It’s so easy to spot the face and I see it a lot on Taobao. Yes, Taobao sellers are already using AI-generated images for product images in mid-2023.</p>


<h3 class="wp-block-heading" id="5-uber-realistic-porn-merge-urpm">5. Uber Realistic Porn Merge (URPM)</h3>


<figure data-wp-context="{&quot;imageId&quot;:&quot;672f54104868a&quot;}" data-wp-interactive="core/image" class="wp-block-image size-large wp-lightbox-container"><img data-recalc-dims="1" decoding="async" width="1140" height="342" data-wp-class--hide="state.isContentHidden" data-wp-class--show="state.isContentVisible" data-wp-init="callbacks.setButtonStyles" data-wp-on-async--click="actions.showLightbox" data-wp-on-async--load="callbacks.setButtonStyles" data-wp-on-async-window--resize="callbacks.setButtonStyles" src="https://i0.wp.com/jenxi.com/wp-content/uploads/URPM-v13.webp?resize=1140%2C342&#038;ssl=1" alt="" class="wp-image-15672" srcset="https://i0.wp.com/jenxi.com/wp-content/uploads/URPM-v13.webp?resize=1140%2C342&amp;ssl=1 1140w, https://i0.wp.com/jenxi.com/wp-content/uploads/URPM-v13.webp?resize=700%2C210&amp;ssl=1 700w, https://i0.wp.com/jenxi.com/wp-content/uploads/URPM-v13.webp?resize=768%2C230&amp;ssl=1 768w, https://i0.wp.com/jenxi.com/wp-content/uploads/URPM-v13.webp?resize=1536%2C461&amp;ssl=1 1536w, https://i0.wp.com/jenxi.com/wp-content/uploads/URPM-v13.webp?resize=2048%2C614&amp;ssl=1 2048w" sizes="(max-width: 1000px) 100vw, 1000px" /><button
			class="lightbox-trigger"
			type="button"
			aria-haspopup="dialog"
			aria-label="Enlarge image"
			data-wp-init="callbacks.initTriggerButton"
			data-wp-on-async--click="actions.showLightbox"
			data-wp-style--right="state.imageButtonRight"
			data-wp-style--top="state.imageButtonTop"
		>
			<svg xmlns="http://www.w3.org/2000/svg" width="12" height="12" fill="none" viewBox="0 0 12 12">
				<path fill="#fff" d="M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z" />
			</svg>
		</button></figure>



<ul class="wp-block-list">
<li>Type: Merge</li>



<li>Usage: Photograph, digital painting</li>



<li>Download: <a href="https://civitai.com/models/2661/uber-realistic-porn-merge-urpm" target="_blank" rel="noopener">Civitai</a> (NSFW)</li>
</ul>



<p>You can tell from the name that URPM is created to generated realistic NSFW images. That explains its popularity. It is very good with anatomy, naturally, and hence is used in many checkpoint merges for realistic and non-realistic models. Realistic Vision contains URPM.</p>



<p>Realistic Vision delivers better skin and hair textures due to the other checkpoints in the merge. But if you want a high level of NSFW concepts in your output, you might want to consider URPM.</p>


<h3 class="wp-block-heading" id="6-epicrealism">6. epiCRealism</h3>


<figure data-wp-context="{&quot;imageId&quot;:&quot;672f541048f80&quot;}" data-wp-interactive="core/image" class="wp-block-image size-large wp-lightbox-container"><img data-recalc-dims="1" decoding="async" width="1140" height="342" data-wp-class--hide="state.isContentHidden" data-wp-class--show="state.isContentVisible" data-wp-init="callbacks.setButtonStyles" data-wp-on-async--click="actions.showLightbox" data-wp-on-async--load="callbacks.setButtonStyles" data-wp-on-async-window--resize="callbacks.setButtonStyles" src="https://i0.wp.com/jenxi.com/wp-content/uploads/epicrealism-naturalSin.webp?resize=1140%2C342&#038;ssl=1" alt="" class="wp-image-15674" srcset="https://i0.wp.com/jenxi.com/wp-content/uploads/epicrealism-naturalSin.webp?resize=1140%2C342&amp;ssl=1 1140w, https://i0.wp.com/jenxi.com/wp-content/uploads/epicrealism-naturalSin.webp?resize=700%2C210&amp;ssl=1 700w, https://i0.wp.com/jenxi.com/wp-content/uploads/epicrealism-naturalSin.webp?resize=768%2C230&amp;ssl=1 768w, https://i0.wp.com/jenxi.com/wp-content/uploads/epicrealism-naturalSin.webp?resize=1536%2C461&amp;ssl=1 1536w, https://i0.wp.com/jenxi.com/wp-content/uploads/epicrealism-naturalSin.webp?resize=2048%2C614&amp;ssl=1 2048w" sizes="(max-width: 1000px) 100vw, 1000px" /><button
			class="lightbox-trigger"
			type="button"
			aria-haspopup="dialog"
			aria-label="Enlarge image"
			data-wp-init="callbacks.initTriggerButton"
			data-wp-on-async--click="actions.showLightbox"
			data-wp-style--right="state.imageButtonRight"
			data-wp-style--top="state.imageButtonTop"
		>
			<svg xmlns="http://www.w3.org/2000/svg" width="12" height="12" fill="none" viewBox="0 0 12 12">
				<path fill="#fff" d="M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z" />
			</svg>
		</button><figcaption class="wp-element-caption">epiCRealism Natural Sin comparison</figcaption></figure>



<ul class="wp-block-list">
<li>Type: Merge</li>



<li>Usage: Type: Photograph</li>



<li>Download: <a href="https://civitai.com/models/25694/" target="_blank" rel="noopener">Civitai</a></li>
</ul>



<p>EpiCRealism by epinikion takes realistic output to the next level with the amazing skin and hair texture it generates. On top of realism, it can create images with a photographic look if you prompt for it. I love the cinematic lighting it’s capable of. This special look that epiCRealism delivers has led to many checkpoints being merged or trained with it.</p>



<p>Realistic Vision contains epiCRealism, but I feel that it lost some of the latter’s magic touch in the merge. When I want the photos to look like a photograph, epiCRealism is the first checkpoint I would use. Its derivatives are also useful if you want to go for a particular look.</p>


<h3 class="wp-block-heading" id="7-rev-animated">7. ReV Animated</h3>


<figure data-wp-context="{&quot;imageId&quot;:&quot;672f541049853&quot;}" data-wp-interactive="core/image" class="wp-block-image size-large wp-lightbox-container"><img data-recalc-dims="1" decoding="async" width="1140" height="342" data-wp-class--hide="state.isContentHidden" data-wp-class--show="state.isContentVisible" data-wp-init="callbacks.setButtonStyles" data-wp-on-async--click="actions.showLightbox" data-wp-on-async--load="callbacks.setButtonStyles" data-wp-on-async-window--resize="callbacks.setButtonStyles" src="https://i0.wp.com/jenxi.com/wp-content/uploads/revAnimated-v122.webp?resize=1140%2C342&#038;ssl=1" alt="" class="wp-image-15675" srcset="https://i0.wp.com/jenxi.com/wp-content/uploads/revAnimated-v122.webp?resize=1140%2C342&amp;ssl=1 1140w, https://i0.wp.com/jenxi.com/wp-content/uploads/revAnimated-v122.webp?resize=700%2C210&amp;ssl=1 700w, https://i0.wp.com/jenxi.com/wp-content/uploads/revAnimated-v122.webp?resize=768%2C230&amp;ssl=1 768w, https://i0.wp.com/jenxi.com/wp-content/uploads/revAnimated-v122.webp?resize=1536%2C461&amp;ssl=1 1536w, https://i0.wp.com/jenxi.com/wp-content/uploads/revAnimated-v122.webp?resize=2048%2C614&amp;ssl=1 2048w" sizes="(max-width: 1000px) 100vw, 1000px" /><button
			class="lightbox-trigger"
			type="button"
			aria-haspopup="dialog"
			aria-label="Enlarge image"
			data-wp-init="callbacks.initTriggerButton"
			data-wp-on-async--click="actions.showLightbox"
			data-wp-style--right="state.imageButtonRight"
			data-wp-style--top="state.imageButtonTop"
		>
			<svg xmlns="http://www.w3.org/2000/svg" width="12" height="12" fill="none" viewBox="0 0 12 12">
				<path fill="#fff" d="M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z" />
			</svg>
		</button></figure>



<ul class="wp-block-list">
<li>Type: Merge</li>



<li>Usage: Digital painting</li>



<li>Download: <a href="https://civitai.com/models/7371/" target="_blank" rel="noopener">Civitai</a></li>
</ul>



<p>If you’re looking to create artwork with intricate details, ReV Animated is the checkpoint. It is so good at generate complex details and delivers a stunning digital painting style with the 2.5D to semi-realistic look it produces.</p>



<p>Unfortunately, the creator is no longer maintaining the model. I used to start off almost all my pieces with a Rev Animated draft, but it’s age is showing and there are other checkpoints out there that handles hands and complex poses better.</p>


<h3 class="wp-block-heading" id="8-perfect-world">8. Perfect World</h3>


<figure data-wp-context="{&quot;imageId&quot;:&quot;672f54104a1c2&quot;}" data-wp-interactive="core/image" class="wp-block-image size-large wp-lightbox-container"><img data-recalc-dims="1" decoding="async" width="1140" height="342" data-wp-class--hide="state.isContentHidden" data-wp-class--show="state.isContentVisible" data-wp-init="callbacks.setButtonStyles" data-wp-on-async--click="actions.showLightbox" data-wp-on-async--load="callbacks.setButtonStyles" data-wp-on-async-window--resize="callbacks.setButtonStyles" src="https://i0.wp.com/jenxi.com/wp-content/uploads/perfectWorld-v5.webp?resize=1140%2C342&#038;ssl=1" alt="" class="wp-image-15679" srcset="https://i0.wp.com/jenxi.com/wp-content/uploads/perfectWorld-v5.webp?resize=1140%2C342&amp;ssl=1 1140w, https://i0.wp.com/jenxi.com/wp-content/uploads/perfectWorld-v5.webp?resize=700%2C210&amp;ssl=1 700w, https://i0.wp.com/jenxi.com/wp-content/uploads/perfectWorld-v5.webp?resize=768%2C230&amp;ssl=1 768w, https://i0.wp.com/jenxi.com/wp-content/uploads/perfectWorld-v5.webp?resize=1536%2C461&amp;ssl=1 1536w, https://i0.wp.com/jenxi.com/wp-content/uploads/perfectWorld-v5.webp?resize=2048%2C614&amp;ssl=1 2048w" sizes="(max-width: 1000px) 100vw, 1000px" /><button
			class="lightbox-trigger"
			type="button"
			aria-haspopup="dialog"
			aria-label="Enlarge image"
			data-wp-init="callbacks.initTriggerButton"
			data-wp-on-async--click="actions.showLightbox"
			data-wp-style--right="state.imageButtonRight"
			data-wp-style--top="state.imageButtonTop"
		>
			<svg xmlns="http://www.w3.org/2000/svg" width="12" height="12" fill="none" viewBox="0 0 12 12">
				<path fill="#fff" d="M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z" />
			</svg>
		</button></figure>



<ul class="wp-block-list">
<li>Type: Merge</li>



<li>Usage: Render, digital painting</li>



<li>Download: <a href="https://civitai.com/models/8281/" data-type="link" data-id="https://civitai.com/models/8281/" target="_blank" rel="noopener">Civitai</a></li>
</ul>



<p>Perfect World specialises in a semi-realistic look inspired by artwork from the game of the same name. It exaggerates body proportions, a goal that it proudly strives for. If that’s your thing, then this checkpoint is perfect for render-like and digital painting outputs.</p>


<h3 class="wp-block-heading" id="9-meinamix">9. MeinaMix</h3>


<figure data-wp-context="{&quot;imageId&quot;:&quot;672f54104ac15&quot;}" data-wp-interactive="core/image" class="wp-block-image size-large wp-lightbox-container"><img data-recalc-dims="1" decoding="async" width="1140" height="342" data-wp-class--hide="state.isContentHidden" data-wp-class--show="state.isContentVisible" data-wp-init="callbacks.setButtonStyles" data-wp-on-async--click="actions.showLightbox" data-wp-on-async--load="callbacks.setButtonStyles" data-wp-on-async-window--resize="callbacks.setButtonStyles" src="https://i0.wp.com/jenxi.com/wp-content/uploads/MeinaMix-v11.webp?resize=1140%2C342&#038;ssl=1" alt="" class="wp-image-15683" srcset="https://i0.wp.com/jenxi.com/wp-content/uploads/MeinaMix-v11.webp?resize=1140%2C342&amp;ssl=1 1140w, https://i0.wp.com/jenxi.com/wp-content/uploads/MeinaMix-v11.webp?resize=700%2C210&amp;ssl=1 700w, https://i0.wp.com/jenxi.com/wp-content/uploads/MeinaMix-v11.webp?resize=768%2C230&amp;ssl=1 768w, https://i0.wp.com/jenxi.com/wp-content/uploads/MeinaMix-v11.webp?resize=1536%2C461&amp;ssl=1 1536w, https://i0.wp.com/jenxi.com/wp-content/uploads/MeinaMix-v11.webp?resize=2048%2C614&amp;ssl=1 2048w" sizes="(max-width: 1000px) 100vw, 1000px" /><button
			class="lightbox-trigger"
			type="button"
			aria-haspopup="dialog"
			aria-label="Enlarge image"
			data-wp-init="callbacks.initTriggerButton"
			data-wp-on-async--click="actions.showLightbox"
			data-wp-style--right="state.imageButtonRight"
			data-wp-style--top="state.imageButtonTop"
		>
			<svg xmlns="http://www.w3.org/2000/svg" width="12" height="12" fill="none" viewBox="0 0 12 12">
				<path fill="#fff" d="M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z" />
			</svg>
		</button></figure>



<ul class="wp-block-list">
<li>Type: Merge</li>



<li>Usage: Anime</li>



<li>Download: <a href="https://civitai.com/models/7240/" target="_blank" rel="noopener">Civitai</a></li>
</ul>



<p>MeinaMix is hands down my favourite anime checkpoint for the typical anime look. It strikes a good balance between the character and background.</p>


<h3 class="wp-block-heading" id="10-beautiful-realistic-asians">10. Beautiful Realistic Asians</h3>


<figure data-wp-context="{&quot;imageId&quot;:&quot;672f54104b4c1&quot;}" data-wp-interactive="core/image" class="wp-block-image size-large wp-lightbox-container"><img data-recalc-dims="1" decoding="async" width="1140" height="342" data-wp-class--hide="state.isContentHidden" data-wp-class--show="state.isContentVisible" data-wp-init="callbacks.setButtonStyles" data-wp-on-async--click="actions.showLightbox" data-wp-on-async--load="callbacks.setButtonStyles" data-wp-on-async-window--resize="callbacks.setButtonStyles" src="https://i0.wp.com/jenxi.com/wp-content/uploads/beautifulRealistic-v7.png?resize=1140%2C342&#038;ssl=1" alt="" class="wp-image-15680" srcset="https://i0.wp.com/jenxi.com/wp-content/uploads/beautifulRealistic-v7.png?resize=1140%2C342&amp;ssl=1 1140w, https://i0.wp.com/jenxi.com/wp-content/uploads/beautifulRealistic-v7.png?resize=700%2C210&amp;ssl=1 700w, https://i0.wp.com/jenxi.com/wp-content/uploads/beautifulRealistic-v7.png?resize=768%2C230&amp;ssl=1 768w, https://i0.wp.com/jenxi.com/wp-content/uploads/beautifulRealistic-v7.png?resize=1536%2C461&amp;ssl=1 1536w, https://i0.wp.com/jenxi.com/wp-content/uploads/beautifulRealistic-v7.png?resize=2048%2C614&amp;ssl=1 2048w" sizes="(max-width: 1000px) 100vw, 1000px" /><button
			class="lightbox-trigger"
			type="button"
			aria-haspopup="dialog"
			aria-label="Enlarge image"
			data-wp-init="callbacks.initTriggerButton"
			data-wp-on-async--click="actions.showLightbox"
			data-wp-style--right="state.imageButtonRight"
			data-wp-style--top="state.imageButtonTop"
		>
			<svg xmlns="http://www.w3.org/2000/svg" width="12" height="12" fill="none" viewBox="0 0 12 12">
				<path fill="#fff" d="M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z" />
			</svg>
		</button></figure>



<ul class="wp-block-list">
<li>Type: Merge</li>



<li>Usage: Photograph</li>



<li>Download: <a href="https://civitai.com/models/25494/" target="_blank" rel="noopener">Civitai</a></li>
</ul>



<p>Beautiful Realistic Asians (BRA) by pleasebankai is capable of generating very realistic photographs of Asian subjects. However, I find that it takes good prompt engineering to coax the best out of it, and this kind of explains why it is less popular than ChilloutMix. When used right, BRA generates better realism than ChilloutMix.</p>



<p>The photographs BRA generates have a cinematic aesthetic to them like epiCRealism but specialising in Asian features. Why the need for Asian-specific checkpoints? Well, most of the general checkpoints give Asian faces that lean towards the Western stereotypes or preferences. These Asian-specific checkpoints delivers output that with Asian aesthetics.</p>



<p>Many realistic checkpoints use BRA in their merge partly because of its quality, and partly to avoid the licensing issue with ChilloutMix.</p>


<h3 class="wp-block-heading" id="11-cyberrealistic-bonus">11. CyberRealistic (Bonus)</h3>


<figure data-wp-context="{&quot;imageId&quot;:&quot;672f54104be45&quot;}" data-wp-interactive="core/image" class="wp-block-image size-large wp-lightbox-container"><img data-recalc-dims="1" decoding="async" width="1140" height="342" data-wp-class--hide="state.isContentHidden" data-wp-class--show="state.isContentVisible" data-wp-init="callbacks.setButtonStyles" data-wp-on-async--click="actions.showLightbox" data-wp-on-async--load="callbacks.setButtonStyles" data-wp-on-async-window--resize="callbacks.setButtonStyles" src="https://i0.wp.com/jenxi.com/wp-content/uploads/cyberrealistic-v41.webp?resize=1140%2C342&#038;ssl=1" alt="" class="wp-image-15681" srcset="https://i0.wp.com/jenxi.com/wp-content/uploads/cyberrealistic-v41.webp?resize=1140%2C342&amp;ssl=1 1140w, https://i0.wp.com/jenxi.com/wp-content/uploads/cyberrealistic-v41.webp?resize=700%2C210&amp;ssl=1 700w, https://i0.wp.com/jenxi.com/wp-content/uploads/cyberrealistic-v41.webp?resize=768%2C230&amp;ssl=1 768w, https://i0.wp.com/jenxi.com/wp-content/uploads/cyberrealistic-v41.webp?resize=1536%2C461&amp;ssl=1 1536w, https://i0.wp.com/jenxi.com/wp-content/uploads/cyberrealistic-v41.webp?resize=2048%2C614&amp;ssl=1 2048w" sizes="(max-width: 1000px) 100vw, 1000px" /><button
			class="lightbox-trigger"
			type="button"
			aria-haspopup="dialog"
			aria-label="Enlarge image"
			data-wp-init="callbacks.initTriggerButton"
			data-wp-on-async--click="actions.showLightbox"
			data-wp-style--right="state.imageButtonRight"
			data-wp-style--top="state.imageButtonTop"
		>
			<svg xmlns="http://www.w3.org/2000/svg" width="12" height="12" fill="none" viewBox="0 0 12 12">
				<path fill="#fff" d="M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z" />
			</svg>
		</button></figure>



<ul class="wp-block-list">
<li>Type: Merge</li>



<li>Usage: Photograph</li>



<li>Download: <a href="https://civitai.com/models/15003/" target="_blank" rel="noopener">Civitai</a>, <a href="https://huggingface.co/cyberdelia/CyberRealistic/tree/main" target="_blank" rel="noopener">Hugging Face</a></li>
</ul>



<p>To me, CyberRealistic sits between Realistic Vision and epiCRealism. I feel that it delivers output closer to what Realistic Vision is trying to achieve while retaining a more photographic aesthetic. It also works very well with textual inversions and LoRAs, though I’ll need to do some tests to determine if it is more versatile than the other two checkpoints.</p>



<p>Realistic Vision contains CyberRealistic. I suggest trying all three to see which you prefer, or just switch between them. All three are great for photorealistic output, especially if you want to avoid Asian faces.</p>


<h3 class="wp-block-heading" id="12-counterfeit-bonus">12. Counterfeit (Bonus)</h3>


<figure data-wp-context="{&quot;imageId&quot;:&quot;672f54104c78c&quot;}" data-wp-interactive="core/image" class="wp-block-image size-large wp-lightbox-container"><img data-recalc-dims="1" decoding="async" width="1140" height="342" data-wp-class--hide="state.isContentHidden" data-wp-class--show="state.isContentVisible" data-wp-init="callbacks.setButtonStyles" data-wp-on-async--click="actions.showLightbox" data-wp-on-async--load="callbacks.setButtonStyles" data-wp-on-async-window--resize="callbacks.setButtonStyles" src="https://i0.wp.com/jenxi.com/wp-content/uploads/Counterfeit-v30.webp?resize=1140%2C342&#038;ssl=1" alt="" class="wp-image-15682" srcset="https://i0.wp.com/jenxi.com/wp-content/uploads/Counterfeit-v30.webp?resize=1140%2C342&amp;ssl=1 1140w, https://i0.wp.com/jenxi.com/wp-content/uploads/Counterfeit-v30.webp?resize=700%2C210&amp;ssl=1 700w, https://i0.wp.com/jenxi.com/wp-content/uploads/Counterfeit-v30.webp?resize=768%2C230&amp;ssl=1 768w, https://i0.wp.com/jenxi.com/wp-content/uploads/Counterfeit-v30.webp?resize=1536%2C461&amp;ssl=1 1536w, https://i0.wp.com/jenxi.com/wp-content/uploads/Counterfeit-v30.webp?resize=2048%2C614&amp;ssl=1 2048w" sizes="(max-width: 1000px) 100vw, 1000px" /><button
			class="lightbox-trigger"
			type="button"
			aria-haspopup="dialog"
			aria-label="Enlarge image"
			data-wp-init="callbacks.initTriggerButton"
			data-wp-on-async--click="actions.showLightbox"
			data-wp-style--right="state.imageButtonRight"
			data-wp-style--top="state.imageButtonTop"
		>
			<svg xmlns="http://www.w3.org/2000/svg" width="12" height="12" fill="none" viewBox="0 0 12 12">
				<path fill="#fff" d="M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z" />
			</svg>
		</button></figure>



<ul class="wp-block-list">
<li>Type: Merge</li>



<li>Usage: Anime</li>



<li>Download: <a href="https://civitai.com/models/4468/" target="_blank" rel="noopener">Civitai</a>, Hugging Face (<a href="https://huggingface.co/gsdf/Counterfeit-V2.0" target="_blank" rel="noopener">v2.0</a>, <a href="https://huggingface.co/gsdf/Counterfeit-V2.5" target="_blank" rel="noopener">v2.5</a>, <a href="https://huggingface.co/gsdf/Counterfeit-V3.0" target="_blank" rel="noopener">v3.0</a>)</li>
</ul>



<p>I had to add this as another bonus to the Top 10 because the majority of anime checkpoints out there can be traced back to Counterfeit. They either use Counterfeit or a derivative in their merges, or are trained with Counterfeit-generated data sets.</p>



<p>Before Counterfeit and Abyss Orange Mix came about, anime checkpoints were mostly trained on the controversial Anything V3. Anything V3 is suspected to be the leaked NovelAI checkpoint, and hence has a lot of question marks hanging over it regarding copyright infringement.</p>


<h2 class="wp-block-heading" id="roundup">Round-up</h2>


<figure class="wp-block-image size-large"><img data-recalc-dims="1" decoding="async" width="1140" height="342" src="https://i0.wp.com/jenxi.com/wp-content/uploads/Notion-Top-checkpoints.webp?resize=1140%2C342&#038;ssl=1" alt="Top 10 best Stable Diffusion checkpoints of SD 1.5" class="wp-image-15651" srcset="https://i0.wp.com/jenxi.com/wp-content/uploads/Notion-Top-checkpoints.webp?resize=1140%2C342&amp;ssl=1 1140w, https://i0.wp.com/jenxi.com/wp-content/uploads/Notion-Top-checkpoints.webp?resize=700%2C210&amp;ssl=1 700w, https://i0.wp.com/jenxi.com/wp-content/uploads/Notion-Top-checkpoints.webp?resize=768%2C230&amp;ssl=1 768w, https://i0.wp.com/jenxi.com/wp-content/uploads/Notion-Top-checkpoints.webp?resize=1536%2C461&amp;ssl=1 1536w, https://i0.wp.com/jenxi.com/wp-content/uploads/Notion-Top-checkpoints.webp?resize=2048%2C614&amp;ssl=1 2048w" sizes="(max-width: 1000px) 100vw, 1000px" /><figcaption class="wp-element-caption">Top 10 best Stable Diffusion checkpoints of SD 1.5</figcaption></figure>



<p>These top ten or twelve best Stable Diffusion checkpoints are ranked based on their total number of downloads on Civitai at the time of publishing this article. This might not be a truly accurate reflection of their popularity, because some of them have multiple versions and downloads of all the versions add up to the total downloads.</p>



<p>A checkpoint with ten versions can have someone downloading all eight out of ten versions, whereas another with one version gets only one download. Still, it is a good list to get you started if you are new to Stable Diffusion, and you’re looking for checkpoints to play with.</p>



<p>Having more versions means more effort has been put in to keep improving the checkpoints. If you’re interested to rank the checkpoints based on the actual number of downloads, check out my Ultimate Stable Diffusion Checkpoint Database for a comprehensive breakdown by version.</p>


<h2 class="wp-block-heading" id="free-notion-resource">Free Notion resource</h2>


<figure data-wp-context="{&quot;imageId&quot;:&quot;672f54104d2f7&quot;}" data-wp-interactive="core/image" class="wp-block-image size-large wp-lightbox-container"><img data-recalc-dims="1" decoding="async" width="1140" height="749" data-wp-class--hide="state.isContentHidden" data-wp-class--show="state.isContentVisible" data-wp-init="callbacks.setButtonStyles" data-wp-on-async--click="actions.showLightbox" data-wp-on-async--load="callbacks.setButtonStyles" data-wp-on-async-window--resize="callbacks.setButtonStyles" src="https://i0.wp.com/jenxi.com/wp-content/uploads/Notion-Top-10-Stable-Diffusion-free-Notion-template-web.webp?resize=1140%2C749&#038;ssl=1" alt="Best Stable Diffusion checkpoints Notion resource" class="wp-image-15686" srcset="https://i0.wp.com/jenxi.com/wp-content/uploads/Notion-Top-10-Stable-Diffusion-free-Notion-template-web.webp?resize=1140%2C749&amp;ssl=1 1140w, https://i0.wp.com/jenxi.com/wp-content/uploads/Notion-Top-10-Stable-Diffusion-free-Notion-template-web.webp?resize=700%2C460&amp;ssl=1 700w, https://i0.wp.com/jenxi.com/wp-content/uploads/Notion-Top-10-Stable-Diffusion-free-Notion-template-web.webp?resize=768%2C504&amp;ssl=1 768w, https://i0.wp.com/jenxi.com/wp-content/uploads/Notion-Top-10-Stable-Diffusion-free-Notion-template-web.webp?resize=1536%2C1009&amp;ssl=1 1536w, https://i0.wp.com/jenxi.com/wp-content/uploads/Notion-Top-10-Stable-Diffusion-free-Notion-template-web.webp?w=1600&amp;ssl=1 1600w" sizes="(max-width: 1000px) 100vw, 1000px" /><button
			class="lightbox-trigger"
			type="button"
			aria-haspopup="dialog"
			aria-label="Enlarge image: Best Stable Diffusion checkpoints Notion resource"
			data-wp-init="callbacks.initTriggerButton"
			data-wp-on-async--click="actions.showLightbox"
			data-wp-style--right="state.imageButtonRight"
			data-wp-style--top="state.imageButtonTop"
		>
			<svg xmlns="http://www.w3.org/2000/svg" width="12" height="12" fill="none" viewBox="0 0 12 12">
				<path fill="#fff" d="M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z" />
			</svg>
		</button><figcaption class="wp-element-caption">Best Stable Diffusion checkpoints Notion resource</figcaption></figure>



<p>I’m giving away my personal top 10 Stable Diffusion photorealistic checkpoint list for free when you <a href="http://jenxi.com/subscribe/" data-type="link" data-id="http://jenxi.com/subscribe/">join my newsletter</a>. While it is less comprehensive than the paid Ultimate Database, this free list is a good way starting point if you are new to the world of Stable Diffusion.</p>



<p>I’ve split the list so that you can view the top 10 trained checkpoints, top 10 merged checkpoints, or the overall top 10 checkpoints.</p>



<p>What about anime or painterly checkpoints? You can find them in the Ultimate Database. There’s just an overwhelming number of anime checkpoints out there. Besides, they differ a lot by art style and that is subject to personal taste, so it doesn’t make sense for to rank them.</p>



<p>You get the full-sized comparison grids to study the effect of different variables and parameters on the checkpoints output. I create these for my own research and write down my notes in the Ultimate Database, and I’m sharing it with you.</p>



<p>Feel free to reach out if you have checkpoints to suggest or if you want me to do a more in-depth review.</p>
]]></content:encoded>
					
					<wfw:commentRss>https://jenxi.com/10-best-stable-diffusion-checkpoints-sd-1-5/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
		<post-id xmlns="com-wordpress:feed-additions:1">15626</post-id>	</item>
		<item>
		<title>Asuka Langley Soryu AI art</title>
		<link>https://jenxi.com/asuka-langley-soryu-ai-art/</link>
					<comments>https://jenxi.com/asuka-langley-soryu-ai-art/#respond</comments>
		
		<dc:creator><![CDATA[Jenxi Seow]]></dc:creator>
		<pubDate>Thu, 18 Jan 2024 14:00:23 +0000</pubDate>
				<category><![CDATA[Art]]></category>
		<category><![CDATA[AI art]]></category>
		<category><![CDATA[Evangelion]]></category>
		<category><![CDATA[Stable Diffusion]]></category>
		<guid isPermaLink="false">https://jenxi.com/?p=15630</guid>

					<description><![CDATA[Ever since I got into anime in the late 90s, both Rei Ayanami and Asuka Langley Soryu have been two of the most prominent heroines.]]></description>
										<content:encoded><![CDATA[
<figure class="wp-block-image size-large"><img data-recalc-dims="1" decoding="async" width="760" height="1140" src="https://i0.wp.com/jenxi.com/wp-content/uploads/asuka-langley-soryu-2-by-jenxi-art.webp?resize=760%2C1140&#038;ssl=1" alt="" class="wp-image-15632" srcset="https://i0.wp.com/jenxi.com/wp-content/uploads/asuka-langley-soryu-2-by-jenxi-art.webp?resize=760%2C1140&amp;ssl=1 760w, https://i0.wp.com/jenxi.com/wp-content/uploads/asuka-langley-soryu-2-by-jenxi-art.webp?resize=467%2C700&amp;ssl=1 467w, https://i0.wp.com/jenxi.com/wp-content/uploads/asuka-langley-soryu-2-by-jenxi-art.webp?resize=768%2C1152&amp;ssl=1 768w, https://i0.wp.com/jenxi.com/wp-content/uploads/asuka-langley-soryu-2-by-jenxi-art.webp?resize=1024%2C1536&amp;ssl=1 1024w, https://i0.wp.com/jenxi.com/wp-content/uploads/asuka-langley-soryu-2-by-jenxi-art.webp?w=1280&amp;ssl=1 1280w" sizes="(max-width: 760px) 100vw, 760px" /></figure>



<p>Ever since I got into anime in the late 90s, both Rei Ayanami and Asuka Langley Soryu have been two of the most prominent heroines. This was due to the sheer popularity of Neon Genesis Evangelion and their character designs. Their posters and figurines filled the otaku shops that I frequented.</p>



<p>So it’s inevitable that I would work on Rei and Asuka pieces once I started creating realistic artwork of my favourite anime characters.</p>


<h2 class="wp-block-heading" id="creation">Creation</h2>


<p>Anime checkpoints are able to get her interface headset right, but they start turning into hair bands or accessories with realistic checkpoints.</p>



<p>Asuka is a US citizen of Japanese and German descent. Most depictions show her with more Caucasian features. I decided to try to play around with her facial features while trying to retain her red hair and blue eyes. However, the hair and eye colours were weird in some of the styles so I tweaked them to go with a more cosplay-like approach rather than a character recreation.</p>


<h2 class="wp-block-heading" id="tools-used">Tools used</h2>


<ul class="wp-block-list">
<li><a href="https://github.com/AUTOMATIC1111/stable-diffusion-webui" target="_blank" rel="noopener">Stable Diffusion Web UI</a></li>



<li>Adobe Photoshop (Beta)</li>



<li>Huion Kamvas Pro 16</li>
</ul>


<h2 class="wp-block-heading" id="downloads">Downloads</h2>


<p>The HD versions of these images available for download on Afdian. See the links below each image.</p>



<p>The 4K versions will be available to members on Patreon and DeviantArt. <a href="https://jenxi.com/subscribe/">Subscribe to my newsletter</a> to be notified when these and new content become available.</p>


<h3 class="wp-block-heading" id="asuka-langley-soryu-hello-there">Asuka Langley Soryu hello there</h3>


<figure class="wp-block-image size-large"><img data-recalc-dims="1" decoding="async" width="760" height="1140" src="https://i0.wp.com/jenxi.com/wp-content/uploads/asuka-langley-soryu-1-by-jenx-free-mark.webp?resize=760%2C1140&#038;ssl=1" alt="" class="wp-image-15631" srcset="https://i0.wp.com/jenxi.com/wp-content/uploads/asuka-langley-soryu-1-by-jenx-free-mark.webp?resize=760%2C1140&amp;ssl=1 760w, https://i0.wp.com/jenxi.com/wp-content/uploads/asuka-langley-soryu-1-by-jenx-free-mark.webp?resize=467%2C700&amp;ssl=1 467w, https://i0.wp.com/jenxi.com/wp-content/uploads/asuka-langley-soryu-1-by-jenx-free-mark.webp?resize=768%2C1152&amp;ssl=1 768w, https://i0.wp.com/jenxi.com/wp-content/uploads/asuka-langley-soryu-1-by-jenx-free-mark.webp?resize=1024%2C1536&amp;ssl=1 1024w, https://i0.wp.com/jenxi.com/wp-content/uploads/asuka-langley-soryu-1-by-jenx-free-mark.webp?w=1280&amp;ssl=1 1280w" sizes="(max-width: 760px) 100vw, 760px" /><figcaption class="wp-element-caption">Asuka greets her new pilot.</figcaption></figure>



<ul class="wp-block-list">
<li>Get it on <a href="https://ko-fi.com/i/IF1F3QOLH3" target="_blank" rel="noopener">Ko-fi</a></li>



<li>Get it on <a href="https://www.patreon.com/posts/92016172" target="_blank" rel="noopener">Patreon</a></li>



<li>Get it using Alipay or WeChat Pay on <a href="https://afdian.net/item/ad746e1c780411eeabf55254001e7c00" target="_blank" rel="noopener">Afdian 爱发电</a></li>



<li>Collect it on <a href="https://www.deviantart.com/jenxi/art/Asuka-Langley-Soryu-hello-there-991522089" target="_blank" rel="noopener">DeviantArt</a>.</li>
</ul>


<h3 class="wp-block-heading" id="asuka-langley-soryu-dash">Asuka Langley Soryu dash</h3>


<figure class="wp-block-image size-large"><img data-recalc-dims="1" decoding="async" width="760" height="1140" src="https://i0.wp.com/jenxi.com/wp-content/uploads/asuka-langley-soryu-2-by-jenxi-art-1.webp?resize=760%2C1140&#038;ssl=1" alt="" class="wp-image-15633" srcset="https://i0.wp.com/jenxi.com/wp-content/uploads/asuka-langley-soryu-2-by-jenxi-art-1.webp?resize=760%2C1140&amp;ssl=1 760w, https://i0.wp.com/jenxi.com/wp-content/uploads/asuka-langley-soryu-2-by-jenxi-art-1.webp?resize=467%2C700&amp;ssl=1 467w, https://i0.wp.com/jenxi.com/wp-content/uploads/asuka-langley-soryu-2-by-jenxi-art-1.webp?resize=768%2C1152&amp;ssl=1 768w, https://i0.wp.com/jenxi.com/wp-content/uploads/asuka-langley-soryu-2-by-jenxi-art-1.webp?resize=1024%2C1536&amp;ssl=1 1024w, https://i0.wp.com/jenxi.com/wp-content/uploads/asuka-langley-soryu-2-by-jenxi-art-1.webp?w=1280&amp;ssl=1 1280w" sizes="(max-width: 760px) 100vw, 760px" /><figcaption class="wp-element-caption">Asuka running through the battle field to get to her mecha.</figcaption></figure>



<ul class="wp-block-list">
<li>Get it on <a href="https://ko-fi.com/i/IB0B5THL9A" target="_blank" rel="noopener">Ko-fi</a></li>



<li>Get it on <a href="https://www.patreon.com/posts/96731794" target="_blank" rel="noopener">Patreon</a></li>



<li>Get it using Alipay or WeChat Pay on <a href="https://afdian.net/item/8553b03eb60a11ee837d52540025c377" target="_blank" rel="noopener">Afdian 爱发电</a></li>



<li>Collect it on <a href="https://www.deviantart.com/jenxi/art/Asuka-Langley-Soryu-hello-there-991522089" target="_blank" rel="noopener">DeviantArt</a></li>
</ul>
]]></content:encoded>
					
					<wfw:commentRss>https://jenxi.com/asuka-langley-soryu-ai-art/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
		<post-id xmlns="com-wordpress:feed-additions:1">15630</post-id>	</item>
		<item>
		<title>Chun-Li AI art</title>
		<link>https://jenxi.com/chun-li-ai-art/</link>
					<comments>https://jenxi.com/chun-li-ai-art/#respond</comments>
		
		<dc:creator><![CDATA[Jenxi Seow]]></dc:creator>
		<pubDate>Mon, 08 Jan 2024 12:19:29 +0000</pubDate>
				<category><![CDATA[Art]]></category>
		<category><![CDATA[AI art]]></category>
		<category><![CDATA[Chun-Li]]></category>
		<category><![CDATA[Stable Diffusion]]></category>
		<guid isPermaLink="false">https://jenxi.com/?p=15617</guid>

					<description><![CDATA[Chun-Li is one of the first female fighters in gaming history, and has become one of the most iconic game characters.]]></description>
										<content:encoded><![CDATA[
<p>Chun-Li is one of the first female fighters in gaming history, and has become one of the most iconic game characters. It was a pretty easy decision to pick up Chun-Li as my third fighting game character art after <a href="https://jenxi.com/mai-shiranui-ai-art/" data-type="post" data-id="15131">Mai Shiranui</a> and <a href="https://jenxi.com/sakura-kasugano-ai-art/" data-type="post" data-id="15508">Sakura Kasugano</a>.</p>



<div class="wp-block-group has-background has-global-padding is-layout-constrained wp-container-core-group-is-layout-3 wp-block-group-is-layout-constrained" style="border-radius:8px;background-color:#f7f7f7;padding-top:var(--wp--preset--spacing--small);padding-right:clamp(30px, 4vw, 40px);padding-bottom:var(--wp--preset--spacing--small);padding-left:clamp(30px, 4vw, 40px)"><h2 class="simpletoc-title">Contents</h2>
<ul class="simpletoc-list">
<li><a href="#creation">Creation</a>

</li>
<li><a href="#tools-used">Tools used</a>

</li>
<li><a href="#downloads">Downloads</a>


<ul><li>
<a href="#chunli-challenges-you">Chun-Li challenges you</a>
</li>
</ul>
</li></ul></div>


<h2 class="wp-block-heading" id="creation">Creation</h2>


<p>The challenge with Chun-Li is getting her hair bunds and spiked bracelets right. The spikes were so troublesome that I went with cloth bracelets for the first piece instead. I like how that turned out in the image.</p>



<p>Chun-Li is known for her large, muscular thighs, and that was what I depicted. However, the most feedback I’ve received so far is about how the thighs are too big to be realistic. That’s the character design, and the fun of portraying a character in photorealistic style. You get to have fun and be creative instead of just trying to make it look real.</p>


<h2 class="wp-block-heading" id="tools-used">Tools used</h2>


<ul class="wp-block-list">
<li><a href="https://github.com/AUTOMATIC1111/stable-diffusion-webui" target="_blank" rel="noopener">Stable Diffusion Web UI</a></li>



<li>Adobe Photoshop (Beta)</li>



<li>Huion Kamvas Pro 16</li>
</ul>


<h2 class="wp-block-heading" id="downloads">Downloads</h2>


<p>The HD versions of these images available for download on Afdian. See the links below each image.</p>



<p>The 4K versions will be available to members on Patreon and Pixiv. <a href="https://jenxi.com/subscribe/">Subscribe to my newsletter</a> to be notified when these and new content become available.</p>


<h3 class="wp-block-heading" id="chunli-challenges-you">Chun-Li challenges you</h3>


<figure class="wp-block-image size-large"><img data-recalc-dims="1" decoding="async" width="760" height="1140" src="https://i0.wp.com/jenxi.com/wp-content/uploads/chun-li-1-by-jenxi-free-mark.webp?resize=760%2C1140&#038;ssl=1" alt="" class="wp-image-15619" srcset="https://i0.wp.com/jenxi.com/wp-content/uploads/chun-li-1-by-jenxi-free-mark.webp?resize=760%2C1140&amp;ssl=1 760w, https://i0.wp.com/jenxi.com/wp-content/uploads/chun-li-1-by-jenxi-free-mark.webp?resize=467%2C700&amp;ssl=1 467w, https://i0.wp.com/jenxi.com/wp-content/uploads/chun-li-1-by-jenxi-free-mark.webp?resize=768%2C1152&amp;ssl=1 768w, https://i0.wp.com/jenxi.com/wp-content/uploads/chun-li-1-by-jenxi-free-mark.webp?resize=1024%2C1536&amp;ssl=1 1024w, https://i0.wp.com/jenxi.com/wp-content/uploads/chun-li-1-by-jenxi-free-mark.webp?w=1280&amp;ssl=1 1280w" sizes="(max-width: 760px) 100vw, 760px" /><figcaption class="wp-element-caption">Are you up for a fight against Chun-Li?</figcaption></figure>



<ul class="wp-block-list">
<li>Get it on <a href="https://ko-fi.com/i/IV7V1T3X87" target="_blank" rel="noopener">Ko-fi</a></li>



<li>Get it on <a href="https://www.patreon.com/posts/96089949" target="_blank" rel="noopener">Patreon</a></li>



<li>Get it using Alipay or WeChat Pay on <a href="https://afdian.net/item/1580089a9c0411eeb29852540025c377" target="_blank" rel="noopener">Afdian 爱发电</a></li>



<li>Collect it on <a href="https://www.deviantart.com/jenxi/art/Chun-Li-challenges-you-1009219424" target="_blank" rel="noopener">DeviantArt</a> and <a href="https://www.pixiv.net/en/artworks/115003759" target="_blank" rel="noopener">Pixiv</a></li>
</ul>



<p></p>
]]></content:encoded>
					
					<wfw:commentRss>https://jenxi.com/chun-li-ai-art/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
		<post-id xmlns="com-wordpress:feed-additions:1">15617</post-id>	</item>
		<item>
		<title>What is a Stable Diffusion checkpoint</title>
		<link>https://jenxi.com/what-is-a-stable-diffusion-checkpoint/</link>
					<comments>https://jenxi.com/what-is-a-stable-diffusion-checkpoint/#respond</comments>
		
		<dc:creator><![CDATA[Jenxi Seow]]></dc:creator>
		<pubDate>Sat, 25 Nov 2023 16:10:20 +0000</pubDate>
				<category><![CDATA[Blog]]></category>
		<category><![CDATA[checkpoints]]></category>
		<category><![CDATA[Stable Diffusion]]></category>
		<guid isPermaLink="false">https://jenxi.com/?p=15605</guid>

					<description><![CDATA[What is a Stable Diffusion checkpoint? Is it a model or is it something different? What are the different types of checkpoints you can use?]]></description>
										<content:encoded><![CDATA[
<p>You might have heard of checkpoints in the context of machine learning, especially in generative AI image creation. What is a <a href="https://jenxi.com/what-is-stable-diffusion/" data-type="post" data-id="15451">Stable Diffusion</a> checkpoint? Is it a model or is it something different?</p>



<div class="wp-block-group has-background has-global-padding is-layout-constrained wp-container-core-group-is-layout-4 wp-block-group-is-layout-constrained" style="border-radius:8px;background-color:#f7f7f7;padding-top:var(--wp--preset--spacing--small);padding-right:clamp(30px, 4vw, 40px);padding-bottom:var(--wp--preset--spacing--small);padding-left:clamp(30px, 4vw, 40px)"><h2 class="simpletoc-title">Contents</h2>
<ul class="simpletoc-list">
<li><a href="#what-are-checkpoints">What are checkpoints</a>

</li>
<li><a href="#types-of-checkpoints">Types of checkpoints</a>

</li>
<li><a href="#checkpoint-types-trained-knowledge">Checkpoint types – trained knowledge</a>


<ul><li>
<a href="#trained-checkpoints">Trained checkpoints</a>

</li>
<li><a href="#merged-checkpoints">Merged checkpoints</a>

</li>
</ul>
<li><a href="#checkpoint-types-image-output">Checkpoint types – image output</a>


<ul><li>
<a href="#photorealistic-checkpoints">Photorealistic checkpoints</a>

</li>
<li><a href="#digital-painting-checkpoints">Digital painting checkpoints</a>

</li>
<li><a href="#render-checkpoints">Render checkpoints</a>

</li>
<li><a href="#anime-checkpoints">Anime checkpoints</a>

</li>
<li><a href="#illustration-checkpoints">Illustration checkpoints</a>

</li>
<li><a href="#general-purpose-checkpoints">General purpose checkpoints</a>

</li>
</ul>
<li><a href="#checkpoint-types-nbsprealism">Checkpoint types –&nbsp;realism</a>


<ul><li>
<a href="#realistic-checkpoints">Realistic checkpoints</a>

</li>
<li><a href="#semirealistic-checkpoints">Semi-realistic checkpoints</a>

</li>
<li><a href="#28d-checkpoints">2.8D checkpoints</a>

</li>
<li><a href="#25d-checkpoints">2.5D checkpoints</a>

</li>
<li><a href="#2d-checkpoints">2D checkpoints</a>

</li>
</ul>
<li><a href="#other-types-of-categories">Other types of categories</a>

</li>
<li><a href="#choosing-stable-diffusion-checkpoints">Choosing Stable Diffusion checkpoints</a>
</li></ul></div>


<h2 class="wp-block-heading" id="what-are-checkpoints">What are checkpoints</h2>


<p>Checkpoints and models are fundamental concepts in machine learning that are related but distinct. It can get a bit confusing when the terms are commonly used interchangeably.</p>



<p>A model is a complex algorithm trained to make predictions based on input data. The process of model training is where the model learns patterns and information from a given training dataset.</p>



<p>A checkpoint is a snapshot during the training that captures the state of a model at a specific stage in the training process. In other words, checkpoints are a type of AI models. There are other types of Stable Diffusion models like LoRAs, LoCONs, LoHAs, LECOs and so on, but we will only be looking at checkpoints today.</p>



<p>Think of checkpoints as save points in a video game, allowing you to capture the state of your model at specific intervals during training. When you use a checkpoint, you are able to generate images using the concepts and knowledge it has learnt up to the checkpoint.</p>


<h2 class="wp-block-heading" id="types-of-checkpoints">Types of checkpoints</h2>


<p>If you know me well enough, you’re most likely aware of how fussy I am about organising things. I group my Stable Diffusion checkpoints based on the output they are able to produce. There are several ways to group checkpoints, including trained knowledge, image output, and realism.</p>



<p>I don’t group my checkpoints based on trained knowledge, but it is useful to know how they are trained to understand what they are capable of. If you are way of using checkpoints trained on copyrighted material, knowing how they are created would be key.</p>



<p>You can also group checkpoints by the level of realism they can achieve. The realism here generally refers to the proportions I find this a good starting point to identify the best checkpoint to use. Of course, some checkpoints are capable of multiple levels of realism.</p>



<p>However, I don’t group my checkpoints by realism. I prefer to sort my checkpoints based on their image output capability. There are checkpoints that can achieve different types of output, but I<br>find that the better checkpoints are generally specialised for a particular type of usage instead of being able to produce different looks.</p>



<p>Note that other than the categorisation by type, the other groupings can be subjective and are just a general way to group the checkpoints to make it easier to organise them. You’ll find that there many Stable Diffusion checkpoints fall under a few categories in actual usage.</p>



<p>All these different categorisation can be a little confusing that’s why I created my Stable Diffusion checkpoint databases to help me track what the checkpoints are capable of.</p>


<h2 class="wp-block-heading" id="checkpoint-types-trained-knowledge">Checkpoint types – trained knowledge</h2>


<p>One way of grouping Stable Diffusion checkpoints is based on how they are trained.</p>


<h3 class="wp-block-heading" id="trained-checkpoints">Trained checkpoints</h3>


<p>Models like the <a href="https://huggingface.co/CompVis/stable-diffusion-v-1-4-original" data-type="link" data-id="https://huggingface.co/CompVis/stable-diffusion-v-1-4-original" target="_blank" rel="noopener">SD 1.4</a> or <a href="https://huggingface.co/runwayml/stable-diffusion-v1-5" data-type="link" data-id="https://huggingface.co/runwayml/stable-diffusion-v1-5" target="_blank" rel="noopener">SD 1.5</a> models are models trained by <a href="https://stability.ai" data-type="link" data-id="https://stability.ai" target="_blank" rel="noopener">Stability AI</a> on a large dataset. Model creators can create similar base models by training a new model with their own dataset. These are referred to as trained checkpoints.</p>



<p>You can also fine-tune a model by using a base model as a starting point to train your dataset. This base model can be the SD 1.4 or SD 1.5 checkpoints, or another checkpoint. Fine-turning is done to adapt an existing model for a specific task or dataset, such as a particular art style, person or character.</p>



<p>Both base models and fine-tuned models are referred to as trained checkpoints.</p>


<h3 class="wp-block-heading" id="merged-checkpoints">Merged checkpoints</h3>


<p>Checkpoints can also be combined to blend the trained knowledge together, either to improve the quality or to mix different art styles together. These are called merged checkpoints, often denoted with a “Mix” in the checkpoint’s name.</p>


<h2 class="wp-block-heading" id="checkpoint-types-image-output">Checkpoint types – image output</h2>


<p>The main way I group my Stable Diffusion checkpoints is by the type of output they are able to generate.</p>



<p>So, let’s look at the types of photos you can generate. These are some of the broad looks people create:</p>



<ul class="wp-block-list">
<li>Photorealistic – hyperrealistic images that resemble photographs</li>



<li>Digital painting – concept or fantasy art images that mimics realism with artistic expression</li>



<li>Render – 3D-rendered image style</li>



<li>Anime – anime style with exaggerated proportions</li>



<li>Illustration – distinct brush strokes, including line art and sketches</li>
</ul>


<h3 class="wp-block-heading" id="photorealistic-checkpoints">Photorealistic checkpoints</h3>


<p><a href="https://en.wikipedia.org/wiki/Photorealism" data-type="link" data-id="https://en.wikipedia.org/wiki/Photorealism" target="_blank" rel="noopener">Photorealism</a> is an art style that tries to mimic realism in paintings. Photorealistic checkpoints are capable of generating hyperrealistic images that look like photographs. Do not confuse the photorealistic style with the amount of realism it generates.</p>



<p>Get my Top 10 Most Popular Realistic Checkpoints database when you <a href="https://jenxi.com/subscribe/">sign up for my newsletter</a>.</p>


<h3 class="wp-block-heading" id="digital-painting-checkpoints">Digital painting checkpoints</h3>


<p>Digital painting checkpoints generate images with realistic look, but the texture is less realistic than photorealistic checkpoints. They balance detail with artistic interpretation, allowing for greater stylistic flexibility, such as visible brush strokes or a more painterly quality, depending on the training data and model design.</p>



<p>The images they create are reminiscent of digital and traditional artwork. I use these checkpoints if I want a concept art or digital art look.</p>


<h3 class="wp-block-heading" id="render-checkpoints">Render checkpoints</h3>


<p>Render checkpoints are often trained with 3D-rendered images and mimic rendering styles, such as Disney’s Pixar style. These checkpoints produce images with render-like qualities. The images created have realistic lighting, but often with texture and details of 3D models.</p>



<p>A popular look is the 3D Niji style from Midjourney. You can find Stable Diffusion trained on 3D Niji images.</p>


<h3 class="wp-block-heading" id="anime-checkpoints">Anime checkpoints</h3>


<p>Anime checkpoints generate images with the distinctive anime style, including exaggerated proportions, expressions, and hair colours and styles. I generally group checkpoints for manga and anime fan art here, unless the lines are so loose that they fall under illustration checkpoints instead.</p>



<p>The use of generative AI to create anime-style images is immensely popular and a major driving force in the development of AI image generation. Thus, you’ll find many anime checkpoints covering different anime styles.</p>



<p>I prefer to group comic checkpoints here as well, unless they have such a high level of realism that warrants their grouping under digital painting checkpoints.</p>


<h3 class="wp-block-heading" id="illustration-checkpoints">Illustration checkpoints</h3>


<p>Illustration checkpoints produce images with distinctive brush strokes. These could range from wet to dry media, including oil painting, water colour, line art, and sketches. The checkpoints are trained to mimic the brush strokes of the particular medium.</p>


<h3 class="wp-block-heading" id="general-purpose-checkpoints">General purpose checkpoints</h3>


<p>Some checkpoints are trained to be able to produce different image styles. These are referred to as general purpose checkpoints. They are the Swiss Army knives checkpoints that lets you create a variety of styles without having to swap checkpoints.</p>


<h2 class="wp-block-heading" id="checkpoint-types-nbsprealism">Checkpoint types – realism</h2>


<p>When I look at realism, I consider both the human proportions and how three-dimensional the images look. This is more subjective than the image output because you can often alter the level of realism through prompting.</p>



<p>Nevertheless, I prefer to also group the realism to help me track what the checkpoints can achieve with these categories:</p>



<ul class="wp-block-list">
<li>Realistic –&nbsp;realistic proportions</li>



<li>Semi-realistic –&nbsp;3D look with almost realistic proportions</li>



<li>2.8D – between 2.5D and 3D look</li>



<li>2.5D – non-flat shading</li>



<li>2D –&nbsp;flat-shading</li>
</ul>


<h3 class="wp-block-heading" id="realistic-checkpoints">Realistic checkpoints</h3>


<p>Realistic checkpoints generate people with life-like proportions and details. These includes both photorealistic and digital painting checkpoints that both aim to replicate the look for real-world or high-fidelity art.</p>


<h3 class="wp-block-heading" id="semirealistic-checkpoints">Semi-realistic checkpoints</h3>


<p>Semi-realistic checkpoints create characters with a three-dimensional look but the proportions are not quite life-like. These are often anime or comic style look with some level of fantastical proportions, or render checkpoints.</p>


<h3 class="wp-block-heading" id="28d-checkpoints">2.8D checkpoints</h3>


<p>2.8D checkpoints straddle between 2.5D and 3D look, with more realism than 2.5D but not quite 3D level of realism. 2.8D is not an actual technical style, and I did not use this category initially. However, th number of checkpoints targeting this specific look has led to me adding it as a distinct category on its on.</p>



<p>These checkpoints are often anime or digital painting checkpoints with a very stylised look.</p>


<h3 class="wp-block-heading" id="25d-checkpoints">2.5D checkpoints</h3>


<p>2.5D checkpoints are have more realistic shading to give the subjects more depth and definition compared to the 2D look. Like 2.8D checkpoints, these are commonly anime or digital painting checkpoints with a stylised look.</p>


<h3 class="wp-block-heading" id="2d-checkpoints">2D checkpoints</h3>


<p>2D checkpoints have flat shading look of traditional anime style. Most anime checkpoints can produce the 2D look. However, this art style extends beyond just anime, including any sort of two-dimensional artistic styles.</p>


<h2 class="wp-block-heading" id="other-types-of-categories">Other types of categories</h2>


<p>I focus <a href="https://jenxi.com/art/" data-type="page" data-id="15158">mainly on portraits</a>, hence I only look at these few features in the checkpoints when groping them. There are other checkpoints that specialises in generating environments, icons, logos, or backgrounds.</p>



<p>Since I rarely generate these kind of images, I won’t talk much about them for now.</p>


<h2 class="wp-block-heading" id="choosing-stable-diffusion-checkpoints">Choosing Stable Diffusion checkpoints</h2>


<p>There are hundreds of Stable Diffusion checkpoints out there to choose from. You can find checkpoints on sites like <a href="https://civitai.com" data-type="link" data-id="https://civitai.com" target="_blank" rel="noopener">Civitai</a>, <a href="https://huggingface.co" data-type="link" data-id="https://huggingface.co" target="_blank" rel="noopener">Hugging Face</a>, and <a href="https://www.liblib.ai" data-type="link" data-id="https://www.liblib.ai" target="_blank" rel="noopener">LibLib AI</a>, to name a few resources. </p>



<p>How do you know which one is the best one? It depends on the type of images you are looking to generate and your preferred workflow.</p>



<p><a href="https://jenxi.com/subscribe/" data-type="page" data-id="2891">Stay tuned</a> for guides on choosing checkpoints and my review of my favourite checkpoints.</p>
]]></content:encoded>
					
					<wfw:commentRss>https://jenxi.com/what-is-a-stable-diffusion-checkpoint/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
		<post-id xmlns="com-wordpress:feed-additions:1">15605</post-id>	</item>
		<item>
		<title>How to improve the performance of Stable Diffusion Web UI</title>
		<link>https://jenxi.com/how-to-improve-the-performance-of-stable-diffusion-web-ui/</link>
					<comments>https://jenxi.com/how-to-improve-the-performance-of-stable-diffusion-web-ui/#respond</comments>
		
		<dc:creator><![CDATA[Jenxi Seow]]></dc:creator>
		<pubDate>Sat, 18 Nov 2023 17:19:06 +0000</pubDate>
				<category><![CDATA[Blog]]></category>
		<category><![CDATA[Stable Diffusion]]></category>
		<category><![CDATA[Stable Diffusion Web UI]]></category>
		<guid isPermaLink="false">https://jenxi.com/?p=15602</guid>

					<description><![CDATA[Knowing how to improve the performance of Stable Diffusion Web UI can boost your image generation speed.]]></description>
										<content:encoded><![CDATA[
<p>When you are generating images in large sizes and batches, knowing how to improve the performance of Stable Diffusion Web UI mean a significant reduction in generation time required.</p>



<p>The minimum requirement for Stable Diffusion Web UI is 2GB VRAM, but generation will be slow and you will run out of memory once you try to create images larger than 512 x 512. Fortunately, there are several ways to optimise Stable Diffusion Web UI to speed up the image generation process.</p>



<p>From my experience, the best setup to use <a href="https://jenxi.com/what-is-stable-diffusion/" data-type="post" data-id="15451">Stable Diffusion</a> is a Windows machine with Nvidia GPU that meets the recommendation of 6GB VRAM. </p>



<p>Bear in mind that many variables affect the optimisation options, so it is best to test the different combinations to find what gives you the best performance. Test with different settings using the same checkpoint to generate 512 x 512 images with 20 steps using the Euler sampling method. Compare how fast it takes Web UI to generate an image.</p>



<div class="wp-block-group has-background has-global-padding is-layout-constrained wp-container-core-group-is-layout-5 wp-block-group-is-layout-constrained" style="border-radius:8px;background-color:#f7f7f7;padding-top:var(--wp--preset--spacing--small);padding-right:clamp(30px, 4vw, 40px);padding-bottom:var(--wp--preset--spacing--small);padding-left:clamp(30px, 4vw, 40px)"><h2 class="simpletoc-title">Contents</h2>
<ul class="simpletoc-list">
<li><a href="#crossattention-optimisation">Cross-attention optimisation</a>


<ul><li>
<a href="#setting-crossattention-optimisation">Setting cross-attention optimisation</a>

</li>
<li><a href="#doggettx">Doggettx</a>

</li>
<li><a href="#xformers">xFormers</a>

</li>
<li><a href="#scaleddotproduct-sdp-attention">Scaled-Dot-Product (sdp) Attention</a>

</li>
<li><a href="#sdp-attention-without-memoryefficient-attention-sdpnomem">SDP Attention without Memory-Efficient Attention (SDP-no-mem)</a>

</li>
<li><a href="#subquadratic-subquad-attention">Sub-Quadratic (sub-quad) Attention</a>

</li>
<li><a href="#splitattention-v1">Split-Attention v1</a>

</li>
<li><a href="#invoke-ai">Invoke AI</a>

</li>
</ul>
<li><a href="#token-merging">Token merging</a>


<ul><li>
<a href="#setting-token-merging">Setting token merging</a>

</li>
</ul>
<li><a href="#negative-guidance-minimum-sigma">Negative guidance minimum sigma</a>


<ul><li>
<a href="#setting-negative-guidance-minimum-sigma">Setting negative guidance minimum sigma</a>

</li>
</ul>
<li><a href="#command-line-arguments">Command line arguments</a>


<ul><li>
<a href="#optimisation-method-arguments">Optimisation method arguments</a>

</li>
<li><a href="#performance-options-arguments">Performance options arguments</a>

</li>
</ul>
<li><a href="#see-also">See also</a>
</li></ul></div>


<h2 class="wp-block-heading" id="crossattention-optimisation">Cross-attention optimisation</h2>


<p>One of the critical operation Stable Diffusion uses is cross-attention calculation. It involves the interaction between two sets of data or vectors: the query and the key. Cross-attention can consume significant amount of memory and time.</p>



<p>Imagine you have a box of building blocks, and you want to build a tall tower. Some blocks are important for making it tall and stable, while others are not so important. You have a pair of special glasses that make the important blocks glow when you look at them through the glasses.</p>



<p>Cross-attention is like using the special glasses to allow the model to focus on the different parts of the input data on what’s important to generate the image.</p>


<h3 class="wp-block-heading" id="setting-crossattention-optimisation">Setting cross-attention optimisation</h3>


<p>Due to the impact of cross-attention calculation, optimising its is the key to speeding up Stable Diffusion. You can set the cross-attention optimisation method in the Stable Diffusion Web UI.</p>



<ol class="wp-block-list">
<li>Launch Stable Diffusion Web UI.</li>



<li>Go to the Settings tab and select the Optimization in the sidebar.</li>



<li>Choose your preferred cross-attention optimisation from the dropdown menu. The default is set to Automatic.</li>



<li>Click Apply Settings to save the settings.</li>
</ol>


<h3 class="wp-block-heading" id="doggettx">Doggettx</h3>


<p>This is a historical improvement to cross-attention operations that offers a decent performance boost, but has been surpassed by newer options. Doggettx submitted the improvements to the original implementation in Stable Diffusion.</p>


<h3 class="wp-block-heading" id="xformers">xFormers</h3>


<p>The Meta AI team developed the xFormers, pronounced transformers. It is a transformer library that increased the attention operation’s speed while reducing memory usage through memory-efficient attention and Flash Attention techniques.</p>



<p>Transformers are a type of neural network architecture that uses self-attention to determine the importance of different parts of the input data. xFormers integrates with PyTorch and CUDA libraries. CUDA is limited to Nvidia hardware, and hence xFormers is only available if you are using an Nvidia GPU.</p>



<p>Memory-efficient attention uses an algorithm that uses less steps and memory to compute the attention operation, making it more efficient for large models and inputs.</p>



<p>Flash Attention uses tiling to compute attention one small piece at a time, reducing memory usage and speeding up calculations.</p>


<h3 class="wp-block-heading" id="scaleddotproduct-sdp-attention">Scaled-Dot-Product (sdp) Attention</h3>


<p>SDP attention is an alternative implementation of memory-efficient attention and Flash Attention native to PyTorch that is available in PyTorch 2 and newer. Depending on your hardware setup, you might get better performance with SDP attention than xFormers. Note that it uses more VRAM than xFormers, so your hardware might run into issues with it.</p>



<p>SDP attention gives non-deterministic output, meaning that the results are reproducible. This is a problem if you want to be able to reproduce the same image when you use the same parameters.</p>



<p>If you are using Stable Diffusion to create art or images for general use, you generally won’t need deterministic output in your workflow. It is only crucial in research.</p>


<h3 class="wp-block-heading" id="sdp-attention-without-memoryefficient-attention-sdpnomem">SDP Attention without Memory-Efficient Attention (SDP-no-mem)</h3>


<p>SDP-no-mem is an implementation of SDP attention without the memory-efficient attention technique. This makes it produce deterministic output, and hence allows you to reproduce the results with the same parameters.</p>



<p>The drawback of using SDP-no-mem is sacrificing the memory-efficient optimisations in exchange for deterministic output.</p>


<h3 class="wp-block-heading" id="subquadratic-subquad-attention">Sub-Quadratic (sub-quad) Attention</h3>


<p>Sub-quad attention is another implementation of memory-efficient attention. It significantly reduces the required memory, but this comes at a cost of speed.</p>



<p>This is useful if you’re unable to run xFormers or SDP. Sub-quad attention allows you to generate larger image sizes if you are on macOS.</p>


<h3 class="wp-block-heading" id="splitattention-v1">Split-Attention v1</h3>


<p>Split-attention v1 is an older implementation of memory-efficient attention that has been surpassed by the other techniques like xFormers or SDP that use memory-efficient attention.</p>



<p>You should be using xFormers or SDP where possible. Split-attention v1 uses less VRAM, so it might be a useful option if your hardware has limited memory. However, it is more limiting on the maximum image size it can generate.</p>


<h3 class="wp-block-heading" id="invoke-ai">Invoke AI</h3>


<p>The Invoke AI is an alternative GUI. Its cross-attention optimisation is useful for macOS machines without Nvidia GPUs.</p>


<h2 class="wp-block-heading" id="token-merging">Token merging</h2>


<p>Token merging (ToMe) is a new technique that accelerates Stable Diffusion by reducing the number of tokens that need processing. It does this by identifying and combining redundant tokens. Merging tokens changes the prompt processed, and hence changes the image output. This could be an issue if you are trying to reproduce the same image with the same parameters.</p>



<p>I personally find it a better habit to practice good prompt engineering and optimise your prompt length. Be mindful when creating prompts and avoid using redundant prompts.</p>



<p>You’ll find that many prompts out there are very badly structured. Instead of just copying prompts, take the time to remove redundancies. If you have a sample image to refer to, remove the tokens that don’t appear in the output that you want to generate.</p>



<p>With less tokens to process, the generation is naturally faster. However, it doesn’t seem to deliver that much improvements compared to cross-attention optimisations. I would avoid using this unless you are getting very long generation times with your setup.</p>


<h3 class="wp-block-heading" id="setting-token-merging">Setting token merging</h3>


<ol class="wp-block-list">
<li>Launch Stable Diffusion Web UI.</li>



<li>Go to the Settings tab and select the Optimization in the sidebar.</li>



<li>Choose your preferred token merging ratio by dragging the slider or keying in the ratio value.</li>



<li>Click Apply Settings to save the settings.</li>
</ol>


<h2 class="wp-block-heading" id="negative-guidance-minimum-sigma">Negative guidance minimum sigma</h2>


<p>Negative guidance minimum sigma is an optimisation that adjusts the sigma, a parameter that represents randomness in the generation process. By increasing the minimum sigma value, you are increasing the chances of the generation process skipping the negative prompt for some steps when the image is almost ready.</p>



<p>Increasing the sigma value reduces the generation time, though I find the performance boost on par with token merging. Negative guidance minimum sigma alters the image output, but to a lesser extent than token merging. If you had to choose between the two, I would suggest going with negative guidance minimum sigma.</p>



<p>Again, I would avoid using this unless you are getting very slow performance with your setup.</p>


<h3 class="wp-block-heading" id="setting-negative-guidance-minimum-sigma">Setting negative guidance minimum sigma</h3>


<ol class="wp-block-list">
<li>Launch Stable Diffusion Web UI.</li>



<li>Go to the Settings tab and select the Optimization in the sidebar.</li>



<li>Choose your preferred negative guidance minimum sigma by dragging the slider or keying in the sigma value.</li>



<li>Click Apply Settings to save the settings.</li>
</ol>


<h2 class="wp-block-heading" id="command-line-arguments">Command line arguments</h2>


<p>Since Stable Diffusion Web UI is a command-line application, you can provide command-line arguments to configure it when launching Web UI. Some of these arguments can be used in combination to improve the performance of Stable Diffusion Web UI.</p>



<p>If you launch Web UI from the terminal, you can add the arguments to the command. If you launch Web UI by double-clicking on the <code>webui-user.bat</code> or <code>run.bat</code> files, you can edit the <code>webui-user.bat</code> (Windows) or <code>webui-user.sh</code> (Mac or Linux) in a text editor and add the variables.</p>



<p>In <code>webui-user.bat</code>, add the arguments to the line <code>set COMMANDLINE_ARGS=</code>.<br>In <code>webui-user.sh</code>, add the arguments to the line <code>export COMMANDLINE_ARGS=</code>.</p>



<p>For example, <code>set COMMANDLINE_ARGS=--skip-torch-cuda-test --no-half-vae —api --opt-sdp-attention</code></p>



<p>There is a <a href="https://github.com/AUTOMATIC1111/stable-diffusion-webui/wiki/Command-Line-Arguments-and-Settings" target="_blank" rel="noopener">full list of command line arguments</a> you can use with Stable Diffusion Web UI on GitHub.</p>


<h3 class="wp-block-heading" id="optimisation-method-arguments">Optimisation method arguments</h3>


<p>These are the arguments that enable the optimisations mentioned in this article:</p>



<ul class="wp-block-list">
<li><code>--opt-sdp-attention</code> –&nbsp;Enables SDP attention optimisation</li>



<li><code>--opt-sdp-no-mem-attention</code> – Enables SDP-no-mem</li>



<li><code>--xformers</code> – Enables xFormers</li>



<li><code>--force-enable-xformers</code> – Enables xFormers regardless of whether the program thinks you can run it</li>



<li><code>--opt-split-attention</code> – Enables cross-attention layer optimisation; enabled by default for torch.cuda for both Nvidia and AMD cards</li>



<li><code>--disable-opt-split-attention</code> – Disables the cross-attention optimisation</li>



<li><code>--opt-sub-quad-attention</code> – Enables sub-quad attention optimisation</li>



<li><code>--opt-split-attention-v1</code> – Enables split attention v1</li>
</ul>


<h3 class="wp-block-heading" id="performance-options-arguments">Performance options arguments</h3>


<p>You can also add other arguments to improve the performance of Stable Diffusion Web UI:</p>



<ul class="wp-block-list">
<li><code>--medvram</code> –&nbsp;Splits the Stable Diffusion into three parts and only loads one in VRAM at all times, keeping the others in CPU RAM. It slows down generation speed but allows you to generate the image with a lower VRAM ceiling.</li>



<li><code>--medvram-sdxl</code> –&nbsp;Enables <code>--medvram</code> only for SDXL models</li>



<li><code>--lowvram</code> –&nbsp;An even more thorough optimisation that splits the third part, the unet, into many modules, and keeping only one module is kept in VRAM. Very, very slow generation.</li>



<li><code>--lowram</code> –&nbsp;Load Stable Diffusion checkpoint weights to VRAM instead of RAM for machines that have limited RAM</li>



<li><code>--upcast-sampling</code> – Improves generation speed for machines that need to run with <code>--no-half</code>. Better performance and VRAM usage than <code>--no-half</code>.</li>
</ul>


<h2 class="wp-block-heading" id="see-also">See also</h2>


<ul class="wp-block-list">
<li><a href="https://jenxi.com/what-is-stable-diffusion/">What is Stable Diffusion</a></li>



<li><a href="https://jenxi.com/how-to-install-stable-diffusion-web-ui/" data-type="post" data-id="15583">How to install Stable Diffusion Web UI</a></li>



<li><a href="https://github.com/AUTOMATIC1111/stable-diffusion-webui/wiki/Troubleshooting" target="_blank" rel="noreferrer noopener">Troubleshooting Stable Diffusion Web UI</a></li>



<li>How to use Stable Diffusion Web UI</li>
</ul>
]]></content:encoded>
					
					<wfw:commentRss>https://jenxi.com/how-to-improve-the-performance-of-stable-diffusion-web-ui/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
		<post-id xmlns="com-wordpress:feed-additions:1">15602</post-id>	</item>
		<item>
		<title>How to install Stable Diffusion Web UI</title>
		<link>https://jenxi.com/how-to-install-stable-diffusion-web-ui/</link>
					<comments>https://jenxi.com/how-to-install-stable-diffusion-web-ui/#respond</comments>
		
		<dc:creator><![CDATA[Jenxi Seow]]></dc:creator>
		<pubDate>Sat, 04 Nov 2023 04:26:40 +0000</pubDate>
				<category><![CDATA[Blog]]></category>
		<category><![CDATA[Artificial intelligence]]></category>
		<category><![CDATA[Guide]]></category>
		<category><![CDATA[Stable Diffusion]]></category>
		<guid isPermaLink="false">https://jenxi.com/?p=15583</guid>

					<description><![CDATA[Some of you mentioned that the official guide is a bit too technical, so I have written a simplified guide on how to install Stable Diffusion Web UI.]]></description>
										<content:encoded><![CDATA[
<p>Some of you mentioned that the official guide is a bit too technical, so I have written a simplified guide on how to install Stable Diffusion Web UI.</p>



<p>I will try to keep my installation guide updated, but things move fast in the Stable Diffusion world, so don’t be surprised things don’t work. </p>



<p>Check the official GitHub installation guide for the latest information. The official guide is a bit more technical, so my guide makes it a bit easier for beginners to understand.</p>



<p>The process is a bit different depending on the device you’re installing Stable Diffusion on. The official guide covers installing on <a href="https://github.com/AUTOMATIC1111/stable-diffusion-webui/wiki/Install-and-Run-on-NVidia-GPUs" target="_blank" rel="noopener">Nvidia GPUs</a>, <a href="https://github.com/AUTOMATIC1111/stable-diffusion-webui/wiki/Install-and-Run-on-AMD-GPUs" target="_blank" rel="noopener">AMD GPUs</a>, <a href="https://github.com/AUTOMATIC1111/stable-diffusion-webui/wiki/Installation-on-Apple-Silicon" target="_blank" rel="noopener">Apple Silicon</a>, and <a href="https://github.com/openvinotoolkit/stable-diffusion-webui/wiki/Installation-on-Intel-Silicon" target="_blank" rel="noopener">Intel Silicon</a>.</p>



<p>I’ve only installed Stable Diffusion Web UI on Windows 10 with Nvidia GPU and macOS on M1 Max, so I’ll only be sharing the guides for these two setups.</p>



<div class="wp-block-group has-background has-global-padding is-layout-constrained wp-container-core-group-is-layout-6 wp-block-group-is-layout-constrained" style="border-radius:8px;background-color:#f7f7f7;padding-top:var(--wp--preset--spacing--small);padding-right:clamp(30px, 4vw, 40px);padding-bottom:var(--wp--preset--spacing--small);padding-left:clamp(30px, 4vw, 40px)"><h2 class="simpletoc-title">Contents</h2>
<ul class="simpletoc-list">
<li><a href="#downloading-dreamshaper">Downloading DreamShaper</a>

</li>
<li><a href="#installing-on-windows-with-nvidia-gpu">Installing on Windows with Nvidia GPU</a>


<ul><li>
<a href="#using-the-web-ui-windows-installer">Using the Web UI Windows installer</a>

</li>
<li><a href="#manual-web-ui-windows-installation">Manual Web UI Windows installation</a>

</li>
</ul>
<li><a href="#installing-on-macos-with-apple-silicon">Installing on macOS with Apple Silicon</a>

</li>
<li><a href="#updating-stable-diffusion-web-ui">Updating Stable Diffusion Web UI</a>


<ul><li>
<a href="#updating-on-windows-with-nvidia-gpu">Updating on Windows with Nvidia GPU</a>


</li>

<li><a href="#updating-on-macos-with-apple-silicon">Updating on macOS with Apple Silicon</a>

</li>
</ul>
<li><a href="#see-also">See also</a>
</li></ul></div>


<h2 class="wp-block-heading" id="downloading-dreamshaper">Downloading DreamShaper</h2>


<p>Before we start, I recommend that you download your first Stable Diffusion checkpoint. You can learn more about what Stable Diffusion checkpoints are and the top checkpoints available out there in my other articles.</p>



<ul class="wp-block-list">
<li>Download DreamShaper v8 from <a href="https://civitai.com/models/4384/dreamshaper" target="_blank" rel="noopener">Civitai</a> or <a href="https://huggingface.co/Lykon/DreamShaper/tree/main" data-type="link" data-id="https://huggingface.co/Lykon/DreamShaper/tree/main" target="_blank" rel="noopener">Hugging Face</a>. You don’t need to do anything with it now.</li>
</ul>



<p>Why download DreamShaper? It is a good starting checkpoint for beginners compared to the default Stable Diffusion 1.5 model. Downloading the model now also makes it easier for you to track the download. </p>



<p>If the installer doesn’t detect an existing model, it will automatically download the large 4 GB file. If you are on a slow connection, you might think that the process is stuck.</p>


<h2 class="wp-block-heading" id="installing-on-windows-with-nvidia-gpu">Installing on Windows with Nvidia GPU</h2>


<p>There are two ways to install Stable Diffusion Web UI on Windows. The easy way is to use the installer package. The second method pulls the source from GitHub and requires you to have some technical knowledge to operate git.</p>



<p>The difference between the method is that it’s easier to get started with the installer, whereas using git allows you to switch between different commits. This is useful when you encounter bugs with a certain version or an extension.</p>


<h3 class="wp-block-heading" id="using-the-web-ui-windows-installer">Using the Web UI Windows installer</h3>


<ol class="wp-block-list">
<li>Download <code>sd.webui.zip</code>&nbsp;from&nbsp;<a href="https://github.com/AUTOMATIC1111/stable-diffusion-webui/releases/tag/v1.0.0-pre" target="_blank" rel="noopener">this GitHub release page</a>.</li>



<li>Extract the zip file at where you want to install Web UI. Bear in mind that the models and extensions you install will take up space, so I would recommend choosing a drive that has most space available.</li>



<li>Double click to run&nbsp;<code>update.bat</code>. This will update Web UI to the latest version. Wait until the update completes, then close the window.</li>



<li>If you have <a href="#downloading-dreamshaper" data-type="internal" data-id="#downloading-dreamshaper">downloaded DreamShaper</a>, move it into the <code>sd.webui\webui\models\Stable-diffusion\</code> folder.</li>



<li>Double click&nbsp;<code>run.bat</code>&nbsp;to launch Web UI. It will download all the required files during first launch. There are many files so it might take a while depending on your Internet connection speed.</li>



<li>When everything has been downloaded and installed successfully, you will see the message “<code>Running on local URL: http://127.0.0.1:7860</code>”.</li>



<li>Copy and paste the URL <code>http://127.0.0.1:7860</code> in your preferred browser, or click on <a href="http://127.0.0.1:7860" data-type="link" data-id="http://127.0.0.1:7860" target="_blank" rel="noreferrer noopener">this link</a> to go to the Web UI.</li>
</ol>


<h3 class="wp-block-heading" id="manual-web-ui-windows-installation">Manual Web UI Windows installation</h3>


<ol class="wp-block-list">
<li>Download and install&nbsp;<a href="https://www.python.org/ftp/python/3.10.6/python-3.10.6-amd64.exe" target="_blank" rel="noopener">Python 3.10.6</a>. Select the&nbsp;“Add to PATH” option when installing. Skip this step if you already have Python 3.10 installed.</li>



<li>Download and install&nbsp;<a href="https://github.com/git-for-windows/git/releases/download/v2.39.2.windows.1/Git-2.39.2-64-bit.exe" target="_blank" rel="noopener">git</a>. Skip this step if you already have git installed.</li>



<li>Launch Command Prompt. Navigate to the folder or drive you want to install the Web UI, and then run&nbsp;<code>git clone https://github.com/AUTOMATIC1111/stable-diffusion-webui</code>.</li>



<li>If you have <a href="#downloading-dreamshaper" data-type="internal" data-id="#downloading-dreamshaper">downloaded DreamShaper</a>, move it into the <code>stable-diffusion-webui\webui\models\Stable-diffusion\</code> folder.</li>



<li>Double click&nbsp;<code>webui-user.bat</code> to launch Web UI. It will download a large amount of the dependencies during first launch, so it might take a while depending on your Internet connection speed.</li>



<li>When everything has been downloaded and installed successfully, you will see the message “<code>Running on local URL: http://127.0.0.1:7860</code>”.</li>



<li>Copy and paste the URL <code>http://127.0.0.1:7860</code> in your preferred browser, or click on <a href="http://127.0.0.1:7860" data-type="link" data-id="http://127.0.0.1:7860" target="_blank" rel="noreferrer noopener">this link</a> to go to the Web UI.</li>
</ol>


<h2 class="wp-block-heading" id="installing-on-macos-with-apple-silicon">Installing on macOS with Apple Silicon</h2>


<p>The installation process for macOS is similar to the manual installation for Windows. The only difference is that you need to install Homebrew, if you’ve never installed it before.</p>



<ol class="wp-block-list">
<li>If you haven’t installed Homebrew, follow the installation instructions at&nbsp;<a href="https://brew.sh/" target="_blank" rel="noopener">https://brew.sh</a>. You can either install Homebrew using the script or use the <code>.pkg</code> installer. Keep the terminal window open after Homebrew finishes installing.</li>



<li>Follow the instructions under “Next steps” to add Homebrew to your PATH.</li>



<li>Open a new terminal window and run&nbsp;<code>brew install cmake protobuf rust python@3.10 git wget</code>. This will install the main dependencies and might take some time depending on your internet connection speed.</li>



<li>Clone the Web UI’s GitHub repository by running&nbsp;<code>git clone https://github.com/AUTOMATIC1111/stable-diffusion-webui</code></li>



<li>If you have <a href="#downloading-dreamshaper" data-type="internal" data-id="#downloading-dreamshaper">downloaded DreamShaper</a>, move it into the <code>stable-diffusion-webui/models/Stable-diffusion/</code> folder.</li>



<li>Run <code>cd stable-diffusion-webui</code>&nbsp;and then&nbsp;<code>./webui.sh</code>&nbsp;to launch the Web UI. A Python virtual environment will be created and activated using venv. It will automatically download and install any missing dependencies. Once again, it might take a while depending on your internet connection speed.</li>



<li>When everything has been downloaded and installed successfully, you will see the message “<code>Running on local URL: http://127.0.0.1:7860</code>”.</li>



<li>Copy and paste the URL <code>http://127.0.0.1:7860</code> in your preferred browser, or click on <a href="http://127.0.0.1:7860" data-type="link" data-id="http://127.0.0.1:7860" target="_blank" rel="noreferrer noopener">this link</a> to go to the Web UI.</li>
</ol>


<h2 class="wp-block-heading" id="updating-stable-diffusion-web-ui">Updating Stable Diffusion Web UI</h2>


<p>As all things in the world of Stable Diffusion, things move fast and break fast with the Stable Diffusion Web UI. While I recommend staying up-to-date with your software, I don’t suggest doing so with the Web UI.</p>



<p>Updating the Web UI might break compatibility with a certain extension that is critical to your image generation workflow. If things are working well enough, don’t update immediately. Give it some time for bugs and issues to be reported.</p>



<p>I’ve learnt the hard way, but I also install via git so it is easy for me to switch back to an earlier commit that works.</p>


<h3 class="wp-block-heading" id="updating-on-windows-with-nvidia-gpu">Updating on Windows with Nvidia GPU</h3>

<h4 class="wp-block-heading" id="if-you-installed-using-the-web-ui-windows-installer">If you installed using the Web UI Windows installer</h4>


<ul class="wp-block-list">
<li>Double click the&nbsp;<code>update.bat</code>&nbsp;to update web UI and wait for the process to finish before closing the window.</li>



<li>Once this is done, double click <code>run.bat</code> to launch Web UI.</li>
</ul>


<h4 class="wp-block-heading" id="if-you-installed-manually">If you installed manually</h4>


<ul class="wp-block-list">
<li>Launch Command Prompt and navigate to the <code>stable-diffusion-webui</code> folder.</li>



<li>Run&nbsp;<code>git pull</code> and wait for it to pull the latest files from GitHub.</li>



<li>Once the process is complete, double click&nbsp;<code>webui-user.bat</code> to launch Web UI or run <code>.\webui-user.bat</code> in the command prompt.</li>
</ul>


<h3 class="wp-block-heading" id="updating-on-macos-with-apple-silicon">Updating on macOS with Apple Silicon</h3>


<ul class="wp-block-list">
<li>Open Terminal, run <code>cd stable-diffusion-webui</code>&nbsp;and then <code>git pull</code> and wait for it to update to the latest files.</li>



<li>When the process is done, run <code>./webui.sh</code> to launch Web UI.</li>
</ul>


<h2 class="wp-block-heading" id="see-also">See also</h2>


<ul class="wp-block-list">
<li><a href="https://jenxi.com/what-is-stable-diffusion/" data-type="post" data-id="15451">What is Stable Diffusion</a></li>



<li><a href="https://jenxi.com/how-to-improve-the-performance-of-stable-diffusion-web-ui/" data-type="link" data-id="https://jenxi.com/how-to-improve-the-performance-of-stable-diffusion-web-ui/">How to optimise Stable Diffusion Web UI</a></li>



<li><a href="https://github.com/AUTOMATIC1111/stable-diffusion-webui/wiki/Troubleshooting" target="_blank" rel="noopener">Troubleshooting Stable Diffusion Web UI</a></li>



<li>How to use Stable Diffusion Web UI</li>
</ul>
]]></content:encoded>
					
					<wfw:commentRss>https://jenxi.com/how-to-install-stable-diffusion-web-ui/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
		<post-id xmlns="com-wordpress:feed-additions:1">15583</post-id>	</item>
		<item>
		<title>Sakura Kasugano AI art</title>
		<link>https://jenxi.com/sakura-kasugano-ai-art/</link>
					<comments>https://jenxi.com/sakura-kasugano-ai-art/#respond</comments>
		
		<dc:creator><![CDATA[Jenxi Seow]]></dc:creator>
		<pubDate>Tue, 10 Oct 2023 07:09:26 +0000</pubDate>
				<category><![CDATA[Art]]></category>
		<category><![CDATA[AI art]]></category>
		<category><![CDATA[Mai Shiranui]]></category>
		<category><![CDATA[Stable Diffusion]]></category>
		<guid isPermaLink="false">https://jenxi.com/?p=15508</guid>

					<description><![CDATA[I started working on Sakura Kasugano AI art as a tribute to my favourite Street Fighter character.]]></description>
										<content:encoded><![CDATA[
<p>I started working on Sakura Kasugano AI art as a tribute to my favourite <em><a href="https://en.wikipedia.org/wiki/Street_Fighter" target="_blank" rel="noopener">Street Fighter</a></em> character. When the <em><a href="https://en.wikipedia.org/wiki/Street_Fighter_II" target="_blank" rel="noopener">Street Fighter II</a></em> was released and became the rage back in the 90s, I was partial to Ryu and Ken but I also enjoyed using other characters because their unique moves made them interesting to play.</p>



<p>Then, Sakura burst onto the scene when <em><a href="https://en.wikipedia.org/wiki/Street_Fighter_Alpha_2" target="_blank" rel="noopener">Street Fighter Alpha 2</a></em> was released, and boy was I smitten. She&#8217;s been stealing hearts ever since and is one of the undisputed fan favourites alongside Cammy and Chun-Li. In fact, she was voted the third most popular character in Capcom&#8217;s 2002 <a href="https://web.archive.org/web/20051219091936/http://www2.geestore.com/sf15th2/sf15rank/ninkichara.html" target="_blank" rel="noopener">15th Street Fighter anniversary poll</a>, after Chun-Li and Cammy.</p>



<p>I had just gotten into my serafuku phase, so the outfit was on point. Her character backstory also appealed to Ryu fans. She is a fan of Ryu and travelled to find her idol so that she could ask him to train her as a fighter. That iconic white headband was <a href="https://screenrant.com/street-fighter-video-game-ryu-facts-trivia/#trading-headbands" target="_blank" rel="noopener">given to her</a> by Ryu, and she later <a href="https://web.archive.org/web/20181125030807/https://game.capcom.com/cfn/sfv/column/132442" target="_blank" rel="noopener">switched a red headband</a> after he started wearing the red one Ken gave to him.</p>



<p>Now, what sets Sakura apart from the other characters is that she&#8217;s entirely self-taught. She picked up moves from watching Ryu in action and even developed her own unique style. The character story resonates with Ryu fans. Her moves are also similar to Ryu and Ken’s so that adds to her appeal.</p>



<div class="wp-block-group has-background has-global-padding is-layout-constrained wp-container-core-group-is-layout-7 wp-block-group-is-layout-constrained" style="border-radius:8px;background-color:#f7f7f7;padding-top:var(--wp--preset--spacing--small);padding-right:clamp(30px, 4vw, 40px);padding-bottom:var(--wp--preset--spacing--small);padding-left:clamp(30px, 4vw, 40px)"><h2 class="simpletoc-title">Contents</h2>
<ul class="simpletoc-list">
<li><a href="#background">Background</a>

</li>
<li><a href="#creation">Creation</a>

</li>
<li><a href="#tools-used">Tools used</a>

</li>
<li><a href="#downloads">Downloads</a>


<ul><li>
<a href="#sakura-kasugano-sakura-season">Sakura Kasugano sakura season</a>

</li>
<li><a href="#sakura-kasugano-twist">Sakura Kasugano twist</a>

</li>
<li><a href="#sakura-kasugano-headband">Sakura Kasugano headband</a>

</li>
<li><a href="#sakura-kasugano-stance">Sakura Kasugano stance</a>

</li>
<li><a href="#sakura-kasugano-park">Sakura Kasugano park</a>
</li>
</ul>
</li></ul></div>


<h2 class="wp-block-heading" id="background">Background</h2>


<p>Due to their immense popularity, Marvel and DC character art was flooding the AI art scene. I didn’t want to work on yet another Supergirl or Wonder Woman. Being an otaku, the natural source for me to find inspiration was from anime and video games. I had created a commissioned art of Lucy Kushinada from <em>Cyberpunk 2077:Edgerunners</em>, so I skipped this obvious choice.</p>



<p>As much as I love the Gundam series, the characters are relatively more niche. I mean, how can any of the characters top Rei Ayanami or Asuka Langley Soryu? But I believe they also lose out to Motoko Kusanagi in global appeal. Despite all the controversies, people outside the otaku scene would still remember Scarlett Johansson’s portrayal of Motoko in the live action movie, or perhaps even because of the backlash.</p>



<p>I love the Final Fantasy franchise as much as Gundam. But everyone was doing Tifa Lockhart and Aerith Gainsborough, so I avoided them for a while. If you follow my <a href="https://www.instagram.com/jenxi.art/" target="_blank" rel="noopener">Instagram</a>, or other platforms where I post <a href="https://jenxi.com/art/">my art</a>, you’d have seen that I’ve succumbed and worked on Tifa and Aerith, and the aforementioned Motoko.</p>



<p>Studio Ghibli characters are popular worldwide. I jumped on my favourite Miyazaki character, Princess Mononoke, or, as I prefer to call her, Mononoke-hime.</p>



<p>Then, I looked at fighting games and <em>Street Fighter</em> and <em>King of Fighter</em> were at the top of my list. I wrote previously about how my creation process for <a href="https://jenxi.com/mai-shiranui-ai-art/">Mai Shiranui</a> from the King of Fighters.</p>


<h2 class="wp-block-heading" id="creation">Creation</h2>


<p>The challenging part of creating Sakura art is the gloves. Hands are notoriously difficult to produce accurately with AI. Hands wearing gloves? The AI starts getting extremely creative in how it generates the hands.</p>



<p>That said, these were done when I was just started learning <a href="https://jenxi.com/what-is-stable-diffusion/">Stable Diffusion</a>. I’ve since picked up new techniques and tricks to optimise my workflow, allowing me to create better hands with less effort. Though, fingernails still remain hit-or-miss. Luckily, we just need fists with Sakura, right?</p>


<h2 class="wp-block-heading" id="tools-used">Tools used</h2>


<ul class="wp-block-list">
<li><a href="https://github.com/AUTOMATIC1111/stable-diffusion-webui" target="_blank" rel="noopener">Stable Diffusion Web UI</a></li>



<li>Adobe Photoshop (Beta)</li>



<li>Huion Kamvas Pro 16</li>
</ul>


<h2 class="wp-block-heading" id="downloads">Downloads</h2>


<p>The HD versions of these images available for download on Afdian. See the links below each image.</p>



<p>The 4K versions will be available to members on Patreon and Pixiv. <a href="https://jenxi.com/subscribe/">Subscribe to my newsletter</a> to be notified when these and new content become available.</p>


<h3 class="wp-block-heading" id="sakura-kasugano-sakura-season">Sakura Kasugano sakura season</h3>


<figure class="wp-block-image size-large"><img data-recalc-dims="1" decoding="async" width="760" height="1140" src="https://i0.wp.com/jenxi.com/wp-content/uploads/sakura-kasugano-1-by-jenxi-free.webp?resize=760%2C1140&#038;ssl=1" alt="Sakura Kasugano sakura season AI art by Jenxi" class="wp-image-15510" srcset="https://i0.wp.com/jenxi.com/wp-content/uploads/sakura-kasugano-1-by-jenxi-free.webp?resize=760%2C1140&amp;ssl=1 760w, https://i0.wp.com/jenxi.com/wp-content/uploads/sakura-kasugano-1-by-jenxi-free.webp?resize=467%2C700&amp;ssl=1 467w, https://i0.wp.com/jenxi.com/wp-content/uploads/sakura-kasugano-1-by-jenxi-free.webp?resize=768%2C1152&amp;ssl=1 768w, https://i0.wp.com/jenxi.com/wp-content/uploads/sakura-kasugano-1-by-jenxi-free.webp?w=800&amp;ssl=1 800w" sizes="(max-width: 760px) 100vw, 760px" /><figcaption class="wp-element-caption">Sakura Kasugano walking along a street filled with sakura trees.</figcaption></figure>



<ul class="wp-block-list">
<li>Get it on <a href="https://ko-fi.com/i/IO4O8MW5D7" target="_blank" rel="noopener">Ko-fi</a></li>



<li>Get it on <a href="https://www.patreon.com/posts/85581800" target="_blank" rel="noopener">Patreon</a></li>



<li>Get it using Alipay or WeChat Pay on <a href="https://afdian.net/item/03d429fc1a8e11ee8deb5254001e7c00" target="_blank" rel="noopener">Afdian 爱发电</a></li>



<li>Collect it on <a href="https://www.deviantart.com/jenxi/art/Sakura-Saugano-sakura-season-970290317" target="_blank" rel="noopener">DeviantArt</a> and <a href="https://www.pixiv.net/en/artworks/109628813" target="_blank" rel="noopener">Pixiv</a></li>
</ul>


<h3 class="wp-block-heading" id="sakura-kasugano-twist">Sakura Kasugano twist</h3>


<figure class="wp-block-image size-large"><img data-recalc-dims="1" decoding="async" width="760" height="1140" src="https://i0.wp.com/jenxi.com/wp-content/uploads/sakura-kasugano-2-by-jenxi-free.webp?resize=760%2C1140&#038;ssl=1" alt="Sakura Kasugano twist AI art by Jenxi" class="wp-image-15511" srcset="https://i0.wp.com/jenxi.com/wp-content/uploads/sakura-kasugano-2-by-jenxi-free.webp?resize=760%2C1140&amp;ssl=1 760w, https://i0.wp.com/jenxi.com/wp-content/uploads/sakura-kasugano-2-by-jenxi-free.webp?resize=467%2C700&amp;ssl=1 467w, https://i0.wp.com/jenxi.com/wp-content/uploads/sakura-kasugano-2-by-jenxi-free.webp?resize=768%2C1152&amp;ssl=1 768w, https://i0.wp.com/jenxi.com/wp-content/uploads/sakura-kasugano-2-by-jenxi-free.webp?w=800&amp;ssl=1 800w" sizes="(max-width: 760px) 100vw, 760px" /><figcaption class="wp-element-caption">Sakura Kasugano turning to Shoryuken… or would it be a Tatsumaki?</figcaption></figure>



<ul class="wp-block-list">
<li>Get it on <a href="https://ko-fi.com/i/IO5O2N48O3" target="_blank" rel="noopener">Ko-fi</a></li>



<li>Get it on <a href="https://www.patreon.com/posts/85938381" target="_blank" rel="noopener">Patreon</a></li>



<li>Get it using Alipay or WeChat Pay on <a href="https://afdian.net/item/533b7a1e271c11eea5aa52540025c377" target="_blank" rel="noopener">Afdian 爱发电</a></li>



<li>Collect it on <a href="https://www.deviantart.com/jenxi/art/Sakura-Kasugano-twist-971433992" target="_blank" rel="noopener">DeviantArt</a> and <a href="https://www.pixiv.net/en/artworks/109838021" target="_blank" rel="noopener">Pixiv</a></li>
</ul>


<h3 class="wp-block-heading" id="sakura-kasugano-headband">Sakura Kasugano headband</h3>


<figure class="wp-block-image size-large"><img data-recalc-dims="1" decoding="async" width="760" height="1140" src="https://i0.wp.com/jenxi.com/wp-content/uploads/sakura-kasugano-3-by-jenxi-free.webp?resize=760%2C1140&#038;ssl=1" alt="Sakura Kasugano headband AI art by Jenxi" class="wp-image-15512" srcset="https://i0.wp.com/jenxi.com/wp-content/uploads/sakura-kasugano-3-by-jenxi-free.webp?resize=760%2C1140&amp;ssl=1 760w, https://i0.wp.com/jenxi.com/wp-content/uploads/sakura-kasugano-3-by-jenxi-free.webp?resize=467%2C700&amp;ssl=1 467w, https://i0.wp.com/jenxi.com/wp-content/uploads/sakura-kasugano-3-by-jenxi-free.webp?resize=768%2C1152&amp;ssl=1 768w, https://i0.wp.com/jenxi.com/wp-content/uploads/sakura-kasugano-3-by-jenxi-free.webp?w=800&amp;ssl=1 800w" sizes="(max-width: 760px) 100vw, 760px" /><figcaption class="wp-element-caption">Sakura Kasugano getting ready to kick your butt.</figcaption></figure>



<ul class="wp-block-list">
<li>Get it on <a href="https://ko-fi.com/i/IZ8Z7N77CP" target="_blank" rel="noopener">Ko-fi</a></li>



<li>Get it on <a href="https://www.patreon.com/posts/86076579" target="_blank" rel="noopener">Patreon</a></li>



<li>Get it using Alipay or WeChat Pay on <a href="https://afdian.net/item/6b401624271c11eeb41d5254001e7c00" target="_blank" rel="noopener">Afdian 爱发电</a></li>



<li>Collect it on <a href="https://www.deviantart.com/jenxi/art/Sakura-Kasugano-headband-971866079" target="_blank" rel="noopener">DeviantArt</a> and <a href="https://www.pixiv.net/en/artworks/109902756" target="_blank" rel="noopener">Pixiv</a></li>
</ul>


<h3 class="wp-block-heading" id="sakura-kasugano-stance">Sakura Kasugano stance</h3>


<figure class="wp-block-image size-large"><img data-recalc-dims="1" decoding="async" width="760" height="1140" src="https://i0.wp.com/jenxi.com/wp-content/uploads/sakura-kasugano-4-by-jenxi-free.webp?resize=760%2C1140&#038;ssl=1" alt="Sakura Kasugano stance AI art by Jenxi" class="wp-image-15513" srcset="https://i0.wp.com/jenxi.com/wp-content/uploads/sakura-kasugano-4-by-jenxi-free.webp?resize=760%2C1140&amp;ssl=1 760w, https://i0.wp.com/jenxi.com/wp-content/uploads/sakura-kasugano-4-by-jenxi-free.webp?resize=467%2C700&amp;ssl=1 467w, https://i0.wp.com/jenxi.com/wp-content/uploads/sakura-kasugano-4-by-jenxi-free.webp?resize=768%2C1152&amp;ssl=1 768w, https://i0.wp.com/jenxi.com/wp-content/uploads/sakura-kasugano-4-by-jenxi-free.webp?w=800&amp;ssl=1 800w" sizes="(max-width: 760px) 100vw, 760px" /><figcaption class="wp-element-caption">Sakura Kasugano dropping into fighting stance.</figcaption></figure>



<ul class="wp-block-list">
<li>Get it on <a href="https://ko-fi.com/i/IE1E6NEH2U" target="_blank" rel="noopener">Ko-fi</a></li>



<li>Get it on <a href="https://www.patreon.com/posts/86391543" target="_blank" rel="noopener">Patreon</a></li>



<li>Get it using Alipay or WeChat Pay on <a href="https://afdian.net/item/7d1da0f0271c11ee8ff75254001e7c00" target="_blank" rel="noopener">Afdian 爱发电</a></li>



<li>Collect it on <a href="https://www.deviantart.com/jenxi/art/Sakura-Saugano-sakura-season-970290317" target="_blank" rel="noopener">DeviantArt</a> and <a href="https://www.pixiv.net/en/artworks/109628813" target="_blank" rel="noopener">Pixiv</a></li>
</ul>


<h3 class="wp-block-heading" id="sakura-kasugano-park">Sakura Kasugano park</h3>


<figure class="wp-block-image size-large"><img data-recalc-dims="1" decoding="async" width="760" height="1140" src="https://i0.wp.com/jenxi.com/wp-content/uploads/sakura-kasugano-5-by-jenxi-free.webp?resize=760%2C1140&#038;ssl=1" alt="Sakura Kasugano park AI art by Jenxi" class="wp-image-15514" srcset="https://i0.wp.com/jenxi.com/wp-content/uploads/sakura-kasugano-5-by-jenxi-free.webp?resize=760%2C1140&amp;ssl=1 760w, https://i0.wp.com/jenxi.com/wp-content/uploads/sakura-kasugano-5-by-jenxi-free.webp?resize=467%2C700&amp;ssl=1 467w, https://i0.wp.com/jenxi.com/wp-content/uploads/sakura-kasugano-5-by-jenxi-free.webp?resize=768%2C1152&amp;ssl=1 768w, https://i0.wp.com/jenxi.com/wp-content/uploads/sakura-kasugano-5-by-jenxi-free.webp?w=800&amp;ssl=1 800w" sizes="(max-width: 760px) 100vw, 760px" /><figcaption class="wp-element-caption">Sakura Kasugano training in the park.</figcaption></figure>



<ul class="wp-block-list">
<li>Get it on <a href="https://ko-fi.com/i/IU7U5OQYDD" target="_blank" rel="noopener">Ko-fi</a></li>



<li>Get it on <a href="https://www.patreon.com/posts/88473954" target="_blank" rel="noopener">Patreon</a></li>



<li>Get it using Alipay or WeChat Pay on <a href="https://afdian.net/item/2d781644471911ee858552540025c377" target="_blank" rel="noopener">Afdian 爱发电</a></li>



<li>Collect it on <a href="https://www.deviantart.com/jenxi/art/Sakura-Saugano-sakura-season-970290317" target="_blank" rel="noopener">DeviantArt</a> and <a href="https://www.pixiv.net/en/artworks/109628813" target="_blank" rel="noopener">Pixiv</a></li>
</ul>
]]></content:encoded>
					
					<wfw:commentRss>https://jenxi.com/sakura-kasugano-ai-art/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
		<post-id xmlns="com-wordpress:feed-additions:1">15508</post-id>	</item>
		<item>
		<title>What is Stable Diffusion</title>
		<link>https://jenxi.com/what-is-stable-diffusion/</link>
					<comments>https://jenxi.com/what-is-stable-diffusion/#respond</comments>
		
		<dc:creator><![CDATA[Jenxi Seow]]></dc:creator>
		<pubDate>Mon, 25 Sep 2023 06:20:45 +0000</pubDate>
				<category><![CDATA[Blog]]></category>
		<category><![CDATA[AI art]]></category>
		<category><![CDATA[Computer vision]]></category>
		<category><![CDATA[Stable Diffusion]]></category>
		<guid isPermaLink="false">https://jenxi.com/?p=15451</guid>

					<description><![CDATA[After I shared about my AI art journey, people starting asking what is Stable Diffusion. ]]></description>
										<content:encoded><![CDATA[
<p>When I first started posting my AI art, many people reached out to me asking how I made the art pieces because they were different from those circulating the internet. After I posted about <a href="https://jenxi.com/how-i-got-started-in-generative-ai-art/">my AI art journey</a>, the questions became “What is Stable Diffusion” and, for those who have tried Stable Diffusion, “How do you get such results?”</p>



<p>Compared to images you see on social media and on the web, the key difference is due to my insistence on generating realistic images with a lot of details that are not easy to achieve with just prompting alone.</p>



<figure class="wp-block-image size-large"><img data-recalc-dims="1" decoding="async" width="1140" height="599" src="https://i0.wp.com/jenxi.com/wp-content/uploads/cyber-nvxia-1-by-jenxi-banner.webp?resize=1140%2C599&#038;ssl=1" alt="Cyber wuxia Yan by Jenxi Seow" class="wp-image-15455" srcset="https://i0.wp.com/jenxi.com/wp-content/uploads/cyber-nvxia-1-by-jenxi-banner.webp?resize=1140%2C599&amp;ssl=1 1140w, https://i0.wp.com/jenxi.com/wp-content/uploads/cyber-nvxia-1-by-jenxi-banner.webp?resize=700%2C368&amp;ssl=1 700w, https://i0.wp.com/jenxi.com/wp-content/uploads/cyber-nvxia-1-by-jenxi-banner.webp?resize=768%2C403&amp;ssl=1 768w, https://i0.wp.com/jenxi.com/wp-content/uploads/cyber-nvxia-1-by-jenxi-banner.webp?w=1200&amp;ssl=1 1200w" sizes="(max-width: 1000px) 100vw, 1000px" /><figcaption class="wp-element-caption"><a href="https://www.deviantart.com/jenxi/art/Cyber-wuxia-Yan-distracted-970925547" data-type="link" data-id="https://www.deviantart.com/jenxi/art/Cyber-wuxia-Yan-distracted-970925547" target="_blank" rel="noopener">Cyber wuxia Yan</a>  by Jenxi</figcaption></figure>



<p>This is partly due to my personal goal of trying to see how close generative AI can get to realism, and also because I wanted to see how AI would impact my business since visual content generation is a big part of it.</p>



<p>I’m aware of this obsession with realism, and the need to drop this fixation to spend more time on improving my composition, but I digress.</p>



<p>Keep your feedback coming as your voice is invaluable in shaping the content I put out. My goal is to share what I’ve learnt so you can skip past the hundreds of hours I poured into research and trial-and-error. If you find this content helpful in any way, consider <a href="https://ko-fi.com/jenxi" target="_blank" rel="noopener">buying me a coffee on Ko-fi</a> or <a href="https://www.patreon.com/jenxi" target="_blank" rel="noopener">join my Patreon</a> to get my art in glorious high resolution.</p>



<div class="wp-block-group has-background has-global-padding is-layout-constrained wp-container-core-group-is-layout-8 wp-block-group-is-layout-constrained" style="border-radius:8px;background-color:#f7f7f7;padding-top:var(--wp--preset--spacing--small);padding-right:clamp(30px, 4vw, 40px);padding-bottom:var(--wp--preset--spacing--small);padding-left:clamp(30px, 4vw, 40px)"><h2 class="simpletoc-title">Contents</h2>
<ul class="simpletoc-list">
<li><a href="#unraveling-the-mystery-what-is-stable-diffusion">Unraveling the Mystery: What is Stable Diffusion?</a>

</li>
<li><a href="#the-mechanics-of-stable-diffusion">The Mechanics of Stable Diffusion</a>


<ul><li>
<a href="#core-concept-diffusion">Core Concept: Diffusion</a>

</li>
<li><a href="#the-foundation-generative-adversarial-networks-gans">The Foundation: Generative Adversarial Networks (GANs)</a>

</li>
<li><a href="#the-algorithmic-dance-gans-in-stable-diffusion">The Algorithmic Dance: GANs in Stable Diffusion</a>

</li>
</ul>
<li><a href="#the-potential-of-stable-diffusion">The Potential of Stable Diffusion</a>


<ul><li>
<a href="#elevating-artistic-expression-with-ai">Elevating Artistic Expression with AI</a>

</li>
<li><a href="#beyond-basics-exploring-concepts">Beyond Basics: Exploring Concepts</a>

</li>
<li><a href="#bridging-the-gap-the-humanai-collaboration">Bridging the Gap: The Human-AI Collaboration</a>

</li>
<li><a href="#comparing-stable-diffusion-with-midjourney-amp-dalle">Comparing Stable Diffusion with Midjourney, &amp; DALL-E</a>


</li>

</li>

</li>
</ul>
<li><a href="#using-stable-diffusion-tools-and-resources">Using Stable Diffusion: Tools and Resources</a>


<ul><li>
<a href="#image-generation-sites">Image generation sites</a>

</li>
<li><a href="#run-an-online-instance">Run an online instance</a>

</li>
<li><a href="#local-installation">Local installation</a>

</li>
<li><a href="#training-tool">Training tool</a>

</li>
<li><a href="#resources">Resources</a>

</li>
</ul>
<li><a href="#start-your-stable-diffusion-journey">Start your Stable Diffusion journey</a>
</li></ul></div>



<p>Let’s dive into the captivating world of Stable Diffusion, where imagination and algorithms join forces to create art like never before.</p>


<h2 class="wp-block-heading" id="unraveling-the-mystery-what-is-stable-diffusion">Unraveling the Mystery: What is Stable Diffusion?</h2>


<p>Stable Diffusion, in simple terms, is a remarkable technique that uses the power of artificial intelligence to create stunning and mesmerizing images. It simulates the process of colors or patterns spreading and blending harmoniously, resulting in visually captivating transformations.</p>



<p>What’s in a name? Well, it can aptly describe what something is but confuse beginners. We learnt about diffusion in chemistry class, so how is it related to artificial intelligence and machine learning? And why is it stable? What happens if it’s unstable? (You don’t want to know what Unstable Diffusion is. Yes, it’s a thing and it’s NSFW.)</p>



<p>The term “Stable” in the name comes from the startup <a href="https://stability.ai/" target="_blank" rel="noopener">Stability AI</a> that developed Stable Diffusion model. “Diffusion” refers to it being a latent diffusion model.</p>



<p>Before I start, I’d like to make it clear that I’m not a machine learning expert. What I share here is my understanding of Stable Diffusion. While I do my best to provide accurate information and explanations, I’m also well aware that I might be completely wrong. Please feel free to correct me where I’m wrong. This is what learning in public is all about.</p>



<figure class="wp-block-image size-large"><img data-recalc-dims="1" decoding="async" width="1140" height="599" src="https://i0.wp.com/jenxi.com/wp-content/uploads/water-mage-1-by-jenxi-banner.webp?resize=1140%2C599&#038;ssl=1" alt="Water mage by Jenxi Seow" class="wp-image-15462" srcset="https://i0.wp.com/jenxi.com/wp-content/uploads/water-mage-1-by-jenxi-banner.webp?resize=1140%2C599&amp;ssl=1 1140w, https://i0.wp.com/jenxi.com/wp-content/uploads/water-mage-1-by-jenxi-banner.webp?resize=700%2C368&amp;ssl=1 700w, https://i0.wp.com/jenxi.com/wp-content/uploads/water-mage-1-by-jenxi-banner.webp?resize=768%2C403&amp;ssl=1 768w, https://i0.wp.com/jenxi.com/wp-content/uploads/water-mage-1-by-jenxi-banner.webp?w=1200&amp;ssl=1 1200w" sizes="(max-width: 1000px) 100vw, 1000px" /><figcaption class="wp-element-caption"><a href="https://www.deviantart.com/jenxi/art/Water-Mage-demure-976631438" data-type="link" data-id="https://www.deviantart.com/jenxi/art/Water-Mage-demure-976631438" target="_blank" rel="noopener">Water mage</a> by Jenxi</figcaption></figure>


<h2 class="wp-block-heading" id="the-mechanics-of-stable-diffusion">The Mechanics of Stable Diffusion</h2>


<p>So, let’s get down to demystifying Stable Diffusion. You probably remember diffusion from chemistry class. If you don’t or aren’t familiar with it, I’ll try to explain it with an analogy.&nbsp;</p>


<h3 class="wp-block-heading" id="core-concept-diffusion">Core Concept: Diffusion</h3>


<p>Imagine you&#8217;re in a room filled with coloured smoke. At first, the colours might be clustered, forming pockets of intensity. Diffusion occurs when these clusters gradually spread and blend until the entire room is a harmonious mix of hues.</p>



<p>Stable Diffusion operates on a similar principle, but with data and features instead of colours. It&#8217;s like a digital artist&#8217;s brushstroke, smoothly transitioning and merging details to create a seamless and realistic image. This process ensures that every element in the artwork harmonises, resulting in a refined and captivating visual.</p>



<p>In machine learning, diffusion models learn the latent structure of a dataset by modelling how data points diffuse through the latent space, where items that resemble each other are positioned close to each other.</p>



<p>In computer vision, the neural network is trained to denoise blurred images with Gaussian noise by learning to reverse the diffusion process.</p>



<figure class="wp-block-image size-large"><img data-recalc-dims="1" decoding="async" width="1140" height="772" src="https://i0.wp.com/jenxi.com/wp-content/uploads/Stable-Diffusion-denoising-process.webp?resize=1140%2C772&#038;ssl=1" alt="What is Stable Diffusion? Stable Diffusion denoising process" class="wp-image-15463" srcset="https://i0.wp.com/jenxi.com/wp-content/uploads/Stable-Diffusion-denoising-process.webp?resize=1140%2C772&amp;ssl=1 1140w, https://i0.wp.com/jenxi.com/wp-content/uploads/Stable-Diffusion-denoising-process.webp?resize=700%2C474&amp;ssl=1 700w, https://i0.wp.com/jenxi.com/wp-content/uploads/Stable-Diffusion-denoising-process.webp?resize=768%2C520&amp;ssl=1 768w, https://i0.wp.com/jenxi.com/wp-content/uploads/Stable-Diffusion-denoising-process.webp?w=1200&amp;ssl=1 1200w" sizes="(max-width: 1000px) 100vw, 1000px" /><figcaption class="wp-element-caption">Stable Diffusion denoising process. Source: <a href="https://en.wikipedia.org/wiki/Stable_Diffusion#/media/File:X-Y_plot_of_algorithmically-generated_AI_art_of_European-style_castle_in_Japan_demonstrating_DDIM_diffusion_steps.png" data-type="link" data-id="https://en.wikipedia.org/wiki/Stable_Diffusion#/media/File:X-Y_plot_of_algorithmically-generated_AI_art_of_European-style_castle_in_Japan_demonstrating_DDIM_diffusion_steps.png" target="_blank" rel="noopener">Wikipedia</a></figcaption></figure>


<h3 class="wp-block-heading" id="the-foundation-generative-adversarial-networks-gans">The Foundation: Generative Adversarial Networks (GANs)</h3>


<p>To understand Stable Diffusion, you need to get acquainted with Generative Adversarial Networks, or GANs for short.&nbsp;</p>



<p>Imagine there are two forces in the neural network – a generator and a discriminator. The generator’s job is to conjure up images with the aim of creating something that resembles reality. The discriminator then decides if the generated image is real or a computer-generated imitation.</p>



<p>This is where things get interesting. The generator learns from the discriminator’s feedback and is constantly improving its ability to create increasingly convincing images. Likewise, the discriminator is also getting better at determining whether the images are real. It is a never-ending dance to keep pushing towards increasingly higher performance to produce images that are very close to reality.</p>



<p>This is what makes GANs so powerful in AI image generation.</p>


<h3 class="wp-block-heading" id="the-algorithmic-dance-gans-in-stable-diffusion">The Algorithmic Dance: GANs in Stable Diffusion</h3>


<p>How does all this tie into Stable Diffusion? Imagine a Taichi master directing these two forces to achieve perfect balance and harmony. Stable Diffusion operates on the principle of probability by leveraging the inherent uncertainty and randomness in the generation process. It fine-tunes the interplay between the generator and discriminator in a GAN, enhancing its ability to generate images that are highly detailed and and realistic.</p>



<p>Through meticulous adjustments to the training process, we are able to train the algorithm to generate results with increasingly higher quality and more refined outputs.</p>


<h2 class="wp-block-heading" id="the-potential-of-stable-diffusion">The Potential of Stable Diffusion</h2>


<p>Stable Diffusion is a transformative technique that represents a significant leap forward in generative art that has taken the world by storm.</p>


<h3 class="wp-block-heading" id="elevating-artistic-expression-with-ai">Elevating Artistic Expression with AI</h3>


<p>This technique opens up a world of possibilities for artists, photographers, and creatives. Stable Diffusion acts as a catalyst, enhancing the artist&#8217;s ability to express themselves through the synergy of human creativity and artificial intelligence.&nbsp;</p>



<p>It&#8217;s a tool that empowers us to explore uncharted territories, unlocking styles and concepts that were once beyond imagination. With Stable Diffusion, artists transcend the limitations of conventional art creation, venturing into a realm where imagination knows no bounds.</p>


<h3 class="wp-block-heading" id="beyond-basics-exploring-concepts">Beyond Basics: Exploring Concepts</h3>


<p>What does that mean? You’ve probably seen some AI generated images and know that AI can generate images of a person or an object. It is able to imitate an artist’s style, painting medium, painting style. It can also reproduce photographs based to the focal length, lighting, or a specific type of look or a photographer’s style. It can even understand composition and placement of subjects. But there’s more to generative AI.</p>



<p>We can go beyond just a look or style. You can train concepts such as a pose or pattern. It is able to learn what different clothing look like. You can even train textures or materials. A stormtrooper from Star Wars wearing armour made of blue and white porcelain. Or a mage conjuring water or magic.</p>



<figure class="wp-block-image size-large"><img data-recalc-dims="1" decoding="async" width="1140" height="760" src="https://i0.wp.com/jenxi.com/wp-content/uploads/porcelain-stormtrooper-1-by-jenxi-free.webp?resize=1140%2C760&#038;ssl=1" alt="Porcelain stormtrooper by Jenxi Seow blends blue and white designs on the ceramic armour" class="wp-image-15453" srcset="https://i0.wp.com/jenxi.com/wp-content/uploads/porcelain-stormtrooper-1-by-jenxi-free.webp?resize=1140%2C760&amp;ssl=1 1140w, https://i0.wp.com/jenxi.com/wp-content/uploads/porcelain-stormtrooper-1-by-jenxi-free.webp?resize=700%2C467&amp;ssl=1 700w, https://i0.wp.com/jenxi.com/wp-content/uploads/porcelain-stormtrooper-1-by-jenxi-free.webp?resize=768%2C512&amp;ssl=1 768w, https://i0.wp.com/jenxi.com/wp-content/uploads/porcelain-stormtrooper-1-by-jenxi-free.webp?w=1200&amp;ssl=1 1200w" sizes="(max-width: 1000px) 100vw, 1000px" /><figcaption class="wp-element-caption">Porcelain stormtrooper by Jenxi blends blue and white designs on the ceramic armour</figcaption></figure>



<p>You can even create what we call world morphs. These are concepts that influence and transform everything that appears in the world, hence the name. From cyberpunk to steampunk that we are familiar with, all the way to fantasy worlds where candy, bones or mushrooms dominate the world.</p>



<p>Whatever concept you can come up with, you can attempt to train with Stable Diffusion. And this is just scratching the surface. You can merge concepts to create images generated based on concepts that are out of this world. The limit is your imagination.</p>



<p>Stable Diffusion is an open invitation to artists to break free from the confines of established styles and concepts. It&#8217;s a gateway to uncharted territories, where the exploration of diverse artistic expressions becomes not just possible, but exhilarating. Whether it&#8217;s blending genres, experimenting with new techniques, or venturing into unexplored thematic realms, Stable Diffusion empowers artists to bring their imaginations to life.</p>


<h3 class="wp-block-heading" id="bridging-the-gap-the-humanai-collaboration">Bridging the Gap: The Human-AI Collaboration</h3>


<p>If you’re an artist who is still adamant on boycotting generative AI for ethical reasons, I urge you to give it a try. You can maintain your stance while experimenting in private for research purposes. Stable Diffusion heralds a new era in artistic methodologies and I believe every artist should try to understand what a powerful tool it can be.&nbsp;</p>



<p>It redefines how artists approach their craft, introducing innovative techniques that fuse human intuition with the capabilities of generative AI. The result is a dynamic interplay that pushes the boundaries of what&#8217;s achievable, paving the way for novel creative processes and groundbreaking artistic endeavours, not dissimilar to how Photoshop and digital painting and photography has transformed the art world.</p>



<figure class="wp-block-image size-large"><img data-recalc-dims="1" decoding="async" width="1140" height="760" src="https://i0.wp.com/jenxi.com/wp-content/uploads/cyber-self-portrait-by-jenxi-web.webp?resize=1140%2C760&#038;ssl=1" alt="AI self portrait of Jenxi" class="wp-image-15464" srcset="https://i0.wp.com/jenxi.com/wp-content/uploads/cyber-self-portrait-by-jenxi-web.webp?resize=1140%2C760&amp;ssl=1 1140w, https://i0.wp.com/jenxi.com/wp-content/uploads/cyber-self-portrait-by-jenxi-web.webp?resize=700%2C467&amp;ssl=1 700w, https://i0.wp.com/jenxi.com/wp-content/uploads/cyber-self-portrait-by-jenxi-web.webp?resize=768%2C512&amp;ssl=1 768w, https://i0.wp.com/jenxi.com/wp-content/uploads/cyber-self-portrait-by-jenxi-web.webp?w=1200&amp;ssl=1 1200w" sizes="(max-width: 1000px) 100vw, 1000px" /><figcaption class="wp-element-caption">AI self portrait of Jenxi</figcaption></figure>



<p>Stable Diffusion is not just a tool. It&#8217;s a creative partner that harmonises the human touch with the precision of AI. The results are collaborative works that transcend individual capabilities. The artist and the algorithm engage in a symbiotic dance, each contributing their strengths to craft art that is a testament to the potential of human-AI collaboration. Together, they bridge the gap between traditional artistic methods and the cutting-edge world of AI-generated art.</p>



<p>Tradition meets innovation as Stable Diffusion blurs the lines between conventional and digital art forms. It challenges preconceived notions about the boundaries of artistic expression, proving that the digital realm is a canvas as versatile and expressive as any traditional medium. This paradigm shift invites artists and audiences alike to embrace the limitless possibilities offered by technology in the pursuit of creative excellence.</p>



<p>AI-generated art is at this intersection that innovation flourishes, birthing a new breed of art that resonates with the digital age.</p>


<h3 class="wp-block-heading" id="comparing-stable-diffusion-with-midjourney-amp-dalle">Comparing Stable Diffusion with Midjourney, &amp; DALL-E</h3>


<p>Stable Diffusion, Midjourney, and DALL-E represent the vanguard of AI art generation, each with its distinctive approach and its own set of strengths and limitations. I’ve listed down the key pros and cons that a beginner should consider.</p>


<h4 class="wp-block-heading has-large-font-size" id="dalle">DALL-E</h4>


<p>Pros:</p>



<ul class="wp-block-list">
<li><strong>Easy to use.</strong> DALL-E has a simple interface with minimal learning curve, making it easy for beginners to learn.</li>



<li><strong>Hosted resources.</strong> It’s an online service so you leverage on OpenAI’s servers to do the processing.</li>
</ul>



<p>Cons:&nbsp;</p>



<ul class="wp-block-list">
<li><strong>Limited creative options.</strong> DALL-E&#8217;s simplicity means you rely on text prompting to generate images.</li>



<li><strong>Pay to use.</strong> You need to <a href="https://help.openai.com/en/articles/6399305-how-dall-e-credits-work" target="_blank" rel="noopener">buy credits</a> to generate images. Each prompt generates four images and uses one credit. It costs USD 15 for 115 credits, that’s around USD 0.13 per prompt or USD 0.0325 per image. It used to give out free monthly credits but that option is gone.</li>
</ul>


<h4 class="wp-block-heading has-large-font-size" id="midjourney">Midjourney</h4>


<p>Pros:&nbsp;</p>



<ul class="wp-block-list">
<li><strong>Amazing output.</strong> Midjourney is probably the most well-known among the three, or even all computer vision tools, for the highly artistic images it can generate with simple prompts.</li>
</ul>



<ul class="wp-block-list">
<li><strong>Many resources available.</strong> There are many Midjourney prompts out there for you to refer and use to generate images.</li>
</ul>



<p>Cons:</p>



<ul class="wp-block-list">
<li><strong>Pay to use.</strong> While DALL-E uses a credit system, Midjourney charges a <a href="https://mid-journey.ai/pricing/" target="_blank" rel="noopener">monthly subscription</a> that limits the number of generation or the duration of the generation. For example, the basic plan is limited to 200 generations per month, while the standard and pro plans get unlimited relaxed generations and 15 hours and 30 hours of fast generations respectively.</li>
</ul>



<ul class="wp-block-list">
<li><strong>Account required.</strong> You need to register for a Discord account to join Midjourney’s server to use it. Not a hassle if you already have an account. There are many Discord servers for AI communities, so it’s a good idea to get an account to access them. </li>
</ul>



<ul class="wp-block-list">
<li><strong>Not as easy as DALL-E.</strong> There is a slight learning curve in learning the commands for the Discord bot to generate images.</li>
</ul>


<h4 class="wp-block-heading has-large-font-size" id="stable-diffusion">Stable Diffusion</h4>


<p>Pros:&nbsp;</p>



<ul class="wp-block-list">
<li><strong>Most powerful out of the three.</strong> With Stable Diffusion, you get better control over the output and there are tools to train custom models. Midjourney generates output that is arguably better, depending on your tastes, but this point becomes moot once you discover that you can train Stable Diffusion models to imitate the Midjourney style.</li>
</ul>



<ul class="wp-block-list">
<li><strong>You can run it locally.</strong> This let you customise how you run it and if you have a decent GPU, you can get generate outputs faster and at higher resolutions compared to the other two.</li>
</ul>



<ul class="wp-block-list">
<li><strong>Free!</strong> It’s free to use and open source.</li>
</ul>



<ul class="wp-block-list">
<li><strong>Large amount of resources.</strong> There is a large amount of custom models trained by others, ready for you to use. I current have over 2,000 models.</li>
</ul>



<p>Cons:&nbsp;</p>



<ul class="wp-block-list">
<li><strong>Most difficult out of the three.</strong> Stable Diffusion has a steeper learning curve since there are more options and tools available.</li>
</ul>



<ul class="wp-block-list">
<li><strong>Work needed to get it running.</strong> The installation process might be a hurdle, especially if you’re not used to working with Python. There are one-click installers, but things move so fast in the world of Stable Diffusion, so expect breakage and problems.</li>
</ul>



<ul class="wp-block-list">
<li><strong>Hardware requirements.</strong> You need a decent setup, including at least 6 to 8 GB of VRAM and enough storage space. How much space do you need? Models range from several MBs to 10 GBs. My 2,000 plus models take up almost 2 TBs of space.</li>
</ul>


<h2 class="wp-block-heading" id="using-stable-diffusion-tools-and-resources">Using Stable Diffusion: Tools and Resources</h2>


<p>There are several ways to use Stable Diffusion. You can use image generation sites that run on Stable Diffusion, run Stable Diffusion on a cloud service, or install it locally.</p>


<h3 class="wp-block-heading" id="image-generation-sites">Image generation sites</h3>


<p>Use an online service if you prefer not to go through the hassle of setting up a local installation of Stable Diffusion, or if your machine is unable to run Stable Diffusion. The more popular ones are <a href="https://rundiffusion.com" data-type="link" data-id="https://rundiffusion.com" target="_blank" rel="noopener">RunDiffusion</a>, <a href="https://www.mage.space" target="_blank" rel="noopener">Mage Space</a> and <a href="https://pixai.art" target="_blank" rel="noopener">PixAI</a>. I started out using <a href="https://creator.nightcafe.studio" target="_blank" rel="noopener">NightCafe</a>. And there’s also <a href="https://beta.dreamstudio.ai/generate" target="_blank" rel="noopener">Dream Studio</a> from Stability AI themselves.</p>



<p>Like Midjourney and DALL-E, these sites have to pay for the site development, hosting, and maintenance costs on top of the GPU processing cost to generate images for you. So they all require you either pay a subscription or buy credits.</p>



<p>These sites ensure that the models and different extensions work well so you can focus on your AI art generation without worrying about the technicalities.</p>


<h3 class="wp-block-heading" id="run-an-online-instance">Run an online instance</h3>


<p>If you don’t want or are unable to run Stable Diffusion locally, there’s another option available for you. You can run Stable Diffusion online by using <a href="https://colab.research.google.com" target="_blank" rel="noopener">Google Colaboratory</a>, or Colab for short. Google Colab allows you to run Python code on Google’s server using your Google Drive to store the models and images generated.</p>



<p>People were using Google Colab to run Stable Diffusion for free but Google has since changed their policies to require a Colab Pro subscription of USD 9.99 per month to run Stable Diffusion on Colab.</p>



<p>You can easily get started using the <a href="https://colab.research.google.com/github/TheLastBen/fast-stable-diffusion/blob/main/fast_stable_diffusion_AUTOMATIC1111.ipynb" target="_blank" rel="noopener">Fast Stable Diffusion Colab notebook</a> shared by TheLastBen. The instructions are in the notebook and you can get your Stable Diffusion up and running pretty quickly.</p>



<p>Alternatives include <a href="https://huggingface.co/spaces?sort=trending&amp;search=stable+diffusion" target="_blank" rel="noopener">Hugging Face spaces</a> and <a href="https://blog.runpod.io/using-the-stablediffusion-template-to-create-a-runpod-instance/" target="_blank" rel="noopener">Runpod</a>.</p>



<p>When you use an online instance, you pay based on your GPU usage. The advantage of this over image generation sites is greater control over the Stable Diffusion instance. You can run Stable Diffusion wherever you are since it is a cloud instance.</p>


<h3 class="wp-block-heading" id="local-installation">Local installation</h3>


<p>You can install Stable Diffusion locally on your computer. You need a GPU with at least 6 GB of VRAM to run Stable Diffusion 1.5 and 2.1, and at least 8 GB of VRAM to run Stable Diffusion XL.</p>



<p>Stable Diffusion is <a href="https://github.com/Stability-AI/stablediffusion" target="_blank" rel="noopener">available on GitHub</a>. However, you will need to run it using a graphical user interface if you don’t want to operate it via a command prompt.</p>



<p>The most popular GUI is <a href="https://github.com/AUTOMATIC1111/stable-diffusion-webui" target="_blank" rel="noopener">Stable Diffusion Web UI</a> by AUTOMATIC1111. It is often referred to as the Automatic1111 Web UI or A1111. This is what I use for my AI image generation. It is very well-supported and I’ve witnessed it growing by leaps and bounds from version 1.4 to the current 1.6.</p>



<figure class="wp-block-image size-large"><img data-recalc-dims="1" decoding="async" width="1140" height="781" src="https://i0.wp.com/jenxi.com/wp-content/uploads/Stable-Diffusion-Web-UI.webp?resize=1140%2C781&#038;ssl=1" alt="Stable Diffusion Web UI" class="wp-image-15466" srcset="https://i0.wp.com/jenxi.com/wp-content/uploads/Stable-Diffusion-Web-UI.webp?resize=1140%2C781&amp;ssl=1 1140w, https://i0.wp.com/jenxi.com/wp-content/uploads/Stable-Diffusion-Web-UI.webp?resize=700%2C480&amp;ssl=1 700w, https://i0.wp.com/jenxi.com/wp-content/uploads/Stable-Diffusion-Web-UI.webp?resize=768%2C526&amp;ssl=1 768w, https://i0.wp.com/jenxi.com/wp-content/uploads/Stable-Diffusion-Web-UI.webp?w=1200&amp;ssl=1 1200w" sizes="(max-width: 1000px) 100vw, 1000px" /><figcaption class="wp-element-caption">Stable Diffusion Web UI</figcaption></figure>



<p>There is a popular fork by vladmandic called <a href="https://github.com/vladmandic/automatic" target="_blank" rel="noopener">SD.Next</a> that started out adding improvements to the Automatic1111 WebUI but has since diverged so much that it is considered a standalone GUI for Stable Diffusion. It is sometimes referred to as Vlad’s Automatic.</p>



<p>Some, especially power users, swear by <a href="https://github.com/comfyanonymous/ComfyUI" target="_blank" rel="noopener">ComfyUI</a>. It is another GUI that takes a modular approach to operating Stable Diffusion, allowing you to create advanced pipelines for your workflows. The complex nature of ComfyUI means that it has a steeper learning curve compared to A1111, but once you get the hang of it, it becomes a very powerful tool.</p>



<figure class="wp-block-image size-large"><img data-recalc-dims="1" decoding="async" width="1140" height="432" src="https://i0.wp.com/jenxi.com/wp-content/uploads/ComfyUI.webp?resize=1140%2C432&#038;ssl=1" alt="" class="wp-image-15470" srcset="https://i0.wp.com/jenxi.com/wp-content/uploads/ComfyUI.webp?resize=1140%2C432&amp;ssl=1 1140w, https://i0.wp.com/jenxi.com/wp-content/uploads/ComfyUI.webp?resize=700%2C265&amp;ssl=1 700w, https://i0.wp.com/jenxi.com/wp-content/uploads/ComfyUI.webp?resize=768%2C291&amp;ssl=1 768w, https://i0.wp.com/jenxi.com/wp-content/uploads/ComfyUI.webp?resize=1536%2C583&amp;ssl=1 1536w, https://i0.wp.com/jenxi.com/wp-content/uploads/ComfyUI.webp?w=1859&amp;ssl=1 1859w" sizes="(max-width: 1000px) 100vw, 1000px" /><figcaption class="wp-element-caption">ComfyUI</figcaption></figure>


<h3 class="wp-block-heading" id="training-tool">Training tool</h3>


<p>You can train models with Automatic1111 Web UI. For better control over the training setup and parameters, there’s <a href="https://github.com/kohya-ss/sd-scripts" target="_blank" rel="noopener">Kohya’s Stable Diffusion training scripts</a>. Like the A1111, you need a GUI to make operating it easier. The GUI for Kohya’s scripts is <a href="https://github.com/bmaltais/kohya_ss" target="_blank" rel="noopener">Kohya’s GUI</a> by bmaltais, more commonly referred to as kohya_ss.</p>


<h3 class="wp-block-heading" id="resources">Resources</h3>


<p>The two largest repositories for Stable Diffusion models are <a href="https://civitai.com" target="_blank" rel="noopener">Civitai</a> and <a href="https://huggingface.co/models" target="_blank" rel="noopener">Hugging Face</a>.&nbsp;</p>



<div class="wp-block-group has-global-padding is-layout-constrained wp-block-group-is-layout-constrained">
<div class="wp-block-outermost-icon-block"><div class="icon-container" style="width:260px"><svg width="126" height="23" viewBox="0 0 126 23" fill="none" xmlns="http://www.w3.org/2000/svg"><path d="M40.3 2.09998H23.2L21 4.49998V19.5L23.3 21.8H40.4V16.6H26.2V7.39998H40.3V2.09998Z" fill="black"></path><path d="M95.6 2.09998H76.1V7.39998H83.3V21.7H88.5V7.39998H95.5L95.6 2.09998ZM42.7 2.19998V21.7H47.9V2.19998H42.7ZM50.3 2.19998V21.7H57.9L66.2 13.4V2.19998H61V10.5L55.6 16.5V2.19998H50.3ZM68.6 2.19998V21.7H73.8V2.19998H68.6Z" fill="black"></path><path d="M119.8 2.19995V21.7H125V2.19995H119.8ZM115.1 2.19995H100.3L98 4.49995V21.7H103.2V14.6H112.1V21.7H117.3V4.49995L115.1 2.19995ZM112.2 9.29995H103.3V7.39995H112.2V9.29995Z" fill="black"></path><path d="M66.2 16.6V21.7H61.1" fill="black"></path><path d="M1 7V17L9.7 22L18.4 17V7L9.7 2L1 7Z" fill="url(#paint0_linear_901_4)"></path><path d="M9.7 5.1L15.6 8.5V15.4L9.7 18.8L3.7 15.4V8.5L9.7 5.1ZM9.7 2L1 7V17L9.7 22L18.4 17V7C18.3 7 9.7 2 9.7 2Z" fill="url(#paint1_linear_901_4)"></path><path d="M11.3 12.8L9.6 13.8L7.9 12.8V10.9L9.6 9.9L11.3 10.9H13.4V9.7L9.6 7.5L5.9 9.7V14L9.7 16.2L13.5 14V12.8H11.3Z" fill="white"></path><defs><linearGradient id="paint0_linear_901_4" x1="9.656" y1="1.95" x2="9.656" y2="21.9386" gradientUnits="userSpaceOnUse"><stop stop-color="#081692"></stop><stop offset="1" stop-color="#1E043C"></stop></linearGradient><linearGradient id="paint1_linear_901_4" x1="9.656" y1="1.95" x2="9.656" y2="21.95" gradientUnits="userSpaceOnUse"><stop stop-color="#1284F7"></stop><stop offset="1" stop-color="#0A20C9"></stop></linearGradient></defs></svg></div></div>



<div class="wp-block-outermost-icon-block"><div class="icon-container" style="width:320px"><svg width="510" height="88" viewBox="0 0 510 88" fill="none" xmlns="http://www.w3.org/2000/svg"><path d="M47.21 76.5C56.4263 76.5 65.2651 72.8388 71.782 66.322C78.2988 59.8051 81.96 50.9663 81.96 41.75C81.96 32.5337 78.2988 23.6949 71.782 17.178C65.2651 10.6612 56.4263 7 47.21 7C37.9937 7 29.1549 10.6612 22.638 17.178C16.1211 23.6949 12.46 32.5337 12.46 41.75C12.46 50.9663 16.1211 59.8051 22.638 66.322C29.1549 72.8388 37.9937 76.5 47.21 76.5Z" fill="#FFD21E"></path><path d="M81.96 41.75C81.96 32.5337 78.2988 23.6949 71.782 17.178C65.2651 10.6612 56.4263 7 47.21 7C37.9937 7 29.1549 10.6612 22.638 17.178C16.1211 23.6949 12.46 32.5337 12.46 41.75C12.46 50.9663 16.1211 59.8051 22.638 66.322C29.1549 72.8388 37.9937 76.5 47.21 76.5C56.4263 76.5 65.2651 72.8388 71.782 66.322C78.2988 59.8051 81.96 50.9663 81.96 41.75ZM8.46 41.75C8.46 36.6613 9.4623 31.6224 11.4097 26.921C13.357 22.2197 16.2113 17.9479 19.8096 14.3496C23.4079 10.7513 27.6796 7.89704 32.381 5.94967C37.0824 4.0023 42.1213 3 47.21 3C52.2987 3 57.3376 4.0023 62.039 5.94967C66.7403 7.89704 71.0121 10.7513 74.6104 14.3496C78.2087 17.9479 81.063 22.2197 83.0103 26.921C84.9577 31.6224 85.96 36.6613 85.96 41.75C85.96 52.0271 81.8774 61.8834 74.6104 69.1504C67.3434 76.4174 57.4871 80.5 47.21 80.5C36.9329 80.5 27.0766 76.4174 19.8096 69.1504C12.5426 61.8834 8.46 52.0271 8.46 41.75Z" fill="#FF9D0B"></path><path d="M58.5 32.3C59.78 32.74 60.28 35.36 61.57 34.68C62.4435 34.2162 63.1599 33.5038 63.6285 32.6329C64.0971 31.762 64.2969 30.7717 64.2026 29.7872C64.1083 28.8027 63.7242 27.8683 63.0989 27.1021C62.4735 26.336 61.635 25.7725 60.6893 25.4829C59.7437 25.1933 58.7334 25.1907 57.7863 25.4754C56.8392 25.76 55.9977 26.3191 55.3684 27.082C54.739 27.8449 54.3501 28.7773 54.2507 29.7613C54.1513 30.7453 54.3459 31.7366 54.81 32.61C55.42 33.76 57.36 31.89 58.51 32.29L58.5 32.3ZM34.95 32.3C33.67 32.74 33.16 35.36 31.88 34.68C31.0065 34.2162 30.2901 33.5038 29.8215 32.6329C29.3529 31.762 29.1531 30.7717 29.2474 29.7872C29.3417 28.8027 29.7258 27.8683 30.3511 27.1021C30.9765 26.336 31.815 25.7725 32.7607 25.4829C33.7063 25.1933 34.7166 25.1907 35.6637 25.4754C36.6109 25.76 37.4523 26.3191 38.0816 27.082C38.711 27.8449 39.0999 28.7773 39.1993 29.7613C39.2987 30.7453 39.1041 31.7366 38.64 32.61C38.03 33.76 36.08 31.89 34.94 32.29L34.95 32.3Z" fill="#3A3B45"></path><path d="M46.96 56.29C56.79 56.29 59.96 47.53 59.96 43.03C59.96 40.69 58.39 41.43 55.87 42.67C53.54 43.82 50.41 45.41 46.97 45.41C39.78 45.41 33.97 38.53 33.97 43.03C33.97 47.53 37.13 56.29 46.97 56.29H46.96Z" fill="#FF323D"></path><path fill-rule="evenodd" clip-rule="evenodd" d="M39.43 54C39.9577 52.9277 40.7006 51.9756 41.6124 51.2031C42.5243 50.4306 43.5856 49.8543 44.73 49.51C45.13 49.39 45.54 50.08 45.97 50.79C46.37 51.47 46.79 52.16 47.21 52.16C47.66 52.16 48.11 51.48 48.54 50.81C48.99 50.11 49.43 49.43 49.86 49.56C52.0082 50.242 53.8034 51.7391 54.86 53.73C58.59 50.79 59.96 45.99 59.96 43.03C59.96 40.69 58.39 41.43 55.87 42.67L55.73 42.74C53.42 43.89 50.34 45.41 46.96 45.41C43.58 45.41 40.51 43.89 38.19 42.74C35.59 41.45 33.96 40.64 33.96 43.03C33.96 46.08 35.42 51.09 39.43 54Z" fill="#3A3B45"></path><path d="M70.71 37C71.572 37 72.3986 36.6576 73.0081 36.0481C73.6176 35.4386 73.96 34.612 73.96 33.75C73.96 32.888 73.6176 32.0614 73.0081 31.4519C72.3986 30.8424 71.572 30.5 70.71 30.5C69.848 30.5 69.0214 30.8424 68.4119 31.4519C67.8024 32.0614 67.46 32.888 67.46 33.75C67.46 34.612 67.8024 35.4386 68.4119 36.0481C69.0214 36.6576 69.848 37 70.71 37ZM24.21 37C25.072 37 25.8986 36.6576 26.5081 36.0481C27.1176 35.4386 27.46 34.612 27.46 33.75C27.46 32.888 27.1176 32.0614 26.5081 31.4519C25.8986 30.8424 25.072 30.5 24.21 30.5C23.348 30.5 22.5214 30.8424 21.9119 31.4519C21.3024 32.0614 20.96 32.888 20.96 33.75C20.96 34.612 21.3024 35.4386 21.9119 36.0481C22.5214 36.6576 23.348 37 24.21 37ZM17.52 48C15.9 48 14.46 48.66 13.45 49.87C12.5887 50.9338 12.1192 52.2613 12.12 53.63C11.4903 53.4407 10.8374 53.3397 10.18 53.33C8.63 53.33 7.23 53.92 6.24 54.99C5.35676 55.9072 4.79989 57.0895 4.65531 58.3546C4.51073 59.6197 4.78648 60.8972 5.44 61.99C4.55499 62.7125 3.92713 63.7016 3.65 64.81C3.41 65.71 3.17 67.61 4.45 69.55C3.97016 70.2877 3.68463 71.1348 3.61994 72.0125C3.55525 72.8901 3.71351 73.7699 4.08 74.57C5.1 76.89 7.65 78.71 12.6 80.67C15.67 81.89 18.49 82.67 18.51 82.68C22.0724 83.667 25.7442 84.2045 29.44 84.28C35.3 84.28 39.49 82.48 41.9 78.94C45.78 73.25 45.23 68.04 40.2 63.02C37.43 60.24 35.58 56.15 35.2 55.25C34.42 52.59 32.36 49.63 28.95 49.63C28.0428 49.6443 27.1522 49.875 26.3521 50.3029C25.552 50.7308 24.8656 51.3435 24.35 52.09C23.35 50.83 22.37 49.84 21.49 49.27C20.3157 48.475 18.9377 48.0342 17.52 48ZM17.52 52C18.03 52 18.66 52.22 19.34 52.65C21.48 54.01 25.59 61.08 27.1 63.83C27.6 64.75 28.47 65.14 29.24 65.14C30.79 65.14 31.99 63.61 29.39 61.66C25.47 58.73 26.84 53.94 28.71 53.65C28.79 53.63 28.88 53.63 28.95 53.63C30.65 53.63 31.4 56.56 31.4 56.56C31.4 56.56 33.6 62.08 37.38 65.86C41.15 69.63 41.35 72.66 38.6 76.69C36.72 79.44 33.13 80.27 29.44 80.27C25.63 80.27 21.71 79.37 19.52 78.81C19.41 78.78 6.07 75.01 7.76 71.81C8.04 71.27 8.51 71.05 9.1 71.05C11.48 71.05 15.8 74.59 17.67 74.59C18.08 74.59 18.37 74.42 18.5 73.99C19.29 71.14 6.44 69.94 7.52 65.82C7.72 65.09 8.23 64.8 8.96 64.8C12.1 64.8 19.16 70.33 20.64 70.33C20.75 70.33 20.84 70.3 20.88 70.23C21.62 69.03 21.21 68.19 15.98 65.03C10.77 61.87 7.1 59.97 9.18 57.7C9.42 57.44 9.76 57.32 10.18 57.32C13.35 57.32 20.84 64.14 20.84 64.14C20.84 64.14 22.86 66.24 24.09 66.24C24.37 66.24 24.61 66.14 24.77 65.86C25.63 64.4 16.71 57.64 16.21 54.85C15.87 52.95 16.45 52 17.52 52Z" fill="#FF9D0B"></path><path d="M38.6 76.69C41.35 72.65 41.15 69.62 37.38 65.85C33.6 62.08 31.4 56.55 31.4 56.55C31.4 56.55 30.58 53.35 28.71 53.65C26.84 53.95 25.47 58.73 29.39 61.66C33.3 64.59 28.61 66.58 27.1 63.83C25.6 61.08 21.48 54.01 19.34 52.65C17.21 51.3 15.71 52.05 16.21 54.85C16.71 57.64 25.64 64.4 24.77 65.85C23.9 67.32 20.84 64.14 20.84 64.14C20.84 64.14 11.27 55.43 9.18 57.7C7.1 59.97 10.77 61.87 15.98 65.03C21.21 68.19 21.62 69.03 20.88 70.23C20.13 71.43 8.6 61.7 7.52 65.83C6.44 69.94 19.29 71.13 18.5 73.98C17.7 76.83 9.44 68.6 7.76 71.8C6.06 75.01 19.41 78.78 19.52 78.81C23.82 79.93 34.77 82.3 38.6 76.69Z" fill="#FFD21E"></path><path d="M77.4 48C79.02 48 80.47 48.66 81.47 49.87C82.3312 50.9338 82.8008 52.2613 82.8 53.63C83.4329 53.4397 84.0892 53.3388 84.75 53.33C86.3 53.33 87.7 53.92 88.69 54.99C89.5732 55.9072 90.1301 57.0895 90.2747 58.3546C90.4193 59.6197 90.1435 60.8972 89.49 61.99C90.3713 62.714 90.9955 63.703 91.27 64.81C91.51 65.71 91.75 67.61 90.47 69.55C90.9498 70.2877 91.2354 71.1348 91.3 72.0125C91.3647 72.8901 91.2065 73.7699 90.84 74.57C89.82 76.89 87.27 78.71 82.33 80.67C79.25 81.89 76.43 82.67 76.41 82.68C72.8476 83.667 69.1758 84.2045 65.48 84.28C59.62 84.28 55.43 82.48 53.02 78.94C49.14 73.25 49.69 68.04 54.72 63.02C57.5 60.24 59.35 56.15 59.73 55.25C60.51 52.59 62.56 49.63 65.97 49.63C66.8772 49.6443 67.7678 49.875 68.5679 50.3029C69.368 50.7308 70.0544 51.3435 70.57 52.09C71.57 50.83 72.55 49.84 73.44 49.27C74.6115 48.4768 75.9857 48.0361 77.4 48ZM77.4 52C76.89 52 76.27 52.22 75.58 52.65C73.45 54.01 69.33 61.08 67.82 63.83C67.6162 64.2224 67.3092 64.5517 66.9322 64.7826C66.5551 65.0134 66.1221 65.137 65.68 65.14C64.14 65.14 62.93 63.61 65.54 61.66C69.45 58.73 68.08 53.94 66.21 53.65C66.1306 53.6371 66.0504 53.6304 65.97 53.63C64.27 53.63 63.52 56.56 63.52 56.56C63.52 56.56 61.32 62.08 57.55 65.86C53.77 69.63 53.57 72.66 56.33 76.69C58.2 79.44 61.8 80.27 65.48 80.27C69.3 80.27 73.21 79.37 75.41 78.81C75.51 78.78 88.86 75.01 87.17 71.81C86.88 71.27 86.42 71.05 85.83 71.05C83.45 71.05 79.12 74.59 77.26 74.59C76.84 74.59 76.55 74.42 76.43 73.99C75.63 71.14 88.48 69.94 87.4 65.82C87.21 65.09 86.7 64.8 85.96 64.8C82.82 64.8 75.76 70.33 74.28 70.33C74.18 70.33 74.09 70.3 74.05 70.23C73.31 69.03 73.71 68.19 78.93 65.03C84.16 61.87 87.83 59.97 85.73 57.7C85.5 57.44 85.16 57.32 84.75 57.32C81.57 57.32 74.08 64.14 74.08 64.14C74.08 64.14 72.06 66.24 70.84 66.24C70.7025 66.2461 70.5661 66.2138 70.4459 66.1467C70.3258 66.0796 70.2268 65.9803 70.16 65.86C69.29 64.4 78.21 57.64 78.71 54.85C79.05 52.95 78.47 52 77.4 52Z" fill="#FF9D0B"></path><path d="M56.33 76.69C53.58 72.65 53.77 69.62 57.55 65.85C61.32 62.08 63.52 56.55 63.52 56.55C63.52 56.55 64.34 53.35 66.22 53.65C68.08 53.95 69.45 58.73 65.54 61.66C61.62 64.59 66.32 66.58 67.82 63.83C69.33 61.08 73.45 54.01 75.58 52.65C77.71 51.3 79.22 52.05 78.71 54.85C78.21 57.64 69.29 64.4 70.16 65.85C71.02 67.32 74.08 64.14 74.08 64.14C74.08 64.14 83.66 55.43 85.74 57.7C87.82 59.97 84.16 61.87 78.94 65.03C73.71 68.19 73.31 69.03 74.04 70.23C74.79 71.43 86.32 61.7 87.4 65.83C88.48 69.94 75.64 71.13 76.43 73.98C77.23 76.83 85.48 68.6 87.17 71.8C88.86 75.01 75.52 78.78 75.41 78.81C71.1 79.93 60.15 82.3 56.33 76.69Z" fill="#FFD21E"></path><path d="M116.608 67V25.4H127.616V40.888H140.032V25.4H151.04V67H140.032V50.488H127.616V67H116.608ZM169.065 67.768C165.566 67.768 163.028 66.5947 161.449 64.248C159.87 61.8587 159.081 58.68 159.081 54.712V35H170.089V53.304C170.089 55.352 170.345 56.7173 170.857 57.4C171.412 58.0827 172.222 58.424 173.289 58.424C174.313 58.424 175.166 58.2107 175.849 57.784C176.532 57.3573 177.214 56.632 177.897 55.608V35H188.905V67H179.945L179.177 62.776H178.921C177.641 64.312 176.212 65.528 174.633 66.424C173.097 67.32 171.241 67.768 169.065 67.768ZM207.985 80.184C205.553 80.184 203.313 79.9067 201.265 79.352C199.217 78.84 197.574 78.008 196.337 76.856C195.142 75.704 194.545 74.168 194.545 72.248C194.545 69.7307 196.038 67.6827 199.025 66.104V65.848C198.214 65.2507 197.51 64.5253 196.913 63.672C196.358 62.776 196.081 61.6667 196.081 60.344C196.081 59.2347 196.401 58.1467 197.041 57.08C197.724 56.0133 198.598 55.096 199.665 54.328V54.072C198.513 53.304 197.489 52.216 196.593 50.808C195.74 49.3573 195.313 47.7147 195.313 45.88C195.313 43.2347 195.953 41.0587 197.233 39.352C198.556 37.6027 200.262 36.3227 202.353 35.512C204.444 34.6587 206.662 34.232 209.009 34.232C210.886 34.232 212.593 34.488 214.129 35H226.033V42.936H221.425C221.553 43.32 221.66 43.8107 221.745 44.408C221.873 45.0053 221.937 45.624 221.937 46.264C221.937 48.7813 221.361 50.8293 220.209 52.408C219.057 53.944 217.5 55.0747 215.537 55.8C213.574 56.5253 211.398 56.888 209.009 56.888C207.942 56.888 206.748 56.7387 205.425 56.44C204.913 56.9093 204.657 57.528 204.657 58.296C204.657 58.9787 204.998 59.4693 205.681 59.768C206.406 60.0667 207.558 60.216 209.137 60.216H213.937C217.948 60.216 221.041 60.8773 223.217 62.2C225.436 63.48 226.545 65.592 226.545 68.536C226.545 70.84 225.777 72.8667 224.241 74.616C222.748 76.3653 220.614 77.7307 217.841 78.712C215.068 79.6933 211.782 80.184 207.985 80.184ZM209.009 50.36C210.033 50.36 210.865 49.9973 211.505 49.272C212.145 48.5467 212.465 47.416 212.465 45.88C212.465 44.3867 212.145 43.2987 211.505 42.616C210.865 41.8907 210.033 41.528 209.009 41.528C207.985 41.528 207.153 41.8907 206.513 42.616C205.873 43.2987 205.553 44.3867 205.553 45.88C205.553 47.416 205.873 48.5467 206.513 49.272C207.153 49.9973 207.985 50.36 209.009 50.36ZM209.777 73.4C211.569 73.4 213.062 73.1227 214.257 72.568C215.452 72.056 216.049 71.352 216.049 70.456C216.049 69.6027 215.665 69.0693 214.897 68.856C214.172 68.6427 213.126 68.536 211.761 68.536H209.265C207.985 68.536 207.004 68.4933 206.321 68.408C205.681 68.3653 205.126 68.28 204.657 68.152C203.889 68.92 203.505 69.688 203.505 70.456C203.505 71.48 204.081 72.2267 205.233 72.696C206.428 73.1653 207.942 73.4 209.777 73.4ZM242.923 80.184C240.491 80.184 238.251 79.9067 236.203 79.352C234.155 78.84 232.512 78.008 231.275 76.856C230.08 75.704 229.483 74.168 229.483 72.248C229.483 69.7307 230.976 67.6827 233.963 66.104V65.848C233.152 65.2507 232.448 64.5253 231.851 63.672C231.296 62.776 231.019 61.6667 231.019 60.344C231.019 59.2347 231.339 58.1467 231.979 57.08C232.661 56.0133 233.536 55.096 234.603 54.328V54.072C233.451 53.304 232.427 52.216 231.531 50.808C230.677 49.3573 230.251 47.7147 230.251 45.88C230.251 43.2347 230.891 41.0587 232.171 39.352C233.493 37.6027 235.2 36.3227 237.291 35.512C239.381 34.6587 241.6 34.232 243.947 34.232C245.824 34.232 247.531 34.488 249.067 35H260.971V42.936H256.363C256.491 43.32 256.597 43.8107 256.683 44.408C256.811 45.0053 256.875 45.624 256.875 46.264C256.875 48.7813 256.299 50.8293 255.147 52.408C253.995 53.944 252.437 55.0747 250.475 55.8C248.512 56.5253 246.336 56.888 243.947 56.888C242.88 56.888 241.685 56.7387 240.363 56.44C239.851 56.9093 239.595 57.528 239.595 58.296C239.595 58.9787 239.936 59.4693 240.619 59.768C241.344 60.0667 242.496 60.216 244.075 60.216H248.875C252.885 60.216 255.979 60.8773 258.155 62.2C260.373 63.48 261.483 65.592 261.483 68.536C261.483 70.84 260.715 72.8667 259.179 74.616C257.685 76.3653 255.552 77.7307 252.779 78.712C250.005 79.6933 246.72 80.184 242.923 80.184ZM243.947 50.36C244.971 50.36 245.803 49.9973 246.443 49.272C247.083 48.5467 247.403 47.416 247.403 45.88C247.403 44.3867 247.083 43.2987 246.443 42.616C245.803 41.8907 244.971 41.528 243.947 41.528C242.923 41.528 242.091 41.8907 241.451 42.616C240.811 43.2987 240.491 44.3867 240.491 45.88C240.491 47.416 240.811 48.5467 241.451 49.272C242.091 49.9973 242.923 50.36 243.947 50.36ZM244.715 73.4C246.507 73.4 248 73.1227 249.195 72.568C250.389 72.056 250.987 71.352 250.987 70.456C250.987 69.6027 250.603 69.0693 249.835 68.856C249.109 68.6427 248.064 68.536 246.699 68.536H244.203C242.923 68.536 241.941 68.4933 241.259 68.408C240.619 68.3653 240.064 68.28 239.595 68.152C238.827 68.92 238.443 69.688 238.443 70.456C238.443 71.48 239.019 72.2267 240.171 72.696C241.365 73.1653 242.88 73.4 244.715 73.4ZM266.212 67V35H277.22V67H266.212ZM271.716 31.032C269.924 31.032 268.473 30.5413 267.364 29.56C266.255 28.536 265.7 27.192 265.7 25.528C265.7 23.864 266.255 22.5413 267.364 21.56C268.473 20.536 269.924 20.024 271.716 20.024C273.508 20.024 274.959 20.536 276.068 21.56C277.177 22.5413 277.732 23.864 277.732 25.528C277.732 27.192 277.177 28.536 276.068 29.56C274.959 30.5413 273.508 31.032 271.716 31.032ZM284.65 67V35H293.61L294.378 38.84H294.634C295.914 37.6027 297.364 36.536 298.986 35.64C300.65 34.7013 302.57 34.232 304.746 34.232C308.287 34.232 310.826 35.4267 312.362 37.816C313.94 40.1627 314.729 43.32 314.729 47.288V67H303.722V48.696C303.722 46.648 303.466 45.2827 302.954 44.6C302.442 43.9173 301.631 43.576 300.522 43.576C299.498 43.576 298.644 43.7893 297.962 44.216C297.279 44.6427 296.511 45.24 295.658 46.008V67H284.65ZM333.548 80.184C331.116 80.184 328.876 79.9067 326.828 79.352C324.78 78.84 323.137 78.008 321.9 76.856C320.705 75.704 320.108 74.168 320.108 72.248C320.108 69.7307 321.601 67.6827 324.588 66.104V65.848C323.777 65.2507 323.073 64.5253 322.476 63.672C321.921 62.776 321.644 61.6667 321.644 60.344C321.644 59.2347 321.964 58.1467 322.604 57.08C323.286 56.0133 324.161 55.096 325.228 54.328V54.072C324.076 53.304 323.052 52.216 322.156 50.808C321.302 49.3573 320.876 47.7147 320.876 45.88C320.876 43.2347 321.516 41.0587 322.796 39.352C324.118 37.6027 325.825 36.3227 327.916 35.512C330.006 34.6587 332.225 34.232 334.572 34.232C336.449 34.232 338.156 34.488 339.692 35H351.596V42.936H346.988C347.116 43.32 347.222 43.8107 347.308 44.408C347.436 45.0053 347.5 45.624 347.5 46.264C347.5 48.7813 346.924 50.8293 345.772 52.408C344.62 53.944 343.062 55.0747 341.1 55.8C339.137 56.5253 336.961 56.888 334.572 56.888C333.505 56.888 332.31 56.7387 330.988 56.44C330.476 56.9093 330.22 57.528 330.22 58.296C330.22 58.9787 330.561 59.4693 331.244 59.768C331.969 60.0667 333.121 60.216 334.7 60.216H339.5C343.51 60.216 346.604 60.8773 348.78 62.2C350.998 63.48 352.108 65.592 352.108 68.536C352.108 70.84 351.34 72.8667 349.804 74.616C348.31 76.3653 346.177 77.7307 343.404 78.712C340.63 79.6933 337.345 80.184 333.548 80.184ZM334.572 50.36C335.596 50.36 336.428 49.9973 337.068 49.272C337.708 48.5467 338.028 47.416 338.028 45.88C338.028 44.3867 337.708 43.2987 337.068 42.616C336.428 41.8907 335.596 41.528 334.572 41.528C333.548 41.528 332.716 41.8907 332.076 42.616C331.436 43.2987 331.116 44.3867 331.116 45.88C331.116 47.416 331.436 48.5467 332.076 49.272C332.716 49.9973 333.548 50.36 334.572 50.36ZM335.34 73.4C337.132 73.4 338.625 73.1227 339.82 72.568C341.014 72.056 341.612 71.352 341.612 70.456C341.612 69.6027 341.228 69.0693 340.46 68.856C339.734 68.6427 338.689 68.536 337.324 68.536H334.828C333.548 68.536 332.566 68.4933 331.884 68.408C331.244 68.3653 330.689 68.28 330.22 68.152C329.452 68.92 329.068 69.688 329.068 70.456C329.068 71.48 329.644 72.2267 330.796 72.696C331.99 73.1653 333.505 73.4 335.34 73.4ZM370.546 67V25.4H397.682V34.616H381.554V42.552H395.378V51.768H381.554V67H370.546ZM409.72 67.768C407.714 67.768 405.986 67.32 404.536 66.424C403.128 65.4853 402.04 64.2693 401.272 62.776C400.504 61.2827 400.12 59.6613 400.12 57.912C400.12 54.584 401.485 51.9813 404.216 50.104C406.946 48.184 411.384 46.904 417.528 46.264C417.272 44.0453 415.778 42.936 413.048 42.936C411.896 42.936 410.68 43.1707 409.4 43.64C408.12 44.0667 406.648 44.728 404.984 45.624L401.144 38.456C403.405 37.0907 405.688 36.0453 407.992 35.32C410.338 34.5947 412.749 34.232 415.224 34.232C419.405 34.232 422.669 35.4267 425.016 37.816C427.362 40.1627 428.536 43.96 428.536 49.208V67H419.576L418.808 63.928H418.552C417.272 65.08 415.906 66.0187 414.456 66.744C413.048 67.4267 411.469 67.768 409.72 67.768ZM413.56 59.32C414.456 59.32 415.202 59.128 415.8 58.744C416.397 58.3173 416.973 57.784 417.528 57.144V52.792C414.925 53.176 413.112 53.752 412.088 54.52C411.106 55.288 410.616 56.12 410.616 57.016C410.616 58.552 411.597 59.32 413.56 59.32ZM450.176 67.768C447.147 67.768 444.416 67.1067 441.984 65.784C439.552 64.4613 437.611 62.5627 436.16 60.088C434.752 57.5707 434.048 54.5413 434.048 51C434.048 47.4587 434.837 44.4507 436.416 41.976C438.037 39.4587 440.149 37.5387 442.752 36.216C445.355 34.8933 448.171 34.232 451.2 34.232C453.12 34.232 454.848 34.5307 456.384 35.128C457.963 35.7253 459.349 36.536 460.544 37.56L455.424 44.6C454.741 44.0027 454.123 43.5973 453.568 43.384C453.013 43.1707 452.437 43.064 451.84 43.064C449.792 43.064 448.192 43.7893 447.04 45.24C445.888 46.648 445.312 48.568 445.312 51C445.312 53.432 445.909 55.3733 447.104 56.824C448.299 58.232 449.749 58.936 451.456 58.936C452.352 58.936 453.227 58.744 454.08 58.36C454.976 57.9333 455.808 57.4427 456.576 56.888L460.8 64.056C459.221 65.464 457.472 66.4453 455.552 67C453.632 67.512 451.84 67.768 450.176 67.768ZM478.81 67.768C475.695 67.768 472.879 67.1067 470.362 65.784C467.887 64.4613 465.925 62.5627 464.474 60.088C463.023 57.5707 462.298 54.5413 462.298 51C462.298 47.544 463.023 44.5787 464.474 42.104C465.967 39.5867 467.887 37.6453 470.234 36.28C472.581 34.9147 475.055 34.232 477.658 34.232C480.815 34.232 483.418 34.936 485.466 36.344C487.557 37.7093 489.093 39.5653 490.074 41.912C491.098 44.2587 491.61 46.8613 491.61 49.72C491.61 50.6587 491.546 51.576 491.418 52.472C491.333 53.3253 491.247 53.944 491.162 54.328H472.922C473.434 56.248 474.351 57.592 475.674 58.36C476.997 59.0853 478.554 59.448 480.346 59.448C482.394 59.448 484.57 58.808 486.874 57.528L490.458 64.056C488.751 65.2507 486.831 66.168 484.698 66.808C482.565 67.448 480.602 67.768 478.81 67.768ZM472.794 47.16H482.266C482.266 45.9227 481.946 44.856 481.306 43.96C480.709 43.0213 479.578 42.552 477.914 42.552C476.719 42.552 475.653 42.9147 474.714 43.64C473.775 44.3653 473.135 45.5387 472.794 47.16Z" fill="black"></path></svg></div></div>
</div>



<p>Besides hosting models, Civitai also showcases user generated output from the models and has a discussion and review system to help the community gauge the quality of the models.</p>



<p>Hugging Face is the GitHub of machine learning. You find more than just Stable Diffusion models on the site. There are also other resources for computer vision, natural language processing, audio, and others.</p>



<p>Civitai is dedicated to Stable Diffusion and has a better features for the community. Due to Civitai’s popularity driving its rapid growth, the site was very unstable, though it has since improved vastly. So many creators upload their files on Hugging Face as a backup.&nbsp;</p>


<h2 class="wp-block-heading" id="start-your-stable-diffusion-journey">Start your Stable Diffusion journey</h2>


<p>Stable Diffusion is a powerful tool. It is the most powerful of out the options available as I mentioned above. Of course, there are prolific AI artists who combine Midjourney and Stable Diffusion to produce amazing artwork. However, I would suggest focusing on mastering one tool first if you’re just getting started in AI art.</p>



<figure class="wp-block-image size-large"><img data-recalc-dims="1" decoding="async" width="1140" height="599" src="https://i0.wp.com/jenxi.com/wp-content/uploads/mononoke-hime-2-by-jenxi-banner.webp?resize=1140%2C599&#038;ssl=1" alt="Mononoke Hime by Jenxi Seow" class="wp-image-15465" srcset="https://i0.wp.com/jenxi.com/wp-content/uploads/mononoke-hime-2-by-jenxi-banner.webp?resize=1140%2C599&amp;ssl=1 1140w, https://i0.wp.com/jenxi.com/wp-content/uploads/mononoke-hime-2-by-jenxi-banner.webp?resize=700%2C368&amp;ssl=1 700w, https://i0.wp.com/jenxi.com/wp-content/uploads/mononoke-hime-2-by-jenxi-banner.webp?resize=768%2C403&amp;ssl=1 768w, https://i0.wp.com/jenxi.com/wp-content/uploads/mononoke-hime-2-by-jenxi-banner.webp?w=1200&amp;ssl=1 1200w" sizes="(max-width: 1000px) 100vw, 1000px" /><figcaption class="wp-element-caption"><a href="https://www.deviantart.com/jenxi/art/Mononoke-Hime-wary-976445969" data-type="link" data-id="https://www.deviantart.com/jenxi/art/Mononoke-Hime-wary-976445969" target="_blank" rel="noopener">Mononoke Hime</a> by Jenxi</figcaption></figure>



<p>I hope this overview gives a good introduction to Stable Diffusion and demystifies AI image generation. Share this article the next time someone asks, “What is Stable Diffusion?”</p>



<p>I’ll be sharing guides on how I use Stable Diffusion to generate art and answer some of the common questions I get to help you get started in computer vision. <a href="https://www.patreon.com/jenxi" target="_blank" rel="noopener">Pateron members</a> get a peek at the behind-the-scenes of how certain pieces are made.</p>



<p>Ready to stay updated with the latest developments in Stable Diffusion, AI image generation, and explore exciting computer vision techniques? <a href="https://jenxi.com/subscribe/" data-type="page" data-id="2891">Subscribe to my newsletter</a> where I share my journey and learnings.</p>



<p>Remember, with Stable Diffusion, the canvas becomes your playground, and the possibilities are limitless!</p>
]]></content:encoded>
					
					<wfw:commentRss>https://jenxi.com/what-is-stable-diffusion/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
		<post-id xmlns="com-wordpress:feed-additions:1">15451</post-id>	</item>
		<item>
		<title>How I got started in generative AI art</title>
		<link>https://jenxi.com/how-i-got-started-in-generative-ai-art/</link>
					<comments>https://jenxi.com/how-i-got-started-in-generative-ai-art/#respond</comments>
		
		<dc:creator><![CDATA[Jenxi Seow]]></dc:creator>
		<pubDate>Fri, 15 Sep 2023 09:39:32 +0000</pubDate>
				<category><![CDATA[Blog]]></category>
		<category><![CDATA[AI art]]></category>
		<category><![CDATA[Artificial intelligence]]></category>
		<guid isPermaLink="false">https://jenxi.com/?p=15442</guid>

					<description><![CDATA[Check out how I started my generative AI art journey and some of my thoughts on the controversies.]]></description>
										<content:encoded><![CDATA[
<p>If you follow my <a href="https://instagram.com/jenxi.art" data-type="link" data-id="https://instagram.com/jenxi.art" target="_blank" rel="noopener">Instagram</a> and <a href="https://facebook.com/jenxi/" data-type="link" data-id="https://facebook.com/jenxi/" target="_blank" rel="noopener">Facebook</a> accounts, you’d have seen my generative AI art pieces. </p>



<p>Many people have been asking about how I create my AI art, so I decide to share about my journey in AI art, and what I learnt along the way.</p>



<div class="wp-block-group has-background has-global-padding is-layout-constrained wp-container-core-group-is-layout-10 wp-block-group-is-layout-constrained" style="border-radius:8px;background-color:#f7f7f7;padding-top:var(--wp--preset--spacing--small);padding-right:clamp(30px, 4vw, 40px);padding-bottom:var(--wp--preset--spacing--small);padding-left:clamp(30px, 4vw, 40px)"><h2 class="simpletoc-title">Contents</h2>
<ul class="simpletoc-list">
<li><a href="#how-it-all-started-and-stopped">How it all started, and stopped</a>

</li>
<li><a href="#chatgpt-midjourney-stable-diffusion">ChatGPT, Midjourney, Stable Diffusion</a>


<ul><li>
<a href="#chatgpt">ChatGPT</a>

</li>
<li><a href="#midjourney">Midjourney</a>

</li>
<li><a href="#stable-diffusion">Stable Diffusion</a>

</li>
</ul>
<li><a href="#ai-controversies">AI Controversies</a>


<ul><li>
<a href="#abuse-and-misuse">Abuse and misuse</a>

</li>
<li><a href="#plagiarism">Plagiarism</a>

</li>
</ul>
<li><a href="#model-training">Model training</a>


<ul><li>
<a href="#ethicallytrained-models">Ethically-trained models</a>

</li>
<li><a href="#train-your-own-style">Train your own style</a>

</li>
</ul>
<li><a href="#my-generative-ai-art">My generative AI art</a>
</li></ul></div>


<h2 class="wp-block-heading" id="how-it-all-started-and-stopped">How it all started, and stopped</h2>


<p>My dad is an oil painter, and I grew up exposed to art at a young age. I started drawing at two and I’ve always had an interest in creating art. I generated my <a href="https://jenxi.com/dragons/">first computational graphics artwork</a> using Apophysis and <a href="https://www.ultrafractal.com" target="_blank" rel="noopener">Ultra Fractal</a> back in 2004, before the current crop of generative AI tools came into existence.</p>



<p>I first dabbled in generative AI art in June 2022 when I tried out <a href="https://novelai.net/" target="_blank" rel="noopener">NovelAI</a>. As someone who dabbles in creative writing and <a href="https://play.adventdestiny.com" target="_blank" rel="noopener">text role playing</a>, I was intrigued by a text-generation platform that allowed AI-assisted storytelling. However, I thought it was just a novelty and was turned off by the subscription cost and lost interest in it.</p>



<p>Despite that, NovelAI was still on my radar and when it launched the text-to-image generation feature in October 2022, I gave it a try. To have a better idea of the development of AI image generation, I also experimented with OpenAI’s <a href="https://labs.openai.com/" target="_blank" rel="noopener">DALL-E 2</a> that had just gotten rid of its waitlist, the <a href="https://www.midjourney.com/" target="_blank" rel="noopener">Midjourney beta</a> that had just launched, and <a href="https://nightcafe.studio" target="_blank" rel="noopener">NightCafe</a> that ran <a href="https://stability.ai" target="_blank" rel="noopener">Stability AI</a>’s Stable Diffusion.</p>



<p>Back then, I had no idea what prompt engineering was and the images I generated were so horrible that I didn’t save a copy of them. I wish I did so I could show them here. If you saw those outputs, you would understand why I concluded that AI image generation was not ready for the mainstream.</p>


<h2 class="wp-block-heading" id="chatgpt-midjourney-stable-diffusion">ChatGPT, Midjourney, Stable Diffusion</h2>


<p>Then, OpenAI launched ChatGPT at the end of November 2022, and it blew up in December and took the world by storm in January.</p>


<h3 class="wp-block-heading" id="chatgpt">ChatGPT</h3>


<p>Being a tech geek working on content creation, it was inevitable that I jumped on the ChatGPT bandwagon early on. I won’t go into much details on ChatGPT since it’s a separate topic. In short, besides using ChatGPT to help generate content, I was using it to brainstorm ideas, structure strategies and plans, and even wrote a couple of WordPress plugins.</p>



<p>All these were done through giving the right instructions through <a href="https://en.wikipedia.org/wiki/Prompt_engineering" target="_blank" rel="noopener">prompt engineering</a>, the art of structuring instructions to get the generative AI model to perform tasks as intended. It was frustrating initially, having to fight ChatGPT to get the desired outcome, but very rewarding once you get the hang of it.</p>



<p>It was like having an AI assistant you could rely on, when it didn’t hallucinate.</p>


<h3 class="wp-block-heading" id="midjourney">Midjourney</h3>


<p>While ChatGPT was the most talked-about thing in December 2022, becoming the fastest-growing consumer software in history by gaining over 100 million users within a month, another software was also taking the creative industry by storm – Midjourney.</p>



<p>Visual artists and content creators were creating artwork with Midjourney. Images flooded all my social media feeds. People were gushing over what Midjourney is able to generate. On the other end of the spectrum, people were also protesting as loudly about the ethical issues, which I’ll briefly touch on in a bit.</p>



<p>I gave Midjourney another go, tapping on my prompt engineering abilities. This gave results that were a lot better than what I generated half a year ago.</p>


<h3 class="wp-block-heading" id="stable-diffusion">Stable Diffusion</h3>


<p>This rekindled my interest in generative AI art. I went around trying the different cloud platforms before I decided to give running a Stable Diffusion instance a go. First, I tried the Stable Diffusion macOS apps <a href="https://drawthings.ai" target="_blank" rel="noopener">Draw Things</a> and <a href="https://diffusionbee.com" target="_blank" rel="noopener">DiffusionBee</a>, but I found them lacking in a lot of ways, especially after I studied what was possible with Stable Diffusion.</p>



<p>I managed to install <a href="https://github.com/AUTOMATIC1111/stable-diffusion-webui" target="_blank" rel="noopener">Stable Diffusion Web UI</a> on my M1 Max Macbook, and that started me down my generative AI art journey as I discovered tricks to constantly improve my image generation output. However, it <a href="https://github.com/AUTOMATIC1111/stable-diffusion-webui/discussions/5461" target="_blank" rel="noopener">isn’t optimised for the Mac</a> and produced very slow generation. I get generation speeds of around 20 seconds per iteration for a simple 512&#215;512 image using the Euler sampler. Compare this to around 5 iterations per second on my PC.</p>



<p>There’s the <a href="https://machinelearning.apple.com/research/stable-diffusion-coreml-apple-silicon" target="_blank" rel="noopener">Stable Diffusion optimisation for Core ML</a> on macOS to leverage on the Apple Neural Engine, but it doesn’t perform as well as a PC with a decent Nvidia GPU. It also required converting the Stable Diffusion models from PyTorch to Core ML. That is quite a pain especially when I have over a thousand models.</p>



<p>To speed up my generation workflow, I built a cheap PC to run Stable Diffusion Web UI. There are many of these on Taobao targeting the AIGC (AI Generative Content) market. AIGC is huge in China and continues to grow rapidly. I run the Web UI on my local network so I can work on it from a browser on my Macbook.</p>



<figure class="wp-block-image aligncenter size-full"><img data-recalc-dims="1" decoding="async" width="512" height="768" src="https://i0.wp.com/jenxi.com/wp-content/uploads/cyber-ange-comparison.webp?resize=512%2C768&#038;ssl=1" alt="Learning generative AI art" class="wp-image-15445" srcset="https://i0.wp.com/jenxi.com/wp-content/uploads/cyber-ange-comparison.webp?w=512&amp;ssl=1 512w, https://i0.wp.com/jenxi.com/wp-content/uploads/cyber-ange-comparison.webp?resize=467%2C700&amp;ssl=1 467w" sizes="(max-width: 512px) 100vw, 512px" /><figcaption class="wp-element-caption">First and second pass compared.</figcaption></figure>



<p>There are many techniques that I learnt to help me improve my generative AI art. With my prior knowledge in photography, Photoshop image manipulation, art direction, and Python, I was able to grasp the nuances of Stable Diffusion quickly and find ways to hack my workflows.</p>



<p>I plan to share as I grow and develop a Learn In Public series. If you’re interested to follow my journey, <a href="https://jenxi.com/subscribe/" data-type="link" data-id="https://jenxi.com/subscribe/">sign up for my newsletter</a>!</p>



<p>Many people have also asked about workshops and lessons. I hear you and watch this space to be the first to know if that happens.</p>


<h2 class="wp-block-heading" id="ai-controversies">AI Controversies</h2>


<p>There are many concerns over generative AI images. The two major ones I come across most often are training of AI models without artists’ consent, and the potential for abuse and misuse.</p>



<p>I held off from diving into creating generative AI art because I wanted to learn more about the ethical issues. As always, I have <a href="/about/">strong opinions, weakly held</a>. My views are constantly changing as I gain more knowledge to make a better judgement.</p>


<h3 class="wp-block-heading" id="abuse-and-misuse">Abuse and misuse</h3>


<p>The latter is the lowering of the bar to abuse by bad actors. This includes creation of graphic and sensitive content, and spreading misinformation through fake images.</p>



<p>This is not something new that generative AI introduced. Photo manipulation has been around since the early 19th century, and deepfakes have been around for a few years. Yet, little has been done to deal with such deception and hoax.</p>



<p>Generative AI makes it a lot easier to produce a convincing fake image. And you can batch produce images at scale. Something needs to be done to make viewers know that the image is AI-generated and not real. While I don’t think there is any way to stop criminals from creating harmful content, there should be at least some form of safeguard.</p>


<h3 class="wp-block-heading" id="plagiarism">Plagiarism</h3>


<p>The other major source of outrage from those who oppose AI image generation is the unauthorised use of artists’ work to train the AI models. As an artist myself, I stand against any attempt to infringe upon the copyright of a creator.</p>



<p>However, once I understood how the diffusion models work, I realised that the choice to plagiarise artwork lies in the hands of the person creating the image, not the AI model.</p>



<p>Can you choose to take an exact copy of a photo that another photographer took? You can, but it makes you look bad. Can you imitate your favourite photographer’s style in the process of finding your own style? You can, and many amateur photographers do that while we attempt to discover our own style. Likewise for painting.</p>



<p>Just because these occur doesn’t mean that we should have an outright ban or boycott of AI image generation. People take images with their smartphones. And then there are those who engage in illegal photography such as up-skirt and other nonconsensual images. Should they be allowed to take such photos? No. Do these warrant a ban on smartphone cameras? No, because it won’t solve the problem and a ban would deprive people of the ability to take legit photos.</p>


<h2 class="wp-block-heading" id="model-training">Model training</h2>


<p>If your concern is infringement of copyright, I’m sure you won’t create an image that replicates another artist’s style even if the model allows it. You can take this one step further by choosing to use models that are trained ethically, meaning they use a training set sourced from images that consent to being used for training AI or from images in the public domain.</p>


<h3 class="wp-block-heading" id="ethicallytrained-models">Ethically-trained models</h3>


<p>A good model creator documents how they trained their models. This includes how they source their training data or the models used to create merged models. By keeping this transparent, it allows others who iterate on these models to train or fine-tune new models to make an informed decision.</p>



<p>Of course, there will be those who choose to train their models using unethical or even illegal training data sources. I think that these will remain as prevalent as the piracy of software, films, and books.</p>


<h3 class="wp-block-heading" id="train-your-own-style">Train your own style</h3>


<p>Instead of viewing AI as a threat, I believe it is important to learn how to use it properly to empower yourself.</p>



<p>While the debate over the ethical issues of generative AI art continues, some artists have already jumped on the technical advantages of generative AI and started training models based on their own photography or art style. By doing so, they are then able to generate images with their signature style using AI and experiment with concepts.</p>


<h2 class="wp-block-heading" id="my-generative-ai-art">My generative AI art</h2>


<p>To see more of my AI art, check out the <a href="https://jenxi.com/art/">overview page</a> and follow me on the various platforms.</p>



<p>Check out the <a href="https://jenxi.com/mai-shiranui-ai-art/">Mai Shiranui series</a>.</p>
]]></content:encoded>
					
					<wfw:commentRss>https://jenxi.com/how-i-got-started-in-generative-ai-art/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
		<post-id xmlns="com-wordpress:feed-additions:1">15442</post-id>	</item>
	</channel>
</rss>

<!--
Performance optimized by W3 Total Cache. Learn more: https://www.boldgrid.com/w3-total-cache/

Object Caching 108/455 objects using APC
Page Caching using Disk: Enhanced (Page is feed) 
Database Caching 2/83 queries in 0.019 seconds using Memcached

Served from: jenxi.com @ 2024-11-09 20:22:40 by W3 Total Cache
-->