Concepts

Update WIP

The Concepts tab is where you tell OneTrainer where your inputs are. Concepts can be your train data, regularization data or any other data you want to train on.

Concepts Tab UI Overview

The Concepts tab is made up of the following elements:

Dropdown menu (default: concepts) You can setup multiple configs for your concepts, and the dropdown is how you select them. OneTrainer only trains from the current selected config.
Add config Pressing this button will bring up an UI element to ask for a name to create a new concepts config
Add concept Pressing this button will create a new default blank concept in the current config
Delete concept (red X) Pressing this button on a concept will delete it from the config
Duplicate concept (green plus) Pressing this button will duplicate this concept, including all settings
Enable concept (toggle, default: on) This sliding toggle will tell OneTrainer to train using this concept or not. The toggle is blue when enabled.
Edit concept Clicking on any part of the concept image, outside of the buttons and toggles, will open the concept settings window. This is a separate window where all the rest of the concept settings can be modified, and is the next part of this wiki section.

Concepts Settings - General

The general tab of the Concept focuses on the basic info, balancing and caching settings.

Name (Default: Blank) This field allows you to enter a name for your concept. If you do not choose a name, it will default to the folder name you enter when you close the window.
Enabled (Default: True) A toggle that will mirror the toggle on the tab itself. This toggle controls if OneTrainer will train on this data or not.
Path (Default: Blank) This field will allow you to type or paste the path the location to your concept images. You can also the button next to the field (...) to browse for the folder instead.
Prompt Source (Default: from text file per sample) This dropdown has three options.

From text file per sample - 0001.jpg will use 0001.txt file as the prompt. Note you can add multiple captions, one per line in the txt file, it will randomly choose one for each epoch.
From single text file - the text file in the field to the right of the dropdown will be used for all images
From image file name - tag1 tag2 tag3.jpg will use tag1 tag2 tag3 as the prompt

Include Subdirectories (Default: False) This toggle will allow you to use subdirectories for ease of your use, but have OneTrainer treat them as one concept for internal management
Image Variations (Default: 1) This field controls how many images will be cached using the variables from the image augmentation tab. It is required when using image augmentations and latent caching, there is no best number but keep in mind that it will multiply the number of images cached.
Text Variations (Default: 1) This field controls how many prompts will be cached for each image. Note: If you are training the text encoder (or embeddings/additional embeddings), prompts are not cached and this setting does not need to be changed.
Balancing (Default: 1 Repeats) These two fields control the balancing for the concept. Balancing allows you to, as the name suggests, balance out one concept amongst others. One use case for this is regularization images. If you have 100 source images, but have 10,000 reg images, you can use balancing to train only a fraction of the images every epoch. There are two ways to use this setting.
- Repeats - Your source images times the value will be used every epoch. For example, if you have 10,000 images and use .01, 100 images will be used every epoch.
- Sample - Explicitly tells OneTrainer how many images to use each epoch. For example. Using 100 samples will always use 100 images per epoch. This is true if you have 10,000 images or 20.
Loss Weight (Default: 1) Another technique to balance your inputs. One use case for this is using a value less than 1 for reg images if you find they are affecting the training run more than you would like.

Image Augmentation Tab

This tab focuses on image augmentation to help diversify your image set. This tends to become more important as your input size becomes smaller. Most image augmentation options have both an option to be random or fixed. Random will choose a value up to the number entered, fixed will use it. It is important to note, that using image augmentations either requires caching every epoch (turning latent caching off) or using image variations. For small datasets, this is not costly, but the more images you have the more time it will take to use image augmentations.

Update Preview - Pressing this button will give you an idea of what your augmentations are doing. When using random augmentations, pressing the button multiple times will help give a better idea of what will happen.
Crop Jitter (Default: On) - OneTrainer will try to pick the closest standard resolution bucket for your image, but if OneTrainer needs to crop your image, and this option is selected, it will perform a non center crop randomly to allow the image to be different.
Random Flip (Default: On) - The image will be flipped, mirrored about the vertical midpoint. Can be fixed (always) or random
Random Rotation (Default: Off - 0) - The image will be rotated. Randomly, it will rotate either direction up to the number specified in degrees. Fixed, it will always rotate to the number specified in degrees.
Random Brightness (Default: Off - 0) - The brightness of the image will be changed. When using random, it will change up and down with the value specified being a cap. When using fixed, it will change by the number specified.
Random Contrast (Default: Off - 0) - The contrast of the image will be changed. When using random, it will change up and down with the value specified being a cap. When using fixed, it will change by the number specified.
Random Saturation (Default: Off - 0) - The saturation of the image will be changed. When using random, it will change up and down with the value specified being a cap. When using fixed, it will change by the number specified.
Random Hue (Default: Off - 0) - The hue (color) of the image will be changed. When using random, it will change up and down with the value specified being a cap. When using fixed, it will change by the number specified.
Resolution Override (Default: Off - 512) - This feature can be used to override the training resolution of the concept. When disabled, One Trainer rescale the images to your training resolution(s). Possible values: either a single resolution or several separated by comma, or a single resolution in the format of width x height. In the case you set several resolutions, the resolution selection is random and you need to increase the image variation. But keep in mind the resolution selection will still be random when caching images, meaning that with 2 resolutions and image variation set to 2, the same resolution can be cached twice. Note also that every image in the concept will have one of the resolutions applied randomly.
- When activated, it can be used for two purposes:
  - With multi resolution training (several training resolutions separated by a comma), it will use images from the concept of the same resolution.
  - To prevent image upscaling, you can train at 1024 (target resolution) with images of 512 or 256 that won't get upscaled. Training will be done at 512 or 256. It can help with low quality images.

Text Augmentation Tab

Screenshot 2024-11-09 215015

Text augmentations apply changes to the captions associated with each image. A "caption" is the full text associated with each image, as defined in the "Prompt Source" option on the first tab. Each caption is split into "tags" separated by a user-defined Delimiter, which is usually a comma. A new variation will be generated for each image on a new epoch, so if you want to take the most advantage of text variation then set up your training with more epochs and less repeats/samples per concept. In general, text variations can help training learn concepts without overfitting to specific groups or sequences of words, and make prompting more flexible. Depending on the model being trained on, it may respond better to "tag-based" captions, or may prefer "natural language" captions. In the latter case, text augmentations may not be as beneficial.

Tag Shuffling

This text augmentation will randomize the order of the tags within a caption. Tags near the start of a caption are generally interpreted as more important, so shuffling can help avoid that effect if not intended. It will also reduce the chance that the concept will become too closely tied to the specific order of the tags in the caption.

Keep Tag Count will specify a number of tags to always keep at the front of the caption. If training a LoRA on a specific concept, it's a good idea keep that concept's name (aka the "trigger word") at the front to have the training focus on it more closely.

Tag Dropout

"Tag Dropout" will randomly remove some tags from the caption. This can help the intended concept work better in short prompts without including everything from the training captions, and may prevent it from picking up too strongly on unintended aspects. However if too many tags are dropped, training may be unable to separate concepts as easily or it may form unintended associations.

The same Keep Tag Count and Delimiter values apply to dropout as the shuffling augmentation. Dropout is applied before shuffling.

The Probability of dropout being applied can be specified between 0-1. There are three different Dropout Modes that control the method used to drop tags:

Full will randomly either drop all tags at once with the defined Probability (except the ones preserved by Keep Tag Count) or leave the caption untouched.
- For example, with caption "a, b, c, d, e" and Probability set to 0.5, it would have a 50% chance to train with the caption "a" and a 50% chance to train with the full caption "a, b, c, d, e".
Random will go through each tag in the caption one by one, and choose to drop or keep them individually with the defined Probability.
Random Weighted is similar to Random, but has a reduced chance to drop tags near the start of the caption (scaled linearly by position/caption length) and reaching full probability by the end. This can be useful if your captions have the more "important" tags at the front, making it less likely for those to be removed, while still allowing more variation in the "less important" tags at the end.
- For example, with a caption "a, b, c, d, e" with Probability set to 0.5, in Random mode each tag from "a" to "e" would have a 50% chance of being dropped. With Random Weighted, it would only have a 10% chance to drop "a", 20% for "b", 30% for "c", 40% for "d", and the full 50% for "e".

In addition, you can specify a list of Special Tags which will behave differently than the rest. This can either be a comma-separated list in the input field, or a file path to a .txt or .csv file with each element separated by newlines. The Special Tags can be set to act as a Whitelist or Blacklist - as a Whitelist, the specified tags will always be kept in the caption, and anything else will be subject to the overall dropping rules. As a Blacklist, only the specified tags will potentially be dropped, and all others will be kept in the caption. With None, the special tag list is ignored, and all tags are susceptible to being dropped.

The Special Tags list also allows matching based on regex expressions if Special Tags Regex is enabled. You can find full documentation on how that works here https://docs.python.org/3/library/re.html, but in short they allow one expression to match a broad range of tags. Some examples:

"photo.*" will match any tag starting with the word "photo", so it would match with "photo", "photograph", "photon", but would not match with "telephotography" - you would need to use ".*photo.*" to match that.
"\d.*" would match any tag starting with a decimal digit, such as "1girl", "4kidz", "2001 a Space Odyssey", etc.
"d.{1}g" will match "dog", "dig", "dug", but not "drag" or "domestic dog"

Note that if any tags in the "special list" contain any of the special characters which are used in regex expressions (anything in ".^$*+?!{}[]|()\"), they are likely to cause problems by being interpreted incorrectly, and may fail to match correctly. You can insert a backslash "\" in front of them to force regex to interpret them literally (ex. instead of "Panic! at the Disco holding $100 bill" use "Panic\! at the Disco holding \$100 bill"). This already includes a find/replace rule to fix that behavior for tags including "$" and "$" syntax which is common in many booru-style tags and autotagging models (such as in "watercolor $artwork$").

Randomize Capitalization

This will apply variations to the capitalization of tags within a caption. Each individual tag has a chance of a different variation being applied based on the defined Probability. If Force Lowercase is enabled, the entire caption will be converted to all lowercase before any other changes are applied (this can also be used on its own without any further capitalization variations).

The following types of capitalization variation can be applied. You can specify which ones to apply as a list in the Capitalization Mode input field separated by commas, such as "capslock, title, first", and a random method will be picked for each tag it modifies:

capslock: ALL CAPITAL LETTERS
title: First Letter Of Every Word Capitalized
first: Only the first word capitalized (equivalent to "title" for single-word tags)
random: ApPLiEs cAPs RanDOmLy tO EacH lETtEr

Overview

Home

Overview

Learning

Training

Getting Started

The Program - Tab Explanation

General

Model

Data

Concepts

Training

Optimizers

Custom Scheduler

Sampling

Backup and Saving

Tools

Additional Embeddings

Cloud

Embedding

Lora

More info

Infos, Guides and Lessons Learnt

Misc Info

Diffusion Models

Guides

One Trainer March 2024 Guide

Run One Trainer on Runpod

Other Tools - Helpful Links

Lessons Learnt

Frequently Asked Questions

Lessons Learnt and Tutorials

For Developers

Dev Corner

Developing on Clouds

Quick Start for Developers

CLI Training

Docker Image

Embedding Training

Project Structure

RAM Offloading

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Concepts

Concepts Tab UI Overview

Concepts Settings - General

Image Augmentation Tab

Text Augmentation Tab

Tag Shuffling

Tag Dropout

Randomize Capitalization

Overview

Training

More info

For Developers

Clone this wiki locally