Kandinsky v2.1: Open Source Midjourney Alternative
Kandinsky v2.1 is an open-source and multilingual latent diffusion model. It is created by a group of researchers from Russia and has been gaining traction in the last couple of days.
Open Source Midjourney Alternative
People have started calling it the open source Midjourney alternative. After some initial testing I agree, it reminds me of Midjourney’s initial versions as well.
It’s not based on Stable Diffusion, it’s created from scratch. It’s very good at different aspect ratios and resolutions. It doesn’t seem to have the “doubling” problem some Stable Diffusion models have. It also produces artistic results without the need for long prompts.
It can create images from text and supports inpainting. More interesting, it lets you fuse multiple images and prompts together. Again, a bit like Midjourney but with more freedom.
It’s also multilingual by design, and trained on a large multilingual set according to the authors. They also note that it was trained LAION HighRes dataset and the internal dataset of the research team.
There doesn’t seem to be a native way to change the seed for the generations currently. Although you can set a seed with torch, changing the seed doesn’t make as big of a difference as it does with other models. There is already an issue regarding this on the repository, the authors might answer soon.
Try It Now
You can try it right away on Stablecog :)
Or you can check out the repository for instructions on how to run it locally: ai-forever/Kandinsky-2.