Stable Diffusion 3.5, released on June 2024 by Stability AI, is the third iteration in the Stable Diffusion family. The Turbo-Large and Large variants of the SD3.5 family are Stability AI’s most ...
Handwritten Text Recognition using OCR
Handwritten text documents are ubiquitous in the field of research and study. They are personalized to the user’s needs and often contain a style of writing difficult to comprehend by others. This ...
Fine Tuning Whisper on Custom Dataset
Whisper is a leading open-source model used for converting speech to text. Developed by OpenAI, Whisper has been trained on a diverse array of languages and speech conditions using extensive data. ...
SDXL Inpainting: Fusing Image Inpainting with Stable Diffusion
Suppose you have an old photo of your childhood with your parents which is close to your heart. Unfortunately, some parts of it have become damaged or corrupted over time. But what if I tell you that ...
Retrieval Augmented Generation – RAG with LLMs
In today's information age, we're constantly bombarded with questions. Whether it's researching a historical event, troubleshooting a tech issue, or simply satisfying our curiosity, finding the right ...
Dreambooth using Diffusers
Ever wanted to generate different images of your pet but in different settings and locations? Dreambooth training using the Diffuser library can help you achieve this. In the evolving landscape of ...