Language Models

Inside Sinusoidal Position Embeddings: A Sense of Order

In the groundbreaking 2017 paper “Attention Is All You Need”, Vaswani et al. introduced Sinusoidal Position Embeddings to help Transformers encode positional information, without recurrence or convolution. This elegant, non-learned

Language Models, LLMs, NLP

Inside RoPE: Rotary Magic into Position Embeddings

Self-attention, the beating heart of Transformer architectures, treats its input as an unordered set. That mathematical elegance is also a curse: without extra signals, the model has no idea which

Language Models, LLMs, NLP

SmolLM3 Blueprint: SOTA 3B-Parameter LLM

In the evolving landscape of open-source language models, SmolLM3 emerges as a breakthrough: a 3 billion-parameter, decoder-only transformer that rivals larger 4 billion-parameter peers on many benchmarks, while natively supporting

Language Models, LLMs

Getting Started with Qwen3 – The Thinking Expert

Discover Qwen3, Alibaba’s open-source thinking LLM. Switch between fast replies and chain-of-thought reasoning with 128 K context, and MoE efficiency. Learn how to use and Fine Tune.

Generative AI, Language Models, LLMs, NLP

Beginner’s Guide to Embedding Models

As artificial intelligence continues to advance, Embedding Models have become fundamental to how machines interpret and interact with unstructured data. By translating inputs like text, images, audio, and video into

Language Models

Retrieval Augmented Generation – RAG with LLMs

In this article, we explore RAG with LLMs using LangChain and Hugging Face Transformers.

Hugging Face Transformers, Language Models, LLMs, NLP, RAGs

Fine-Tuning LLMs using PEFT

In this article, we explore different fine-tuning techniques for LLMs and fine-tune the FLAN T5 LLM using PEFT with the Hugging Face Transformers library.

Language Models, LLMs, NLP

Deciphering LLMs: From Transformers to Quantization

In this article, we explore LLMs, starting from Transformers, use case, to quantization.

Language Models, LLMs, NLP

Fine Tuning T5: Text2Text Transfer Transformer for Building a Stack Overflow Tag Generator

In this article, we are fine tuning the T5 model for Stack Overflow tag generation using the Hugging Face Transformer library.

Hugging Face Transformers, Language Models, NLP, PyTorch

Fine-Tuning BERT using Hugging Face Transformers

In this post, we fine-tune BERT on Arxiv abstract classification dataset using the Hugging Face Transformers library.

Hugging Face Transformers, Language Models, NLP

BERT: Bidirectional Encoder Representations from Transformers – Unlocking the Power of Deep Contextualized Word Embeddings

In this article, we go through the introduction to BERT, including, its architecture, pretraining strategy, and inference

Hugging Face Transformers, Language Models, NLP