#llm

10 posts · all tags

Jan 27, 2026 · H-Neurons: Hallucination at the Neuron Level

Gao et al. https://arxiv.org/abs/2512.01797 present a compelling investigation into the microscopic mechanisms of hallucination in LLMs. The central thesis is that hallucinations are not diffuse phenomena spread across millions of parameters, but are instead driven by a remarkably sparse subset of neurons -- fewer than 0.1% of the total -- which the authors term H-Neurons (Hallucination-associated Neurons).
Aug 4, 2025 · Mixture of Experts

In the pursuit of scaling neural networks to unprecedented parameter counts while maintaining computational tractability, the paradigm of conditional computation has emerged as a cornerstone of modern deep learning architectures. A prominent and highly successful incarnation of this principle is the Mixture of Experts (MoE) layer. At its core, an MoE model eschews the monolithic, dense activation of traditional networks, wherein every parameter is engaged for every input. Instead, it employs a collection of specialized subnetworks, termed experts, and dynamically selects a sparse combination of these experts to process each input token. This approach allows for a dramatic increase in model capacity without a commensurate rise in computational cost (FLOPs), as only a fraction of the network's parameters are utilized for any given forward pass.
Apr 12, 2025 · Mechanistic Interpretability - Some concepts

Here are some quick notes on concepts in Mechanistic Interpretability. The subject is vast and very recent and try to interpret features for neural networks, specifically transformers and LLM's.
Feb 6, 2025 · Group Relative Policy Optimization (GRPO)

PPO is a reinforcement learning algorithm originally designed to update policies in a stable and reliable way. In the context of LLM fine-tuning, the model (the “policy”) is trained using feedback from a reward model that represents human preferences. Value Function (Critic): Estimates the “goodness” of a state, used with Generalized Advantage Estimation (GAE) to balance bias and variance. Basically it works as follows:
Dec 28, 2024 · Deepseek, an overview and quick notes

Some notes of DeepSeek-V3
Nov 10, 2024 · Sparse Autoencoders

Sparse autoencoders are neural networks that learn compressed representations of data while enforcing sparsity - a constraint that ensures most neurons remain inactive for any given input. This approach leads to more robust and interpretable features, often capturing meaningful patterns in the data.
Sep 29, 2024 · Quantization of LLMs

The escalating complexity and scale of large language models (LLMs) have introduced substantial challenges concerning computational demands and resource allocation. These models, often comprising hundreds of billions of parameters, necessitate extensive memory and processing capabilities, making their deployment and real-time inference both costly and impractical for widespread use.
Aug 27, 2024 · Understanding and Implementing RAG (Retrieval-Augmented Generation)

Retrieval-Augmented Generation (RAG) is a powerful technique that combines the strengths of large language models with the ability to retrieve relevant information from external sources. This approach enhances the model's responses by grounding them in specific, up-to-date, or domain-specific knowledge.
Jul 21, 2024 · Encoder vs Decoder vs EncoderDecoder Architectures

Language models are a crucial component in natural language processing (NLP). The architecture of these models can be broadly categorized into three types: encoder-only, decoder-only, and encoder-decoder architectures. Each of these architectures has distinct characteristics and applications.
Mar 24, 2024 · Having fun with decoding and optimization

Hey. One topic very fascinating for me is coding theory. It can be very challenging and it can be pleasing for a more mathematical inclined person or someone like me that, likes a lot mathematics but like engineering as well. I think that the beginning of coding theory is strong related to Shannon work, A mathematical theory of communication but it can be interpreted in a very broad sense. What I mean by that is that a lot of natural phenomenum can be interpreted as an application of coding theory. For instance, you can consider the language as a coding theory application where what is done by expressing ourselfs in words is to find an optimal code for communicating thoughts. Other interesting example is what happens in natural evolution. Basically, there you can interpret the changes on environment and the DNA of species being on a communication channel where the DNA is coding the optimal way to survive on a given environment.

Jan 27, 2026 · H-Neurons: Hallucination at the Neuron Level

Aug 4, 2025 · Mixture of Experts

Apr 12, 2025 · Mechanistic Interpretability - Some concepts

Feb 6, 2025 · Group Relative Policy Optimization (GRPO)

Dec 28, 2024 · Deepseek, an overview and quick notes

Nov 10, 2024 · Sparse Autoencoders

Sep 29, 2024 · Quantization of LLMs

Aug 27, 2024 · Understanding and Implementing RAG (Retrieval-Augmented Generation)

Jul 21, 2024 · Encoder vs Decoder vs EncoderDecoder Architectures

Mar 24, 2024 · Having fun with decoding and optimization