Blog posts

2024

Probes and Cons: A Deep Dive into the Latent Geometry of Functional Triggers in Language Models

7 minute read

Published: December 15, 2024

An investigation into how language models internally represent abstract functions that can be invoked by diverse, semantically equivalent prompts. This work reveals surprising findings about the geometry of learned concepts in LLMs and the unexpected effects of supervised fine-tuning.

Unraveling the Mystery of Superposition in Large Language Models

3 minute read

Published: September 23, 2024

In the rapidly evolving field of artificial intelligence, Large Language Models (LLMs) like GPT, BERT, and their successors have emerged as powerful tools capable of understanding and generating human-like text. While these models demonstrate remarkable abilities, their inner workings remain largely opaque. One intriguing theory attempting to explain the efficiency and capability of LLMs is the concept of “superposition” - a phenomenon borrowed from quantum physics to describe how these models might represent information internally.

Vedant Gaur

Blog posts

2024

Probes and Cons: A Deep Dive into the Latent Geometry of Functional Triggers in Language Models

Unraveling the Mystery of Superposition in Large Language Models