Need AI Development or Sponsor Exposure?

We help companies build AI systems and reach AI readers.

AI Development Become Sponsor

The Geometry of Concepts: Sparse Autoencoder Feature Structure

Yuxiao LiEric J. MichaudDavid D. BaekJoshua EngelsXiaoqing SunMax Tegmark

Sparse autoencoders have recently produced dictionaries of high-dimensional vectors corresponding to the universe of concepts represented by large language models. We find that this concept universe has interesting structure at three levels: 1) The “atomic” small-scale structure contains “crystals” whose faces are parallelograms or trapezoids, generalizing well-known examples such as (man-woman-king-queen). We find that the quality of such parallelograms and associated function vectors improves greatly when projecting out global distractor directions such as word length, which is efficiently done with linear discriminant analysis. 2) The “brain” intermediate-scale structure has significant spatial modularity; for example, math and code features form a “lobe” akin to functional lobes seen in neural fMRI images. We quantify the spatial locality of these lobes with multiple metrics and find that clusters of co-occurring features, at coarse enough scale, also cluster together spatially far more than one would expect if feature geometry were random. 3) The “galaxy” scale large-scale structure of the feature point cloud is not isotropic, but instead has a power law of eigenvalues with steepest slope in middle layers. We also quantify how the clustering entropy depends on the layer.

View PDF

  • Related Posts

    Corpus2Skill — New Standard of Knowledge Architecture for the LLM Era

    Executive Summary The core shift in enterprise knowledge systems is no longer just from “documents” to “LLMs.” It is from retrieving snippets toward structuring, navigating, editing, and exploring knowledge in forms that fit different kinds of work. Standard Retrieval-Augmented Generation, or RAG, remains the…

    Integrated AI After the LLM Boom

    Executive summary Detailed research report for article writing Background and context. Neural AI’s achievements remain extraordinary. Frontier models now write and summarize text, generate and debug code, handle multimodal inputs, and in many products invoke external tools, search the web, or…

    You Missed

    NVIDIA RTX Spark: The Chip That Could Turn the Windows PC Into a Local AI Workstation

    NVIDIA RTX Spark: The Chip That Could Turn the Windows PC Into a Local AI Workstation

    AI Developments in May 2026

    AI Developments in May 2026

    From “Waiting for Instructions” to “Autonomous Execution”: May 2026, Autonomous AI Agents and Extreme Multimodality Reshape the World

    From “Waiting for Instructions” to “Autonomous Execution”: May 2026, Autonomous AI Agents and Extreme Multimodality Reshape the World

    Corpus2Skill — New Standard of Knowledge Architecture for the LLM Era

    Corpus2Skill — New Standard of Knowledge Architecture for the LLM Era

    The End of Hierarchy, the Rise of Intelligence: How “Company Brain” and “AI OS” Are Rewriting the Future of Organization

    The End of Hierarchy, the Rise of Intelligence: How “Company Brain” and “AI OS” Are Rewriting the Future of Organization

    The Rise of the Forward Deployed Engineer: Bridging the High-Stakes Chasm Between AI Theory and Execution

    The Rise of the Forward Deployed Engineer: Bridging the High-Stakes Chasm Between AI Theory and Execution

    Integrated AI After the LLM Boom

    Integrated AI After the LLM Boom

    Andrej Karpathy’s latest concept ‘LLM Wiki’ and the future of enterprise knowledge

    Andrej Karpathy’s latest concept ‘LLM Wiki’ and the future of enterprise knowledge

    How to Build Enterprise AI

    How to Build Enterprise AI

    AI Developments in April 2026

    AI Developments in April 2026

    The Rise of the Context Layer: Why AI Agents Need More Than Data

    The Rise of the Context Layer: Why AI Agents Need More Than Data

    Comparison of Major Companies’ Computer Use Agents

    Comparison of Major Companies’ Computer Use Agents

    GPT-5.5 Is Real, Powerful, and Expensive — but OpenAI’s Biggest Story Is the Race to Own Enterprise AI Work

    GPT-5.5 Is Real, Powerful, and Expensive — but OpenAI’s Biggest Story Is the Race to Own Enterprise AI Work

    Claude Mythos and the New Cybersecurity Balance

    Claude Mythos and the New Cybersecurity Balance

    AI News Briefing for April 13–20, 2026

    AI News Briefing for April 13–20, 2026

    Current Research Trends in Latent Space

    Current Research Trends in Latent Space

    AI Patents from Google Patents Search

    AI Patents from Google Patents Search

    AI Articles from IEEE Xplore

    AI Articles from IEEE Xplore

    AI articles from OpenAlex

    AI articles from OpenAlex

    AI News from NewsAPI

    AI News from NewsAPI

    AI News from Google News

    AI News from Google News

    Idea of New AI services

    Idea of New AI services

    Problem to use AI services

    Problem to use AI services

    AI Services Market Structure 2026

    AI Services Market Structure 2026

    Why Conceptual Investigation?

    Why Conceptual Investigation?
    Need AI solutions or sponsorship opportunities? Get in touch