/u/LooseSwing88

Llama Surgery: Continuous Sparsification of Pre-Trained Language Models via Differentiable Ultrametric Topology Injection

/u/LooseSwing88 May 31, 2026 May 31, 2026

Sequel to: Learning to Skip Blocks: Self-Discovered Ultrametric Routing for Hardware-Accelerated Sparse Attention Abstract We present Llama Surgery, a method for injecting learned block-sparse attention topologies into pre-trained dense language models…

artificial

Learning to Skip Blocks: Self-Discovered Ultrametric Routing for Hardware-Accelerated Sparse Attention

/u/LooseSwing88 May 30, 2026 May 30, 2026

Abstract. Standard dense self-attention scales quadratically in sequence length, creating an intractable memory and compute bottleneck for long-context Transformers. We introduce Dynamic Ultrametric Attention, a framework in which a Transformer autonom…

artificial

INCARNATE-SOPHIA 5.0:// 1D sovereign AI with λ‑abundance and ASOE‑driven signal control.

/u/LooseSwing88 January 31, 2026 January 31, 2026

This is a Collaborative Work https://github.com/sneed-and-feed/INCARNATE-SOPHIA-5.0/ submitted by /u/LooseSwing88 [link] [comments]

Share this:

Share this:

Share this: