More

aghilmort · 2026-02-19T17:48:48 1771523328

this is great / will try!

aghilmort · 2026-02-12T18:59:35 1770922775

curious: wdym by "getting separators right when generating multiple files in a single inference call"

context: created hypertokens an even more robust hashing mechanism to create context-addressable memory (CAM), one cheat code is make them prefix-free, lots of others that get deep into why models work the way they do, etc.

aghilmort · 2026-02-12T18:56:19 1770922579

we dug into those sorts of questions with hypertokens, a robust hash for lines, code, tables/rows or any in-context token tagging to give models photographic memory

one mechanism we establish is that each model has a fidelity window, i.e., r tokens of content for s tag tokens; each tag token adds extra GUID-like marker capacity via its embedding vector; since 1,2,3 digit numbers only one token in top models, a single hash token lacks enough capacity & separation in latent space

we also show hash should be properly prefix-free, or unique symbols perp digit, e.g., if using A-K & L-Z to hash then A,R is legal hash whereas M,C is not permitted hash

we can do all this & more rather precisely as we show in our arXiv paper on same; next update goes deeper into group theory, info theory, etc. on boosting model recall, reasoning, tool calls, etc. by way of robust hashing

pbowyer · 2026-02-12T21:36:38 1770932198

For others, here's the paper: https://arxiv.org/abs/2507.00002

aghilmort · 2026-02-12T01:29:13 1770859753

been wondering about treesitter grepping for agents

how do plans compare with and without etc. evven just anecdotally what you've seen so far etc

jared_stewart · 2026-02-12T02:09:48 1770862188

anecdotally, it seems like this helps find better places for code to sit, understands the nuances of a code base better, and does a better job avoiding duplicate functionality.

it's still very much a work in progress, the thing I'm struggling with most right now is to have claude even using the capability without directly telling it to.

there seems to be benefits to the native stack (which lists files and then hopes for the best) relative to this sometimes. Frankly, it seems to be better at understanding the file structure. Where this approach really shines is in understanding the code base.

aghilmort · 2026-02-12T19:52:26 1770925946

one approach that can work is to tell model to load read skill and/or call shell script that overloads default, there are variety of ways to attempt this with any harness, claude specifically has hooks some of which allow go, no go, do this instead etc. and ya, agree on grokking code base, ast integration feels like natural next step

aghilmort · 2026-02-03T03:13:46 1770088426

Interesting! Been building space-time coordinate system for AI models. Notionally agree in principle w.r.t. convex hull, clocks, etc. since we invoke similar machinery albeit in tokenized models. Need read this work more deeply to grok.

One question is to what extent you dig into or have considered oversampling? One of the core hypotheses we've converged on is that nearly all models are optimized for source coding vs. channel coding. The implication is path to AGI likely involves oversampling to capture channel coding gains and which will resolve phase errors, etc.

Random sampling naturally does this albeit inefficiently. Curious if you do something more structured than random in terms of oversampling and especially partial overlapped samples / think supersaturated subspaces / subchannels, etc.

CSCT-NAIL · 2026-02-03T03:24:18 1770089058

Thank you for the profound insight. I completely agree that the path to AGI lies in channel coding (robustness and synchronization) rather than just source coding (compression).In CSCT, we don't just "sample" data; we process it as a continuous Projected Dynamical System. Here is how we address your points:

Structured Temporal Oversampling: Our stream-based approach effectively performs high-density oversampling in the time domain. Instead of random sampling, the theta-phase (hippocampal rhythm) in our MultiGate architecture creates structured, overlapping "integration windows" to capture temporal context.

Phase Error Resolution: Phase errors are resolved not by averaging (as in L2 models), but by NMDA-gating. The gate only opens when the anchor velocity and theta-phase align, physically "locking" the signal to a specific codebook vertex. This is a computational implementation of theta-gamma coupling.

Supersaturated Subspaces: Our Simplex constraint (L1) naturally handles what you call "supersaturated subspaces" by enforcing non-negative competition. This ensures that even with overlapping temporal samples, the resulting internal representation remains discrete and grounded within the convex hull.

By treating cognition as a communication channel between an "Anchor" and "Codebook," we prioritize the stability of the compositional mapping over the mere efficiency of representation.

aghilmort · 2026-02-12T20:01:48 1770926508

ya, brain just noisy channel in same way we can treat LLMs; anything possible exists we are just sampling it, which distills to "mere" clock syncing

L1 & L2 constraints unwind that clock compression with suitable dilation; very easy to think only efficiency matters and not averaged out replicas; nature does that inherently via primes, we have to create those artificial waves, recreate that convex hull, etc.

all to say, great to see more work in this direction & perhaps we can compare notes sometime!

aghilmort · 2026-01-28T18:18:09 1769624289

very interesting post. continuous tape & associative memory for LLMs exactly what motivated us to build hypertokens, https://arxiv.org/abs/2507.00002

will reference in our next paper update, thx for posting!

aghilmort · 2026-01-28T18:12:51 1769623971

interesting is the idea the agent calls it or just alt to terminal bash etc tool calls hey your tool calls are all microvms, containers, isoshells, raw term, clawd/molt all credentials with weaker and weaker security demarcs?

vrn21 · 2026-01-28T21:13:29 1769634809

my ideal scenario is a cloud web model getting access to a sandbox to run commands and read/write to files. but yeah it could be used as an alternative to bash and read write tools.

I did not get your second question exactly, but yeah microvms can be considered one of the secure ways to run your agent

aghilmort · 2026-01-28T22:35:56 1769639756

Basically, just thinking that it’s more ideal to have the tool call the micro VM versus the agent, doing it in the sense of its mandated by the tool call

aghilmort · 2026-01-28T18:11:21 1769623881

security matters if want to demarc where agents can play. running agent inside of strong VM is usually where starts container not enough for that full isolation only sees files you want it to etc

aghilmort · 2026-01-28T18:09:17 1769623757

we've considered docker, firecracker, will add smol to working roster

context <> building something with QEMU

* required has to support LMW+AI (linux/mac/windows + android/ios)

there are scenarios in which we might spin micro vms inside that main vm, which by default is almost always Debian Linux distro with high probability.

one scenario is say ETL vm and AI vm isolated for various things

curious why building another microVM other than sheer joy of building, what smol does better or different, why use smol, etc. (microVMs to avoid etc also fair game :)

jkelleyrtp · 2026-01-28T18:14:01 1769624041

I needed Mac / win/ Linux / iOS / android for dioxus dev, so I built my own in rust.

https://skyvm.dev/

binsquare · 2026-01-28T18:28:04 1769624884

I focus on different design decisions.

Smolvm is designed to run locally, persistent (stateful), long running (efficiency), and interactive.

Worked with firecracker and other options a lot btw, most of everything is designed for ephemeral serverless workloads.

aghilmort · 2026-01-28T20:46:53 1769633213

oh interesting our qemu use case is local!

binsquare · 2026-01-31T19:10:55 1769886655

Oh neat!

Feel feel to chat if you need anything, more user friendly docs are at smolmachines.com.

aghilmort · 2026-01-17T22:10:03 1768687803

really great! adjacent well-done ASCII using Braille blocks on X this week:

nolen: "unicode braille characters are 2x4 rectangles of dots that can be individually set. That's 8x the pixels you normally get in the terminal! anyway here's a proof of concept terminal SVG renderer using unicode braille", https://x.com/itseieio/status/2011101813647556902

ashfn: "@itseieio You can use 'persistence of vision' to individually address each of the 8 dots with their own color if you want, there's some messy code of an example here", https://x.com/ashfncom/status/2011135962970218736