More

nibbleyou · 2026-05-30T13:21:43 1780147303

> If a one wants IP and rule of law (incl contracts) to be respected, one should not violate others rights when it is convenient.

Yes that's what should be said to OpenAI. Now they should not cry about their T&Cs not being respected when they never cared about others' copyrights.

nibbleyou · 2026-05-23T03:23:55 1779506635

I think a lot of us would be fine for AA to be a for-profit enterprise earning money from donations and deals with companies. The service it provides is invaluable - free and DRM-free access to millions of titles in the world.

nibbleyou · 2026-05-15T05:19:29 1778822369

I have only worked in startups and I have been an early engineer in both of them. I would always get high privileges within a short time where I would have the access to create and delete resources. I don't think it's that uncommon.

eecc · 2026-05-15T06:11:41 1778825501

I would never have these privileges granted directly to my account.

Indeed it’s a good practice to use roles where supported (AWS has them) and explicitly switch when needed

maccard · 2026-05-15T07:45:20 1778831120

The problem with agents is they regularly sidestep the guardrails and do what they want with a script anyway. The number of times I’ve seen Claude try to escape the folder it’s working in, and then for it to write a python script that does exactly what I told it it’s not allowed do supports that.

If you use SSO and have an AWS config that Claude is allowed to see to get the correct role in the first place, it will just pick the role and plough on anyway.

bigstrat2003 · 2026-05-15T07:56:21 1778831781

And this is why it is the height of irresponsibility to run LLMs on your system. We know they are unreliable and just make things up; it's extremely foolish to go "yeah I'm going to let that run commands".

maccard · 2026-05-15T08:25:23 1778833523

It's not _really_ any different to running an undocumented third party binary. Is it the height of irresponsibility to run Windows, or VSCode, or Spotify?

I think the model we've got now is wrong, and the harnesses should be OS-level sandboxed, and the agents should be running in harness managed sandboxes.

indentit · 2026-05-15T05:27:24 1778822844

But the correct way to do it is to have a separate account with more privileges, and only give AI access to your standard developer account

digitaltrees · 2026-05-15T05:36:01 1778823361

I have personally seen AI bypass this multiple times.

giancarlostoro · 2026-05-15T06:32:50 1778826770

Sounds like they're still giving the model the keys to the kingdom, which is my point, stop giving the model the avenue to do catastrophic mistakes, it makes no sense.

digitaltrees · 2026-05-15T07:50:33 1778831433

If you’re message is in response to me, which I think it is, I deliberately don’t give access to credentials and env variables. I’ve worked to create restrictions and seen AI models use very interesting methods to bypass them.

Even now my prompt says the AI must verify the path of the files it intends to edit, and get permission before editing one file at a time and only after permission. I stop it from ignoring those rules once a day at least.

suchar · 2026-05-15T08:38:01 1778834281

This is not privilege separation/sandboxing. Separate virtual machine for an agent with limited credentials is reasonably safe approach

digitaltrees · 2026-05-15T20:58:46 1778878726

I built www.propelcode.app with separate Linux containers, unless you disconnect the container and your computer from the internet the models can escape the sandbox and get information off of your machine.

I am open to being corrected and learning from you if you have a better method of sandboxing

Anon1096 · 2026-05-16T04:21:42 1778905302

The best way to use LLMs is via tmux where it's running on a disposable VM. 0 chance of it getting information from your local machine.

digitaltrees · 2026-05-17T05:19:54 1778995194

I am using tmux but not disposable vm. I have thought about something like that but honestly some of the debugging work makes ephemeral environments hard to work with. How are you doing that in your workflow?

Terr_ · 2026-05-15T05:52:02 1778824322

We kinda need to architect things with the assumption that all token-output from an LLM can be unpredictably sneaky and malicious.

Alas, humans suck at constant vigilance, we're built to avoid it whenever possible, so a "reverse centaur" future of "do what the AI says but only if you see it's good" is going to suck.

digitaltrees · 2026-05-15T07:46:48 1778831208

I built my own IDE to replace vscode / cursor so I could design the harness and ensure that the model tool access was secure and limited. But the rest of the industry is YOLO

trick-or-treat · 2026-05-15T08:57:07 1778835427

That's one way to do it, how about backup to a remote location every hour? There's more than one way to be careful.

ramraj07 · 2026-05-15T05:56:05 1778824565

The first step I do when I do any meaningful side project is to set up rds with snapshots. So any startup that doesnt do this one basic step already deserves to fail in my opinion.

Then next I've used AI agents like crazy, we even have linked mcp servers that let it query on the dev database. Haven't seen it try deleting everything a single time. I haven't seen any agent try to do anything destructive. Ever. Perhaps its just reflecting an outrageously bad engineer and nothing else.

nibbleyou · 2026-05-05T15:21:55 1777994515

I too have felt the same around me. There is this lack of faith in the institutions now, feeling of distrust. Someone on HN called this the era of shamelessness and I kind of agree to it. The top has gotten shameless and the people at the bottom are trying to scrabble whatever they can to become one of them so that they can escape this hellhole that has been created.

randomNumber7 · 2026-05-05T16:06:33 1777997193

Definitely the fish stinks from the head.

I'm also a bit confused about how the people on the top think this will play out.

A long time ago there was a french saying "noblesse oblige", or the german pendant "Wohlstand verpflichtet".

munificent · 2026-05-05T17:34:39 1778002479

> I'm also a bit confused about how the people on the top think this will play out.

I don't know if they are really capable of thinking of the second and third order effects of what they're doing. There is something psychologically broken about many of the ultra-rich today where their behavior comes across as compulsive.

When you have a hole in your soul that can't be filled with a billion dollars, it simply can't be filled, and that black hole drives much of their behavior. You look at people like Trump and Musk, and they seem... miserable. Like, have you ever heard Trump have a genuine laugh of joy? Not the sort of sneering snicker of a bully, but one that comes from delight? Because I haven't.

We are all at the mercy of their actions, but it's almost like they're at the mercy of their irrational compulsions too.

Not that I'm saying they are deserving of sympathy or aren't responsible for their actions. But if we're looking for someone to pump the brakes on the crazy that's happening these days, it's sure as hell not going to be those hollow men.

thatguy0900 · 2026-05-05T17:09:27 1778000967

I don't like being conspiratorial but it genuinely feels like the people at the top know some major catastrophe is coming and are just grabbing whatever resources they can while they can before retreating to their bunkers. Even the white house is trying to build a massive underground bunker using the ballroom on top as a excuse. I don't see why else they would all be willingly destroying society as they are right now unless they don't think it matters.

brokenmachine · 2026-05-06T01:31:17 1778031077

Everyone knows a major catastrophe is coming. Scientists have been talking about the tipping point for like five decades now.

It's a done deal, we were too stupid.

nibbleyou · 2026-04-28T11:13:41 1777374821

There's also a tool to automatically push it to multiple repos: https://github.com/prashantsengar/GitEcho

Disclaimer: the author is a colleague of mine

Though to be fair, what the parent meant by federated forges is different than this approach.

pabs3 · 2026-04-28T12:24:28 1777379068

git itself can push to multiple URLs btw:

https://stackoverflow.com/questions/849308/how-can-i-pull-pu...

nibbleyou · 2026-04-24T07:12:21 1777014741

Curious to know what kind of problems you are talking about here

hodgehog11 · 2026-04-24T07:20:39 1777015239

I don't want to give away too much due to anonymity reasons, but the problems are generally in the following areas (in order from hardest to easiest):

- One problem on using quantum mechanics and C*-algebra techniques for non-Markovian stochastic processes. The interchange between the physics and probability languages often trips the models up, so pretty much everything tends to fail here.

- Three problems in random matrix theory and free probability; these require strong combinatorial skills and a good understanding of novel definitions, requiring multiple papers for context.

- One problem in saddle-point approximation; I've just recently put together a manuscript for this one with a masters student, so it isn't trivial either, but does not require as much insight.

- One problem pertaining to bounds on integral probability metrics for time-series modelling.

MinimalAction · 2026-04-24T13:33:39 1777037619

Regarding the first problem: are you looking at NCP maps for non-Markovian processes given you mention C*-algebra? Or is it more of a continuous weak monitoring of a stochastic system that results in dynamics with memory effects?

I'd be very curious to know how any LLMs fare. I completely understand if you don't want to continue the discussion because of anonymity reasons.

hodgehog11 · 2026-04-24T15:46:27 1777045587

More of the latter. It's a pet project of mine, and all of the LLMs tend to utterly fail at getting anywhere with it, at least in chats. In an agentic setup, it can chip away at some aspects, but it needs serious guidance on relevant language, notation, and concepts. To me, it demonstrates that the LLMs are not particularly good at crossing literatures, but then again, humans rarely seem to be good at that either...

mdprock · 2026-04-26T22:01:37 1777240897

By agentic do you mean that you run these models through an harness in the cli? If yes which one? Thanks for sharing

pm2r · 2026-04-24T07:33:53 1777016033

It would be wonderful to have a deeper insight, but I understand that you can disclose your identity (I understand that you work in applied research field, right ? )

hodgehog11 · 2026-04-24T08:54:46 1777020886

Yes, I do mostly applied work, but I come from a background in pure probability so I sometimes dabble in the fundamental stuff when the mood strikes.

Happy to try to answer more specific questions if anyone has any, but yes, these are among my active research projects so there's only so much I can say.

pm2r · 2026-04-24T14:53:38 1777042418

Thanks a lot for your kind but detailed answer. I’m no more in the research field but you gave me good ideas to work on

nibbleyou · 2026-04-23T14:18:11 1776953891

I saw something like this for a book. It was under an Instagram reel where the person was describing ways to improve your self-esteem. In the comments section someone mentioned a book that worked for them and it had a few replies saying how it worked for them too. I searched for the book and it was a very new book from an unknown author and zero reviews everywhere.

nibbleyou · 2026-04-03T03:01:16 1775185276

Exact same story at my place. Upper management decided it's a good idea to build on Azure because Microsoft promised some benefits. Things that ran reliable on GCP now need active firefighting on Azure

nibbleyou · 2026-03-28T15:22:24 1774711344

I see this being said often but I don't understand.

A lot of people posting there are young and may well be in their first relationship. It makes sense for them to ask a question in the community they spend their most time in - which is reddit

nibbleyou · 2026-03-20T03:26:39 1773977199

What is your Workflow?