More

apwheele · 2026-06-03T13:40:36 1780494036

Spotcrime just allows you to sign up for email alerts to crime nearby an address of your choosing.

I am generally of the mind even if it results in negative externalities, knowledge is good. So even if it on average increases fear of crime, knowing the reported crime nearby your home is a good thing.

Zigurd · 2026-06-03T13:53:48 1780494828

The quality of the data matters. Over-policed minority neighborhoods don't provide the quality of data that supports rational decision-making. LA has a lot of problems with quality of policing.

snapcaster · 2026-06-03T14:05:01 1780495501

"knowledge is good" is such a naive take. Trivial example: You only have knowledge of crimes committed by immigrants but zero knowledge of crimes committed by citizens. How is that good?

wang_li · 2026-06-03T15:36:05 1780500965

Having partial knowledge is good, even in your YOU'RE RACCIIISSTTTT!!!! example. Let's explain.

Consider you have to perform a task that in some way can interact with something in the environment. You have two choices of where to perform this task. In the first location there are 20 red things in the environment. In the second location there are 20 red things and 10 blue things. You know that 1 in 10 of the blue things have a negative interaction with your task. You know nothing about the red interactions with your task. You obviously choose the location with no blue things.

snapcaster · 2026-06-04T12:25:49 1780575949

You're not modeling this adversarially and that's your fundamental error here. If you look at your "knowledge supplier" as an entity actively trying to deceive you and give you an incorrect world model you'll realize i'm right and you're living in a fantasy theory world

wang_li · 2026-06-04T14:42:05 1780584125

No. You are trying to change the rules to a different game and then use those rules to invalidate a completely unrelated point. Having bad information doesn't make the fact that partial information is better than no information. It just means you were deceived by the information and your attempt at making decisions was sabotaged. If you are so wound up in "everything and everyone is against me" you have a mental disorder. If you are just picking and choosing who you define as adversarial based on which information disagrees with your priors you are just close-minded.

_vertigo · 2026-06-03T15:33:47 1780500827

I could see it being good if it helps you estimate crimes committed by citizens. If you know where the gaps in your knowledge/data are, you can attempt to account for them. And that’s better than nothing.

I think misleading information is obviously bad, incomplete information is not necessarily misleading though.

On the other hand, it might be better to remove incomplete information if it is actively being used to mislead people.

snapcaster · 2026-06-04T12:25:58 1780575958

You're not modeling this adversarially and that's your fundamental error here. If you look at your "knowledge supplier" as an entity actively trying to deceive you and give you an incorrect world model you'll realize i'm right and you're living in a fantasy theory world

jvanderbot · 2026-06-03T14:08:56 1780495736

So, continue this train of thought - If only partial data is available, then no data should be available because the partial data might induce incorrect assumptions in the general populace.

Apply this to:

Vaccination / disease management

Housing availability ("if they only know of these areas, will those areas become swamped and drive up prices?")

Price of drugs / medical services, or even medical test results (how many more suicides "might" occur if someone gets a possible cancer diagnosis)

Climate change

or anything else.

I think you'll find you're quickly concentrating knowledge dissemination into a central authority who decides what is "right" and that is much more dangerous than incomplete information.

Zigurd · 2026-06-03T14:29:05 1780496945

We're not talking about "partial data." We're talking about tendentious data that propagates existing known bias, produced by brutal problematic low quality policing. At the very least, people making apps based on crime location data need to acknowledge and flag such problems and inform their users of the dubiousness of LAPD and LASD data.

Surveillance tech and cop tech generally don't contribute to society because of these problems.

If you wouldn't trust RFK Jr. about vaccines, you should also be skeptical about what many PDs tell you. LAPD is just a particularly notorious example.

jvanderbot · 2026-06-03T14:38:58 1780497538

> We're not talking about "partial data."

You might change the subject away from partial data, but the comment I replied to _was_ talking about partial data, and my rebuttal _is_ about partial data and the judgement of whether that partial data is worth releasing based soley on how we imaging people might react to it. Have you read "The Unthinkable"?

I wouldn't trust RFK Jr. about vaccines if I didn't trust his data. But establishing a body that was in charge of disseminating data about vaccines is in high likelihood going to be taken over by RFK Jr types. Such a body shouldn't exist. Such a body would write "Turtles all the way down."

> "knowledge is good" is such a naive take. Trivial example: You only have knowledge of crimes committed by immigrants but zero knowledge of crimes committed by citizens. How is that good?

That counter-logic is so fundamentally flawed b/c it rests exclusively on the prejudgement of others and prediction of their use of the data while "I", the good thinker, can determine that it is bad for "them" to have access to this data. That is just a very bad way to think and is precisely what RFK-types do all the time.

throawayonthe · 2026-06-03T19:52:23 1780516343

> even if it results in negative externalities, knowledge is good.

i can respect[0] that, perhaps the data should be available in some form; but would regardless call 'crime alert emails' a disservice.

[0]i'm skeptical of LAPD producing knowledge in this area, others have noted overpolicing etc. also this is tracking police activity/knowledge, not crimes directly (and tbf it's impossible for it to complete even with best effort)

apwheele · 2026-05-30T17:16:25 1780161385

I have my crazy notes on Quarto and word documents here, https://github.com/apwheele/Blog_Code/tree/master/Quarto/Rep.... Hopefully useful for others reading these comments.

I don't even know what magic buttons I need to push to get that template to correctly inherit the table format I wanted from pandoc, but it does. I tend to have other scripts though for more complicated tables though. So if I want a table to have a certain row highlighted a different color, I would write a Powershell script to run after the table was generated.

I was never able to figure out how to use LibreOffice to insert the table of contents and then export to PDF (although I can do it via the GUI).

apwheele · 2026-05-28T13:50:14 1779976214

I have a contact page on my business website. I get about 2 spam messages a day.

So far I have not taken the effort to better filter them. It is a nice check to make sure the site is still working at the moment given the low volume I get.

I should try to AI inject them though at this point with something silly, like I am more likely to respond if you email as a limerick.

apwheele · 2026-05-26T12:42:42 1779799362

A recent paper believes work from home reducing entry level gigs fits the data better, https://papers.ssrn.com/sol3/papers.cfm?abstract_id=6787638.

I hate it, but the X thread is the easiest review piece I can find, https://x.com/pj_lambert/status/2057477629528150369.

iso1631 · 2026-05-26T13:07:26 1779800846

https://xcancel.com/pj_lambert/status/2057477629528150369

I struggle to see how WFH, especially as that was far more common from 2020 to 2023 than 2023 to 2026

Rather than the post-covid slump we've seen globally

> WFH makes supervision, monitoring, and on-the-job learning harder

It makes it different. In many ways it makes it easier, if you have the right supervisors and mentors working in the right way.

The larger impact would be hotdesking. Going to an office and not sitting anywhere in your team makes collaboration harder than working from home.

The requirement to move job to progress in remuneration harms retention, and thus reduces willingness to invest in a junior, but it's the expectation to move job after 2-3 years.

cshimmin · 2026-05-26T12:59:21 1779800361

https://xcancel.com/pj_lambert/status/2057477629528150369

apwheele · 2026-05-24T12:20:04 1779625204

You should add in Calvin Schmid's Handbook of Graphic Presentation into your list Doug -- https://archive.org/details/HandbookOfGraphicPresentation/pa...

Unfortunately I do not see specific discussion of how to make the lines a consistent thickness. It does have notes on how to sharpen your pencil and how to use a carpenters spline to draw smooth curves though.

mauvehaus · 2026-05-24T15:24:35 1779636275

Technical drawings pens are held upright and have a circular tip that gives a specific line width based on the diameter of the tip.

If you're inking your drawings, you probably don't need to worry all that much about the exact line width and consistency of your pencil work.

N.b. I don't ink my drawings. I've used drafting pens a couple times to experiment, but it's not part of my regular workflow.

https://www.jetpens.com/blog/The-Best-Technical-Drawing-Pens

nakedneuron · 2026-05-25T08:37:38 1779698258

Thanks for sharing! Today I discovered the Jibun Techo. Always great to discover products made by people who put so much thought and craft in. Not surprising the Japanese excel in this art.

Set the mood for today.

Unfortunately can't upvote.. your karma says 6666. May return when the spell is broken.

dougdude3339 · 2026-05-24T17:04:59 1779642299

This will be an excellent addition to the list. Curve lines are a challenge I have yet to tackle in depth.

thaumaturgy · 2026-05-24T17:15:45 1779642945

You may also like Information Graphics: https://www.taschen.com/en/books/graphic-design/44653/inform...

antiquark · 2026-05-24T15:00:39 1779634839

> lines a consistent thickness

A "ruling pen" would help. It's like a fountain pen where you can adjust the width of the ink.

https://en.wikipedia.org/wiki/Ruling_pen

apwheele · 2026-05-15T11:03:38 1778843018

So not familiar with the caselaw around work product, but if you use an API tool directly and not the different chat tools, the queries are not permanently cached for anyone to give up in the end.

So basically if you use any of the CLI tools, there is nothing for OpenAI, Anthropic, etc. to give the courts.

Online ChatGPT (especially the free version), are apparently cached by OpenAI on their servers. (I am not sure if Claude Desktop caches the conversations locally or in the cloud as well, read the fine print if it matters!)

trollbridge · 2026-05-15T12:20:51 1778847651

Indeed, there is no way my terabytes per day of API calls is getting permanently stored anywhere.

Perhaps an AI generated summary of it is.

skeeter2020 · 2026-05-15T15:22:42 1778858562

interesting angle - how are/would compressed context (i.e. the parts of the user-LLM transcription likely to be saved) be treated by the courts? Would this be considered hearsay?

nerdsniper · 2026-05-15T15:57:42 1778860662

It would be treated as evidence, and the defense would be free to argue that it carries very little weight. The judge or jury would be free to decide how meaningful it is.

apwheele · 2026-05-10T15:14:04 1778426044

There needs to be a new name for people creating these with no obvious validation.

Skill spam?

siva7 · 2026-05-10T18:05:59 1778436359

Spam. It takes a minute in 2026 to create any app, any skill, any anything without any education that looks plausible that took five years ago a highly educated and skilled person at least months. Now it takes the highly skilled individual ten times the time to evaluate the vibeslopped spam it took the author to publish.

Daviey · 2026-05-10T20:46:09 1778445969

In job I am seeing it as an amplification DoS attack, the amount of content being produced is crippling the processes to protect the org.

AndyNemmity · 2026-05-10T16:01:27 1778428887

Define obviously validation? What is the signal that tells you one is reasonable vs another?

I find the only way to do that is to look at it, if it passes some visual tests, try it, and then a/b test if it's any better than without it.

theptip · 2026-05-10T16:30:42 1778430642

Some sort of eval. Eg TermBench, implemented in Harbor.

It’s an insane amount of effort to build shareable, reusable, comprehensive evals, hence why so almost all skills are stuck in the “vibes” phase.

That said I think it’s quite easy to skim/intuit these sort of skills and do horizontal gene transfer into your own vibes-based system. If you use the skills regularly you can construct a cheap personal eval that is a lot easier to maintain and use it to compare a new skill/plugin. Just things like “please write a paper on <my personal unpublished thesis>” is a good starting point here. You get a good feel for whether a skill is better than vanilla by running it a couple times and watching the failure modes.

AndyNemmity · 2026-05-10T16:38:40 1778431120

Yeah, I think we're in a phase honestly where you shouldn't use anyone elses skills, and you should instead point your stuff at a repo with skills, have it really read it, and then ask what of value there is to potentially rewrite in your style based on your preferences.

I have a complex setup with a lot of things based around what I do. I don't know how anyone could reasonably get their head around any of it. It's a research project in itself.

So I tell people, please don't use it. Just point your claude code at it, and see if there's anything useful for you.

bisonbear · 2026-05-11T02:38:05 1778467085

Agree, it's impossible to tell if someone else's workflow works with your codebase without actually trying it, which takes time/tokens. I've been thinking about how to make running quick, directional evals easier / more efficient to give more confidence in using / developing skills. Basically, how do we go from vibes to data?

apwheele · 2026-05-10T16:23:54 1778430234

So yes a/b broadly speaking is what I was saying (test cases and can show it is actually better).

Even this repo just the "b" showcase, showing the outputs as is (with no clear documentation how those were generated, is it headless in a CI pipeline somewhere?), is not good, https://github.com/Imbad0202/academic-research-skills/tree/m....

AndyNemmity · 2026-05-10T16:31:09 1778430669

I run a lot of a/b testing. But I'm not sure showing it actually communicates all that much. Since these are non deterministic systems, even showing you an a/b test from when i made the decision a month ago, doesn't really mean a whole lot.

I agree we need more clear indications of value, I don't quite understand how to legitimately do that in a fair, and honest way.

elashri · 2026-05-10T15:50:08 1778428208

Skill-slop.

whattheheckheck · 2026-05-10T17:25:43 1778433943

Its the same as whipping out some random python package why diminish it? Your comment could be called skeptic reply guy spam

mmooss · 2026-05-10T16:49:40 1778431780

The OP evaluates what it has developed with great rigor and describes the evaluation in detail. What do you feel is missing?

apwheele · 2026-05-10T19:17:03 1778440623

It actually does not -- and that is part of the issue. Consumers just see "oh gosh this looks very detailed" and superficially think someone must of spent quite a bit of time on this and it works well.

Skills are just prompts -- and most of what I am seeing are people using AI to write the (quite verbose) prompts. There should be a test, somewhere, that shows "my prompt does better than XYZ other prompt" for some model and some specific inputs. This is what is called a benchmark.

It may work well, I don't know. Just asking Claude "hey help me iterate on a paper" works pretty well out of the box too. Call me skeptical this actually works in any substantive way without seeing any evidence it works.

I agree writing a good benchmark takes time. How do people know if all these prompts they are writing are any good though? You could make an edit and it causes a regression overall. Or add too much info and it is just wasted space in the context window, or causes the model to go in loops between the different skills, or plenty of other errors.

AndyNemmity · 2026-05-10T20:02:31 1778443351

I really do run a/b tests. I really do test, and validate.

I do not believe me giving you that information is honest. If I do, I am pretending that you will get the same experience.

Maybe you're using a different model. Maybe you have stuff in your CLAUDE.md that will break it.

It is not honest to me to give you confidence in it, when no one can be confident in it.

mmooss · 2026-05-10T22:46:16 1778453176

> It actually does not

I read it, right there on the OP. Tests and test results, including discussions of flaws with earlier designs and how they are improved here. What are you talking about?

sumeno · 2026-05-10T19:34:39 1778441679

I seriously doubt any human has ever read the full readme for the project.

apwheele · 2026-05-04T17:33:11 1777915991

This data is just generally often available in the US, https://northcarolina.votermaps.org/?#16.76/35.78541/-78.779... (agree it is bad though!)

apwheele · 2026-05-01T16:46:33 1777653993

It is institutional in the sense that Flock and the individual PDs have not put steps in place (either post auditing or pre not allowing bad queries) that prevent the abuse.

Post auditing is obviously not taken seriously by these departments, and Flock could build tools to do this out of the box (identify weird search patterns) if they wanted to.

Edit -- I see Flock does have some audit tools, https://www.flocksafety.com/trust/compliance-tools. If those work as they should, it is more on PDs to use them properly.

apwheele · 2026-05-01T12:09:38 1777637378

FYI the first link, I copy-pasted the first few paragraphs into pangram and it correctly identifies as AI written, https://www.pangram.com/history/790fc2b8-6348-47fa-ad3e-8bae...