More

vadansky · 2026-06-01T01:21:10 1780276870

I would love something that you can open and it expands/pops out a split keyboard like the Voyager (https://www.zsa.io/voyager)

vadansky · 2026-05-28T18:01:17 1779991277

I don't know, maybe I'm doing it wrong but I feel LLMs add a slop debt, and each agent pass just exuberates it.

Like I had an LLM implement a spec and said it was done... Except it had a ton of `casts` everywhere. Okay, my bad, I should have been clear "NO CASTS", so I use the LLM to remove the casts, except it just kept making things more and more complicated and ugly.

It took me taking a break and having a shower thought to realize all the ugliness is because one type should have been broken up into 2, which would remove a ton of generics and code. But Claude never suggested that, it was always "we need at least one cast here, or we need 1000 LOC of generic factories". I tried multiple new sessions with various prompts too.

Maybe one day soon LLMs could pay off their own slop debt but at least right now I don't trust them to write code unseen.

Edit: Maybe the correct action should have been to delete everything and make it re-write everything from scratch with the clear "NO CASTS EVER" rule. But still the point is feels like having LLM clean up after an LLM doesn't work well enough to just have keep it in a loop and never look at what it does.

highwaylights · 2026-05-28T18:17:08 1779992228

This matches my experience.

I've had to put a fair chunk of effort in to skills that will run deterministic mechanisms to unslop a codebase (cyclomatic complexity grading has been really helpful here) as invariably some amount of guidance around principles will be missed over time. I've found it does help, though. Certainly I'm getting overall better results from Flash and Sonnet over multiple runs for fairly modest token increases. GPT 5.5 less so, but that's because it scores better in a first pass. I won't really know until I gauge it at the end of my sub month which has been more cost efficient for me all things considered.

NichoPaolucci · 2026-05-29T02:54:23 1780023263

I’m in a similar boat. I find that longer sessions will introduce “noise”. I have to be extremely explicit to avoid adding this noise, as it pollutes the future output of the models. Sometimes it’s innocuous, other times it can derail sessions as the 2nd or 3rd pass introduce even more of their own noise.

To me, it seems the models are inherently designed to do this. Creating more verbose output than input, generating plans introduce things I didn’t ask for, extras, more “defensive” code that makes sense at first but is completely unnecessary in practice… I find it exhausting, but it’s important to pare down the output / plans at each stage and trim the generated stuff that isn’t needed.

vinnymac · 2026-05-28T18:34:46 1779993286

The problem is that we have an ever growing and large number of constraints, and not following even a single one means the result is sloppy.

I don’t see them fixing this any time soon, and thus human in the loop is a requirement to use these tools effectively. That is unless you love your slot machine dopamine rush enough to ignore quality gates and respect for your peers time.

tomjakubowski · 2026-05-28T19:38:02 1779997082

I've been reading writing Rust for a long while now, since before 1.0. I'm capable of critically evaluating Rust code. I'm also a happy Claude Code user, mostly for lightweight uses like generating scaffolding, prototyping, and debugging.

The pure LLM, no human intervention vibe-coded PRs on Bun since the vibe-rewrite to Rust contain the worst coding horrors I've seen in 20 years of programming.

Setting aside the quality of the change itself (I would have done it differently, for sure: it is pretty straightforward to build a safe abstraction out of this type), the utterly pointless "source-text consistency test" added here is easily the worst example of "test repeats implementation" I have seen in my career:

https://github.com/oven-sh/bun/pull/30728/files#diff-863477b...

eithed · 2026-05-29T13:40:32 1780062032

Write a skill outlining your expectations of the code, put that skill into the pipeline, so that it can be included within your workflow.

Webdev here, but currently I have: - a skill where I outlined how the architecture of the system should look like, with guards (static analysis, architecture tests, linting) confirming that the code it generates adheres to standards

- a skill that tells it how tests should look like (use generators, write both feature / unit tests)

- a skill that tells it to generate docs from the code in a form of acceptance criteria (Given / When / Then)

- a skill that tells it to generate frontend uat tests + accompanying backend seeders given the AC

- a skill that tells it to verify that ticket objectives match what was delivered

At this point I still need to guide it to move task from one stage to the other (coding, testing, verification that indeed what was coded adheres to what was required), but I believe that these dynamic workflows can automate this work as well.

zmj · 2026-05-28T22:22:41 1780006961

If you want hard rules, use deterministic tools. Prompts are for fuzzy guidance.

erispoe · 2026-05-29T15:14:11 1780067651

How would you prevent a junior engineer doing this mistake? Presumably, you would setup a lint rule. Do the same for LLMs. Run the linter after each edit through a hook, give feedback to the LLM. Write your lint rules with clear explanations of why the behavior is a problem, and nudges to the good behavior.

grim_io · 2026-05-29T23:18:43 1780096723

You wouldn't prevent the junior from making this mistake.

You would correct them once or twice, and they won't make the mistake again.

It's something we can't do with LLM's currently, so we all just try(and fail) to predict any possible failure ever, and then somehow try to cram it into the limited context.

sevenseacat · 2026-06-03T15:48:09 1780501689

You would review their code, and give them the feedback. They would learn from that, and not make the mistake again (or not make it after receiving the same feedback again).

vadansky · 2026-05-12T14:48:39 1778597319

http://archive.is/UMMCx

cluckindan · 2026-05-12T17:27:45 1778606865

dang, please block these links.

https://en.wikipedia.org/wiki/Wikipedia:Archive.today_guidan...

NooneAtAll3 · 2026-05-12T19:49:42 1778615382

what's the alternative?

lobocinza · 2026-05-12T22:10:58 1778623858

archive.org

ameliaquining · 2026-05-13T02:14:10 1778638450

It was just demonstrated upthread that archive.org doesn't work for this purpose.

vadansky · 2026-04-29T19:24:51 1777490691

This is annoying since I have a side project I like to use alchemical names in, and HERMES.md sounds like something I would do. Guess I have to go with AGRIPPA.md, but Hermes Trismegistus is so much cooler...

vadansky · 2026-04-29T13:41:52 1777470112

I've been using Notepad Next, it supports leaving all your tabs open when you close the window which is the main feature I need. But I do miss the plugins.

vadansky · 2026-04-13T16:06:18 1776096378

I want to like Claude but I keep having to pop over to codex and I feel at some point I'll stop starting with Claude and just use Codex from the start.

zamalek · 2026-04-13T16:21:29 1776097289

Claude to plan, codex to implement. Claude's giant context is great for reading large amounts of code but it is currently incapable of following instructions/guidelines.

d0100 · 2026-04-13T17:11:17 1776100277

GPT 5.4 is working pretty well for me, both in Copilot and Codex vscode extensions

If you create a plan it follows it closely

vadansky · 2026-04-10T21:25:21 1775856321

Or just use it as an example to vibecode your own. Extension laundering through vibecoding.

vadansky · 2026-03-03T22:39:50 1772577590

I think the theory was he had a rare copy and wanted to drive the price of it up.

duskwuff · 2026-03-04T05:48:56 1772603336

That's hard to reconcile with actions like issuing DMCA takedowns on videos of the game (or even Discord messages which mention it). If fewer people know a game exists, there's less of a market for copies of it.

vadansky · 2026-03-03T04:30:51 1772512251

Good time to watch Shattered Glass.

Imagine what he could have gotten up to with LLMs.

thomassmith65 · 2026-03-03T10:27:04 1772533624

It's an excellent movie, regardless.

  "When this thing blows there isn't going to be a magazine anymore!"

https://youtube.com/watch?v=oj79mp2WEx0

vadansky · 2026-01-09T17:17:05 1767979025

Keep scrolling down, there is a Max option