Where they'd get training data? Source code generation is possible due to large ...

jimmaswell · 2025-12-16T22:37:25 1765924645

How long ago was this? I've had outstansingly impressive results asking Copilot Chat with Sonnet 4.5 or ChatGPT to debug difficult multithreaded C++.

oxag3n · 2025-12-16T23:00:42 1765926042

Few months back with ChatGPT 5. Multi-threaded Rust & custom async runtime, data integrity bug, reproduced every ~5th run.

simonw · 2025-12-16T22:38:03 1765924683

Have you tried running gdb from a Claude Code or Codex CLI session?

oxag3n · 2025-12-16T23:10:25 1765926625

No, I'm in academia and the goal is not code or product launch. I find research process to struggle a lot once someone solves a problem instead of you.

I understand that AI can help with writing, coding, analyzing code bases and summarizing other papers, but going through these myself makes a difference, at least for me. I tried ChatGPT 3.5 when I started and while I got a pile of work done, I had to throw it away at some point because I didn't fully understand it. AI could explain to me various parts, but it's different when you create it.

planckscnst · 2025-12-17T06:19:30 1765952370

For interactive programs like this, I use tmux and mention "send-keys" and "capture-pane" and it's able to use it to drive an interactive program. My demo/poc for this is making the agent play 20 questions with another agent via tmux

fragmede · 2025-12-16T23:36:28 1765928188

> Where they'd get training data?

They generated it, and had a compiler compile it, and then had it examine the output. Rinse, repeat.

RA_Fisher · 2025-12-17T00:26:18 1765931178

LLMs are okay at bisecting programs and identifying bugs in my experience. Sometimes they require guidance but often enough I can describe the symptom and they identify the code causing the issue (and recommend a fix). They’re fairly methodical, and often ask me to run diagnostic code (or do it themselves).

anon-3988 · 2025-12-16T23:47:40 1765928860

> I suspect debugging is not that straightforward to LLM'ize.

Debugging is not easy but there should be a lot of training corpus for "bug fixing" from all the commits that have ever existed.

christophilus · 2025-12-16T22:46:56 1765925216

Debugging has been excellent for me with Opus 4.5 and Claude Code.