More

tlack · 2025-07-07T18:55:30 1751914530

If your application is pricing sensitive, check out DeepInfra.com - they have a variety of models in the pennies-per-mil range. Not quite as fast as Mercury, Groq or Samba Nova though.

(I have no affiliation with this company aside from being a happy customer the last few years)

asaddhamani · 2025-07-08T08:21:27 1751962887

DeepInfra is amazing in terms of price, like really, they have the Qwen3 embedding model for $0.002 per mn tokens. That's an order of magnitude cheaper than most alternatives with better benchmark scores. But the performance P99 is slow and the variance is huge. For latency sensitive workloads it's problematic, if they can fix that it'll be a no-brainer to use them. DeepInfra does tend to have the lowest prices of any API provider.

tlack · 2025-04-11T21:11:15 1744405875

Vast.ai is a well known provider in that category.

Runpod also has an offering in this space but I think they’re shifting toward more managed-by-them rental services.

If you want someone to take you up on that offer you gotta specify a % differential from cloud list price. :)

mertleee · 2025-04-11T21:13:04 1744405984

I'm open to offers. But to start lets say 50% of spot prices on Lambda or AWS

tlack · 2025-03-25T18:09:23 1742926163

Sounds great to me! Just be sure the AI is capable of taking on various personas because not everyone you meet is a believer in nonviolent communication.

tlack · on Jan 29, 2025

A tip, Miami guy to Miami guy - the best devs eschew PHP like the plague. The remaining PHPers are desperate work-a-day sorts who will not be bringing the latest hotness to your project. Hire an Elixir or Haskell expert and you're gonna get a much more well-traveled coder.

tlack · on Jan 18, 2025

I believe the parent poster was referring to recent efforts in New York State to require licensure and background checks to purchase a 3D printer [1]

[1] https://www.nysenate.gov/legislation/bills/2025/A2228

tlack · on Nov 7, 2024

Kudos on your bold undertaking! I've been a side-lined QNX admirer for some time, though not a potential user in most cases. A good next step would be a series of blog posts where the author takes on common types of enthusiast projects and unpacks how QNX's strengths can be applied in those scenarios.

steve_adams_86 · on Nov 7, 2024

I’d love to see this. I’m curious about QNX but at a loss as to how it could benefit me as someone who tinkers with robotics at home.

I also work for a non-profit org which does coastal margin research. We have several people working on custom hardware. Could this benefit us?

bregma · on Nov 7, 2024

Start with https://gitlab.com/elahav/qnx-rpi-book/-/blob/master/pdf/qnx... as an intro to tinkering at home.

tlack · on Oct 9, 2024

I believe the tree-wrangler you mentioned is Aaron Hsu, author of the Co-dfns APL "compiler" that Trap uses.

Here are some videos related to his work: https://www.youtube.com/playlist?list=PLDU0iEj6f8duXzmgnlGX4...

Co-dfns was most recently discussed on Hacker News 3 months ago: https://news.ycombinator.com/item?id=40928450

tlack · on Sept 20, 2024

Do you find you really need that level of “resolution” with memories?

On our [1] chatbots we use one long memories text field per chatbot <-> user relationship.

Each bot response cycle suggests a new memory to add as part of its prompt (along with the message etc)

Then we take that new memory and the existing memories text and feed it to a separate “memory archivist” LLM prompt cycle that’s tasked with adding this new memory and resummarizing the whole thing, yielding a replacement for the stored memories, with this new memory added.

Maybe overly simplistic but easy to manage and pretty inexpensive. The archiving part is async and fast. The LLM seems pretty good sussing out what’s important and what isn’t.

[1] https://Graydient.ai

codekisser · on Sept 20, 2024

I have already tried what you're doing, and it didn't perform well enough for me. I've been developing this project for a two years now. Its memory isn't going to fit in a single prompt.

I imagine that your AI chatbots aren't as cheap or performant as they can be with your potentially enormous prompts. Technical details aside, just like when talking to real people, it feels nice when they recall minor details you mentioned a long time ago.

gkorland · on Sept 20, 2024

If it's your personal assassinate and is helping you for months it means pretty fast it will start forget the details and only have a vogue view of you and your preferences. So instead of being you personal assassinate it practically cluster your personality and give you general help with no reliance on real historical data.

tlack · on July 29, 2024

I have read a lot of reports that the job market is pretty bad.

But, bad or good, I think all you can really do is keep trying! Don't let a few rejections stymie your long term goals. Your family needs you to keep putting one foot in front of the other and applying to more and more places until you find that perfect role.

Maybe use this downtime to build yourself up: open source some stuff that defines you as a subject matter expert, or blog about some of your experiences, etc.

Wouldn't hurt to share your resume here too - lotta industry people lurking. :)

rcshubhadeep · on July 29, 2024

Thanks a lot for the reply!

tlack · on June 28, 2024

Sad day! This guy was a hilarious and talented writer. If anyone is looking for a book to pick up this weekend, I'd recommend checking out some of his work, especially if you like hard drinking Jewish nihilist detectives.

bookofjoe · on June 28, 2024

I loved Kinky. I first encountered him in a Washington Post interview in the early 1980s, in which he remarked "I'm searching for a lifestyle which does not require my presence." That's been my lodestar ever since. R.I.P. Kinkster.