majdalsado's comments

majdalsado · 2025-12-05T01:01:49 1764896509

Very interesting how Singapore ranks 2nd in terms of token volume. I wonder if this is potentially Chinese usage via VPN, or if Singaporean consumers and firms are dominating in AI adoption.

Also interesting how the 'roleplaying' category is so dominant, makes me wonder if Google's classifier sees a system prompt with "Act as a X" and classifies that as roleplay vs the specific industry the roleplay was intended to serve.

olalonde · 2025-12-05T05:01:45 1764910905

Almost certainly VPN traffic. Most major LLMs block both China and Hong Kong (surprisingly, not the other way around), so Singapore ends up being the fastest nearby endpoint that isn't restricted.

slack2450 · 2025-12-05T06:38:28 1764916708

It’s not VPN traffic all data is aggregated by billing payment information so it’s Singaporean billing details.

olalonde · 2025-12-05T06:57:42 1764917862

Ah, you're right. Still, I wonder if it's because of Chinese people and companies using Singaporean bank accounts. It just seems odd that such a small country is so overrepresented here.

m3h · 2025-12-05T05:19:28 1764911968

Why do major LLMs block china? Isn't that a potentially huge market for them?

olalonde · 2025-12-05T05:35:44 1764912944

I'm not sure, but my guess is that it's due to pressure (or perceived pressure) from the U.S. government.

orbital-decay · 2025-12-05T22:53:44 1764975224

It's their own decisions they made long before the controls and presure. Besides being in bed with the US gov, people that run big AI shops tend to be fervently nationalistic and politically ambitious on their own. Leopold Aschenbrenner's dystopian rant [1] or Dario Amodei's [2] [3] are pretty representative.

[1] https://situational-awareness.ai/

[2] https://www.darioamodei.com/essay/machines-of-loving-grace

[3] https://www.darioamodei.com/post/on-deepseek-and-export-cont...

mike_hearn · 2025-12-05T09:20:57 1764926457

Early on there was a lot of distillation going on, apparently. Note that OpenAI introduced ID verification for high volume accounts and I think it was for that reason. It does raise questions about how much of the Chinese model's performance is entirely home grown. At least historically, it was quite hard to crawl the English web from behind the Great Firewall.

majdalsado · 2025-12-01T21:26:13 1764624373

We're an early-stage (pre-seed) VC-backed startup automating RFP proposals for the AEC (Architecture, Engineering, Construction) industry.

We are building an agentic AI platform that embeds directly into Microsoft Word, helping firms find and win more work, while serving as their knowledge management hub for all business development.

You would be the first full-time engineering hire working directly with the founders (I'm the technical co-founder). We need someone who can ship production code across the whole stack.

The Stack: - Frontend: Next.js, React - Backend: FastAPI (Python), Temporal

Hard problems you will solve:

Deep Word Integration: Building a high-performance, "Cursor-like" experience within the constraints of Office.js.

Agentic Workflows: Orchestrating AI agents that can read complex government requirements, reason about compliance, and generate winning output autonomously.

Evolving Knowledge Graph: Architecting a library system that doesn't just store files, but learns from project history and feedback loops.

If you want a chance to work on a hard problem, in an exciting space, with a strong team that has validated the market and de-risked the business, with major upside, let's talk!

To apply, email me directly: majd [at] bidaya.ai or [here](https://app.dover.com/apply/bidaya/d5f29bbb-9c67-4c4e-82bf-5...) Mention HN in the subject.

majdalsado · 2025-07-30T16:23:19 1753892599

I'm looking into a tool like this for my startup. Why should I use this over Langfuse or Helicone?

AbhinavX · 2025-07-30T16:53:13 1753894393

Langfuse and Helicone work well for traditional LLM operations, but AI agents are different. We discovered that AI agents require fundamentally different tooling, here are some examples.

First, while LLMs simply respond to prompts, agents often get stuck in behavioral loops where they repeat the same actions; to address this, we built a graph visualization that automatically detects when an agent reaches the same state multiple times and groups these occurrences together, making loops immediately visible.

Second, our evaluations are much more tailored for AI Agents. LLM ops evaluations usually occur at a per prompt level (i.e hallucination, qa-correctness) which makes sense for those use cases, but agent evaluations are usually per session or run. What this means is that usually a single prompt in isolation didn’t cause an issue but some downstream memory issue or previous action caused this current tool to fail. So, we spent a lot of time creating a way for you to create a rubric. Then, to evaluate the rubric (so that there isn’t context overload) we created an agentic pipeline which has tools like viewing rubric examples, ability to zoom “in and out” of a session (to prevent context overload), referencing previous examples, etc.

Third, time traveling and clustering of similar responses. LLM debugging is straightforward because prompts are stateless and are independent from one another, but agents maintain complex state through tools, context, and memory management; we solved this by creating “time travel” functionality that captures the complete agent state at any point, allowing developers to modify variables like context or tool availability and replay from that exact moment and then simulate that 20-30 times and group together similar responses (with our clustering alg).

Fourth, agents exhibit far more non-deterministic behavior than LLMs because a single tool call can completely change their trajectory; to handle this complexity, we developed workflow trajectory clustering that groups similar execution paths together, helping developers identify patterns and edge cases that would be impossible to spot in traditional LLM systems.

majdalsado · 2025-07-31T19:24:50 1753989890

This makes sense. We'll look into this some more, will be making a decision next couple days :)

Good luck!

majdalsado · on June 20, 2024

With a 0.14mg/L Lead content I'm not sure that you do... (28x acceptable amounts)

The_suffocated · on June 20, 2024

As there is only one bottle of this wine in the world, I think what matters is the dose but not concentration of Pb.

In addition, the 0.14 mg/L figure reported in the paper is at a similar level to the current safety standard. The International Organization of Vine and Wine (OIV), an intergovernmental agency comprised of 45 international member states, has a current maximum acceptable limit of 0.15 mg/L for Pb in wine starting from the 2007 harvest year.

https://www.oiv.int/public/medias/3741/e-code-annex-maximum-...