Hacker Newsnew | past | comments | ask | show | jobs | submit | ojr's commentslogin

Google Search has gotten better unless you think AI mode is a downgrade, the alternative of having a wikipedia article, reddit post or random website as the first result is not better technically maybe morally for you but not matter of factually. The average user does less manual filtering.

I definitely think the AI mode is a downgrade. It has me seriously considering abandoning Google for different search engine. With a reddit post or a Wikipedia article, it's much easier to assess the credibility of the content.

The AI mode does at least attempt to list it's sources, but it's extra hoops to jump through.


For the average Google User, searching for things like who perform in the Superbowl it is not extra hoops and incredibly fast. It's going to power Siri soon, enjoy.

AI overview though is crap. It almost always gives wrong answers and contradicts things that are in the results.

hasn't been my experience, I am able to search for information faster, speed to correct answer has increased on average, I feel, if we start providing examples your argument starts to fall fast. Think about the average Google Search, you really think it gets it wrong? Your search query is probably more obscure than mainstream web users.

I’d infinitely prefer a relevant wikipedia article to an AI “Answer” that is almost always wrong.

Google lens image search used to be amazing, I tried a repeat of a search I did before of a piece of art, it showed the same piece but confidently listed the artist and year wrong by about 300 years.

I’ve had relatives do “research” about things I mentioned I needed to do, and they’ve just sent screenshots of the incorrect AI answer.

It’s made google almost entirely useless, there is zero incentive for them to try to make search better (vs incentive to make it worse) and even if they did want to make it better the sheer volumes of slop have made that even harder.

We’ve completely sabotaged out ability to collate information at scale as a civilization, for the benefit of a few companies that were already the largest in the world to begin with. And it turns out, very few people notice or even care about this.


almost always wrong is just incorrect. Ask it who made Hackernews and it says Paul Graham with a informational paragraph it scraped from Wikipedia, without me clicking into Wikipedia. I can provide so many examples.

Factually and environmentally as well.

I guess all the publishers and advertisers are worried and complaining about nothing

I primarily use Gemini 3 Flash with a GUI coding agent I made by myself and its been able to successfully one-shot mostly any task I throw at it. Why would I ever use a more expensive reasoning and slower reasoning model? I am impressed with the library knowledge Gemini knows, I don't use any skills or MCP and its able to implement functions to perfection. No one crawls more data than Google and their model reflects that in my experience.

My experience with Antigravity was that 3 Pro can reason itself out of Gemini’s typical loops, but won’t actually achieve it (it gets stuck).

3 Flash usually doesn't get into any loops, but then again, it’s also not really following prompts properly. I’ve tried all manner of harnesses around what it shouldn’t do, but it often ignores some instructions. It also doesn’t follow design specs at all, it will output React code that is 70% like what it was asked to do.

My experience with Stitch is the same. Gemini has nice free-use tiers, but it wastes a lot of my time with reprompting it.


I don't use Stitch it doesn't have the context of my codebase, I just tell Gemini to make the UI directly and its able to do it. The only time it failed is when my prompt and goal was bad. I told it to swap expo-audio with react-native-track-player and it was able to do it in one-shot. Implement Revenue Cat and it did it in one shot. I do task by task like all the other agent tools recommended. The harness I made doesn't install packages, it just provides code. I don't use Anitgravity or any Electron-based coding agent, mine has a Rust core and different prompt engineering, not sure why it works so well but it does.

Demo: https://www.youtube.com/watch?v=jKMrvh56F0M Website: https://slidebits.com/isogen

I need to implement a better free trial plan, it's reached enough maturity where its my only and primary way I write code, I also use web chats to help me craft prompts. Reach out to test. https://slidebits.com/support


I'm curious, what's the agent like?

If I were to build something for Gemini models I'd plan around ingesting a bunch of context then oneshotting it.


you can run into payload too large errors, ingesting bunch of context, I use vercel's ai sdk so I can interchange between models but have 0 OpenAI and Claude credits or subscriptions. I use a combination of grepping files like a terminal tool and implemented a vector search database for fuzzy searches, Gemini chooses what tool it wants to use, I provide it create, read, update, delete, functions. There's a few tricks I do as well but if I tell you, you can probably prompt a clone . Sharing the full implementation is basically open sourcing the code.

Demo: https://www.youtube.com/watch?v=jKMrvh56F0M Website: https://slidebits.com/isogen

if you want to try it out let me know, I'll provide free access and a gemini test key

https://slidebits.com/support


AI web agent that lives in a wasm sandbox with predefined skills, it creates websites, mobile apps, powerpoint slides (.pptx files), word documents (.docx files), visualize csv data, and creates remotion videos, no external sandbox servers, all in wasm.

Submitted for the Gemini 3 Hackathon https://devpost.com/software/slidebits-betty


talk is even cheaper, still show me the code, people claim 10x productivity that translates to 10x of work done in a month, even with Opus 4.5 out since November 2025 I haven't seen signs of this. AI makes the level of complexity with modern systems bearable, it was getting pretty bad before and AI kinda saved us. A non-trivial React app is still a pain to write. Also creating a harness for a non-deterministic api that AI provides is also pain. At least we don't have to fight through typing errors or search through relevant examples before copying and pasting. AI is good at automating typing, the lack of reasoning and the knowledge cutoff still makes coding very tedious though.


Best example of this is Claude's own terminal program. Apparently renders react at 60fps and then translates it into ANSI chars that then diff the content of the terminal and do an overwrite...

All to basically mimic what curses can do very easily.


This is the fun part of whole AI-built things that a lot of people don’t want to accept - it really, really, really does not matter if the code “nice, maintainable and etc.”. Does it work? Is it somewhat extendable with AI? Are users generally happy and adoption rate is high? That’s it.


If you have high time preference yes. Over longer time horizons I think the issues with vibe coded software will reveal themselves in the same way badly written software does.


In this case, no, it does not work. CC is full of reported rendering issues, including some that affect me.


Remember 8 years ago how vibe coding was awful? Like i never used it in prod apps. Now i'm a daily CC user, and every few months models are getting better, some problems are being solved, the other ones still being worked on and so on.


Opus was a non-deterministic probability machine in the past, present and the foreseeable future. The variance eventually shows up when you push it hard.


I don't get this sentiment, regressions still exist, you can't just prompt them away and a programmer will spend 10x more time fixing regressions, bug fixing and improvements than scaffolding in most projects that people pay for. If most of your time at work is not doing this, then you are already living a simple life.


I found out that Claude Code and OpenCode doesn't do vector search and embeddings but relies on grep to power their agents. Most people are satisfied with this even though I think vector search approach is way better and saves on tokens.


I remember thinking the same thing and this articles goes over most of the arguments here - https://milvus.io/blog/why-im-against-claude-codes-grep-only...

When it came out and I think it was Boris from Anthropic that said they experimented a lot with Vector Search and grep just worked better.

You can try it out using the Claude-Context MCP - https://github.com/zilliztech/claude-context


I built my own ai coding agent and do vector search and embeddings locally

https://slidebits.com/isogen



Windows still have the gamers. A lot of anti-cheat system completely block out Linux users. The Year of the Linux Desktop will still be a meme at the end of this year as well.


Except ProtonDB website reports that completely blocked games make up 3% of the top 1000 Steam games.

Meanwhile, 84% is perfectly playable (some with minor tweaks).

https://www.protondb.com/


My guess is that this is going to shift rapidly for new games. Once the Steam PC launches, most new games will probably run fine on it. There's no logical reason for game studios to throw way significant market share over weird legacy crap related to "anti cheat". I expect the already significant amount of Linux using Steam users will grow to the point where game studios can no longer ignore it in terms of revenue and angry users and will actively test and ensure their games work flawlessly.

Of course one point here is that MS owns some of the more problematic game studios. Anti cheat here might be less about users cheating and more about them using this as a control point to ensure gamers keep on preferring Windows. It will be interesting to see how this plays out. I don't think MS has much of a moat left for gaming. And it will be tempting for them as well to tap into the few percent of Linux using Steam users for selling them games. They've long stopped insisting on windows for things like Office or SQL Server as well. The whole of Azure is pretty much Linux based at this point. So, they might dig in for a bit but they'll be under growing pressure to give in.


So you’re still rolling a 6 sided dice every time you try a new game as to whether it works at all, and half the time you need to tweak it still? That’s a reasonably large barrier to entry then. I have arch Linux but I still boot into windows to play games that are supposed to be supported because I got sick of playing through 20 min or so of a game for it to crash in a specific spot and I’d have to start over in windows if I couldn’t find a reliable solution. After that happens a few times in a few games, I gave up and now I just go to windows to play games every time so I stop running into issues.


I buy games for my steam deck, only stopping to read up if they're "unsupported". Surprisingly, they seem to often STILL work in that case, this happened with Ghosts of Tsushima, which was unsupported because their online play didn't work (good riddance lol).

Never had an issue with any game running through proton. Only issue was Stardew Valley that couldn't play online. Turns out the Linux version (was default) had an unfixed bug, and choosing the Windows version with proton "Just Worked". Hilariously, "Win32 is the most stable Linux ABI"...


Yeah I’m not surprised, the steam deck hardware was picked with the best compatibility in mind. I’m aware in my case I have non-ideal hardware for Linux, such as an nvidia card.


This is also one of the primary reasons I'll buy the Steam Machine. Sure I could build whatever myself, but I actually want proper hardware support. I also don't mind that the profit margin that I pay to make my life easier goes to Valve and their Linux Gaming pursuits.


Indeed, the steam machine is very tempting to me too


I saw someone make a good point about this the other day that that 3% of games represents a much larger percentage of the gamer population - Pareto distribution comes into play with popular games where a small number of games account for a larger share of gamers' attention.


Valorant, Fortnite, Call of Duty, Apex Legends, Destiny 2 and Rainbox Six Siege are not supported on Linux, how much of those 30 games of 1000 are some of the most played games? I already name 20% of them.


Correct, for top 10 Steam games it reports 50% perfect play.


To be honest, I've found ProtonDB to be way too optimistic when saying that games are "playable" (for example, a game running with no multiplayer still counted as "playable").


I spent so much time getting the mocks right with AI tests and the tests could not be one shotted or done by an inexperienced intern. Certainly don't have the budget to through Claude Opus on it, I'll give it some time though maybe things change.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: