More

nabla9 · 2025-12-19T20:44:40 1766177080

You saw name "Noam Chomsky" and that started a process in your mind that generated the standard spiel about Syntactic Structures.

Chomsky Hierarchy is his more fundamental work that joins computer science and linguistics. It was published in IRE Transactions on Information Theory. Chomsky, Noam (1956). "Three models for the description of language" https://chomsky.info/wp-content/uploads/195609-.pdf

Type-3 grammar ≡ finite-state automaton

Type-2 grammar ≡ Non-deterministic pushdown automaton

Type-1 grammar ≡ Linear-bounded non-deterministic Turing machine

Type-0 grammar ≡ Turing machine

ps. Chomsky was already aware of Finite State Automata and Turing Machines and understood that they match Type-3 and Type-0. Pushdown Automata was invented later, and connection between Type-1 grammar and Linear Bounded Automata was made few years later.

PaulHoule · 2025-12-20T16:38:45 1766248725

An LLM is not type 0. It always finishes in finite time so it is not Turing complete.

I asked Copilot

   answer yes or no,  is the following Java method syntactically well formed
   
   static int aMethod() { return "5"; }

and got what I thought was the wrong answer

   No.
   It’s syntactically valid as Java code, but it will not compile because it returns a String where 
   an int is required.

because I hadn't specified clearly that I was talking about the type 2 CFG of the parser as opposed to the type 1 behavior of the compiler as a whole. [1] I had a good conversation with copilot about it and I'm sure I'd get better results with a better prompt... It would make a good arXiv paper to pose an LLM grammatical recognition problems by prompting

   here is a set of rules: ...  is the production ... in the grammar?

with a wide range of cases. Somebody who just tries a few examples might be impressed by its capability but if you were rigorous about it you would conclude that an LLM pretends to be able to recognize grammars but can't actually do it.

And that's true about everything they do, one could argue that "in an exact sense, LLM can't do anything at all") They'll make a try at a weird question like

   what percent of North Americans would recognize the kitsune hand gesture?

which is a public opinion research question similar in character to

   what is lowest mass eigenstate of the neutrino?

in that it could be answered rigorously (but still in terms of probability, even hep results have p-values)

[1] javac implements type 1 behavior in the java language which is a substrate

xg15 · 2025-12-19T21:00:52 1766178052

I think it could be useful to combine the two paradigms to maybe get a better understanding of what transformers can and cannot learn.

E.g. would it be possible to create an algorithm that takes a grammar (and maybe a desired context window size) as input and constructs a transformer network that generates sentences exactly from that grammar?

("Construct" meaning directly setting the weights, without any iterative training process)

nabla9 · 2025-12-19T21:08:41 1766178521

They are combined. Chomsky Hierarchies are the core of modern Computer science because they map perfectly into automata theory. They are always taught together in computer science.

>E.g. would it be possible to create an algorithm that takes a grammar (and maybe a desired context window size) as input and constructs a transformer network that generates sentences exactly from that grammar?

You don't need transformers for what you describe. That's 101 theory of computation class where you learn about automata, grammars, parsers, and generators.

xg15 · 2025-12-19T21:18:06 1766179086

Yeah, I know the theory of formal grammars and automata that the Chomsky hierarchy is part of. What I meant is that language models and specifically transformer networks are usually entirely separate from that theory, so it would be useful to build a bridge between "modern" language processing using GPTs/LLMs and the classical formal theory.

The most obvious overlap in usage is with programming languages: LLMs can parse and generate code in formal languages, but their processing model is completely different from syntax trees and parsers. So the question is, how do they store the formal structure of a programming language and could this be mapped back in any way to a grammar or automaton?

PaulHoule · 2025-12-20T01:05:29 1766192729

The way I see it is that attention is graph structured -- this token here is connected to that token there and so forth by the attention lighting up or also in the sense that there are a bunch of places in the document where people are talking about "Noam" or "Chomsky" or "Noam Chomsky" or "The Author" or "him", etc.

Alternately if you were looking it from a semantic web perspective the knowledge expressed in a document is a graph and that graph structure is more fundamental than the tree structure of a text because you could express the same knowledge in different orders. Serialization fundamentally requires putting things in some specific order which might specifically be chronological (work from 2005 to the present as a play with many acts) or could be organized around some conceptual hierarchy (kitsune legends, self psychology, character acting, animal and human behavior and physiology, ...) or the minimization or elimination of backward references (whatever it is that the C spec does a touch wrong but post-Common Lisp specs do right) , etc. Ultimately the graph is pruned away into a tree where the remaining links are denoted by syntactic features in the local scale of the document, you're kinda left filling in the rest of the links with some combination of pragmatics, logical inference, something like SAT solving, etc,

A conventional parsing point of view sees a Java program as a tree but for ordinary purposes it does not matter what order you put the fields and methods in and even though procedural programs are allegedly a sequence of operations done in a certain order it is frequently the case that it does not matter at all if you run line 71 or line 75 first so it is often the case that the graph is the real thing and the trees that we're so comfortable with are the shadows on the walls of the cave.

yongjik · 2025-12-19T22:37:56 1766183876

Chomsky Hierarchy pertains to "languages" defined in the theory of computation: i.e., it is a subset of the set of all finite sequence of alphabets (for some fixed notion of "alphabets"). If a sentence (a particular finite sequence of alphabets) is in the subset, then it is a "valid" sentence of the language. Otherwise it is invalid.

It should be already clear from this that this notion of language is rather different from natural languages. For example, if there is a formal language that contains "Good morning" and "My hovercraft is full of eels" as valid sentences, then nothing distinguishes these sentences any more. (Of course you could add annotations and build semantic values but they are not essential to the discussion of formal languages.)

It gets a bit more ridiculous when you try to connect LLMs to the Chomsky hierarchy. Modern LLMs do not really operate on the principle of "is this a valid sentence?" yet provide vastly superior results when it comes to generating naturally sounding sentences.

I think LLMs have put an end to any hope that formal language theory (in the style of Chomsky Hierarchy) will be relevant to understanding human languages.

littlestymaar · 2025-12-20T06:19:50 1766211590

> For example, if there is a formal language that contains "Good morning" and "My hovercraft is full of eels" as valid sentences, then nothing distinguishes these sentences any more.

Mind explaining a bit? Because I've no idea what you mean.

Legend2440 · 2025-12-19T21:15:45 1766178945

The trouble is that english doesn’t fit neatly into any of these categories. It has features that make it at least a context-free language, but can’t handle other features of context-free languages like unlimited nesting.

Ultimately these are categories of formal languages, and natural language is an entirely different kind of thing.

nabla9 · 2025-12-19T21:42:14 1766180534

Strictly speaking natural languages fit into Context-Sensitive (Type-1) in Chomsky Hierarchy, but that's too broad to be useful.

In practice they are classified into MCSL (Mildly Context-Sensitive) subcategory defined by Aravind K. Joshi.

krnlclnl · 2025-12-19T22:09:52 1766182192

Sure, if you accept and agree with Joshi.

No reason to do that though, except to validate some random persons perspective on language. The sky will not open and smash us with a giant foot if we reject such an obligation.

nabla9 · 2025-12-20T08:48:37 1766220517

Natural languages being in MCSL (Mildly Context-Sensitive) is the consensus among linguistics, not some random individual's viewpoint.

suddenlybananas · 2025-12-20T11:20:31 1766229631

You can also reject quantum physics and the sky will not open and smash us with a giant foot. However, to do so without serious knowledge of physics would be quite dumb.

nabla9 · 2025-12-19T20:33:52 1766176432

> Snowden revealed PRISM meant the US government just had straight access

People read this and think that US government had unhindered access to all data in major providers.

According to Edward Snowden, PRISM allowed the government to compel internet companies to turn over any data that matched specific court-approved search terms. such as email addresses, all under Section 702 of the FISA Amendments Act of 2008.

At least some parts of it were likely unconstitutional as it could target U.S. persons, but it was not free for all as "straight access" indicates. It was straight access after FISA court approval.

NSA runs much more invasive MUSCULAR program in the UK without FISA or other type warrant.

bogantech · 2025-12-19T22:05:51 1766181951

They were tapping fiber links between datacenters

mmh0000 · 2025-12-19T22:22:39 1766182959

And, they were directly installing compromised hardware in datacenters [1]

[1] https://arstechnica.com/tech-policy/2014/05/photos-of-an-nsa...

nabla9 · 2025-12-18T20:01:06 1766088066

Crypto is a zero sum game. "The Public" can't be making millions on aggregate.

Probably something like 90+% lose over longer term. And you make nothing until you sell.

burnerRhodov2 · 2025-12-18T20:05:13 1766088313

Explain to me how crypto is zero sum? It can be infinity rehypothecatated..

D13Fd · 2025-12-18T21:24:33 1766093073

Crypto is a box where a bunch of people put money in and get exactly the same amount out (just distributed differently).

jopython · 2025-12-19T12:48:59 1766148539

By that logic even the Stock market is a zero sum game.

D13Fd · 2025-12-19T15:36:43 1766158603

That's not true. The money you use to buy stocks gets you an ownership interest in a company that creates value. The money you put into crypto gets you a line on a distributed spreadsheet.

The money that comes out of your ownership share is tied to the success of the company, through dividends and buybacks.

stevenjgarner · 2025-12-18T20:12:44 1766088764

That depends if you are using crypto as 1) a store of value; 2) a medium of exchange; or 3) an alternative to permission-based monetary policy. All of it depends on the jurisdiction of the fiat-to-crypto and/or crypto-to-fiat transaction.

nabla9 · 2025-12-16T09:20:01 1765876801

Generating code has never been very valuable.

When you are building a reasonable software project (10k to 300k LOC), the actual generation of code is not a significant cost. If code generation were the true bottleneck, a 300k LOC project worth $50 million would take no more than 1–3 work-years to produce, even without AI. You could hire quality code monkey from India for $15 to $40/hour and get it done with less than $250,000.

The cost of management does not approach zero when you have Microsoft Exel.

nabla9 · 2025-12-16T08:05:45 1765872345

Do most people read the article and think "F-4 couldn't see the F-22 simply because of stealth, even at close range."

In reality, these fighters have forward-looking radars. An F-4 could do the same to a lone F-22. The real issue here is the F-22's stealth against Iranian ground radars. For whatever reason, they did not warn the F-4.

nabla9 · 2025-12-15T16:11:57 1765815117

As the top comment in MR points out this is not that impressive.

> 12 hour ahead prediction is .0581.

The "ahead" is Before Markets Resolve. Brier score 0.0581 12h before markets resolve is usually not impressive score and information gained has usually little value.

Another issue is the difficulty of questions. You can get arbitrarily low Brier score with easy questions.

Low Brier score when the information is not already there and it has value would be impressive. Accurate Brier score when things are often already settled and information has little value is not interesting.

nabla9 · 2025-12-14T12:10:59 1765714259

You don't get any important information from this single automated transaction.

Look at the Nvidia executives disposition/acquisition schedule. Here is Shoquist https://www.nasdaq.com/market-activity/insiders/shoquist-deb... Disposition means the selling or otherwise getting rid of a security. Acquisition means the buying or otherwise gaining ownership of a security.

All Nvidia executives sell Nvidia stock constantly year after year. They get more stock with their options. Almost all their worth is in NVDA and they want to spend and diversify.

Changes whatever narrative you made your mind does it?

nabla9 · 2025-12-13T15:05:01 1765638301

Agreed.

If the only way to get your digital property back is a public plea to your Lord, that's called feudalism. Everyone should be treated fairly, not only those who can get their public pleas heard.

novoreorx · 2025-12-13T15:32:27 1765639947

You just made it clear to me why I felt not resonated and a bit uncomfortable reading that article, despite I thought I should be. Because what I want to see is something straight like "fuck you Apple", not a begging and emphasis on how much the author has contributed to the megacorp.

nabla9 · 2025-12-13T17:05:47 1765645547

"fuck you Apple" is not a correct response either. Bad Apple, good Apple, is just more of the same. Asking Lords to be benevolent is not what we should want.

Just like landlord can't just lock you out of your house, with all your property inside, but has to go trough legal process, we need to have legislation and regulation for the same with digital property.

markus_zhang · 2025-12-13T16:54:28 1765644868

Feudalism never left. The only change is that the majority of the serfs don’t work on land anymore, and we have the freedoms o switch lords easily.

nabla9 · 2025-12-11T21:21:59 1765488119

The difference between Oracle and Microsoft, Meta, Google, Amazon, Nvidia is that Oracle takes debt to finance its AI investments.

If things go south, Oracle is left naked.

nabla9 · 2025-12-11T11:41:57 1765453317

I think we all can agree that Comic Sans MS reflects the current US government best, both spiritually and aesthetically.

ndkap · 2025-12-11T15:47:14 1765468034

As an aside, I didn't know what Comic Sans looks like, so I searched on Google and it rendered the whole page in that font. I tried with other Fonts too like Arial and Times New Roman, and it did the same there. So cool!

LucasFonts · 2025-12-12T14:49:45 1765550985

If you search for Lucas de Groot (the designer of Calibri) you will get the results displayed in Calibri.

rbanffy · 2025-12-11T20:27:48 1765484868

Sadly, it doesn't work with the coolest niche fonts... https://www.google.com/search?q=ibm+3270

tracker1 · 2025-12-11T17:26:05 1765473965

Very cool... but I can't seem to get it to do so for other fonts I can think of off the top of my head... Inconsolata, Consolas, Fira Code, etc. "Times New Roman" does work as well.

Would be cool to see google support this for at least all the fonts in Google Fonts' library, since they're already well supported web fonts.

lippihom · 2025-12-12T00:16:20 1765498580

This was super cool - nice little Google easter egg.

adolph · 2025-12-11T14:45:15 1765464315

Your comment may be in jest but there is some evidence that "easier to read" does not benefit "retain what was read."

  And that brings us back to these ugly fonts. Because their shapes are 
  unfamiliar, because they are less legible, they make the mind work a little 
  harder; the slight frisson of Comic Sans wakes us up or at least prevents us 
  from leaning on the usual efficiencies. “The complex fonts . . . function 
  like an alarm,” Alter writes. They signal “that we need to recruit additional 
  mental resources to overcome that sense of difficulty.”

https://lithub.com/the-ugliness-of-comic-sans-has-a-practica...

nimbius · 2025-12-11T14:33:22 1765463602

i tend to find the kerning issues noted by the calibri team are moot. most Times New Roman is perfectly legible with careful observation and maybe a fresh cup of covfefe.

lo_zamoyski · 2025-12-11T12:10:50 1765455050

I would say it’s worse than that. Read Plato’s “Republic” and you may come to appreciate a much more expansive appropriateness of Comic Sans, beyond just the current administration.

hilbert42 · 2025-12-11T16:47:07 1765471627

I have, many times, hence my earlier comment.

If Rubio read Republic then he's just demonstrated that he'd not have understood it.

butchcassidi · 2025-12-11T12:08:35 1765454915

I would rather see Wingdings.

VikingCoder · 2025-12-11T13:39:55 1765460395

·puᴉɯ oʇ ǝɯoɔ ʇɐɥʇ sʇuoɟ ɹǝɥʇo ǝɹɐ ǝɹǝɥꓕ

lenerdenator · 2025-12-11T14:09:44 1765462184

That's the official font of the Australian government.

ptdorf · 2025-12-11T16:09:51 1765469391

You meant: Austria. The lang of Kangaroos.

jpster · 2025-12-11T22:43:46 1765493026

I beg to differ. Wingdings is more like it.

amypetrik8 · 2025-12-11T17:17:23 1765473443

[flagged]

themadturk · 2025-12-11T23:43:58 1765496638

Maybe when (if?) the Democrats take back the House and Senate in 2026. Right now Congress is solidly right-wing and sees no reason to impeach...nor would a conviction ever happen, even if the trial was held.

dragonwriter · 2025-12-11T23:54:31 1765497271

> Honestly when are we going to impeach Trump, he's basically the same Hitler.

When did Germany impeach Hitler?

Also, Donald Trump has already been impeached as many times all other Presidents combined.