Lisp I can understand to some degree... but writing Forth for a living? I know a...

johnisgood · 2025-05-17T13:22:50 1747488170

Stack-based or concatenative languages can be difficult to understand, but as with anything, you may / could get used to it. :)

I prefer Factor[1] over Forth, however. Maybe you'll like it!

[1] https://factorcode.org/

90s_dev · 2025-05-17T16:29:56 1747499396

It's not that they're hard to understand, it's that they're much denser. From Factor's examples page:

> 2 3 + 4 * .

There's a lot more there to mentally parse than:

> (2 + 3) * 4

It's the same as when Rob Pike decries syntax highlighting. No, it's very useful to me. I can read much quicker with it.

It's the same principle behind how we use heuristics to much more quickly read words by sipmly looking at the begninnings and ends of each word, and most of the time don't even notice typos.

johnisgood · 2025-05-17T17:28:43 1747502923

Well, I guess it might boil down to how one "thinks"?

Some people prefer:

  2 3 + 4 *

Some other people prefer:

  (* 4 (+ 2 3))

And some other people prefer:

  (2 + 3) * 4

I personally find the last one easier to read or understand, but I have had my fair share of Common Lisp and Factor. :D

Syntax highlighting is useful for many people, including me. I can read much quicker with it, too. I know of some people who write Common Lisp without syntax highlighting though. :)

kazinator · 2025-05-17T22:28:49 1747520929

Forth could be written devilishly where you have this

  2 .... hundreds of words .... +

where the operands of + are 2 and the result produced by the hundreds of words!

Which could also be:

  .... hundreds of words .... 2 +

which would be a lot easier to read!

If you're writing Forth, it likely behooves you to try to adhere to the latter style of chaining where you take everything computed thus far, and apply a small operation to it with a simple operand. Not sure if it's always possible:

  ... complex numerator ... ... complex denominator ... /

Now find the division between the numerator and denominator among all those words.

johnisgood · 2025-05-18T11:30:52 1747567852

> ... complex numerator ... ... complex denominator ... /

Yes, this is why you are supposed to have short words. You should factor out the complex parts into short, self-contained, and descriptively named words, which is going to make your code much easier to read, test, and maintain.

For example:

Instead of:

  a b + c d + * e f + g h + * /

You should probably have:

  : compute-numerator   a b + c d + * ;
  : compute-denominator e f + g h + * ;
  : compute-ratio       compute-numerator compute-denominator / ;

Most (if not all) Forth books mention this as well.

kazinator · 2025-05-18T16:27:20 1747585640

Don't you now have actions in the middle of the computation that are putting names into a global dictionary? I'd at least give them names like tmp-numerator to put them into a namespace of local/temporary functions, and then "forget" them immediately after the computaton that references them.

What's the compiled version of : compute-numerator a b + c d + * ; look like? I imagine at the very least that there has to be a call to some run-time support routine to insert a compiled thunk under a name into the dictionary.

johnisgood · 2025-05-18T18:21:44 1747592504

Yes, defining words like "compute-numerator" does add entries to the dictionary, but that happens entirely at compile time. Forth doesn't insert a "compiled thunk" at runtime, the word is compiled as a name bound to a sequence of code field addresses (CFAs). When invoked, it's just a jump through the usual inner interpreter. There's no runtime cost for defining the word itself. When you invoke "compute-numerator" at runtime, the inner interpreter simply threads through those CFAs. There's no indirection, JIT, or dynamic thunk creation involved. The only runtime effect is the word being executed when called. All linking is resolved at compile time.

If you're concerned about polluting the global dictionary, a common idiom is (which you already know):

  \ Define and forget immediately if temporary
  : tmp-numerator a b + c d + * ;
  tmp-numerator
  FORGET tmp-numerator

or alternatively, you can isolate temporary definitions in a separate vocabulary:

  VOCABULARY TMP-WORDS
  TMP-WORDS DEFINITIONS

  : numerator   1 2 + 3 4 + * ;
  : denominator 5 6 + 7 8 + * ;

  ONLY FORTH ALSO TMP-WORDS ALSO DEFINITIONS

  : compute-ratio numerator denominator / . ;

  compute-ratio

  ONLY FORTH DEFINITIONS

TL;DR: Defining intermediate words adds entries to the dictionary, but this happens at compile time, not runtime. There's no additional runtime overhead. Naming conventions, FORGET, or vocabularies can mitigate dictionary pollution / clutter, but still, factoring remains the standard idiom in Forth.

Note: In some native code compiling or JIT-based Forth implementations, definitions may generate machine code or runtime objects rather than simple CFA chains I mentioned, but even in these cases, compilation occurs before runtime execution, and no dynamic thunk insertion happens during word calls.

I hope I understood your comment correctly. Please let me know!