In a First, America Dropped 30,000-Pound Bunker-Busters—But Iran’s Concrete May Be Unbreakable, Scientists Say

NoSpotOfGround@lemmy.world · 4 days ago

deleted by creator

NoSpotOfGround@lemmy.world · 10 days ago

Meh. He only had $23.50 in his account.

NoSpotOfGround@lemmy.world · 14 days ago

I’m all for the “SLEEP 8 HOURS” bit though. I need more of that in my life.

NoSpotOfGround@lemmy.world · 16 days ago

In a First, America Dropped 30,000-Pound Bunker-Busters—But Iran’s Concrete May Be Unbreakable, Scientists Say

NoSpotOfGround@lemmy.world · 20 days ago

You had one job.

NoSpotOfGround@lemmy.world · 21 days ago

Perovskite-based image sensors promise higher sensitivity and resolution than silicon

NoSpotOfGround@lemmy.world · 2 months ago

Color-correcting algorithm removes the effect of water in underwater scenes

NoSpotOfGround@lemmy.world · edit-2 3 months ago

The real meat of the story is in the referenced blog post: https://blog.codingconfessions.com/p/how-unix-spell-ran-in-64kb-ram

TL;DR

If you’re short on time, here’s the key engineering story:

McIlroy’s first innovation was a clever linguistics-based stemming algorithm that reduced the dictionary to just 25,000 words while improving accuracy.

For fast lookups, he initially used a Bloom filter—perhaps one of its first production uses. Interestingly, Dennis Ritchie provided the implementation. They tuned it to have such a low false positive rate that they could skip actual dictionary lookups.

When the dictionary grew to 30,000 words, the Bloom filter approach became impractical, leading to innovative hash compression techniques.

They computed that 27-bit hash codes would keep collision probability acceptably low, but needed compression.

McIlroy’s solution was to store differences between sorted hash codes, after discovering these differences followed a geometric distribution.

Using Golomb’s code, a compression scheme designed for geometric distributions, he achieved 13.60 bits per word—remarkably close to the theoretical minimum of 13.57 bits.

Finally, he partitioned the compressed data to speed up lookups, trading a small memory increase (final size ~14 bits per word) for significantly faster performance.

NoSpotOfGround@lemmy.world · edit-2 5 months ago

Tesla pulls out all the stops as Cybertruck sales grind to a halt

NoSpotOfGround