More

lccerina · 2025-12-03T13:11:50 1764767510

Most likely, and probably inferring the structure on texts with "similar" writing forms. Tried with my handwriting (in italian) and the performance wasn't that stellar. More annoyingly, it is still a LLM and not a "pure" OCR, so some sentences were partially rephrased with different words than the one in the text. This is crucially problematic if they would be used to transcribe historical documents

embedding-shape · 2025-12-03T13:15:38 1764767738

> Tried with my handwriting (in italian) and the performance wasn't that stellar.

Same here, for diaries/journals written in mixed Swedish/English/Spanish and with absolutely terrible hand-writing.

I'd love for the day where the writing is on the wall for handwriting recognition, which is something I bet on when I started with my journals, but seems that day has yet to come. I'm eager to get there though so I can archive all of it!

pbronez · 2025-12-03T16:43:41 1764780221

"it is still a LLM and not a "pure" OCR"

When does a character model become a language model?

If you're looking at block text with no connections between letter forms, each character mostly stands on its own. Except capital letters are much more likely at the beginning of a word or sentence than elsewhere, so you probably get a performance boost if you incorporate that.

Now we're considering two-character chunks. Cursive script connects the letterforms, and the connection changes based on both the source and target. We can definitely get a performance boost from looking at those.

Hmm you know these two-letter groupings aren't random. "ng" is much more likely if we just saw an "i". Maybe we need to take that into account.

Hmm actually whole words are related to each other! I can make a pretty good guess at what word that four-letter-wide smudge is if I can figure out the word before and after...

and now it's an LLM.

butlike · 2025-12-03T14:37:41 1764772661

So it doesn't work is what you're saying, right?

GaggiX · 2025-12-03T13:27:13 1764768433

Are you sure to have used the Gemini 3.0 pro model? Maybe try increasing the media resolution on the AI studio if the text is small

lccerina · 2025-12-03T13:01:49 1764766909

I call out the Lindy effect. Handwriting survived printed characters, typewriters, and the last 50-70 years of computers and keyboards, it will survive this too.

lccerina · 2025-11-03T09:41:08 1762162868

Utter disrespect for using the term "biology" relating to LLM. No one would call the analysis of a mechanical engine "car biology". It's an artificial system, call it system analysis.

lewtun · 2025-11-03T11:07:14 1762168034

The analogy stems from the notion that neural nets are "grown" rather than "engineered". Chris Olah has an old, but good post with some specific examples: https://colah.github.io/notes/bio-analogies/

UltraSane · 2025-11-03T20:50:04 1762203004

It makes sense if you define "biology" as "incredibly complicated system not designed by humans that we kind of poke at to try to understand it."

lccerina · 2025-11-04T09:03:00 1762246980

"not designed by humans"? Since when? Unless you count cortical organoids /wetware (grown in some instrumented petri dish) every artificial neural network, doesn't matter how complicated, it is designed by humans. With equations and rules designed by humans. Backpropagation, optimization algorithms, genetic selections etc... all designed by humans.

There is no biology here, and there are so many other words that describe perfectly what they are doing here, without twisting the meaning of another word.

UltraSane · 2025-11-04T18:23:27 1762280607

By not designed I'm talking about the synaptic weights

lccerina · 2025-11-05T09:06:11 1762333571

Still designed by humans. The loss function, backpropagation and all other mechanisms didn't just appear magically in the neural network. Someone decided which loss function to use, which architecture or which optimization techniques. Only because it takes a big GPU a lot of number crunching to assign those weights, it doesn't mean it's biological.

In the same way, a weather forecast model using a lot of complicated differential equations is not biological. A finite element model analyzing some complicated electromagnetic field, or the aerodynamics of a car is not biological. Just because someone around 70-75 years ago called them 'perceptrons' or 'neurons' instead of thingamajigs does not make them biology.

UltraSane · 2025-11-05T10:02:03 1762336923

"Still designed by humans." No they are not. They are learned via backpropagation. This is the entire reason why neural networks work so well and why we have no idea how they work when they get big.

lccerina · 2025-11-07T14:15:52 1762524952

And who designed backpropagation? It is not a magical property of artificial neurons or some law of nature or god's miracle. A bunch of mathematicians banged their head on the problem of backpropagation, tossed it to a computer, and voilà , neural networks made sense. Neural networks work so well because someone chooses the right loss function for the right problem. Wrong loss function -> wrong results. It's not magic. Nor it's biology.

addaon · 2025-11-03T23:45:41 1762213541

Sure, but it makes no sense at all if you define biology as “the smell of a freshly opened can of tennis balls.” The original comment is probably better understood using a standard definition of the words it used, rather than either of our definitions.

lccerina · 2025-10-29T09:05:33 1761728733

I have a framework: don't use it, if you never used it don't start using it, public shame people, stop talking about it. Slow down. Think long and deep about your problems. Write less code.

There is NOTHING inevitable about this stuff.

SideburnsOfDoom · 2025-10-29T09:54:15 1761731655

Indeed. "No." is perfectly clear.

lccerina · 2025-10-27T09:23:23 1761557003

"This will be a big business" No. It shouldn't be a "business", it should be laws that are enforced fast, education, public shaming of companies putting poison in their products. Volatile Organic compounds in paint were known to be poisonous since 17th century (see Bernardino Ramazzini's works). Just listen to the goddamn scientists for once. You can't solve a problem caused by capitalism corner cutting with more capitalism.

lccerina · 2025-10-27T09:19:00 1761556740

Damage from lead. The 'obsession' here is that the right level of lead in any product should be ZERO. There should be international pacts, like what was done for gases destroying the ozone layer, to remove lead entirely from products

lccerina · 2025-07-21T14:01:54 1753106514

The growing product is something that bolts in the worst of current internet (affiliation links). Hopefully it will fail too

moondowner · 2025-07-21T14:10:53 1753107053

Affiliate links have been here for three decades; (AutoWeb.com doing it since 1995)

olyjohn · 2025-07-21T14:51:38 1753109498

That doesn't mean anything. It's still a huge reason for all the junk and bullshit that's ruined the internet.

lccerina · 2025-02-10T10:56:13 1739184973

The error was to buy a second one after "the first one was just poor manufacturing". I never saw manufacturing quality improve over time from car companies.

rs186 · 2025-02-10T12:34:06 1739190846

This.

After my Nissan car started to have transmission problems that would cost thousands of dollars to fix (among various other small issues), I sold it as quickly as possibly and swore I'll touch the make again.

Loughla · 2025-02-10T12:56:31 1739192191

Subaru burned me on this. I bought my wife an outback. It started to have transmission issues with a full transmission failure at about 145k miles. This is after a life of small problems here and there that didn't really impact performance.

It was a known issue between 125 and 150k miles. Subaru's solution was to extend the warranty to 100k, as if that did anything at all.

We got rid of the broken one, and the one that I drove as well. I'll never go back. I loved those cars, but that's so shady.

aembleton · 2025-02-10T12:37:06 1739191026

Hyundai manufacturing quality has improved over time.

toomuchtodo · 2025-02-10T23:56:12 1739231772

The trick is to buy a used Tesla you’ve had inspected and taken for a test drive. This, mostly, avoids lemons and chronic issues.

lccerina · 2025-02-06T09:35:31 1738834531

Sumatra has also a portable version. Doesn't that work for you?

pletnes · 2025-02-06T11:25:20 1738841120

If IT finds an exe file they go bonkers. If they find a PDF, who will even care?

extraduder_ire · 2025-02-06T14:37:22 1738852642

Depends on how long they've been in IT. A good while back, exploits in adobe reader were so common that pdf files were a common malware vector.

TeMPOraL · 2025-02-06T11:27:35 1738841255

Pray Microsoft Defender will be nice enough to not look at the PDF too closely.

pletnes · 2025-02-06T13:08:53 1738847333

It can use the pdf reader in the pdf to look closely if it wants to

lccerina · 2025-02-05T15:16:07 1738768567

There are distros slightly optimized for games (e.g. Garuda is based on Arch) and the support from Valve and Proton is quite good at the moment. Problems appear only for games launcher that insist on being Windows-only (anything from EA and Ubisoft, Epic games launcher etc...)