Groups | Search | Server Info | Keyboard shortcuts | Login | Register [http] [https] [nntp] [nntps]
Groups > alt.comp.os.windows-10 > #182131
| Path | csiph.com!eternal-september.org!feeder3.eternal-september.org!news.eternal-september.org!eternal-september.org!.POSTED!not-for-mail |
|---|---|
| From | Paul <nospam@needed.invalid> |
| Newsgroups | alt.comp.os.windows-10, alt.comp.os.windows-11 |
| Subject | Re: Source Code: My Beginnings |
| Date | Sat, 8 Feb 2025 17:35:53 -0500 |
| Organization | A noiseless patient Spider |
| Lines | 50 |
| Message-ID | <vo8m8a$88ra$1@dont-email.me> (permalink) |
| References | <vo6i44$2gl41$1@paganini.bofh.team> <vo6ld8$3t8h1$1@dont-email.me> <vo6u8f$3ugea$1@dont-email.me> <vo7jjb$23gb$1@dont-email.me> |
| MIME-Version | 1.0 |
| Content-Type | text/plain; charset=windows-1252 |
| Content-Transfer-Encoding | 8bit |
| Injection-Date | Sat, 08 Feb 2025 23:35:54 +0100 (CET) |
| Injection-Info | dont-email.me; posting-host="d3aef225279ee0ead3cdcfd3f8799735"; logging-data="271210"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1/7V+TSogOjXDHJrIHrEOq7WrPsKfsbPNY=" |
| User-Agent | Ratcatcher/2.0.0.25 (Windows/20130802) |
| Cancel-Lock | sha1:kj36NHIb1J6Hnulra4MWpNNfDpY= |
| Content-Language | en-US |
| In-Reply-To | <vo7jjb$23gb$1@dont-email.me> |
| Xref | csiph.com alt.comp.os.windows-10:182131 alt.comp.os.windows-11:16911 |
Cross-posted to 2 groups.
Show key headers only | View raw
On Sat, 2/8/2025 7:45 AM, Newyana2 wrote:
> This is the first I've heard of "NPU". I'm surprised the author
> didn't explain the acronym, or TOPS.
The "Neural Processing Unit" does Fused Multiply Add (FMA) for matrix math.
Think of rows and columns of a matrix, being multiplied, then added.
If you can do 10^12 of those FMA per second, that is 1 TOPS.
Intel, in a first generation ("unusable") NPU, put *10000* FMA
on a silicon die. Some FPGAs intended for this sort of work
(inference), have *3000* DSP cores. And similar to the CUDA
cores on a video card, the DSP cores are programmable. The
RTX 5090, has around *26000* cores that can run shader programs.
The game is all about multitudes of fairly simple, reproducible blocks.
But there is also a degree of deception going on here, in that
to have real time performance ("an AI that talks to you"), that
requires hardware that at scale is 100x what is on your desktop.
What you get on the desktop, is something that might be able
to do a DNN or an OpenCL type calculation. These are older techniques
and standards. Maybe the first desktop offerings (like... Recall),
those won't be Large Language Models. They'll be more purpose built
neural networks or static calculation approaches. Maybe if there
is any language model at all, the OCR that Recall does, is
buttressed by grammer checks (it recognizes language better
than it recognizes single letters in isolation). Recall is
using a "very similar" OCR to what is in the W11 SnippingTool
right now (the Text Action button).
On a desktop system, even with some degree of acceleration,
there just isn't the memory bandwidth or facilities for
real time LLM AI answers. You'll be waiting some number of
seconds, before your AI says "Hello! Maya". If you distill the
models down, and use "INT1" math to do the calc, it just
makes the model dumber and more mistake prone. Note that
this is not the counting the letter "R" in strawberry problem,
which is an architectural/approach issue.
I can see your home AI computer, being air gapped and
dedicated to doing Wikipedia-type things. "How do you
mix green paint?" "why don't you drive to the Home
Depot and ask that guy in the paint section?" That's
the kind of help I expect.
Paul
Back to alt.comp.os.windows-10 | Previous | Next — Previous in thread | Next in thread | Find similar | Unroll thread
Source Code: My Beginnings Book Review <invalid@invalid.com> - 2025-02-08 03:00 +0000
Re: Source Code: My Beginnings Newyana2 <newyana@invalid.nospam> - 2025-02-07 23:10 -0500
Re: Source Code: My Beginnings Paul <nospam@needed.invalid> - 2025-02-08 01:40 -0500
Re: Source Code: My Beginnings Newyana2 <newyana@invalid.nospam> - 2025-02-08 07:45 -0500
Re: Source Code: My Beginnings Paul <nospam@needed.invalid> - 2025-02-08 17:35 -0500
Re: Source Code: My Beginnings Newyana2 <newyana@invalid.nospam> - 2025-02-08 19:29 -0500
Re: Source Code: My Beginnings Paul <nospam@needed.invalid> - 2025-02-08 23:39 -0500
Re: Source Code: My Beginnings Stan Brown <the_stan_brown@fastmail.fm> - 2025-02-08 05:03 -0800
Re: Source Code: My Beginnings Newyana2 <newyana@invalid.nospam> - 2025-02-08 10:24 -0500
Re: Source Code: My Beginnings sticks <wolverine01@charter.net> - 2025-02-09 17:17 -0600
Re: Source Code: My Beginnings Daniel70 <daniel47@eternal-september.org> - 2025-02-15 21:30 +1100
Re: Source Code: My Beginnings Chris <ithinkiam@gmail.com> - 2025-02-15 11:10 +0000
Re: Source Code: My Beginnings Anton Shepelev <anton.txt@gmail.moc> - 2025-02-08 19:32 +0300
Re: Source Code: My Beginnings MikeS <mikes@is.invalid> - 2025-02-10 21:15 +0000
csiph-web