Rocksolid Light

Welcome to Rocksolid Light

mail  files  register  newsreader  groups  login

Message-ID:  

Let's call it an accidental feature. -- Larry Wall


devel / comp.arch / Re: More of my philosophy about CISC and RISC instructions..

SubjectAuthor
* More of my philosophy about CISC and RISC instructions..Amine Moulay Ramdane
+* Re: More of my philosophy about CISC and RISC instructions..MitchAlsup
|`* Re: More of my philosophy about CISC and RISC instructions..pec...@gmail.com
| +* Re: More of my philosophy about CISC and RISC instructions..MitchAlsup
| |+* Re: More of my philosophy about CISC and RISC instructions..pec...@gmail.com
| ||+* Re: More of my philosophy about CISC and RISC instructions..MitchAlsup
| |||`* Re: More of my philosophy about CISC and RISC instructions..Scott Lurndal
| ||| `* Re: More of my philosophy about CISC and RISC instructions..BGB
| |||  `* Re: More of my philosophy about CISC and RISC instructions..MitchAlsup
| |||   `* Re: More of my philosophy about CISC and RISC instructions..BGB
| |||    +* Re: More of my philosophy about CISC and RISC instructions..MitchAlsup
| |||    |`- Re: More of my philosophy about CISC and RISC instructions..BGB
| |||    `* Re: More of my philosophy about CISC and RISC instructions..Terje Mathisen
| |||     +* Re: More of my philosophy about CISC and RISC instructions..BGB
| |||     |`* Re: More of my philosophy about CISC and RISC instructions..MitchAlsup
| |||     | `- Re: More of my philosophy about CISC and RISC instructions..BGB
| |||     `* Re: More of my philosophy about CISC and RISC instructions..MitchAlsup
| |||      +- Re: More of my philosophy about CISC and RISC instructions..BGB
| |||      `- Re: More of my philosophy about CISC and RISC instructions..Terje Mathisen
| ||`* Re: More of my philosophy about CISC and RISC instructions..BGB
| || +* Re: More of my philosophy about CISC and RISC instructions..pec...@gmail.com
| || |+- Re: More of my philosophy about CISC and RISC instructions..MitchAlsup
| || |+* Re: More of my philosophy about CISC and RISC instructions..BGB
| || ||`* Re: More of my philosophy about CISC and RISC instructions..MitchAlsup
| || || +* Re: More of my philosophy about CISC and RISC instructions..JimBrakefield
| || || |`* Re: More of my philosophy about CISC and RISC instructions..MitchAlsup
| || || | +- Re: More of my philosophy about CISC and RISC instructions..JimBrakefield
| || || | `* Re: More of my philosophy about CISC and RISC instructions..Scott Lurndal
| || || |  `* Re: More of my philosophy about CISC and RISC instructions..BGB
| || || |   `* Re: More of my philosophy about CISC and RISC instructions..MitchAlsup
| || || |    +* Re: More of my philosophy about CISC and RISC instructions..BGB
| || || |    |`* Re: More of my philosophy about CISC and RISC instructions..MitchAlsup
| || || |    | `* Re: More of my philosophy about CISC and RISC instructions..BGB
| || || |    |  `* Re: More of my philosophy about CISC and RISC instructions..MitchAlsup
| || || |    |   `* Re: More of my philosophy about CISC and RISC instructions..BGB
| || || |    |    `* Re: More of my philosophy about CISC and RISC instructions..MitchAlsup
| || || |    |     +* Re: More of my philosophy about CISC and RISC instructions..EricP
| || || |    |     |+- Re: More of my philosophy about CISC and RISC instructions..BGB
| || || |    |     |`- Re: More of my philosophy about CISC and RISC instructions..MitchAlsup
| || || |    |     `- Re: More of my philosophy about CISC and RISC instructions..BGB
| || || |    +- Re: More of my philosophy about CISC and RISC instructions..Scott Lurndal
| || || |    `* Re: More of my philosophy about CISC and RISC instructions..Paul A. Clayton
| || || |     `* Re: More of my philosophy about CISC and RISC instructions..MitchAlsup
| || || |      `* Re: More of my philosophy about CISC and RISC instructions..Stephen Fuld
| || || |       +* Re: More of my philosophy about CISC and RISC instructions..MitchAlsup
| || || |       |+- Re: More of my philosophy about CISC and RISC instructions..Stephen Fuld
| || || |       |`- Re: More of my philosophy about CISC and RISC instructions..Stephen Fuld
| || || |       `* Re: More of my philosophy about CISC and RISC instructions..Thomas Koenig
| || || |        +* Re: More of my philosophy about CISC and RISC instructions..MitchAlsup
| || || |        |`- Going fast, was Re: More of my philosophyJohn Levine
| || || |        `* Re: More of my philosophy about CISC and RISC instructions..aph
| || || |         `* Re: More of my philosophy about CISC and RISC instructions..luke.l...@gmail.com
| || || |          `* Re: More of my philosophy about CISC and RISC instructions..MitchAlsup
| || || |           `* Re: More of my philosophy about CISC and RISC instructions..Stefan Monnier
| || || |            `- Re: More of my philosophy about CISC and RISC instructions..MitchAlsup
| || || +* Re: More of my philosophy about CISC and RISC instructions..BGB
| || || |`* Re: More of my philosophy about CISC and RISC instructions..MitchAlsup
| || || | `* Re: More of my philosophy about CISC and RISC instructions..BGB
| || || |  +* Re: More of my philosophy about CISC and RISC instructions..MitchAlsup
| || || |  |`* Re: More of my philosophy about CISC and RISC instructions..BGB
| || || |  | `* Re: More of my philosophy about CISC and RISC instructions..MitchAlsup
| || || |  |  `* Re: More of my philosophy about CISC and RISC instructions..BGB
| || || |  |   `* Re: More of my philosophy about CISC and RISC instructions..MitchAlsup
| || || |  |    `* Re: More of my philosophy about CISC and RISC instructions..BGB
| || || |  |     `* Re: More of my philosophy about CISC and RISC instructions..MitchAlsup
| || || |  |      `* Re: More of my philosophy about CISC and RISC instructions..BGB
| || || |  |       `* Re: More of my philosophy about CISC and RISC instructions..MitchAlsup
| || || |  |        +- Re: More of my philosophy about CISC and RISC instructions..BGB
| || || |  |        `* Re: More of my philosophy about CISC and RISC instructions..MitchAlsup
| || || |  |         `- Re: More of my philosophy about CISC and RISC instructions..BGB
| || || |  `- Re: More of my philosophy about CISC and RISC instructions..Scott Lurndal
| || || `- Re: More of my philosophy about CISC and RISC instructions..Paul A. Clayton
| || |`- Re: More of my philosophy about CISC and RISC instructions..luke.l...@gmail.com
| || `* Re: More of my philosophy about CISC and RISC instructions..luke.l...@gmail.com
| ||  `- Re: More of my philosophy about CISC and RISC instructions..MitchAlsup
| |+* Re: More of my philosophy about CISC and RISC instructions..Brett
| ||`* Re: More of my philosophy about CISC and RISC instructions..pec...@gmail.com
| || +* Re: More of my philosophy about CISC and RISC instructions..Thomas Koenig
| || |+* Re: More of my philosophy about CISC and RISC instructions..pec...@gmail.com
| || ||`* Re: More of my philosophy about CISC and RISC instructions..Thomas Koenig
| || || `* Re: More of my philosophy about CISC and RISC instructions..MitchAlsup
| || ||  +* Re: More of my philosophy about CISC and RISC instructions..pec...@gmail.com
| || ||  |`* Re: More of my philosophy about CISC and RISC instructions..MitchAlsup
| || ||  | `- Re: More of my philosophy about CISC and RISC instructions..Paul A. Clayton
| || ||  `- register windows (was: More of my philosophy ...)Anton Ertl
| || |`* Re: More of my philosophy about CISC and RISC instructions..Anton Ertl
| || | +* Re: More of my philosophy about CISC and RISC instructions..John Levine
| || | |`* Re: More of my philosophy about CISC and RISC instructions..Anton Ertl
| || | | +- Re: More of my philosophy about CISC and RISC instructions..Scott Lurndal
| || | | `- Re: More of my philosophy about CISC and RISC instructions..John Levine
| || | `- Re: More of my philosophy about CISC and RISC instructions..Stephen Fuld
| || +* Re: More of my philosophy about CISC and RISC instructions..MitchAlsup
| || |`- Re: More of my philosophy about CISC and RISC instructions..Timothy McCaffrey
| || +* Re: More of my philosophy about CISC and RISC instructions..Timothy McCaffrey
| || |`- Re: More of my philosophy about CISC and RISC instructions..luke.l...@gmail.com
| || `* Re: More of my philosophy about CISC and RISC instructions..Timothy McCaffrey
| ||  +* Re: More of my philosophy about CISC and RISC instructions..Stephen Fuld
| ||  |`- Re: More of my philosophy about CISC and RISC instructions..BGB
| ||  +- Re: More of my philosophy about CISC and RISC instructions..MitchAlsup
| ||  `- Re: More of my philosophy about CISC and RISC instructions..luke.l...@gmail.com
| |`- Re: More of my philosophy about CISC and RISC instructions..luke.l...@gmail.com
| +- Re: More of my philosophy about CISC and RISC instructions..BGB
| `* Re: More of my philosophy about CISC and RISC instructions..JimBrakefield
`- Re: More of my philosophy about CISC and RISC instructions..Hogege NaN

Pages:123456
Re: More of my philosophy about CISC and RISC instructions..

<3f281981-896e-43c3-b050-48267206844en@googlegroups.com>

  copy mid

https://news.novabbs.org/devel/article-flat.php?id=33831&group=comp.arch#33831

  copy link   Newsgroups: comp.arch
X-Received: by 2002:a05:6214:b0c:b0:63d:a43:7b06 with SMTP id u12-20020a0562140b0c00b0063d0a437b06mr577448qvj.9.1693087752795;
Sat, 26 Aug 2023 15:09:12 -0700 (PDT)
X-Received: by 2002:a17:90a:d184:b0:269:2227:b290 with SMTP id
fu4-20020a17090ad18400b002692227b290mr4559590pjb.7.1693087752301; Sat, 26 Aug
2023 15:09:12 -0700 (PDT)
Path: i2pn2.org!i2pn.org!usenet.blueworldhosting.com!diablo1.usenet.blueworldhosting.com!peer03.iad!feed-me.highwinds-media.com!news.highwinds-media.com!news-out.google.com!nntp.google.com!postnews.google.com!google-groups.googlegroups.com!not-for-mail
Newsgroups: comp.arch
Date: Sat, 26 Aug 2023 15:09:11 -0700 (PDT)
In-Reply-To: <3b6e5488-022e-4299-ab6b-70a7436bb004n@googlegroups.com>
Injection-Info: google-groups.googlegroups.com; posting-host=92.8.132.48; posting-account=soFpvwoAAADIBXOYOBcm_mixNPAaxW9p
NNTP-Posting-Host: 92.8.132.48
References: <9dd7cbc7-ec85-4a8b-84f1-266cb47575b2n@googlegroups.com>
<7d6e8035-0402-4f29-ae39-c467cfa4245cn@googlegroups.com> <bab02209-0dc4-492e-8cf2-ede14635e4d4n@googlegroups.com>
<7c9ef480-989b-4cbb-ac7f-db8f3749e8f1n@googlegroups.com> <d1fc890d-e4e5-4afd-8718-86143628ea2an@googlegroups.com>
<ubdrp6$2e6ik$1@dont-email.me> <3b6e5488-022e-4299-ab6b-70a7436bb004n@googlegroups.com>
User-Agent: G2/1.0
MIME-Version: 1.0
Message-ID: <3f281981-896e-43c3-b050-48267206844en@googlegroups.com>
Subject: Re: More of my philosophy about CISC and RISC instructions..
From: luke.leighton@gmail.com (luke.l...@gmail.com)
Injection-Date: Sat, 26 Aug 2023 22:09:12 +0000
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable
X-Received-Bytes: 1761
 by: luke.l...@gmail.com - Sat, 26 Aug 2023 22:09 UTC

On Wednesday, August 16, 2023 at 6:04:50 PM UTC+1, pec...@gmail.com wrote:
> Big cores perform most of the floating point operations in the SIMD units.

ahh but bcoz RISK-5 Vectors, there ain't no SIMD satisfaction...

l.

Re: More of my philosophy about CISC and RISC instructions..

<b7b07f28-e2c7-4487-a6b2-69ebfc8d2378n@googlegroups.com>

  copy mid

https://news.novabbs.org/devel/article-flat.php?id=33832&group=comp.arch#33832

  copy link   Newsgroups: comp.arch
X-Received: by 2002:ad4:4f52:0:b0:63c:fc43:fd51 with SMTP id eu18-20020ad44f52000000b0063cfc43fd51mr624130qvb.11.1693087902696;
Sat, 26 Aug 2023 15:11:42 -0700 (PDT)
X-Received: by 2002:a4a:5857:0:b0:56c:95bc:fe6 with SMTP id
f84-20020a4a5857000000b0056c95bc0fe6mr1096248oob.0.1693087902490; Sat, 26 Aug
2023 15:11:42 -0700 (PDT)
Path: i2pn2.org!i2pn.org!usenet.blueworldhosting.com!diablo1.usenet.blueworldhosting.com!peer03.iad!feed-me.highwinds-media.com!news.highwinds-media.com!news-out.google.com!nntp.google.com!postnews.google.com!google-groups.googlegroups.com!not-for-mail
Newsgroups: comp.arch
Date: Sat, 26 Aug 2023 15:11:42 -0700 (PDT)
In-Reply-To: <23b38342-5d35-4fb5-97fb-3cb20432d343n@googlegroups.com>
Injection-Info: google-groups.googlegroups.com; posting-host=92.8.132.48; posting-account=soFpvwoAAADIBXOYOBcm_mixNPAaxW9p
NNTP-Posting-Host: 92.8.132.48
References: <9dd7cbc7-ec85-4a8b-84f1-266cb47575b2n@googlegroups.com>
<7d6e8035-0402-4f29-ae39-c467cfa4245cn@googlegroups.com> <bab02209-0dc4-492e-8cf2-ede14635e4d4n@googlegroups.com>
<ba6c5d7b-80f5-49d5-84a5-4fa92ee7f86bn@googlegroups.com> <ubesqp$2m85c$1@dont-email.me>
<23c1700c-e477-4130-b7f4-d8559f85165an@googlegroups.com> <ubgcn8$2tniu$1@dont-email.me>
<06e13dbd-db31-49e6-8f1d-262b0fc277b8n@googlegroups.com> <ubgqjp$2vppn$1@dont-email.me>
<01c75f2e-0471-4f2e-88fb-f45d9020138bn@googlegroups.com> <ubhqcc$375ak$1@dont-email.me>
<23b38342-5d35-4fb5-97fb-3cb20432d343n@googlegroups.com>
User-Agent: G2/1.0
MIME-Version: 1.0
Message-ID: <b7b07f28-e2c7-4487-a6b2-69ebfc8d2378n@googlegroups.com>
Subject: Re: More of my philosophy about CISC and RISC instructions..
From: luke.leighton@gmail.com (luke.l...@gmail.com)
Injection-Date: Sat, 26 Aug 2023 22:11:42 +0000
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable
X-Received-Bytes: 2099
 by: luke.l...@gmail.com - Sat, 26 Aug 2023 22:11 UTC

On Wednesday, August 16, 2023 at 7:15:42 PM UTC+1, MitchAlsup wrote:

> Conversely all static VLIW forms have failed.

could you give examples of "static VLIW"? i am currently
designing an 8-bit ISA, which is so insanely short it may
have to go VLIW, and would prefer it greatly if i actually
knew what had and hadn't failed in this space :)

l.

Re: More of my philosophy about CISC and RISC instructions..

<510bcd7e-11a4-45ba-b9c0-0daca8f4ddcfn@googlegroups.com>

  copy mid

https://news.novabbs.org/devel/article-flat.php?id=33833&group=comp.arch#33833

  copy link   Newsgroups: comp.arch
X-Received: by 2002:a05:6214:a4f:b0:641:8c92:29f2 with SMTP id ee15-20020a0562140a4f00b006418c9229f2mr654208qvb.5.1693088822255;
Sat, 26 Aug 2023 15:27:02 -0700 (PDT)
X-Received: by 2002:a9d:6381:0:b0:6b9:2381:af59 with SMTP id
w1-20020a9d6381000000b006b92381af59mr649913otk.2.1693088822047; Sat, 26 Aug
2023 15:27:02 -0700 (PDT)
Path: i2pn2.org!i2pn.org!usenet.blueworldhosting.com!diablo1.usenet.blueworldhosting.com!peer03.iad!feed-me.highwinds-media.com!news.highwinds-media.com!news-out.google.com!nntp.google.com!postnews.google.com!google-groups.googlegroups.com!not-for-mail
Newsgroups: comp.arch
Date: Sat, 26 Aug 2023 15:27:01 -0700 (PDT)
In-Reply-To: <444852de-bc0b-4f1c-a03c-61de46800c88n@googlegroups.com>
Injection-Info: google-groups.googlegroups.com; posting-host=92.8.132.48; posting-account=soFpvwoAAADIBXOYOBcm_mixNPAaxW9p
NNTP-Posting-Host: 92.8.132.48
References: <9dd7cbc7-ec85-4a8b-84f1-266cb47575b2n@googlegroups.com>
<7d6e8035-0402-4f29-ae39-c467cfa4245cn@googlegroups.com> <bab02209-0dc4-492e-8cf2-ede14635e4d4n@googlegroups.com>
<7c9ef480-989b-4cbb-ac7f-db8f3749e8f1n@googlegroups.com> <ubeclc$2gi1m$1@dont-email.me>
<36e3c863-a199-4824-9668-1d1c30227baan@googlegroups.com> <444852de-bc0b-4f1c-a03c-61de46800c88n@googlegroups.com>
User-Agent: G2/1.0
MIME-Version: 1.0
Message-ID: <510bcd7e-11a4-45ba-b9c0-0daca8f4ddcfn@googlegroups.com>
Subject: Re: More of my philosophy about CISC and RISC instructions..
From: luke.leighton@gmail.com (luke.l...@gmail.com)
Injection-Date: Sat, 26 Aug 2023 22:27:02 +0000
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable
X-Received-Bytes: 2656
 by: luke.l...@gmail.com - Sat, 26 Aug 2023 22:27 UTC

On Wednesday, August 16, 2023 at 9:09:23 PM UTC+1, Timothy McCaffrey wrote:

> I don't think it was purely religious. If you have a fixed size (power of 2 size) instruction it will:
> b) Your page fault handler never has to test to see if this
> instruction straddles a page boundary and make sure both pages are present.
>
> Are these a *real* problem?

(on D-Cache side)
when you have to solve multi-issue massive numbers of LD/ST issue
at the same time in the same clock cycle, looking for merges and
overlaps... yes! Power ISA solves this by setting limits @ (i think)
20-bit of addresses (where 4k/64k of that is the page size).
if you cross over a boundary of 20-bit *then* a page-fault must
be raised.

(on I-Cache side)
you can get away with doing what Power ISA does which is to set
limits on 64-word-aligned roll-over. the 64-bit instruction encoding
is prohibited from falling on a pair of 64-word-aligned blocks.
that gives pretty damn good multi-issue opportunities without
the disadvantage of wasting vast amounts of space with nops
or complexifying the compiler.

l.

Re: More of my philosophy about CISC and RISC instructions..

<6981f721-bc5c-4b0a-ae3d-ad19192a4bc3n@googlegroups.com>

  copy mid

https://news.novabbs.org/devel/article-flat.php?id=33834&group=comp.arch#33834

  copy link   Newsgroups: comp.arch
X-Received: by 2002:a05:6214:1867:b0:649:c819:1b15 with SMTP id eh7-20020a056214186700b00649c8191b15mr551171qvb.0.1693089237061;
Sat, 26 Aug 2023 15:33:57 -0700 (PDT)
X-Received: by 2002:a17:903:41d0:b0:1b8:84d9:dea6 with SMTP id
u16-20020a17090341d000b001b884d9dea6mr7735150ple.12.1693089236621; Sat, 26
Aug 2023 15:33:56 -0700 (PDT)
Path: i2pn2.org!i2pn.org!usenet.blueworldhosting.com!diablo1.usenet.blueworldhosting.com!peer03.iad!feed-me.highwinds-media.com!news.highwinds-media.com!news-out.google.com!nntp.google.com!postnews.google.com!google-groups.googlegroups.com!not-for-mail
Newsgroups: comp.arch
Date: Sat, 26 Aug 2023 15:33:55 -0700 (PDT)
In-Reply-To: <793a9545-066b-4819-9bd1-ba0453f92955n@googlegroups.com>
Injection-Info: google-groups.googlegroups.com; posting-host=92.8.132.48; posting-account=soFpvwoAAADIBXOYOBcm_mixNPAaxW9p
NNTP-Posting-Host: 92.8.132.48
References: <9dd7cbc7-ec85-4a8b-84f1-266cb47575b2n@googlegroups.com>
<7d6e8035-0402-4f29-ae39-c467cfa4245cn@googlegroups.com> <bab02209-0dc4-492e-8cf2-ede14635e4d4n@googlegroups.com>
<7c9ef480-989b-4cbb-ac7f-db8f3749e8f1n@googlegroups.com> <ubeclc$2gi1m$1@dont-email.me>
<36e3c863-a199-4824-9668-1d1c30227baan@googlegroups.com> <793a9545-066b-4819-9bd1-ba0453f92955n@googlegroups.com>
User-Agent: G2/1.0
MIME-Version: 1.0
Message-ID: <6981f721-bc5c-4b0a-ae3d-ad19192a4bc3n@googlegroups.com>
Subject: Re: More of my philosophy about CISC and RISC instructions..
From: luke.leighton@gmail.com (luke.l...@gmail.com)
Injection-Date: Sat, 26 Aug 2023 22:33:57 +0000
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable
X-Received-Bytes: 3216
 by: luke.l...@gmail.com - Sat, 26 Aug 2023 22:33 UTC

On Wednesday, August 16, 2023 at 9:19:52 PM UTC+1, Timothy McCaffrey wrote:

> Given a variable length instruction set, it seems to me it makes sense to encode the most used
> instructions into small instructions, if possible. I believe I have read that the most used
> instructions are load, compare, add and branch. The rest are in the single digits percentage wise.
> (I wish I had a reference, so take the above with a rock sized grain of salt). Anyway, if
> you could encode those instructions into a 16 bit word, and leave the longer instructions
> for all the useful but not used that much remainder, wouldn't that basically "compress"
> your instruction set (even if variants of the longer instructions "overlapped" the short instructions,
> it would probably still be a win).

it gets complicated, *real* fast.

the very act of adding the compressed instructions means that their
limitations (access to a reduced register range for example) now puts
pressure on those registers, such that only *some* functions gain
advantage but others (with more parameters in an ABI) would not...

plus there is a similar distortion based on the ops themselves.
if you happened to make the statistical assessment based on a
limited suite of software, you end up designing something that's
fantastic for those limited applications and crap at everything else.
(someone already mentioned FP a few posts back).

our team attempted to design Compressed encoding for Power ISA:
the code-size gains, based on register allocation by gcc of Power v3.0
compiled binaries, were... disappointing. i was quite surprised.

l.

Re: More of my philosophy about CISC and RISC instructions..

<8762e241-efc1-4755-a885-7256e9f463c2n@googlegroups.com>

  copy mid

https://news.novabbs.org/devel/article-flat.php?id=33835&group=comp.arch#33835

  copy link   Newsgroups: comp.arch
X-Received: by 2002:a05:6214:4c05:b0:641:8b8d:82da with SMTP id qh5-20020a0562144c0500b006418b8d82damr624548qvb.13.1693093286651;
Sat, 26 Aug 2023 16:41:26 -0700 (PDT)
X-Received: by 2002:a4a:45c6:0:b0:571:2c40:e2a5 with SMTP id
y189-20020a4a45c6000000b005712c40e2a5mr1049826ooa.0.1693093286318; Sat, 26
Aug 2023 16:41:26 -0700 (PDT)
Path: i2pn2.org!i2pn.org!weretis.net!feeder6.news.weretis.net!usenet.blueworldhosting.com!diablo1.usenet.blueworldhosting.com!peer03.iad!feed-me.highwinds-media.com!news.highwinds-media.com!news-out.google.com!nntp.google.com!postnews.google.com!google-groups.googlegroups.com!not-for-mail
Newsgroups: comp.arch
Date: Sat, 26 Aug 2023 16:41:26 -0700 (PDT)
In-Reply-To: <0d7d3af1-be36-4489-b5e2-828814602060n@googlegroups.com>
Injection-Info: google-groups.googlegroups.com; posting-host=2600:1700:291:29f0:88d8:58f6:8390:8089;
posting-account=H_G_JQkAAADS6onOMb-dqvUozKse7mcM
NNTP-Posting-Host: 2600:1700:291:29f0:88d8:58f6:8390:8089
References: <9dd7cbc7-ec85-4a8b-84f1-266cb47575b2n@googlegroups.com>
<7d6e8035-0402-4f29-ae39-c467cfa4245cn@googlegroups.com> <bab02209-0dc4-492e-8cf2-ede14635e4d4n@googlegroups.com>
<7c9ef480-989b-4cbb-ac7f-db8f3749e8f1n@googlegroups.com> <d1fc890d-e4e5-4afd-8718-86143628ea2an@googlegroups.com>
<ubdrp6$2e6ik$1@dont-email.me> <0d7d3af1-be36-4489-b5e2-828814602060n@googlegroups.com>
User-Agent: G2/1.0
MIME-Version: 1.0
Message-ID: <8762e241-efc1-4755-a885-7256e9f463c2n@googlegroups.com>
Subject: Re: More of my philosophy about CISC and RISC instructions..
From: MitchAlsup@aol.com (MitchAlsup)
Injection-Date: Sat, 26 Aug 2023 23:41:26 +0000
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable
X-Received-Bytes: 2639
 by: MitchAlsup - Sat, 26 Aug 2023 23:41 UTC

On Saturday, August 26, 2023 at 5:04:13 PM UTC-5, luke.l...@gmail.com wrote:
> On Monday, August 14, 2023 at 7:28:58 PM UTC+1, BGB wrote:
>
> > It matters if you care about a 10% to 30% delta in code-density, but is
> > mostly irrelevant for performance if one has a "sufficient" I-cache
> > (say, 16K or more).
>
> unfortunately the impact on power of increased code-density
> is a square law related to the i-cache size, not a linear one.
> thus a 10% increase in code size results in appx 1.1^2=21%
> power consumption increase.
<
Is there published literature on this ??
>
> On Tuesday, August 15, 2023 at 12:17:09 AM UTC+1, Brett wrote:
>
> > This is the killer argument that would have saved me from caring about 16
> > bit opcodes.
> > Only toy CPU’s can care about 16 bit opcodes.
>
> don't let Hitachi/Renesas (sh2 / sh4) or the J-Core team hear you say that :)
>
> https://en.wikipedia.org/wiki/SuperH
> https://j-core.org/
>
> l.

Re: More of my philosophy about CISC and RISC instructions..

<ecfae16e-dbd9-4c67-8eb5-7c0310968152n@googlegroups.com>

  copy mid

https://news.novabbs.org/devel/article-flat.php?id=33836&group=comp.arch#33836

  copy link   Newsgroups: comp.arch
X-Received: by 2002:a05:620a:1a92:b0:76f:8b7:1f9c with SMTP id bl18-20020a05620a1a9200b0076f08b71f9cmr109198qkb.3.1693093598489;
Sat, 26 Aug 2023 16:46:38 -0700 (PDT)
X-Received: by 2002:a05:6a00:139e:b0:68b:e801:e34c with SMTP id
t30-20020a056a00139e00b0068be801e34cmr3022381pfg.2.1693093598036; Sat, 26 Aug
2023 16:46:38 -0700 (PDT)
Path: i2pn2.org!i2pn.org!news.niel.me!glou.org!news.glou.org!fdn.fr!proxad.net!feeder1-2.proxad.net!209.85.160.216.MISMATCH!news-out.google.com!nntp.google.com!postnews.google.com!google-groups.googlegroups.com!not-for-mail
Newsgroups: comp.arch
Date: Sat, 26 Aug 2023 16:46:37 -0700 (PDT)
In-Reply-To: <b7b07f28-e2c7-4487-a6b2-69ebfc8d2378n@googlegroups.com>
Injection-Info: google-groups.googlegroups.com; posting-host=2600:1700:291:29f0:88d8:58f6:8390:8089;
posting-account=H_G_JQkAAADS6onOMb-dqvUozKse7mcM
NNTP-Posting-Host: 2600:1700:291:29f0:88d8:58f6:8390:8089
References: <9dd7cbc7-ec85-4a8b-84f1-266cb47575b2n@googlegroups.com>
<7d6e8035-0402-4f29-ae39-c467cfa4245cn@googlegroups.com> <bab02209-0dc4-492e-8cf2-ede14635e4d4n@googlegroups.com>
<ba6c5d7b-80f5-49d5-84a5-4fa92ee7f86bn@googlegroups.com> <ubesqp$2m85c$1@dont-email.me>
<23c1700c-e477-4130-b7f4-d8559f85165an@googlegroups.com> <ubgcn8$2tniu$1@dont-email.me>
<06e13dbd-db31-49e6-8f1d-262b0fc277b8n@googlegroups.com> <ubgqjp$2vppn$1@dont-email.me>
<01c75f2e-0471-4f2e-88fb-f45d9020138bn@googlegroups.com> <ubhqcc$375ak$1@dont-email.me>
<23b38342-5d35-4fb5-97fb-3cb20432d343n@googlegroups.com> <b7b07f28-e2c7-4487-a6b2-69ebfc8d2378n@googlegroups.com>
User-Agent: G2/1.0
MIME-Version: 1.0
Message-ID: <ecfae16e-dbd9-4c67-8eb5-7c0310968152n@googlegroups.com>
Subject: Re: More of my philosophy about CISC and RISC instructions..
From: MitchAlsup@aol.com (MitchAlsup)
Injection-Date: Sat, 26 Aug 2023 23:46:38 +0000
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable
 by: MitchAlsup - Sat, 26 Aug 2023 23:46 UTC

On Saturday, August 26, 2023 at 5:11:44 PM UTC-5, luke.l...@gmail.com wrote:
> On Wednesday, August 16, 2023 at 7:15:42 PM UTC+1, MitchAlsup wrote:
>
> > Conversely all static VLIW forms have failed.
>
> could you give examples of "static VLIW"? i am currently
> designing an 8-bit ISA, which is so insanely short it may
> have to go VLIW, and would prefer it greatly if i actually
> knew what had and hadn't failed in this space :)
>
Multiflow
Cydrome
I860
Trimedia
SHARC
Itanic
Elbrus
and maybe MILL
> l.

Re: More of my philosophy about CISC and RISC instructions..

<uceea7$10mg7$1@dont-email.me>

  copy mid

https://news.novabbs.org/devel/article-flat.php?id=33837&group=comp.arch#33837

  copy link   Newsgroups: comp.arch
Path: i2pn2.org!i2pn.org!eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: cr88192@gmail.com (BGB)
Newsgroups: comp.arch
Subject: Re: More of my philosophy about CISC and RISC instructions..
Date: Sat, 26 Aug 2023 22:01:23 -0500
Organization: A noiseless patient Spider
Lines: 44
Message-ID: <uceea7$10mg7$1@dont-email.me>
References: <9dd7cbc7-ec85-4a8b-84f1-266cb47575b2n@googlegroups.com>
<7d6e8035-0402-4f29-ae39-c467cfa4245cn@googlegroups.com>
<bab02209-0dc4-492e-8cf2-ede14635e4d4n@googlegroups.com>
<ba6c5d7b-80f5-49d5-84a5-4fa92ee7f86bn@googlegroups.com>
<ubesqp$2m85c$1@dont-email.me>
<23c1700c-e477-4130-b7f4-d8559f85165an@googlegroups.com>
<ubgcn8$2tniu$1@dont-email.me>
<06e13dbd-db31-49e6-8f1d-262b0fc277b8n@googlegroups.com>
<ubgqjp$2vppn$1@dont-email.me>
<01c75f2e-0471-4f2e-88fb-f45d9020138bn@googlegroups.com>
<ubhqcc$375ak$1@dont-email.me>
<23b38342-5d35-4fb5-97fb-3cb20432d343n@googlegroups.com>
<b7b07f28-e2c7-4487-a6b2-69ebfc8d2378n@googlegroups.com>
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8; format=flowed
Content-Transfer-Encoding: 8bit
Injection-Date: Sun, 27 Aug 2023 03:01:27 -0000 (UTC)
Injection-Info: dont-email.me; posting-host="7578403700c948d313fb9d3a5f292da2";
logging-data="1071623"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX18zWwxRlSvC2gvCGMDx2LqX"
User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:102.0) Gecko/20100101
Thunderbird/102.14.0
Cancel-Lock: sha1:FuXyw3oTDQr2BkCXQEB44GwzMus=
Content-Language: en-US
In-Reply-To: <b7b07f28-e2c7-4487-a6b2-69ebfc8d2378n@googlegroups.com>
 by: BGB - Sun, 27 Aug 2023 03:01 UTC

On 8/26/2023 5:11 PM, luke.l...@gmail.com wrote:
> On Wednesday, August 16, 2023 at 7:15:42 PM UTC+1, MitchAlsup wrote:
>
>> Conversely all static VLIW forms have failed.
>
> could you give examples of "static VLIW"? i am currently
> designing an 8-bit ISA, which is so insanely short it may
> have to go VLIW, and would prefer it greatly if i actually
> knew what had and hadn't failed in this space :)
>

I guess, unambiguously failed:
IA-64;
TriMedia;
Transmeta (VLIW internally, x86 as surface-level ISA).

Moderate success:
TMS320C6x
Qualcomm Hexagon

Mixed:
ESP32
Its official successor jumped to RISC-V instead
Apparently as a single-issue RISC core.
But, ESP32 still seems to be popular.
AMD TeraScale
Replaced with GCN:
GCN was apparently a single-issue scalar design;
Followed by RDNA, which apparently kept GCN's ISA.

But, I can also note that the vast majority of RISC's that had once
existed have also effectively died off...

Apart from IA-64, most of the failure cases seem to be either:
The parent company died (TriMedia and Transmeta);
The successors typically went to single-issue / scalar cores.

This doesn't appear to be a good example of a critical weakness of VLIW
as a concept, as then one would have expected the typical successors to
have gone to superscalar or OoO.

Re: More of my philosophy about CISC and RISC instructions..

<uceqbq$126oc$1@dont-email.me>

  copy mid

https://news.novabbs.org/devel/article-flat.php?id=33840&group=comp.arch#33840

  copy link   Newsgroups: comp.arch
Path: i2pn2.org!i2pn.org!eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: sfuld@alumni.cmu.edu.invalid (Stephen Fuld)
Newsgroups: comp.arch
Subject: Re: More of my philosophy about CISC and RISC instructions..
Date: Sat, 26 Aug 2023 23:27:06 -0700
Organization: A noiseless patient Spider
Lines: 44
Message-ID: <uceqbq$126oc$1@dont-email.me>
References: <9dd7cbc7-ec85-4a8b-84f1-266cb47575b2n@googlegroups.com>
<7d6e8035-0402-4f29-ae39-c467cfa4245cn@googlegroups.com>
<bab02209-0dc4-492e-8cf2-ede14635e4d4n@googlegroups.com>
<ba6c5d7b-80f5-49d5-84a5-4fa92ee7f86bn@googlegroups.com>
<ubesqp$2m85c$1@dont-email.me>
<23c1700c-e477-4130-b7f4-d8559f85165an@googlegroups.com>
<ubgcn8$2tniu$1@dont-email.me>
<06e13dbd-db31-49e6-8f1d-262b0fc277b8n@googlegroups.com>
<ubgqjp$2vppn$1@dont-email.me>
<01c75f2e-0471-4f2e-88fb-f45d9020138bn@googlegroups.com>
<ubhqcc$375ak$1@dont-email.me>
<23b38342-5d35-4fb5-97fb-3cb20432d343n@googlegroups.com>
<b7b07f28-e2c7-4487-a6b2-69ebfc8d2378n@googlegroups.com>
<uceea7$10mg7$1@dont-email.me>
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8; format=flowed
Content-Transfer-Encoding: 8bit
Injection-Date: Sun, 27 Aug 2023 06:27:06 -0000 (UTC)
Injection-Info: dont-email.me; posting-host="81175181331f6e27793f3a3571bd39c9";
logging-data="1121036"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX19xJVeAWlF4vaKAk7eyJ479pQboKC7Z5G0="
User-Agent: Mozilla Thunderbird
Cancel-Lock: sha1:4Ryk5WNlUjz/99UlLBYIz3Wh0TQ=
In-Reply-To: <uceea7$10mg7$1@dont-email.me>
Content-Language: en-US
 by: Stephen Fuld - Sun, 27 Aug 2023 06:27 UTC

On 8/26/2023 8:01 PM, BGB wrote:
> On 8/26/2023 5:11 PM, luke.l...@gmail.com wrote:
>> On Wednesday, August 16, 2023 at 7:15:42 PM UTC+1, MitchAlsup wrote:
>>
>>> Conversely all static VLIW forms have failed.
>>
>> could you give examples of "static VLIW"? i am currently
>> designing an 8-bit ISA, which is so insanely short it may
>> have to go VLIW, and would prefer it greatly if i actually
>> knew what had and hadn't failed in this space :)
>>
>
> I guess, unambiguously failed:
>   IA-64;
>   TriMedia;
>   Transmeta (VLIW internally, x86 as surface-level ISA).
>
> Moderate success:
>   TMS320C6x
>   Qualcomm Hexagon
>
> Mixed:
>   ESP32
>     Its official successor jumped to RISC-V instead
>       Apparently as a single-issue RISC core.
>     But, ESP32 still seems to be popular.
>   AMD TeraScale
>     Replaced with GCN:
>       GCN was apparently a single-issue scalar design;
>     Followed by RDNA, which apparently kept GCN's ISA.
>
>
> But, I can also note that the vast majority of RISC's that had once
> existed have also effectively died off...

I think that it is also true that the vast majority of CISCs that had
once existed have also effectively died off... :-(

--
- Stephen Fuld
(e-mail address disguised to prevent spam)

Re: More of my philosophy about CISC and RISC instructions..

<ucfbtj$30keu$2@newsreader4.netcologne.de>

  copy mid

https://news.novabbs.org/devel/article-flat.php?id=33842&group=comp.arch#33842

  copy link   Newsgroups: comp.arch
Path: i2pn2.org!i2pn.org!weretis.net!feeder8.news.weretis.net!newsreader4.netcologne.de!news.netcologne.de!.POSTED.2001-4dd4-f2a7-0-ac89-ce2c-47ac-ae4.ipv6dyn.netcologne.de!not-for-mail
From: tkoenig@netcologne.de (Thomas Koenig)
Newsgroups: comp.arch
Subject: Re: More of my philosophy about CISC and RISC instructions..
Date: Sun, 27 Aug 2023 11:26:43 -0000 (UTC)
Organization: news.netcologne.de
Distribution: world
Message-ID: <ucfbtj$30keu$2@newsreader4.netcologne.de>
References: <9dd7cbc7-ec85-4a8b-84f1-266cb47575b2n@googlegroups.com>
<7c9ef480-989b-4cbb-ac7f-db8f3749e8f1n@googlegroups.com>
<d1fc890d-e4e5-4afd-8718-86143628ea2an@googlegroups.com>
<ubdrp6$2e6ik$1@dont-email.me>
<3b6e5488-022e-4299-ab6b-70a7436bb004n@googlegroups.com>
<ubn20o$5d2d$1@dont-email.me>
<2e6de6de-b408-4aa7-a430-ead3f983ed78n@googlegroups.com>
<8eee43b7-cb96-49c1-9735-4bb8c004b5c3n@googlegroups.com>
<47020cd0-2ee6-4c51-91a0-885ba719137cn@googlegroups.com>
<8m4EM.686037$TPw2.506418@fx17.iad> <ubqphs$u0gp$1@dont-email.me>
<4941705f-ac14-4f98-b3d1-6fa62bdb4236n@googlegroups.com>
<uc3etu$2jjeu$1@dont-email.me>
<130086d6-3ea6-41de-a7d6-6e68317c24c5n@googlegroups.com>
<ucd0p2$jjfb$1@dont-email.me>
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit
Injection-Date: Sun, 27 Aug 2023 11:26:43 -0000 (UTC)
Injection-Info: newsreader4.netcologne.de; posting-host="2001-4dd4-f2a7-0-ac89-ce2c-47ac-ae4.ipv6dyn.netcologne.de:2001:4dd4:f2a7:0:ac89:ce2c:47ac:ae4";
logging-data="3166686"; mail-complaints-to="abuse@netcologne.de"
User-Agent: slrn/1.0.3 (Linux)
 by: Thomas Koenig - Sun, 27 Aug 2023 11:26 UTC

Stephen Fuld <sfuld@alumni.cmu.edu.invalid> schrieb:
> On 8/24/2023 12:02 PM, MitchAlsup wrote:
>> On Tuesday, August 22, 2023 at 6:04:34 PM UTC-5, Paul A. Clayton wrote:
>>> On 8/19/23 12:31 PM, MitchAlsup wrote:
>> [snip]
>>>
>>> While you, Mitch, have argued persuasively for a unified register
>>> set, there are some benefits to architectural specialization. Of
>>> course, microarchitectural specialization can be applied if there
>>> is a natural idiom which can be easily detected. An artificial
>>> convention (optimization recommendation) can also provide such an
>>> idiom.
>> <
>> Allow me to clarify::
>> <
>> I am not trying to create and architecture which is
>> a) a marvelous microcontroller CPU
>> b) a marvelous vector supercomputer CPU
>
> OK, I will ask. If you were trying to create a marvelous vector
> supercomputer CPU, how would it be different from MY66000?

Mitch already answered, just one remark...

Today's supercomputing is mainly about graphics cards, for those
workloads where this is an advantage. Memory bandwidth between
CPU and GPU can be a major bottleneck there, and registers on the
graphics card itself are also a very limited resource.

And high performance for graphics cards is very difficult to
achieve for compilers, you need to look into details (and tune a
lot by hand; fortunately the tuning tools are quite good) to get
close to the optimum performance. And divergent branches, when
threads within a warp take different paths, are also a potential
performance killer.

Some of you may remember "same sum of digits up to some prime
number" hobby project I was doing a couple of years ago (see
https://oeis.org/A345296 ). I am getting help from somebody with
a lot of graphics cards fu, and am running a combined GPU+CPU
algorithm. Finding 70911040973874056146188543 in a search that
checks for the same digit sum in basis 2, 3, 5, 7, 11, 13 and 17
now takes around an hour instead of around five days previously,
which is not bad. (We actually found one more number, but are
still looking to extend the number range before publishing on OEIS).

So, I am now learning a bit about graphics card programming.

And how to make a graphics cards ISA that makes things easy for
programmers and compilers... Ugh. This is not helped by having
something like PTX as an intermediate code, so the main compiler
does not even have the information about the available number
of registers.

Re: More of my philosophy about CISC and RISC instructions..

<c03aaa08-0b36-451f-836a-2ac6382c2308n@googlegroups.com>

  copy mid

https://news.novabbs.org/devel/article-flat.php?id=33843&group=comp.arch#33843

  copy link   Newsgroups: comp.arch
X-Received: by 2002:ad4:4baf:0:b0:64a:ef9d:cfba with SMTP id i15-20020ad44baf000000b0064aef9dcfbamr694844qvw.7.1693144679538;
Sun, 27 Aug 2023 06:57:59 -0700 (PDT)
X-Received: by 2002:a05:6870:3a05:b0:1c5:e4a5:698a with SMTP id
du5-20020a0568703a0500b001c5e4a5698amr639282oab.2.1693144679285; Sun, 27 Aug
2023 06:57:59 -0700 (PDT)
Path: i2pn2.org!i2pn.org!usenet.blueworldhosting.com!diablo1.usenet.blueworldhosting.com!peer03.iad!feed-me.highwinds-media.com!news.highwinds-media.com!news-out.google.com!nntp.google.com!postnews.google.com!google-groups.googlegroups.com!not-for-mail
Newsgroups: comp.arch
Date: Sun, 27 Aug 2023 06:57:59 -0700 (PDT)
In-Reply-To: <ucfbtj$30keu$2@newsreader4.netcologne.de>
Injection-Info: google-groups.googlegroups.com; posting-host=2600:1700:291:29f0:6c42:5a:b117:3809;
posting-account=H_G_JQkAAADS6onOMb-dqvUozKse7mcM
NNTP-Posting-Host: 2600:1700:291:29f0:6c42:5a:b117:3809
References: <9dd7cbc7-ec85-4a8b-84f1-266cb47575b2n@googlegroups.com>
<7c9ef480-989b-4cbb-ac7f-db8f3749e8f1n@googlegroups.com> <d1fc890d-e4e5-4afd-8718-86143628ea2an@googlegroups.com>
<ubdrp6$2e6ik$1@dont-email.me> <3b6e5488-022e-4299-ab6b-70a7436bb004n@googlegroups.com>
<ubn20o$5d2d$1@dont-email.me> <2e6de6de-b408-4aa7-a430-ead3f983ed78n@googlegroups.com>
<8eee43b7-cb96-49c1-9735-4bb8c004b5c3n@googlegroups.com> <47020cd0-2ee6-4c51-91a0-885ba719137cn@googlegroups.com>
<8m4EM.686037$TPw2.506418@fx17.iad> <ubqphs$u0gp$1@dont-email.me>
<4941705f-ac14-4f98-b3d1-6fa62bdb4236n@googlegroups.com> <uc3etu$2jjeu$1@dont-email.me>
<130086d6-3ea6-41de-a7d6-6e68317c24c5n@googlegroups.com> <ucd0p2$jjfb$1@dont-email.me>
<ucfbtj$30keu$2@newsreader4.netcologne.de>
User-Agent: G2/1.0
MIME-Version: 1.0
Message-ID: <c03aaa08-0b36-451f-836a-2ac6382c2308n@googlegroups.com>
Subject: Re: More of my philosophy about CISC and RISC instructions..
From: MitchAlsup@aol.com (MitchAlsup)
Injection-Date: Sun, 27 Aug 2023 13:57:59 +0000
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable
X-Received-Bytes: 4903
 by: MitchAlsup - Sun, 27 Aug 2023 13:57 UTC

On Sunday, August 27, 2023 at 6:26:46 AM UTC-5, Thomas Koenig wrote:
> Stephen Fuld <sf...@alumni.cmu.edu.invalid> schrieb:
> > On 8/24/2023 12:02 PM, MitchAlsup wrote:
> >> On Tuesday, August 22, 2023 at 6:04:34 PM UTC-5, Paul A. Clayton wrote:
> >>> On 8/19/23 12:31 PM, MitchAlsup wrote:
> >> [snip]
> >>>
> >>> While you, Mitch, have argued persuasively for a unified register
> >>> set, there are some benefits to architectural specialization. Of
> >>> course, microarchitectural specialization can be applied if there
> >>> is a natural idiom which can be easily detected. An artificial
> >>> convention (optimization recommendation) can also provide such an
> >>> idiom.
> >> <
> >> Allow me to clarify::
> >> <
> >> I am not trying to create and architecture which is
> >> a) a marvelous microcontroller CPU
> >> b) a marvelous vector supercomputer CPU
> >
> > OK, I will ask. If you were trying to create a marvelous vector
> > supercomputer CPU, how would it be different from MY66000?
>
> Mitch already answered, just one remark...
>
> Today's supercomputing is mainly about graphics cards, for those
> workloads where this is an advantage. Memory bandwidth between
> CPU and GPU can be a major bottleneck there, and registers on the
> graphics card itself are also a very limited resource.
<
Where limited means each shader core gets 1024-2048 of these registers.
>
> And high performance for graphics cards is very difficult to
> achieve for compilers, you need to look into details (and tune a
> lot by hand; fortunately the tuning tools are quite good) to get
> close to the optimum performance. And divergent branches, when
> threads within a warp take different paths, are also a potential
> performance killer.
<
They are also a performance killer in normal vector processing, too.
>
> Some of you may remember "same sum of digits up to some prime
> number" hobby project I was doing a couple of years ago (see
> https://oeis.org/A345296 ). I am getting help from somebody with
> a lot of graphics cards fu, and am running a combined GPU+CPU
> algorithm. Finding 70911040973874056146188543 in a search that
> checks for the same digit sum in basis 2, 3, 5, 7, 11, 13 and 17
> now takes around an hour instead of around five days previously,
> which is not bad. (We actually found one more number, but are
> still looking to extend the number range before publishing on OEIS).
>
> So, I am now learning a bit about graphics card programming.
>
> And how to make a graphics cards ISA that makes things easy for
> programmers and compilers... Ugh. This is not helped by having
> something like PTX as an intermediate code, so the main compiler
> does not even have the information about the available number
> of registers.

Re: More of my philosophy about CISC and RISC instructions..

<5152736a-4a23-4dc2-8224-ba086cae8692n@googlegroups.com>

  copy mid

https://news.novabbs.org/devel/article-flat.php?id=33844&group=comp.arch#33844

  copy link   Newsgroups: comp.arch
X-Received: by 2002:ac8:7f4c:0:b0:410:9a8b:bf52 with SMTP id g12-20020ac87f4c000000b004109a8bbf52mr737925qtk.6.1693149734668;
Sun, 27 Aug 2023 08:22:14 -0700 (PDT)
X-Received: by 2002:a63:778a:0:b0:56a:164b:c6ec with SMTP id
s132-20020a63778a000000b0056a164bc6ecmr3710627pgc.7.1693149734242; Sun, 27
Aug 2023 08:22:14 -0700 (PDT)
Path: i2pn2.org!i2pn.org!usenet.blueworldhosting.com!diablo1.usenet.blueworldhosting.com!peer03.iad!feed-me.highwinds-media.com!news.highwinds-media.com!news-out.google.com!nntp.google.com!postnews.google.com!google-groups.googlegroups.com!not-for-mail
Newsgroups: comp.arch
Date: Sun, 27 Aug 2023 08:22:13 -0700 (PDT)
In-Reply-To: <uceea7$10mg7$1@dont-email.me>
Injection-Info: google-groups.googlegroups.com; posting-host=92.8.132.48; posting-account=soFpvwoAAADIBXOYOBcm_mixNPAaxW9p
NNTP-Posting-Host: 92.8.132.48
References: <9dd7cbc7-ec85-4a8b-84f1-266cb47575b2n@googlegroups.com>
<7d6e8035-0402-4f29-ae39-c467cfa4245cn@googlegroups.com> <bab02209-0dc4-492e-8cf2-ede14635e4d4n@googlegroups.com>
<ba6c5d7b-80f5-49d5-84a5-4fa92ee7f86bn@googlegroups.com> <ubesqp$2m85c$1@dont-email.me>
<23c1700c-e477-4130-b7f4-d8559f85165an@googlegroups.com> <ubgcn8$2tniu$1@dont-email.me>
<06e13dbd-db31-49e6-8f1d-262b0fc277b8n@googlegroups.com> <ubgqjp$2vppn$1@dont-email.me>
<01c75f2e-0471-4f2e-88fb-f45d9020138bn@googlegroups.com> <ubhqcc$375ak$1@dont-email.me>
<23b38342-5d35-4fb5-97fb-3cb20432d343n@googlegroups.com> <b7b07f28-e2c7-4487-a6b2-69ebfc8d2378n@googlegroups.com>
<uceea7$10mg7$1@dont-email.me>
User-Agent: G2/1.0
MIME-Version: 1.0
Message-ID: <5152736a-4a23-4dc2-8224-ba086cae8692n@googlegroups.com>
Subject: Re: More of my philosophy about CISC and RISC instructions..
From: luke.leighton@gmail.com (luke.l...@gmail.com)
Injection-Date: Sun, 27 Aug 2023 15:22:14 +0000
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable
X-Received-Bytes: 4650
 by: luke.l...@gmail.com - Sun, 27 Aug 2023 15:22 UTC

On Sunday, August 27, 2023 at 4:01:32 AM UTC+1, BGB wrote:
> On 8/26/2023 5:11 PM, luke.l...@gmail.com wrote:
> > On Wednesday, August 16, 2023 at 7:15:42 PM UTC+1, MitchAlsup wrote:
> >
> >> Conversely all static VLIW forms have failed.
> >
> > could you give examples of "static VLIW"? i am currently
> > designing an 8-bit ISA, which is so insanely short it may
> > have to go VLIW, and would prefer it greatly if i actually
> > knew what had and hadn't failed in this space :)
> >
> I guess, unambiguously failed:
> IA-64;
> TriMedia;
> Transmeta (VLIW internally, x86 as surface-level ISA).

(thank you to both - ok i now get it).

> Moderate success:
> TMS320C6x
> Qualcomm Hexagon

ok Hexagon is entirely proprietary and is in *all* Qualcomm
ARM/SoC products as the R.F. base-band processor. not going away
soon.

TI's TMS320 series is... an anomaly that is likely down to
it being mostly used by military/defense customers of TI
(radar, signal processing) and just being damn good. CEDAR
Audio used it for example to achieve 50 MFLOPs (1000 ops
per audio sample) and was able to do real-time audio processing
*in 1990* for goodness sake. we had to do assembly-level
programming, because the compilers were shite, but it *worked*
and it was simple enough to do because the 7+7 VLIW slots
were allocated to Left+Right Audio Channels. blam. done.

> But, I can also note that the vast majority of RISC's that had once
> existed have also effectively died off...

like Elvis, ARC didn't die, it "just went away" (got bought by Synopsis)
and is used as the basis of Broadcom's VideoCore IV (big mass-volume,
ain't gonna die as long as Broadcom doesn't die)

likewise Tensilica got bought by Cadence a few years after they'd
sold their BILLIONTH license (!!!) back around 2010, mostly in Audio
(where do you think Maxim gets their Audio DSPs from?)

which just leaves the other-VLIWs, and having looked at a couple
they have "slots" (non-uniform instructions). whilst in the TI TMS320
series DSPs they were slow-enough clock rate to be hand-assembly
programmed (FMAC was *only* two cycles @ the 12.5 mhz clock,
Left-Right Odd-Even reg-nums hence 50 MFLOPs), the "slots" in
e.g. the TriMedia VLIW DSPs looked to be absolute hell to work with.

depending on the operation type it would have a *different* length
of completion?? *and* the operations that can be run have to be
in certain "slots", meaning all the hard work is done by the compiler??
i'm not bloody surprised at all that that went down like a lead balloon.

if the VLIW slots were orthogonal *or* the operations were uniform length
i'd say "yyeah that'd be manageable [by assembly-writers and compilers
alike]", but both at the same time?

naah.

l.

Re: More of my philosophy about CISC and RISC instructions..

<fb1da779-2ba5-4f0b-b696-429338ada033n@googlegroups.com>

  copy mid

https://news.novabbs.org/devel/article-flat.php?id=33846&group=comp.arch#33846

  copy link   Newsgroups: comp.arch
X-Received: by 2002:ad4:57cf:0:b0:635:b307:af36 with SMTP id y15-20020ad457cf000000b00635b307af36mr733560qvx.7.1693151088093;
Sun, 27 Aug 2023 08:44:48 -0700 (PDT)
X-Received: by 2002:a17:902:c609:b0:1b8:7f21:6d3 with SMTP id
r9-20020a170902c60900b001b87f2106d3mr7075881plr.6.1693151087831; Sun, 27 Aug
2023 08:44:47 -0700 (PDT)
Path: i2pn2.org!i2pn.org!usenet.blueworldhosting.com!diablo1.usenet.blueworldhosting.com!peer03.iad!feed-me.highwinds-media.com!news.highwinds-media.com!news-out.google.com!nntp.google.com!postnews.google.com!google-groups.googlegroups.com!not-for-mail
Newsgroups: comp.arch
Date: Sun, 27 Aug 2023 08:44:47 -0700 (PDT)
In-Reply-To: <uceqbq$126oc$1@dont-email.me>
Injection-Info: google-groups.googlegroups.com; posting-host=2600:1700:291:29f0:6c42:5a:b117:3809;
posting-account=H_G_JQkAAADS6onOMb-dqvUozKse7mcM
NNTP-Posting-Host: 2600:1700:291:29f0:6c42:5a:b117:3809
References: <9dd7cbc7-ec85-4a8b-84f1-266cb47575b2n@googlegroups.com>
<7d6e8035-0402-4f29-ae39-c467cfa4245cn@googlegroups.com> <bab02209-0dc4-492e-8cf2-ede14635e4d4n@googlegroups.com>
<ba6c5d7b-80f5-49d5-84a5-4fa92ee7f86bn@googlegroups.com> <ubesqp$2m85c$1@dont-email.me>
<23c1700c-e477-4130-b7f4-d8559f85165an@googlegroups.com> <ubgcn8$2tniu$1@dont-email.me>
<06e13dbd-db31-49e6-8f1d-262b0fc277b8n@googlegroups.com> <ubgqjp$2vppn$1@dont-email.me>
<01c75f2e-0471-4f2e-88fb-f45d9020138bn@googlegroups.com> <ubhqcc$375ak$1@dont-email.me>
<23b38342-5d35-4fb5-97fb-3cb20432d343n@googlegroups.com> <b7b07f28-e2c7-4487-a6b2-69ebfc8d2378n@googlegroups.com>
<uceea7$10mg7$1@dont-email.me> <uceqbq$126oc$1@dont-email.me>
User-Agent: G2/1.0
MIME-Version: 1.0
Message-ID: <fb1da779-2ba5-4f0b-b696-429338ada033n@googlegroups.com>
Subject: Re: More of my philosophy about CISC and RISC instructions..
From: MitchAlsup@aol.com (MitchAlsup)
Injection-Date: Sun, 27 Aug 2023 15:44:48 +0000
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable
X-Received-Bytes: 3601
 by: MitchAlsup - Sun, 27 Aug 2023 15:44 UTC

On Sunday, August 27, 2023 at 1:27:10 AM UTC-5, Stephen Fuld wrote:
> On 8/26/2023 8:01 PM, BGB wrote:
> > On 8/26/2023 5:11 PM, luke.l...@gmail.com wrote:
> >> On Wednesday, August 16, 2023 at 7:15:42 PM UTC+1, MitchAlsup wrote:
> >>
> >>> Conversely all static VLIW forms have failed.
> >>
> >> could you give examples of "static VLIW"? i am currently
> >> designing an 8-bit ISA, which is so insanely short it may
> >> have to go VLIW, and would prefer it greatly if i actually
> >> knew what had and hadn't failed in this space :)
> >>
> >
> > I guess, unambiguously failed:
> > IA-64;
> > TriMedia;
> > Transmeta (VLIW internally, x86 as surface-level ISA).
> >
> > Moderate success:
> > TMS320C6x
> > Qualcomm Hexagon
> >
> > Mixed:
> > ESP32
> > Its official successor jumped to RISC-V instead
> > Apparently as a single-issue RISC core.
> > But, ESP32 still seems to be popular.
> > AMD TeraScale
> > Replaced with GCN:
> > GCN was apparently a single-issue scalar design;
> > Followed by RDNA, which apparently kept GCN's ISA.
> >
> >
> > But, I can also note that the vast majority of RISC's that had once
> > existed have also effectively died off...
<
> I think that it is also true that the vast majority of CISCs that had
> once existed have also effectively died off... :-(
<
One could argue that the original x86 has also died off, and that
all but a few of the CISCs have similarly died off, especially the
stack machines.
>
>
>
> --
> - Stephen Fuld
> (e-mail address disguised to prevent spam)

Re: More of my philosophy about CISC and RISC instructions..

<GIKGM.314626$Fgta.135409@fx10.iad>

  copy mid

https://news.novabbs.org/devel/article-flat.php?id=33847&group=comp.arch#33847

  copy link   Newsgroups: comp.arch
Path: i2pn2.org!i2pn.org!usenet.blueworldhosting.com!diablo1.usenet.blueworldhosting.com!peer01.iad!feed-me.highwinds-media.com!news.highwinds-media.com!fx10.iad.POSTED!not-for-mail
X-newsreader: xrn 9.03-beta-14-64bit
Sender: scott@dragon.sl.home (Scott Lurndal)
From: scott@slp53.sl.home (Scott Lurndal)
Reply-To: slp53@pacbell.net
Subject: Re: More of my philosophy about CISC and RISC instructions..
Newsgroups: comp.arch
References: <9dd7cbc7-ec85-4a8b-84f1-266cb47575b2n@googlegroups.com> <ubgcn8$2tniu$1@dont-email.me> <06e13dbd-db31-49e6-8f1d-262b0fc277b8n@googlegroups.com> <ubgqjp$2vppn$1@dont-email.me> <01c75f2e-0471-4f2e-88fb-f45d9020138bn@googlegroups.com> <ubhqcc$375ak$1@dont-email.me> <23b38342-5d35-4fb5-97fb-3cb20432d343n@googlegroups.com> <b7b07f28-e2c7-4487-a6b2-69ebfc8d2378n@googlegroups.com> <uceea7$10mg7$1@dont-email.me> <uceqbq$126oc$1@dont-email.me> <fb1da779-2ba5-4f0b-b696-429338ada033n@googlegroups.com>
Lines: 49
Message-ID: <GIKGM.314626$Fgta.135409@fx10.iad>
X-Complaints-To: abuse@usenetserver.com
NNTP-Posting-Date: Sun, 27 Aug 2023 16:19:50 UTC
Organization: UsenetServer - www.usenetserver.com
Date: Sun, 27 Aug 2023 16:19:50 GMT
X-Received-Bytes: 2872
 by: Scott Lurndal - Sun, 27 Aug 2023 16:19 UTC

MitchAlsup <MitchAlsup@aol.com> writes:
>On Sunday, August 27, 2023 at 1:27:10=E2=80=AFAM UTC-5, Stephen Fuld wrote:
>> On 8/26/2023 8:01 PM, BGB wrote:=20
>> > On 8/26/2023 5:11 PM, luke.l...@gmail.com wrote:=20
>> >> On Wednesday, August 16, 2023 at 7:15:42=E2=80=AFPM UTC+1, MitchAlsup =
>wrote:=20
>> >>=20
>> >>> Conversely all static VLIW forms have failed.=20
>> >>=20
>> >> could you give examples of "static VLIW"? i am currently=20
>> >> designing an 8-bit ISA, which is so insanely short it may=20
>> >> have to go VLIW, and would prefer it greatly if i actually=20
>> >> knew what had and hadn't failed in this space :)=20
>> >>=20
>> >=20
>> > I guess, unambiguously failed:=20
>> > IA-64;=20
>> > TriMedia;=20
>> > Transmeta (VLIW internally, x86 as surface-level ISA).=20
>> >=20
>> > Moderate success:=20
>> > TMS320C6x=20
>> > Qualcomm Hexagon=20
>> >=20
>> > Mixed:=20
>> > ESP32=20
>> > Its official successor jumped to RISC-V instead=20
>> > Apparently as a single-issue RISC core.=20
>> > But, ESP32 still seems to be popular.=20
>> > AMD TeraScale=20
>> > Replaced with GCN:=20
>> > GCN was apparently a single-issue scalar design;=20
>> > Followed by RDNA, which apparently kept GCN's ISA.=20
>> >=20
>> >=20
>> > But, I can also note that the vast majority of RISC's that had once=20
>> > existed have also effectively died off...
><
>> I think that it is also true that the vast majority of CISCs that had=20
>> once existed have also effectively died off... :-(=20
><
>One could argue that the original x86 has also died off, and that
>all but a few of the CISCs have similarly died off, especially the
>stack machines.

Some stack machines still exist, mostly now in emulation.

See clearpath libra.

Re: More of my philosophy about CISC and RISC instructions..

<dd281d06-b423-4a36-8895-6962f5fee186n@googlegroups.com>

  copy mid

https://news.novabbs.org/devel/article-flat.php?id=33848&group=comp.arch#33848

  copy link   Newsgroups: comp.arch
X-Received: by 2002:ac8:5dcb:0:b0:40f:fc40:87d8 with SMTP id e11-20020ac85dcb000000b0040ffc4087d8mr716763qtx.6.1693154460413;
Sun, 27 Aug 2023 09:41:00 -0700 (PDT)
X-Received: by 2002:a17:903:11c8:b0:1bc:6f8c:7c15 with SMTP id
q8-20020a17090311c800b001bc6f8c7c15mr8644509plh.7.1693154460188; Sun, 27 Aug
2023 09:41:00 -0700 (PDT)
Path: i2pn2.org!i2pn.org!newsfeed.endofthelinebbs.com!usenet.blueworldhosting.com!diablo1.usenet.blueworldhosting.com!peer03.iad!feed-me.highwinds-media.com!news.highwinds-media.com!news-out.google.com!nntp.google.com!postnews.google.com!google-groups.googlegroups.com!not-for-mail
Newsgroups: comp.arch
Date: Sun, 27 Aug 2023 09:40:59 -0700 (PDT)
In-Reply-To: <GIKGM.314626$Fgta.135409@fx10.iad>
Injection-Info: google-groups.googlegroups.com; posting-host=2600:1700:291:29f0:6c42:5a:b117:3809;
posting-account=H_G_JQkAAADS6onOMb-dqvUozKse7mcM
NNTP-Posting-Host: 2600:1700:291:29f0:6c42:5a:b117:3809
References: <9dd7cbc7-ec85-4a8b-84f1-266cb47575b2n@googlegroups.com>
<ubgcn8$2tniu$1@dont-email.me> <06e13dbd-db31-49e6-8f1d-262b0fc277b8n@googlegroups.com>
<ubgqjp$2vppn$1@dont-email.me> <01c75f2e-0471-4f2e-88fb-f45d9020138bn@googlegroups.com>
<ubhqcc$375ak$1@dont-email.me> <23b38342-5d35-4fb5-97fb-3cb20432d343n@googlegroups.com>
<b7b07f28-e2c7-4487-a6b2-69ebfc8d2378n@googlegroups.com> <uceea7$10mg7$1@dont-email.me>
<uceqbq$126oc$1@dont-email.me> <fb1da779-2ba5-4f0b-b696-429338ada033n@googlegroups.com>
<GIKGM.314626$Fgta.135409@fx10.iad>
User-Agent: G2/1.0
MIME-Version: 1.0
Message-ID: <dd281d06-b423-4a36-8895-6962f5fee186n@googlegroups.com>
Subject: Re: More of my philosophy about CISC and RISC instructions..
From: MitchAlsup@aol.com (MitchAlsup)
Injection-Date: Sun, 27 Aug 2023 16:41:00 +0000
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable
X-Received-Bytes: 3964
 by: MitchAlsup - Sun, 27 Aug 2023 16:40 UTC

On Sunday, August 27, 2023 at 11:19:54 AM UTC-5, Scott Lurndal wrote:
> MitchAlsup <Mitch...@aol.com> writes:
> >On Sunday, August 27, 2023 at 1:27:10=E2=80=AFAM UTC-5, Stephen Fuld wrote:
> >> On 8/26/2023 8:01 PM, BGB wrote:=20
> >> > On 8/26/2023 5:11 PM, luke.l...@gmail.com wrote:=20
> >> >> On Wednesday, August 16, 2023 at 7:15:42=E2=80=AFPM UTC+1, MitchAlsup =
> >wrote:=20
> >> >>=20
> >> >>> Conversely all static VLIW forms have failed.=20
> >> >>=20
> >> >> could you give examples of "static VLIW"? i am currently=20
> >> >> designing an 8-bit ISA, which is so insanely short it may=20
> >> >> have to go VLIW, and would prefer it greatly if i actually=20
> >> >> knew what had and hadn't failed in this space :)=20
> >> >>=20
> >> >=20
> >> > I guess, unambiguously failed:=20
> >> > IA-64;=20
> >> > TriMedia;=20
> >> > Transmeta (VLIW internally, x86 as surface-level ISA).=20
> >> >=20
> >> > Moderate success:=20
> >> > TMS320C6x=20
> >> > Qualcomm Hexagon=20
> >> >=20
> >> > Mixed:=20
> >> > ESP32=20
> >> > Its official successor jumped to RISC-V instead=20
> >> > Apparently as a single-issue RISC core.=20
> >> > But, ESP32 still seems to be popular.=20
> >> > AMD TeraScale=20
> >> > Replaced with GCN:=20
> >> > GCN was apparently a single-issue scalar design;=20
> >> > Followed by RDNA, which apparently kept GCN's ISA.=20
> >> >=20
> >> >=20
> >> > But, I can also note that the vast majority of RISC's that had once=20
> >> > existed have also effectively died off...
> ><
> >> I think that it is also true that the vast majority of CISCs that had=20
> >> once existed have also effectively died off... :-(=20
> ><
> >One could argue that the original x86 has also died off, and that
> >all but a few of the CISCs have similarly died off, especially the
> >stack machines.
<
> Some stack machines still exist, mostly now in emulation.
<
Would you consider a fountain using an electrical pump to be "like"
Heron's fountain ??
>
> See clearpath libra.

Going fast, was Re: More of my philosophy

<ucg3gd$k2b$1@gal.iecc.com>

  copy mid

https://news.novabbs.org/devel/article-flat.php?id=33850&group=comp.arch#33850

  copy link   Newsgroups: comp.arch
Path: i2pn2.org!i2pn.org!weretis.net!feeder6.news.weretis.net!news.misty.com!news.iecc.com!.POSTED.news.iecc.com!not-for-mail
From: johnl@taugh.com (John Levine)
Newsgroups: comp.arch
Subject: Going fast, was Re: More of my philosophy
Date: Sun, 27 Aug 2023 18:09:17 -0000 (UTC)
Organization: Taughannock Networks
Message-ID: <ucg3gd$k2b$1@gal.iecc.com>
References: <9dd7cbc7-ec85-4a8b-84f1-266cb47575b2n@googlegroups.com> <ucd0p2$jjfb$1@dont-email.me> <ucfbtj$30keu$2@newsreader4.netcologne.de> <c03aaa08-0b36-451f-836a-2ac6382c2308n@googlegroups.com>
Injection-Date: Sun, 27 Aug 2023 18:09:17 -0000 (UTC)
Injection-Info: gal.iecc.com; posting-host="news.iecc.com:2001:470:1f07:1126:0:676f:7373:6970";
logging-data="20555"; mail-complaints-to="abuse@iecc.com"
In-Reply-To: <9dd7cbc7-ec85-4a8b-84f1-266cb47575b2n@googlegroups.com> <ucd0p2$jjfb$1@dont-email.me> <ucfbtj$30keu$2@newsreader4.netcologne.de> <c03aaa08-0b36-451f-836a-2ac6382c2308n@googlegroups.com>
Cleverness: some
X-Newsreader: trn 4.0-test77 (Sep 1, 2010)
Originator: johnl@iecc.com (John Levine)
 by: John Levine - Sun, 27 Aug 2023 18:09 UTC

According to MitchAlsup <MitchAlsup@aol.com>:
>> Today's supercomputing is mainly about graphics cards, for those
>> workloads where this is an advantage. ...

Right, for trivially paralellizable workloads, GPUs are great. (It's not
the workload that's trivial, it's finding work to do in parallel that is.)

Europe's latest supercomputer is LUMI which combines 200,000 AMD CPU cores with
some number of AMD Instinct GPUs. It's located in northern Finland so the
electricity comes from a hydro station and the waste heat will provide 20%
of the local city's district heating so it's nominally carbon negative.

It's doing a lot of stuff that GPUs can't do or at least can't do cost
effectively.

The fastest supercomputer is Frontier at Oak Ridge, with 8 million AMD cores
and Instinct accelerators.

--
Regards,
John Levine, johnl@taugh.com, Primary Perpetrator of "The Internet for Dummies",
Please consider the environment before reading this e-mail. https://jl.ly

Re: More of my philosophy about CISC and RISC instructions..

<LE1HM.144948$ftCb.127994@fx34.iad>

  copy mid

https://news.novabbs.org/devel/article-flat.php?id=33857&group=comp.arch#33857

  copy link   Newsgroups: comp.arch
Path: i2pn2.org!i2pn.org!usenet.blueworldhosting.com!diablo1.usenet.blueworldhosting.com!peer02.iad!feed-me.highwinds-media.com!news.highwinds-media.com!fx34.iad.POSTED!not-for-mail
X-newsreader: xrn 9.03-beta-14-64bit
Sender: scott@dragon.sl.home (Scott Lurndal)
From: scott@slp53.sl.home (Scott Lurndal)
Reply-To: slp53@pacbell.net
Subject: Re: More of my philosophy about CISC and RISC instructions..
Newsgroups: comp.arch
References: <9dd7cbc7-ec85-4a8b-84f1-266cb47575b2n@googlegroups.com> <01c75f2e-0471-4f2e-88fb-f45d9020138bn@googlegroups.com> <ubhqcc$375ak$1@dont-email.me> <23b38342-5d35-4fb5-97fb-3cb20432d343n@googlegroups.com> <b7b07f28-e2c7-4487-a6b2-69ebfc8d2378n@googlegroups.com> <uceea7$10mg7$1@dont-email.me> <uceqbq$126oc$1@dont-email.me> <fb1da779-2ba5-4f0b-b696-429338ada033n@googlegroups.com> <GIKGM.314626$Fgta.135409@fx10.iad> <dd281d06-b423-4a36-8895-6962f5fee186n@googlegroups.com>
Lines: 61
Message-ID: <LE1HM.144948$ftCb.127994@fx34.iad>
X-Complaints-To: abuse@usenetserver.com
NNTP-Posting-Date: Mon, 28 Aug 2023 13:52:43 UTC
Organization: UsenetServer - www.usenetserver.com
Date: Mon, 28 Aug 2023 13:52:43 GMT
X-Received-Bytes: 3525
 by: Scott Lurndal - Mon, 28 Aug 2023 13:52 UTC

MitchAlsup <MitchAlsup@aol.com> writes:
>On Sunday, August 27, 2023 at 11:19:54=E2=80=AFAM UTC-5, Scott Lurndal wrot=
>e:
>> MitchAlsup <Mitch...@aol.com> writes:=20
>> >On Sunday, August 27, 2023 at 1:27:10=3DE2=3D80=3DAFAM UTC-5, Stephen Fu=
>ld wrote:=20
>> >> On 8/26/2023 8:01 PM, BGB wrote:=3D20=20
>> >> > On 8/26/2023 5:11 PM, luke.l...@gmail.com wrote:=3D20=20
>> >> >> On Wednesday, August 16, 2023 at 7:15:42=3DE2=3D80=3DAFPM UTC+1, Mi=
>tchAlsup =3D=20
>> >wrote:=3D20=20
>> >> >>=3D20=20
>> >> >>> Conversely all static VLIW forms have failed.=3D20=20
>> >> >>=3D20=20
>> >> >> could you give examples of "static VLIW"? i am currently=3D20=20
>> >> >> designing an 8-bit ISA, which is so insanely short it may=3D20=20
>> >> >> have to go VLIW, and would prefer it greatly if i actually=3D20=20
>> >> >> knew what had and hadn't failed in this space :)=3D20=20
>> >> >>=3D20=20
>> >> >=3D20=20
>> >> > I guess, unambiguously failed:=3D20=20
>> >> > IA-64;=3D20=20
>> >> > TriMedia;=3D20=20
>> >> > Transmeta (VLIW internally, x86 as surface-level ISA).=3D20=20
>> >> >=3D20=20
>> >> > Moderate success:=3D20=20
>> >> > TMS320C6x=3D20=20
>> >> > Qualcomm Hexagon=3D20=20
>> >> >=3D20=20
>> >> > Mixed:=3D20=20
>> >> > ESP32=3D20=20
>> >> > Its official successor jumped to RISC-V instead=3D20=20
>> >> > Apparently as a single-issue RISC core.=3D20=20
>> >> > But, ESP32 still seems to be popular.=3D20=20
>> >> > AMD TeraScale=3D20=20
>> >> > Replaced with GCN:=3D20=20
>> >> > GCN was apparently a single-issue scalar design;=3D20=20
>> >> > Followed by RDNA, which apparently kept GCN's ISA.=3D20=20
>> >> >=3D20=20
>> >> >=3D20=20
>> >> > But, I can also note that the vast majority of RISC's that had once=
>=3D20
>> >> > existed have also effectively died off...=20
>> ><
>> >> I think that it is also true that the vast majority of CISCs that had=
>=3D20=20
>> >> once existed have also effectively died off... :-(=3D20
>> ><=20
>> >One could argue that the original x86 has also died off, and that=20
>> >all but a few of the CISCs have similarly died off, especially the=20
>> >stack machines.
><
>> Some stack machines still exist, mostly now in emulation.=20
><
>Would you consider a fountain using an electrical pump to be "like"=20
>Heron's fountain ??

I wouldn't try to make that analogy in the first place.

The software running on the system believes it is running on
a stack machine. Virtual or otherwise.

Re: More of my philosophy about CISC and RISC instructions..

<3ffd2ea1-8e3a-4bf1-84cb-b6f75c15afffn@googlegroups.com>

  copy mid

https://news.novabbs.org/devel/article-flat.php?id=33858&group=comp.arch#33858

  copy link   Newsgroups: comp.arch
X-Received: by 2002:ad4:550b:0:b0:640:14ce:e0dd with SMTP id pz11-20020ad4550b000000b0064014cee0ddmr724629qvb.1.1693236359157;
Mon, 28 Aug 2023 08:25:59 -0700 (PDT)
X-Received: by 2002:a05:6a00:17a8:b0:68a:5449:73f9 with SMTP id
s40-20020a056a0017a800b0068a544973f9mr9070222pfg.2.1693236358895; Mon, 28 Aug
2023 08:25:58 -0700 (PDT)
Path: i2pn2.org!i2pn.org!usenet.blueworldhosting.com!diablo1.usenet.blueworldhosting.com!peer02.iad!feed-me.highwinds-media.com!news.highwinds-media.com!news-out.google.com!nntp.google.com!postnews.google.com!google-groups.googlegroups.com!not-for-mail
Newsgroups: comp.arch
Date: Mon, 28 Aug 2023 08:25:58 -0700 (PDT)
In-Reply-To: <ecfae16e-dbd9-4c67-8eb5-7c0310968152n@googlegroups.com>
Injection-Info: google-groups.googlegroups.com; posting-host=199.203.251.52; posting-account=ow8VOgoAAAAfiGNvoH__Y4ADRwQF1hZW
NNTP-Posting-Host: 199.203.251.52
References: <9dd7cbc7-ec85-4a8b-84f1-266cb47575b2n@googlegroups.com>
<7d6e8035-0402-4f29-ae39-c467cfa4245cn@googlegroups.com> <bab02209-0dc4-492e-8cf2-ede14635e4d4n@googlegroups.com>
<ba6c5d7b-80f5-49d5-84a5-4fa92ee7f86bn@googlegroups.com> <ubesqp$2m85c$1@dont-email.me>
<23c1700c-e477-4130-b7f4-d8559f85165an@googlegroups.com> <ubgcn8$2tniu$1@dont-email.me>
<06e13dbd-db31-49e6-8f1d-262b0fc277b8n@googlegroups.com> <ubgqjp$2vppn$1@dont-email.me>
<01c75f2e-0471-4f2e-88fb-f45d9020138bn@googlegroups.com> <ubhqcc$375ak$1@dont-email.me>
<23b38342-5d35-4fb5-97fb-3cb20432d343n@googlegroups.com> <b7b07f28-e2c7-4487-a6b2-69ebfc8d2378n@googlegroups.com>
<ecfae16e-dbd9-4c67-8eb5-7c0310968152n@googlegroups.com>
User-Agent: G2/1.0
MIME-Version: 1.0
Message-ID: <3ffd2ea1-8e3a-4bf1-84cb-b6f75c15afffn@googlegroups.com>
Subject: Re: More of my philosophy about CISC and RISC instructions..
From: already5chosen@yahoo.com (Michael S)
Injection-Date: Mon, 28 Aug 2023 15:25:59 +0000
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable
X-Received-Bytes: 2827
 by: Michael S - Mon, 28 Aug 2023 15:25 UTC

On Sunday, August 27, 2023 at 2:46:40 AM UTC+3, MitchAlsup wrote:
> On Saturday, August 26, 2023 at 5:11:44 PM UTC-5, luke.l...@gmail..com wrote:
> > On Wednesday, August 16, 2023 at 7:15:42 PM UTC+1, MitchAlsup wrote:
> >
> > > Conversely all static VLIW forms have failed.
> >
> > could you give examples of "static VLIW"? i am currently
> > designing an 8-bit ISA, which is so insanely short it may
> > have to go VLIW, and would prefer it greatly if i actually
> > knew what had and hadn't failed in this space :)
> >
> Multiflow
> Cydrome
> I860

Only 2 verbs per instruction group.

> Trimedia
> SHARC

SHARC/HammerSHARC are not VLIW. One can argue that they are LIW.
Another ADI architecture, TigerSHARC, is VLIW, but uncharacteristically
narrow one.

> Itanic

It lacks one of the key characteristics of VLIW - exposed pipeline.

> Elbrus
> and maybe MILL
> > l.

Re: More of my philosophy about CISC and RISC instructions..

<ucimf9$1plna$1@dont-email.me>

  copy mid

https://news.novabbs.org/devel/article-flat.php?id=33861&group=comp.arch#33861

  copy link   Newsgroups: comp.arch
Path: i2pn2.org!i2pn.org!weretis.net!feeder8.news.weretis.net!eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: sfuld@alumni.cmu.edu.invalid (Stephen Fuld)
Newsgroups: comp.arch
Subject: Re: More of my philosophy about CISC and RISC instructions..
Date: Mon, 28 Aug 2023 10:45:12 -0700
Organization: A noiseless patient Spider
Lines: 83
Message-ID: <ucimf9$1plna$1@dont-email.me>
References: <9dd7cbc7-ec85-4a8b-84f1-266cb47575b2n@googlegroups.com>
<7c9ef480-989b-4cbb-ac7f-db8f3749e8f1n@googlegroups.com>
<d1fc890d-e4e5-4afd-8718-86143628ea2an@googlegroups.com>
<ubdrp6$2e6ik$1@dont-email.me>
<3b6e5488-022e-4299-ab6b-70a7436bb004n@googlegroups.com>
<ubn20o$5d2d$1@dont-email.me>
<2e6de6de-b408-4aa7-a430-ead3f983ed78n@googlegroups.com>
<8eee43b7-cb96-49c1-9735-4bb8c004b5c3n@googlegroups.com>
<47020cd0-2ee6-4c51-91a0-885ba719137cn@googlegroups.com>
<8m4EM.686037$TPw2.506418@fx17.iad> <ubqphs$u0gp$1@dont-email.me>
<4941705f-ac14-4f98-b3d1-6fa62bdb4236n@googlegroups.com>
<uc3etu$2jjeu$1@dont-email.me>
<130086d6-3ea6-41de-a7d6-6e68317c24c5n@googlegroups.com>
<ucd0p2$jjfb$1@dont-email.me>
<1d34de28-7fdf-4368-bb34-f7b69aeb4f89n@googlegroups.com>
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8; format=flowed
Content-Transfer-Encoding: 8bit
Injection-Date: Mon, 28 Aug 2023 17:45:13 -0000 (UTC)
Injection-Info: dont-email.me; posting-host="9b1a4d3d5004ecdb44ea4a60bdfce027";
logging-data="1890026"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX19MEW5GJjnp0hmtraSq6w9nkdaI+EjUIpw="
User-Agent: Mozilla Thunderbird
Cancel-Lock: sha1:oNmMYT93f3pMq+IXnDzXM5tXbLc=
In-Reply-To: <1d34de28-7fdf-4368-bb34-f7b69aeb4f89n@googlegroups.com>
Content-Language: en-US
 by: Stephen Fuld - Mon, 28 Aug 2023 17:45 UTC

On 8/26/2023 12:33 PM, MitchAlsup wrote:
> On Saturday, August 26, 2023 at 9:04:22 AM UTC-5, Stephen Fuld wrote:
>> On 8/24/2023 12:02 PM, MitchAlsup wrote:
>>> On Tuesday, August 22, 2023 at 6:04:34 PM UTC-5, Paul A. Clayton wrote:
>>>> On 8/19/23 12:31 PM, MitchAlsup wrote:
>>> [snip]
>>>>
>>>> While you, Mitch, have argued persuasively for a unified register
>>>> set, there are some benefits to architectural specialization. Of
>>>> course, microarchitectural specialization can be applied if there
>>>> is a natural idiom which can be easily detected. An artificial
>>>> convention (optimization recommendation) can also provide such an
>>>> idiom.
>>> <
>>> Allow me to clarify::
>>> <
>>> I am not trying to create and architecture which is
>>> a) a marvelous microcontroller CPU
>>> b) a marvelous vector supercomputer CPU
> <
>> OK, I will ask. If you were trying to create a marvelous vector
>> supercomputer CPU, how would it be different from MY66000?
> <
> What an intriguing question !!

I thought you might like it. :-)

>>
>> Specifically, would you still use the VVM mechanism? How different
>> would the ISA be (or would it just use more FP functional units and
>> perhaps more/bigger buffers for VVM use?)? Would you provide full
>> support for 128 bit FP? etc., etc.
> <
> After thinking about this for an hour::
> <
> ISA would probably be pretty much the same, the memory system and
> interconnect would be vastly beefier. I would shoot for a cache line
> width of FPUs (8)×{FADD, FMAC, FDIV/SQRT} 8 to 16 cache line staging
> buffers, 4 AGENs per cycle, all feeding off the 1MB 16-banked L2, taking
> 4 caches misses per cycle. Then over in the memory/DRAM area there
> would be a minimum of 16 DIMMs (or HBMs) operating at 2 speed
> grades below maximum BW DDR <of that generation> could muster.
> Every lane would be capable of integer and logical calculations.

Your emphasis on memory bandwidth is in line with Seymour Cray's designs
for CDC and Cray, where the vector instructions were supported by a huge
(for the time) memory bandwidth. Of course, you have the "advantage" of
things like DRAMS which have faster access to "close" or even sequential
locations that Cray's SRAM based and lack of cache designs didn't.

This all raises a question about modern supercomputers. As others have
pointed out, most of them toady have large numbers of relatively
standard CPU chips, with or without GPUs with some kink of high speed
interconnect. So the question is, how much of requirement for the large
number of chips is due to adding compute power versus adding memory
bandwidth to the system?

> <
> After writing the above and more thought, I can't see any changes in
> ISA, as we already get gather (LDD->LD) and Scater (LDD-ST) falling
> out for free.

A question about the workloads. Is the use of scatter/gather to
multiple arbitrary locations, or to locations at some "fixed" stride.
If the latter, would it make sens to add some sort of stride detector to
the VVM buffers?

> <
> My hope would be that it would not melt when operating at full throughput.

:-)

--
- Stephen Fuld
(e-mail address disguised to prevent spam)

Re: More of my philosophy about CISC and RISC instructions..

<ucip45$1qiio$1@dont-email.me>

  copy mid

https://news.novabbs.org/devel/article-flat.php?id=33862&group=comp.arch#33862

  copy link   Newsgroups: comp.arch
Path: i2pn2.org!i2pn.org!eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: cr88192@gmail.com (BGB)
Newsgroups: comp.arch
Subject: Re: More of my philosophy about CISC and RISC instructions..
Date: Mon, 28 Aug 2023 13:30:24 -0500
Organization: A noiseless patient Spider
Lines: 88
Message-ID: <ucip45$1qiio$1@dont-email.me>
References: <9dd7cbc7-ec85-4a8b-84f1-266cb47575b2n@googlegroups.com>
<7d6e8035-0402-4f29-ae39-c467cfa4245cn@googlegroups.com>
<bab02209-0dc4-492e-8cf2-ede14635e4d4n@googlegroups.com>
<ba6c5d7b-80f5-49d5-84a5-4fa92ee7f86bn@googlegroups.com>
<ubesqp$2m85c$1@dont-email.me>
<23c1700c-e477-4130-b7f4-d8559f85165an@googlegroups.com>
<ubgcn8$2tniu$1@dont-email.me>
<06e13dbd-db31-49e6-8f1d-262b0fc277b8n@googlegroups.com>
<ubgqjp$2vppn$1@dont-email.me>
<01c75f2e-0471-4f2e-88fb-f45d9020138bn@googlegroups.com>
<ubhqcc$375ak$1@dont-email.me>
<23b38342-5d35-4fb5-97fb-3cb20432d343n@googlegroups.com>
<b7b07f28-e2c7-4487-a6b2-69ebfc8d2378n@googlegroups.com>
<ecfae16e-dbd9-4c67-8eb5-7c0310968152n@googlegroups.com>
<3ffd2ea1-8e3a-4bf1-84cb-b6f75c15afffn@googlegroups.com>
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8; format=flowed
Content-Transfer-Encoding: 8bit
Injection-Date: Mon, 28 Aug 2023 18:30:29 -0000 (UTC)
Injection-Info: dont-email.me; posting-host="34de03c70e71cb5409835def68793f89";
logging-data="1919576"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1+fL/AV5UvZZAZtAFk5bXAO"
User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:102.0) Gecko/20100101
Thunderbird/102.14.0
Cancel-Lock: sha1:vmCDn0Xqx/XHWoWMnuCY9ANm4z8=
In-Reply-To: <3ffd2ea1-8e3a-4bf1-84cb-b6f75c15afffn@googlegroups.com>
Content-Language: en-US
 by: BGB - Mon, 28 Aug 2023 18:30 UTC

On 8/28/2023 10:25 AM, Michael S wrote:
> On Sunday, August 27, 2023 at 2:46:40 AM UTC+3, MitchAlsup wrote:
>> On Saturday, August 26, 2023 at 5:11:44 PM UTC-5, luke.l...@gmail.com wrote:
>>> On Wednesday, August 16, 2023 at 7:15:42 PM UTC+1, MitchAlsup wrote:
>>>
>>>> Conversely all static VLIW forms have failed.
>>>
>>> could you give examples of "static VLIW"? i am currently
>>> designing an 8-bit ISA, which is so insanely short it may
>>> have to go VLIW, and would prefer it greatly if i actually
>>> knew what had and hadn't failed in this space :)
>>>
>> Multiflow
>> Cydrome
>> I860
>
> Only 2 verbs per instruction group.
>
>> Trimedia
>> SHARC
>
> SHARC/HammerSHARC are not VLIW. One can argue that they are LIW.
> Another ADI architecture, TigerSHARC, is VLIW, but uncharacteristically
> narrow one.
>

If one respects the LIW/VLIW distinction, than LIW machines (2 or 3 wide
with groups of 1 to 3 RISC-like instructions able to be flagged to
execute in parallel) likely far outweigh "actual" VLIW machines in terms
of use (eg: 6 or 8 wide, often with a fixed-size / densely packed bundle
format).

So, likely, things Hexagon and Tensilica/Xtensa would more likely fall
under the LIW camp in this sense.

>> Itanic
>
> It lacks one of the key characteristics of VLIW - exposed pipeline.
>

If you mean the property that trying to use a result before it is ready
will cause the pipeline to be delayed until the result is ready (like
RISC's and most other non-VLIW ISAs), rather than give a stale value
(many other VLIWs).

Then, yes, my BJX2 ISA also has this property FWIW.

There is also an imposed restraint that bundled instructions will be
required to give the same results as-if the instructions had been
executed sequentially on a serial machine.

Though, this still does leave the matter of instructions which can't
really be implemented on a 1-wide implementation, like MOV.X (which
requires at least a 4R2W register file to pull off; or alternatively a
mechanism to break it apart into multiple logical instructions, *).

*: I have noted that by the time one pays for a larger register file
(and associated decode-path machinery), they almost may as well throw in
another ALU and have a lane there (the cost of the register ports being
seemingly more significant than the cost difference of adding/removing
an ALU and similar).

Well, and one also needs to pay the cost of a more expensive register
file to be able to support some bigger/more-complex instructions (and it
would be silly to have a 6R2W register file on a 1-wide machine just for
sake of 128-bit SIMD FMAC...).

Granted, one "could" argue for a machine with 2R1W but with native
128-bit register ports (rather than, say, doing 128-bit inputs/outputs
by using register ports in pairs).

Under the LIW / VLIW distinction, BJX2 would also fall on the LIW side.

Things like exposing instruction latency within the pipeline, or things
like branch-delay slots and similar, weren't really a bridge I was
willing to cross.

>> Elbrus
>> and maybe MILL
>>> l.

Re: More of my philosophy about CISC and RISC instructions..

<160e5672-554f-4836-a390-9b3cd946c6c3n@googlegroups.com>

  copy mid

https://news.novabbs.org/devel/article-flat.php?id=33863&group=comp.arch#33863

  copy link   Newsgroups: comp.arch
X-Received: by 2002:a05:6214:ab3:b0:64f:3e89:5060 with SMTP id ew19-20020a0562140ab300b0064f3e895060mr775944qvb.1.1693250306086;
Mon, 28 Aug 2023 12:18:26 -0700 (PDT)
X-Received: by 2002:a05:6870:5a92:b0:1bb:4d41:e929 with SMTP id
dt18-20020a0568705a9200b001bb4d41e929mr749528oab.3.1693250305831; Mon, 28 Aug
2023 12:18:25 -0700 (PDT)
Path: i2pn2.org!i2pn.org!usenet.blueworldhosting.com!diablo1.usenet.blueworldhosting.com!peer02.iad!feed-me.highwinds-media.com!news.highwinds-media.com!news-out.google.com!nntp.google.com!postnews.google.com!google-groups.googlegroups.com!not-for-mail
Newsgroups: comp.arch
Date: Mon, 28 Aug 2023 12:18:25 -0700 (PDT)
In-Reply-To: <ucip45$1qiio$1@dont-email.me>
Injection-Info: google-groups.googlegroups.com; posting-host=2600:1700:291:29f0:44e6:31cb:83df:64b7;
posting-account=H_G_JQkAAADS6onOMb-dqvUozKse7mcM
NNTP-Posting-Host: 2600:1700:291:29f0:44e6:31cb:83df:64b7
References: <9dd7cbc7-ec85-4a8b-84f1-266cb47575b2n@googlegroups.com>
<7d6e8035-0402-4f29-ae39-c467cfa4245cn@googlegroups.com> <bab02209-0dc4-492e-8cf2-ede14635e4d4n@googlegroups.com>
<ba6c5d7b-80f5-49d5-84a5-4fa92ee7f86bn@googlegroups.com> <ubesqp$2m85c$1@dont-email.me>
<23c1700c-e477-4130-b7f4-d8559f85165an@googlegroups.com> <ubgcn8$2tniu$1@dont-email.me>
<06e13dbd-db31-49e6-8f1d-262b0fc277b8n@googlegroups.com> <ubgqjp$2vppn$1@dont-email.me>
<01c75f2e-0471-4f2e-88fb-f45d9020138bn@googlegroups.com> <ubhqcc$375ak$1@dont-email.me>
<23b38342-5d35-4fb5-97fb-3cb20432d343n@googlegroups.com> <b7b07f28-e2c7-4487-a6b2-69ebfc8d2378n@googlegroups.com>
<ecfae16e-dbd9-4c67-8eb5-7c0310968152n@googlegroups.com> <3ffd2ea1-8e3a-4bf1-84cb-b6f75c15afffn@googlegroups.com>
<ucip45$1qiio$1@dont-email.me>
User-Agent: G2/1.0
MIME-Version: 1.0
Message-ID: <160e5672-554f-4836-a390-9b3cd946c6c3n@googlegroups.com>
Subject: Re: More of my philosophy about CISC and RISC instructions..
From: MitchAlsup@aol.com (MitchAlsup)
Injection-Date: Mon, 28 Aug 2023 19:18:26 +0000
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable
X-Received-Bytes: 7690
 by: MitchAlsup - Mon, 28 Aug 2023 19:18 UTC

On Monday, August 28, 2023 at 1:30:33 PM UTC-5, BGB wrote:
> On 8/28/2023 10:25 AM, Michael S wrote:
> > On Sunday, August 27, 2023 at 2:46:40 AM UTC+3, MitchAlsup wrote:
> >> On Saturday, August 26, 2023 at 5:11:44 PM UTC-5, luke.l...@gmail.com wrote:
> >>> On Wednesday, August 16, 2023 at 7:15:42 PM UTC+1, MitchAlsup wrote:
> >>>
> >>>> Conversely all static VLIW forms have failed.
> >>>
> >>> could you give examples of "static VLIW"? i am currently
> >>> designing an 8-bit ISA, which is so insanely short it may
> >>> have to go VLIW, and would prefer it greatly if i actually
> >>> knew what had and hadn't failed in this space :)
> >>>
> >> Multiflow
> >> Cydrome
> >> I860
> >
> > Only 2 verbs per instruction group.
> >
> >> Trimedia
> >> SHARC
> >
> > SHARC/HammerSHARC are not VLIW. One can argue that they are LIW.
> > Another ADI architecture, TigerSHARC, is VLIW, but uncharacteristically
> > narrow one.
> >
> If one respects the LIW/VLIW distinction, than LIW machines (2 or 3 wide
> with groups of 1 to 3 RISC-like instructions able to be flagged to
> execute in parallel) likely far outweigh "actual" VLIW machines in terms
> of use (eg: 6 or 8 wide, often with a fixed-size / densely packed bundle
> format).
>
>
> So, likely, things Hexagon and Tensilica/Xtensa would more likely fall
> under the LIW camp in this sense.
> >> Itanic
> >
> > It lacks one of the key characteristics of VLIW - exposed pipeline.
> >
> If you mean the property that trying to use a result before it is ready
> will cause the pipeline to be delayed until the result is ready (like
> RISC's and most other non-VLIW ISAs), rather than give a stale value
> (many other VLIWs).
>
>
> Then, yes, my BJX2 ISA also has this property FWIW.
<
The former (stall) or the later (stale) ??
>
> There is also an imposed restraint that bundled instructions will be
> required to give the same results as-if the instructions had been
> executed sequentially on a serial machine.
<
This tenet prevents the use of stale.
>
> Though, this still does leave the matter of instructions which can't
> really be implemented on a 1-wide implementation, like MOV.X (which
> requires at least a 4R2W register file to pull off; or alternatively a
> mechanism to break it apart into multiple logical instructions, *).
>
I disagree (in the small) I still consider my 66100 to be a 1-wide machine
even when using CARRY spread across several serial instructions. This
may appear as 4R2W but the µArchitecture has a Carry-Loop which
provides 1R1W without consuming RF ports (right then and there).
{{On the low end machines, CARRY itself supplies the 1R, after the
Carry-Shadow, forwarding supplies the 1R, and sooner or later there
will be an instruction without a result and Carry can steal that port on
this cycle. Higher end machines have many more options and this is
seldom an issue.}}
>
> *: I have noted that by the time one pays for a larger register file
> (and associated decode-path machinery), they almost may as well throw in
> another ALU and have a lane there (the cost of the register ports being
> seemingly more significant than the cost difference of adding/removing
> an ALU and similar).
<
The area footprint of an ALU is smaller that the area footprint delta of
adding 1 more read port to an already 3 read ported file. The ALU
footprint is way smaller than adding a second write port to a 1W RF.
So, the ALU is "almost free". In many of my machines, the forwarding
logic was bigger than the integer ALU. In my 6-wide and 8-wide designs
we simply threw an ALU on the front of every (but 1) function unit.
<
AGEN units can do integer additions and have integer shifters (we
typically call aligners).
FADD is so big you can hardly see an integer ALU pasted on the front.
FMAC is even bigger than FADD....
*CMP IS an integer ALU (both integer and FP with a few special checks)
.....
>
> Well, and one also needs to pay the cost of a more expensive register
> file to be able to support some bigger/more-complex instructions (and it
> would be silly to have a 6R2W register file on a 1-wide machine just for
> sake of 128-bit SIMD FMAC...).
<
It is beyond silly.....and when the ISA supports generous constants, a
RF with 6R ports should be supporting a 3-wide µArchitecture.
>
> Granted, one "could" argue for a machine with 2R1W but with native
> 128-bit register ports (rather than, say, doing 128-bit inputs/outputs
> by using register ports in pairs).
>
You are making assumptions that DECODE is setup ONLY to push decoded
instructions into execution and/or that it has no facility to repeatedly
perform a "cycle of work" until it is time to move forward. Under your
assumption, what you said is true enough, it is not when DECODE can
sequence an instruction into execution.
>
> Under the LIW / VLIW distinction, BJX2 would also fall on the LIW side.
>
>
> Things like exposing instruction latency within the pipeline, or things
> like branch-delay slots and similar, weren't really a bridge I was
> willing to cross.
<
It is not the exposing (code scheduling uses exposure of the pipeline details)
It is the Architecture not presenting a vonNeumann execution paradigm!
If the code sequence is wrong you get stale data rather than architectural
data.
<
> >> Elbrus
> >> and maybe MILL
> >>> l.

Re: More of my philosophy about CISC and RISC instructions..

<uckb14$26517$1@dont-email.me>

  copy mid

https://news.novabbs.org/devel/article-flat.php?id=33865&group=comp.arch#33865

  copy link   Newsgroups: comp.arch
Path: i2pn2.org!i2pn.org!eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: cr88192@gmail.com (BGB)
Newsgroups: comp.arch
Subject: Re: More of my philosophy about CISC and RISC instructions..
Date: Tue, 29 Aug 2023 03:42:09 -0500
Organization: A noiseless patient Spider
Lines: 340
Message-ID: <uckb14$26517$1@dont-email.me>
References: <9dd7cbc7-ec85-4a8b-84f1-266cb47575b2n@googlegroups.com>
<7d6e8035-0402-4f29-ae39-c467cfa4245cn@googlegroups.com>
<bab02209-0dc4-492e-8cf2-ede14635e4d4n@googlegroups.com>
<ba6c5d7b-80f5-49d5-84a5-4fa92ee7f86bn@googlegroups.com>
<ubesqp$2m85c$1@dont-email.me>
<23c1700c-e477-4130-b7f4-d8559f85165an@googlegroups.com>
<ubgcn8$2tniu$1@dont-email.me>
<06e13dbd-db31-49e6-8f1d-262b0fc277b8n@googlegroups.com>
<ubgqjp$2vppn$1@dont-email.me>
<01c75f2e-0471-4f2e-88fb-f45d9020138bn@googlegroups.com>
<ubhqcc$375ak$1@dont-email.me>
<23b38342-5d35-4fb5-97fb-3cb20432d343n@googlegroups.com>
<b7b07f28-e2c7-4487-a6b2-69ebfc8d2378n@googlegroups.com>
<ecfae16e-dbd9-4c67-8eb5-7c0310968152n@googlegroups.com>
<3ffd2ea1-8e3a-4bf1-84cb-b6f75c15afffn@googlegroups.com>
<ucip45$1qiio$1@dont-email.me>
<160e5672-554f-4836-a390-9b3cd946c6c3n@googlegroups.com>
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8; format=flowed
Content-Transfer-Encoding: 8bit
Injection-Date: Tue, 29 Aug 2023 08:42:12 -0000 (UTC)
Injection-Info: dont-email.me; posting-host="a03c254b8130bc3d024bf75004f75064";
logging-data="2298919"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1+UUVQd0SPJZTQRNEy2GHgI"
User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:102.0) Gecko/20100101
Thunderbird/102.14.0
Cancel-Lock: sha1:J+2X6YupsM599QuFsXMaK/kvzXU=
In-Reply-To: <160e5672-554f-4836-a390-9b3cd946c6c3n@googlegroups.com>
Content-Language: en-US
 by: BGB - Tue, 29 Aug 2023 08:42 UTC

On 8/28/2023 2:18 PM, MitchAlsup wrote:
> On Monday, August 28, 2023 at 1:30:33 PM UTC-5, BGB wrote:
>> On 8/28/2023 10:25 AM, Michael S wrote:
>>> On Sunday, August 27, 2023 at 2:46:40 AM UTC+3, MitchAlsup wrote:
>>>> On Saturday, August 26, 2023 at 5:11:44 PM UTC-5, luke.l...@gmail.com wrote:
>>>>> On Wednesday, August 16, 2023 at 7:15:42 PM UTC+1, MitchAlsup wrote:
>>>>>
>>>>>> Conversely all static VLIW forms have failed.
>>>>>
>>>>> could you give examples of "static VLIW"? i am currently
>>>>> designing an 8-bit ISA, which is so insanely short it may
>>>>> have to go VLIW, and would prefer it greatly if i actually
>>>>> knew what had and hadn't failed in this space :)
>>>>>
>>>> Multiflow
>>>> Cydrome
>>>> I860
>>>
>>> Only 2 verbs per instruction group.
>>>
>>>> Trimedia
>>>> SHARC
>>>
>>> SHARC/HammerSHARC are not VLIW. One can argue that they are LIW.
>>> Another ADI architecture, TigerSHARC, is VLIW, but uncharacteristically
>>> narrow one.
>>>
>> If one respects the LIW/VLIW distinction, than LIW machines (2 or 3 wide
>> with groups of 1 to 3 RISC-like instructions able to be flagged to
>> execute in parallel) likely far outweigh "actual" VLIW machines in terms
>> of use (eg: 6 or 8 wide, often with a fixed-size / densely packed bundle
>> format).
>>
>>
>> So, likely, things Hexagon and Tensilica/Xtensa would more likely fall
>> under the LIW camp in this sense.
>>>> Itanic
>>>
>>> It lacks one of the key characteristics of VLIW - exposed pipeline.
>>>
>> If you mean the property that trying to use a result before it is ready
>> will cause the pipeline to be delayed until the result is ready (like
>> RISC's and most other non-VLIW ISAs), rather than give a stale value
>> (many other VLIWs).
>>
>>
>> Then, yes, my BJX2 ISA also has this property FWIW.
> <
> The former (stall) or the later (stale) ??

It will stall the "front end" of the pipeline until the former
instructions have completed.

So, dependencies between bundles will be resolved using pipeline stalls
much like sequential instructions in a more traditional architecture
(and scheduling for latency is mostly used to avoid wasting clock-cycles
via pipeline stalls).

>>
>> There is also an imposed restraint that bundled instructions will be
>> required to give the same results as-if the instructions had been
>> executed sequentially on a serial machine.
> <
> This tenet prevents the use of stale.

Yes.

Stale results will not be visible within any well-formed instruction
sequence.

Bundles which attempt to modify an register and then reference it as an
input within the same bundle could potentially see stale values but are
not allowed (except in cases where predication makes the instructions
mutually exclusive; effectively preventing any stale results from being
possible).

So:
ADD R4, 1, R5 | SUB R5, 1, R6
Is invalid.

But:
ADD?T R4, 1, R5 | SUB?F R5, 1, R6
Is allowed, as these instructions are mutually exclusive.

Technically, ill-formed bundles which violate the above rules may result
in stale data (in the verilog implementation). However, my emulator has
a lint feature to detect these cases and turn them into breakpoints, and
they are still considered as invalid as per the ISA spec.

Since my emulator is used quite extensively in my testing, code which
has such bundles (such as by poorly written ASM or compiler bugs) is
unlikely to go unnoticed.

For a wider VLIW, it would be "more compelling" to relax some of these
rules, but "there be dragons here..."

So, current plan remains to, if a wider implementation is needed, to
likely go over to an OoO strategy instead (likely leaving 3 as the
"canonical width" even if the underlying machine is wider).

>>
>> Though, this still does leave the matter of instructions which can't
>> really be implemented on a 1-wide implementation, like MOV.X (which
>> requires at least a 4R2W register file to pull off; or alternatively a
>> mechanism to break it apart into multiple logical instructions, *).
>>
> I disagree (in the small) I still consider my 66100 to be a 1-wide machine
> even when using CARRY spread across several serial instructions. This
> may appear as 4R2W but the µArchitecture has a Carry-Loop which
> provides 1R1W without consuming RF ports (right then and there).
> {{On the low end machines, CARRY itself supplies the 1R, after the
> Carry-Shadow, forwarding supplies the 1R, and sooner or later there
> will be an instruction without a result and Carry can steal that port on
> this cycle. Higher end machines have many more options and this is
> seldom an issue.}}

My pipeline is naive:
There is a 1:1 mapping between instructions/bundles and the pipeline;
Any instruction can be executed needs all the register ports to serve
the instruction in question;
There is not currently any mechanism to "decompose" any instructions;
....

It is possible that an "instruction splitter" could be added to the ID2
stage. The most likely option would be to require any such instructions
to be "scalar only", and then use the Lanes 2/3 signals to encode the steps.

Say:
Executes Lane 3 op first (if not NOP);
Executes Lane 2 op second (if not NOP);
Then lets Lane 1 op pass through.
With the Lane 2/3 ops being forwarded via Lane 1.

In this case, multi-stage ops would be handled similar to multi-lane ops
in the decoder (but with different control bits).

ID2 would keep the "front-end" stages stalled while this is happening.

>>
>> *: I have noted that by the time one pays for a larger register file
>> (and associated decode-path machinery), they almost may as well throw in
>> another ALU and have a lane there (the cost of the register ports being
>> seemingly more significant than the cost difference of adding/removing
>> an ALU and similar).
> <
> The area footprint of an ALU is smaller that the area footprint delta of
> adding 1 more read port to an already 3 read ported file. The ALU
> footprint is way smaller than adding a second write port to a 1W RF.
> So, the ALU is "almost free". In many of my machines, the forwarding
> logic was bigger than the integer ALU. In my 6-wide and 8-wide designs
> we simply threw an ALU on the front of every (but 1) function unit.
> <
> AGEN units can do integer additions and have integer shifters (we
> typically call aligners).
> FADD is so big you can hardly see an integer ALU pasted on the front.
> FMAC is even bigger than FADD....
> *CMP IS an integer ALU (both integer and FP with a few special checks)
> ....

My lanes are slightly asymmetric:
Lane 1, does everything;
Lane 2, does some stuff:
ALU, Shift, FPU (if Lane 1 is not FPU);
Lane 2 ALU and Shift can combine with Lane 1 for 128-bit ops;
Similar for FPU;
...
Lane 3, does ALU ops and maybe Shift.
Basically nothing else though.

Lane 3 justifies the 6R register file, and the extra ALU is sometimes
useful for stuff.

Much of the cost of the core is due to "features" (mostly in Lane 1
FUs), rather than due to it being 3 wide (since, as noted, Lane 3 lacks
most of the "fancy" FUs).

To what extent "fancy" features exist in Lane 2, it is mostly because
the required plumbing also needed to exist:
Lane 2 gets FPU mostly because I already needed Lane 2 plumbed into the
FPU for 128-bit SIMD ops (and it doesn't add much cost to allow normal
requests to also be issued from Lane 2).

The low-precision SIMD unit can also allow a potential feature of
co-issuing 2x Binary32 SIMD ops (since this isn't too far off from how
128-bit SIMD works already with this unit).

So, I am debating whether to officially allow things like:
PADD.F R23, R21, R25 | PADD.F R27, R29, R31

Since, technically, the SIMD unit can already do this, just it requires
effectively confirming that, yes, the dedicated SIMD unit "actually
exists" (otherwise, it is ambiguous; the main FPU can also do SIMD, just
it takes 10 clock-cycles to do each SIMD op; vs 3 cycles with the
dedicated unit).

However, it would be a little ambiguous, as this co-issuing would only
be possible for 2x Binary32 SIMD ops, and in certain matching
combinations (can't co-issue Binary16 or Binary64 ops; or mix/match).

Theoretically, could potentially allow mixing SIMD with Scalar-FPU ops
in this case, say:
PADD.H ... | FMUL ...
Since, with the dedicated SIMD unit enabled, these are technically two
different FUs.


Click here to read the complete article
Re: More of my philosophy about CISC and RISC instructions..

<20a9e947-5d3a-47d4-abcf-ff7cd8fb16b9n@googlegroups.com>

  copy mid

https://news.novabbs.org/devel/article-flat.php?id=33875&group=comp.arch#33875

  copy link   Newsgroups: comp.arch
X-Received: by 2002:a05:620a:22ed:b0:76f:1450:230 with SMTP id p13-20020a05620a22ed00b0076f14500230mr20619qki.4.1693427579329;
Wed, 30 Aug 2023 13:32:59 -0700 (PDT)
X-Received: by 2002:a05:6870:d347:b0:1c8:bec5:59c4 with SMTP id
h7-20020a056870d34700b001c8bec559c4mr253109oag.1.1693427579082; Wed, 30 Aug
2023 13:32:59 -0700 (PDT)
Path: i2pn2.org!i2pn.org!usenet.blueworldhosting.com!diablo1.usenet.blueworldhosting.com!peer02.iad!feed-me.highwinds-media.com!news.highwinds-media.com!news-out.google.com!nntp.google.com!postnews.google.com!google-groups.googlegroups.com!not-for-mail
Newsgroups: comp.arch
Date: Wed, 30 Aug 2023 13:32:58 -0700 (PDT)
In-Reply-To: <uckb14$26517$1@dont-email.me>
Injection-Info: google-groups.googlegroups.com; posting-host=2600:1700:291:29f0:3465:37f5:b458:f4c6;
posting-account=H_G_JQkAAADS6onOMb-dqvUozKse7mcM
NNTP-Posting-Host: 2600:1700:291:29f0:3465:37f5:b458:f4c6
References: <9dd7cbc7-ec85-4a8b-84f1-266cb47575b2n@googlegroups.com>
<7d6e8035-0402-4f29-ae39-c467cfa4245cn@googlegroups.com> <bab02209-0dc4-492e-8cf2-ede14635e4d4n@googlegroups.com>
<ba6c5d7b-80f5-49d5-84a5-4fa92ee7f86bn@googlegroups.com> <ubesqp$2m85c$1@dont-email.me>
<23c1700c-e477-4130-b7f4-d8559f85165an@googlegroups.com> <ubgcn8$2tniu$1@dont-email.me>
<06e13dbd-db31-49e6-8f1d-262b0fc277b8n@googlegroups.com> <ubgqjp$2vppn$1@dont-email.me>
<01c75f2e-0471-4f2e-88fb-f45d9020138bn@googlegroups.com> <ubhqcc$375ak$1@dont-email.me>
<23b38342-5d35-4fb5-97fb-3cb20432d343n@googlegroups.com> <b7b07f28-e2c7-4487-a6b2-69ebfc8d2378n@googlegroups.com>
<ecfae16e-dbd9-4c67-8eb5-7c0310968152n@googlegroups.com> <3ffd2ea1-8e3a-4bf1-84cb-b6f75c15afffn@googlegroups.com>
<ucip45$1qiio$1@dont-email.me> <160e5672-554f-4836-a390-9b3cd946c6c3n@googlegroups.com>
<uckb14$26517$1@dont-email.me>
User-Agent: G2/1.0
MIME-Version: 1.0
Message-ID: <20a9e947-5d3a-47d4-abcf-ff7cd8fb16b9n@googlegroups.com>
Subject: Re: More of my philosophy about CISC and RISC instructions..
From: MitchAlsup@aol.com (MitchAlsup)
Injection-Date: Wed, 30 Aug 2023 20:32:59 +0000
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable
X-Received-Bytes: 6989
 by: MitchAlsup - Wed, 30 Aug 2023 20:32 UTC

On Tuesday, August 29, 2023 at 3:42:17 AM UTC-5, BGB wrote:
> On 8/28/2023 2:18 PM, MitchAlsup wrote:
> >> Then, yes, my BJX2 ISA also has this property FWIW.
> > <
> > The former (stall) or the later (stale) ??
> It will stall the "front end" of the pipeline until the former
> instructions have completed.
>
> So, dependencies between bundles will be resolved using pipeline stalls
> much like sequential instructions in a more traditional architecture
> (and scheduling for latency is mostly used to avoid wasting clock-cycles
> via pipeline stalls).
<
Under the nonNeumann paradigm:: Realistically, there are only 2 choices
a) stall before issue
b) stall after issue (in instruction queueing mechanism)
<
> >>
> >> There is also an imposed restraint that bundled instructions will be
> >> required to give the same results as-if the instructions had been
> >> executed sequentially on a serial machine.
> > <
> > This tenet prevents the use of stale.
> Yes.
>
> Stale results will not be visible within any well-formed instruction
> sequence.
>
> Bundles which attempt to modify an register and then reference it as an
> input within the same bundle could potentially see stale values but are
> not allowed (except in cases where predication makes the instructions
> mutually exclusive; effectively preventing any stale results from being
> possible).
<
There are a variety of ways around this, but basically the operand register
consumes a result register and you can mark the operand register so it
knows which result to consume within the instruction issue width.
<
In Mc 88120 we expanded the 5-bit register specifiers to 6-bits.
bit<5> == 0 -> reg[bits<4..0>]
bit<5> == 1 -> slot[bits<3..0>]
doing this saves register porting in normal code ! since once you get
great and big 80%+ of register operands arrive on the forwarding bus
and you don't, then, need to read a register file port.
<
Without this marking you basically n×(n-1) 5-bit comparators to determine
intra-width DECODE register forwarding. {{And in fact, this is what gives
you said markings for the next time(s).}}
>
> So:
> ADD R4, 1, R5 | SUB R5, 1, R6
> Is invalid.
<
But::
ADD R4,1,R5 | SUB S0,1,R6
is !!
>
> But:
> ADD?T R4, 1, R5 | SUB?F R5, 1, R6
> Is allowed, as these instructions are mutually exclusive.
>
>
> Technically, ill-formed bundles which violate the above rules may result
> in stale data (in the verilog implementation). However, my emulator has
> a lint feature to detect these cases and turn them into breakpoints, and
> they are still considered as invalid as per the ISA spec.
<
marking eliminates this--but marking either requires a deeper DECODE pipeline
or it requires storage in the Instruction Cache. A Trace or Packet Cache organ-
ization provides this space at low cost.
>
<snip>
> > I disagree (in the small) I still consider my 66100 to be a 1-wide machine
> > even when using CARRY spread across several serial instructions. This
> > may appear as 4R2W but the µArchitecture has a Carry-Loop which
> > provides 1R1W without consuming RF ports (right then and there).
> > {{On the low end machines, CARRY itself supplies the 1R, after the
> > Carry-Shadow, forwarding supplies the 1R, and sooner or later there
> > will be an instruction without a result and Carry can steal that port on
> > this cycle. Higher end machines have many more options and this is
> > seldom an issue.}}
> My pipeline is naive:
> There is a 1:1 mapping between instructions/bundles and the pipeline;
> Any instruction can be executed needs all the register ports to serve
> the instruction in question;
> There is not currently any mechanism to "decompose" any instructions;
Nor recompose.....
> ...
><snip>
> My lanes are slightly asymmetric:
> Lane 1, does everything;
> Lane 2, does some stuff:
> ALU, Shift, FPU (if Lane 1 is not FPU);
> Lane 2 ALU and Shift can combine with Lane 1 for 128-bit ops;
> Similar for FPU;
> ...
> Lane 3, does ALU ops and maybe Shift.
> Basically nothing else though.
<
This reminds me of the 2-wide superscalar Mc 88110 (and 860)
<
But I got rid of all association between an instruction in the IStream
and which FU it gets routed to. Mc 88120 had 6 FUs {3×AGEN, FADD,
FMUL, CND}. Where we went different, is that the ICache used pre-
routed instructions with annotated registers above so all instructions
in a "packet" could issue (so long as the window was not full.)
<
The inst in slot[j] is performed by FU in slot[j], and uses slot[j] result bus;
It gets its renamed result[j] from history[j] and writes PRF[history[j]] when
complete.
>

Re: More of my philosophy about CISC and RISC instructions..

<ucq9lt$3b3p4$1@dont-email.me>

  copy mid

https://news.novabbs.org/devel/article-flat.php?id=33880&group=comp.arch#33880

  copy link   Newsgroups: comp.arch
Path: i2pn2.org!i2pn.org!eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: sfuld@alumni.cmu.edu.invalid (Stephen Fuld)
Newsgroups: comp.arch
Subject: Re: More of my philosophy about CISC and RISC instructions..
Date: Thu, 31 Aug 2023 07:55:57 -0700
Organization: A noiseless patient Spider
Lines: 16
Message-ID: <ucq9lt$3b3p4$1@dont-email.me>
References: <9dd7cbc7-ec85-4a8b-84f1-266cb47575b2n@googlegroups.com>
<7c9ef480-989b-4cbb-ac7f-db8f3749e8f1n@googlegroups.com>
<d1fc890d-e4e5-4afd-8718-86143628ea2an@googlegroups.com>
<ubdrp6$2e6ik$1@dont-email.me>
<3b6e5488-022e-4299-ab6b-70a7436bb004n@googlegroups.com>
<ubn20o$5d2d$1@dont-email.me>
<2e6de6de-b408-4aa7-a430-ead3f983ed78n@googlegroups.com>
<8eee43b7-cb96-49c1-9735-4bb8c004b5c3n@googlegroups.com>
<47020cd0-2ee6-4c51-91a0-885ba719137cn@googlegroups.com>
<8m4EM.686037$TPw2.506418@fx17.iad> <ubqphs$u0gp$1@dont-email.me>
<4941705f-ac14-4f98-b3d1-6fa62bdb4236n@googlegroups.com>
<uc3etu$2jjeu$1@dont-email.me>
<130086d6-3ea6-41de-a7d6-6e68317c24c5n@googlegroups.com>
<ucd0p2$jjfb$1@dont-email.me>
<1d34de28-7fdf-4368-bb34-f7b69aeb4f89n@googlegroups.com>
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8; format=flowed
Content-Transfer-Encoding: 7bit
Injection-Date: Thu, 31 Aug 2023 14:55:57 -0000 (UTC)
Injection-Info: dont-email.me; posting-host="e53acb899b7c0efb0e9d2ca17b18c58c";
logging-data="3510052"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX18agNClK0ZDZ2rkerGms5A86IEBlMtGhqI="
User-Agent: Mozilla Thunderbird
Cancel-Lock: sha1:LJLZ63023xuhP7yWVMqQlk2ZdEk=
Content-Language: en-US
In-Reply-To: <1d34de28-7fdf-4368-bb34-f7b69aeb4f89n@googlegroups.com>
 by: Stephen Fuld - Thu, 31 Aug 2023 14:55 UTC

On 8/26/2023 12:33 PM, MitchAlsup wrote:

snipped discussion of very high memory bandwidth system proposal.

> My hope would be that it would not melt when operating at full throughput.

Of course, it is a custom one-off, and likely would be very expensive, but

https://www.tomshardware.com/news/intel-demoes-8-core-528-thread-puma-chip-with-1-tbs-silicon-photonics

--
- Stephen Fuld
(e-mail address disguised to prevent spam)

Re: More of my philosophy about CISC and RISC instructions..

<Gs2dnUS1ha690WX5nZ2dnZfqn_SdnZ2d@supernews.com>

  copy mid

https://news.novabbs.org/devel/article-flat.php?id=33901&group=comp.arch#33901

  copy link   Newsgroups: comp.arch
Path: i2pn2.org!i2pn.org!weretis.net!feeder6.news.weretis.net!news.misty.com!border-2.nntp.ord.giganews.com!nntp.giganews.com!Xl.tags.giganews.com!local-1.nntp.ord.giganews.com!nntp.supernews.com!news.supernews.com.POSTED!not-for-mail
NNTP-Posting-Date: Wed, 06 Sep 2023 09:53:36 +0000
Sender: Andrew Haley <aph@zarquon.pink>
From: aph@littlepinkcloud.invalid
Subject: Re: More of my philosophy about CISC and RISC instructions..
Newsgroups: comp.arch
References: <9dd7cbc7-ec85-4a8b-84f1-266cb47575b2n@googlegroups.com> <7c9ef480-989b-4cbb-ac7f-db8f3749e8f1n@googlegroups.com> <d1fc890d-e4e5-4afd-8718-86143628ea2an@googlegroups.com> <ubdrp6$2e6ik$1@dont-email.me> <3b6e5488-022e-4299-ab6b-70a7436bb004n@googlegroups.com> <ubn20o$5d2d$1@dont-email.me> <2e6de6de-b408-4aa7-a430-ead3f983ed78n@googlegroups.com> <8eee43b7-cb96-49c1-9735-4bb8c004b5c3n@googlegroups.com> <47020cd0-2ee6-4c51-91a0-885ba719137cn@googlegroups.com> <8m4EM.686037$TPw2.506418@fx17.iad> <ubqphs$u0gp$1@dont-email.me> <4941705f-ac14-4f98-b3d1-6fa62bdb4236n@googlegroups.com> <uc3etu$2jjeu$1@dont-email.me> <130086d6-3ea6-41de-a7d6-6e68317c24c5n@googlegroups.com> <ucd0p2$jjfb$1@dont-email.me> <ucfbtj$30keu$2@newsreader4.netcologne.de>
Distribution: world
User-Agent: tin/1.9.2-20070201 ("Dalaruan") (UNIX) (Linux/4.18.0-477.13.1.el8_8.x86_64 (x86_64))
Message-ID: <Gs2dnUS1ha690WX5nZ2dnZfqn_SdnZ2d@supernews.com>
Date: Wed, 06 Sep 2023 09:53:36 +0000
Lines: 18
X-Trace: sv3-EWjRxGrO5vQU1Hdt13gdOe+xiU04c1LQhlOK1EjxfYX0J9pFahMNar7LlTt57cPTh0cQyKfTXv82nR4!xOLhP+5Q1WXnQY0J9C/r6vt3aSRSIGrMB6sFnnrZY5VyzH9/mJTb2VtjlEGt8DzWSSBZUV+gXVo/!9NZhlGACWqo=
X-Complaints-To: www.supernews.com/docs/abuse.html
X-DMCA-Complaints-To: www.supernews.com/docs/dmca.html
X-Abuse-and-DMCA-Info: Please be sure to forward a copy of ALL headers
X-Abuse-and-DMCA-Info: Otherwise we will be unable to process your complaint properly
X-Postfilter: 1.3.40
 by: aph@littlepinkcloud.invalid - Wed, 6 Sep 2023 09:53 UTC

Thomas Koenig <tkoenig@netcologne.de> wrote:
> Stephen Fuld <sfuld@alumni.cmu.edu.invalid> schrieb:
>> On 8/24/2023 12:02 PM, MitchAlsup wrote:
>>> On Tuesday, August 22, 2023 at 6:04:34?PM UTC-5, Paul A. Clayton wrote:
>> OK, I will ask. If you were trying to create a marvelous vector
>> supercomputer CPU, how would it be different from MY66000?
>
> Mitch already answered, just one remark...
>
> Today's supercomputing is mainly about graphics cards, for those
> workloads where this is an advantage.

With one interesting exception: Fugaku. It's still #2 in the TOP500
list, and uses 512-bit wide SVE. I'd guess that without having to
generate code for some GPU, it should be a lot easier to access all
that performance.

Andrew.

Re: More of my philosophy about CISC and RISC instructions..

<442a006e-f88f-4104-8172-58e5409567c3n@googlegroups.com>

  copy mid

https://news.novabbs.org/devel/article-flat.php?id=33902&group=comp.arch#33902

  copy link   Newsgroups: comp.arch
X-Received: by 2002:a05:6214:8ed:b0:641:8885:5010 with SMTP id dr13-20020a05621408ed00b0064188855010mr301409qvb.9.1694000781355;
Wed, 06 Sep 2023 04:46:21 -0700 (PDT)
X-Received: by 2002:a17:902:cec4:b0:1c1:f658:7cfa with SMTP id
d4-20020a170902cec400b001c1f6587cfamr5414035plg.9.1694000781078; Wed, 06 Sep
2023 04:46:21 -0700 (PDT)
Path: i2pn2.org!i2pn.org!usenet.blueworldhosting.com!diablo1.usenet.blueworldhosting.com!peer03.iad!feed-me.highwinds-media.com!news.highwinds-media.com!news-out.google.com!nntp.google.com!postnews.google.com!google-groups.googlegroups.com!not-for-mail
Newsgroups: comp.arch
Date: Wed, 6 Sep 2023 04:46:20 -0700 (PDT)
In-Reply-To: <Gs2dnUS1ha690WX5nZ2dnZfqn_SdnZ2d@supernews.com>
Injection-Info: google-groups.googlegroups.com; posting-host=79.76.91.23; posting-account=soFpvwoAAADIBXOYOBcm_mixNPAaxW9p
NNTP-Posting-Host: 79.76.91.23
References: <9dd7cbc7-ec85-4a8b-84f1-266cb47575b2n@googlegroups.com>
<7c9ef480-989b-4cbb-ac7f-db8f3749e8f1n@googlegroups.com> <d1fc890d-e4e5-4afd-8718-86143628ea2an@googlegroups.com>
<ubdrp6$2e6ik$1@dont-email.me> <3b6e5488-022e-4299-ab6b-70a7436bb004n@googlegroups.com>
<ubn20o$5d2d$1@dont-email.me> <2e6de6de-b408-4aa7-a430-ead3f983ed78n@googlegroups.com>
<8eee43b7-cb96-49c1-9735-4bb8c004b5c3n@googlegroups.com> <47020cd0-2ee6-4c51-91a0-885ba719137cn@googlegroups.com>
<8m4EM.686037$TPw2.506418@fx17.iad> <ubqphs$u0gp$1@dont-email.me>
<4941705f-ac14-4f98-b3d1-6fa62bdb4236n@googlegroups.com> <uc3etu$2jjeu$1@dont-email.me>
<130086d6-3ea6-41de-a7d6-6e68317c24c5n@googlegroups.com> <ucd0p2$jjfb$1@dont-email.me>
<ucfbtj$30keu$2@newsreader4.netcologne.de> <Gs2dnUS1ha690WX5nZ2dnZfqn_SdnZ2d@supernews.com>
User-Agent: G2/1.0
MIME-Version: 1.0
Message-ID: <442a006e-f88f-4104-8172-58e5409567c3n@googlegroups.com>
Subject: Re: More of my philosophy about CISC and RISC instructions..
From: luke.leighton@gmail.com (luke.l...@gmail.com)
Injection-Date: Wed, 06 Sep 2023 11:46:21 +0000
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable
X-Received-Bytes: 2802
 by: luke.l...@gmail.com - Wed, 6 Sep 2023 11:46 UTC

On Wednesday, September 6, 2023 at 10:53:47 AM UTC+1, a...@littlepinkcloud.invalid wrote:

> With one interesting exception: Fugaku. It's still #2 in the TOP500
> list, and uses 512-bit wide SVE. I'd guess that without having to
> generate code for some GPU, it should be a lot easier to access all
> that performance.

cough all you'd now have to do is access efficiently the SVE instructions
Qty-6-to-7000 cough :)

@begin caveat(big generalisation)
GPUs when used for Scientific Compute are no more "efficient"
than a CPU which has corresponding memory and back-end ALU
capacity. it's only when you target *3D* workloads that you find
"whoops they have hard-wired texture opcodes and etc. etc."
and performance/watt shoots up 4x over Supercomputing CPUs
@end caveat

appreciate the insight / exception Andrew.

l.


devel / comp.arch / Re: More of my philosophy about CISC and RISC instructions..

Pages:123456
server_pubkey.txt

rocksolid light 0.9.81
clearnet tor