Fastest Nubus Video Card Redux

jeremywork

Well-known member
I had this question when I was getting started, and it's come up fairly often since. In the case of Quadras it's generally good advice that onboard video will offer a better experience than any Nubus card can, excepting resolution/color depth or featureset. For this reason I started testing with the IIfx; the fastest machine that essentially requires a Nubus video card to be useful. Picking a fast one makes sense here, and my IIfx is equipped with a NewerTech Variable Speed Overdrive so I can easily take multiple readings at different speeds. My collection pretty much covers the usual suspects these days, so it should be possible to make some headway.

My test environment is a fresh install of 7.5.3, set to 'Easy Install' on the IIfx. 32-bit addressing is enabled, disk cache is set to 768KB, and virtual memory is disabled. AppleTalk is on, but file sharing is disabled. The desktop resolution is 640x480@67Hz/8-bit color (M1212 compatible.) The boot drive is a 128GB SanDisk 120MB/s CF, inserted into a Stratos CF AztecMonster, plugged into a Sixty Eight Thousand SCSI Bolt (fx/PDS.) This can sustain over 9.5MB/s and should accurately indicate any I/O bottleneck. The factory ROM SIMM is swapped with a BMOW Rominator II MEGA, ROM unaltered from shipped.

OS has been updated with Open Transport 1.3 and the A/ROSE extension for use with the Apple Ethernet NB. I did make a few test passes to quantify the impact of this, and interestingly the only significant performance penalty (1-2%) appeared when the A/ROSE extension was loaded but the Nubus card was not present. Inserting the card seemed to reduce these penalties to within margin of error vs stock. On the other hand, I found the MediaVision Nubus ProAudioSpectrum 16 was happy to eat about 5% of the total performance of the system across the board when it was installed. I swapped it for a Digidesign AudioMedia II and confirmed this card and its drivers do not affect the system beyond margin of error.

I also found the NewerTech IIFX VSO driver which sets the clockspeed caused several QuickDraw tests to crash when color depth needed to be changed before running. Since the IIfx/VSO always cold boots at stock speed, the solution is to boot with the VSO driver loaded, verify the speed is boosted, then remove the extension and soft reboot. Testing then runs uninhibited at the faster speed until hard shutdown.

The scope of benchmarking up to this point has purely been for performance data. Card features, resolutions, color depths, etc haven't been considered on this post.

Test procedure should be repeated identically, especially on L2-cache-enabled systems, but generally always.

• Once booted with requisite extensions, Speedometer 4.01 is launched and the Performance Rating test is run five times in all four categories. A location on the boot drive is selected for the disk test. The cursor is moved to the corner of the screen immediately preceeding each test. (Rarely the disk test will report a zero and the average will dramatically increase for some reason. Other tests do this too, but even more scarcely. The disk test and the third FPU test are the worst offenders, and still very infrequent.)

• Five iterations are then run of each of the ten Benchmark Mix tests.

• Five iterations of each of three FPU tests are then run.

• Five iterations of each available Color Quickdraw test is run.

• These results are recorded, Speedometer closed, then DayStar PowerDemo v5 is launched.

• Without altering the test parameters (and assuming the desktop color mode is still 256 colors) each test from the Benchmarks menu is selected, and results recorded. The mouse is left over the start button in every case. The lines test runs infinitely, averaging the live result. After 45-90 seconds, I swiftly move the cursor from its idle position into the test area and click to stop the lines test and record the output. PowerDemo is then closed.

• Apple Personal Diagnostics v 1.1.3 is then launched, Benchmark Checks are opened, and each of the Math; CPU Speed; Hard Drive; and Video buttons is pressed in sequence, allowing each test to complete with the cursor moved off of the window/QuickDraw area. The results compared to IIci are recorded (as well as Quadra 950.)

• Video is then set to Black & White, 4, 16, Thousands, Millions; each available test beyond 256 is also recorded. Personal Diagnostics is then closed.

• Finally, Jigsaw puzzle is opened alongside the Monitors control panel. A new puzzle is started (large pieces is easiest.) The six pieces should be tesselated in the margin of the puzzle, not in the center. Once all six pieces have formed one large rectangle which is not snapped into place, it can be dragged as a single element. Most cards present this large dragged puzzle very fluidly in 8-bit color and 1-bit (B&W), 8-bit grayscale is often less performant and occasionally dropped entirely. The contrast between fluid and non-fluid is stark and unmistakeable. I haven't found a single card which presents the puzzle fluidly in modes beyond these three.

The entire test procedure is then repeated with DiiMO 030 control panel enabled/SANE patches checked; then a third time with DayStar Power Central's PowerMath enabled instead. This provides a greater test resolution for unaffected tests, and opportunity to detect optimizations. So far the only wildcard I may have caught is in the PowerDemo Sieve test, only at 40MHz, only with Diimo SANE, and only with the Interware GrandVimage 21i, the common time of 74 seconds is reduced to 63.7; double verified.

All three complete cycles were completed at both 40MHz (stock) and 50MHz speeds.

-- Presentation --

Since Speedometer compares QuickDraw data to the Quadra 605 and I've generally used the IIci video numbers from Apple Personal Diagnostics, I've doubled the IIci percentages so the graphs are easier to discern.

ONE BIT QUICKDRAW
Screen Shot 2022-10-28 at 3.35.01 AM.png

TWO BIT QUICKDRAW
Screen Shot 2022-10-28 at 3.36.08 AM.png

FOUR BIT QUICKDRAW
Screen Shot 2022-10-28 at 3.37.01 AM.png

EIGHT BIT QUICKDRAW
Screen Shot 2022-10-28 at 3.37.49 AM.png

SIXTEEN BIT QUICKDRAW
Screen Shot 2022-10-28 at 3.39.23 AM.png

TWENTY-FOUR BIT QUICKDRAW
Screen Shot 2022-10-28 at 3.40.18 AM.png

DAYSTARS (seconds)
Screen Shot 2022-10-28 at 3.41.02 AM.png

SIEVE (Seconds)
Screen Shot 2022-10-28 at 3.42.20 AM.png

COLOR SORT (seconds)
Screen Shot 2022-10-28 at 3.43.56 AM.png

DISK IO (seconds)
Screen Shot 2022-10-28 at 3.45.12 AM.png

FRACTAL (seconds)
Screen Shot 2022-10-28 at 3.45.54 AM.png

LINES/sec
Screen Shot 2022-10-28 at 3.46.44 AM.png

QUICKDRAW (data)
Screen Shot 2022-10-28 at 3.58.43 AM.png

POWERDEMO (data)
Screen Shot 2022-10-28 at 4.00.25 AM.png

The raw data is attached in a spreadsheet. If you dig in you'll notice I've also done the onboard benchmarks for some caches and accelerators in the IIci and IIvx, so more video card benchmarks will follow eventually. The most "live" version of this is on the IIfx page of my site: http://www.jeremywork.com/mac_iifx.html
The site is still messy and incomplete, a few things are blatantly incorrect, but I think it's past the point of sharable so feel free to make use of what's there.

...My screenshots grabbed random Excel tooltips, but it's 4AM and they don't seem to be in the way :sleep:
 

Attachments

  • 68030MacBenchmarks.xlsx.zip
    99.8 KB · Views: 10

MrFahrenheit

Well-known member
Wow! Incredible work!!

As they say on Reddit, username checks out. You really worked hard to generate this data. I haven’t yet poured over the data, just read your methodology first.

This is the type of thing we need more of in the community, especially from YouTubers who are showing things off.

I’ll have a look at your data later today. Thanks for doing this!
 

jeremywork

Well-known member
I'm happy to have made a dent! A few days in I began to realize why this hasn't been done very comprehensively. The more time I spent, the more questions I seemed to have, though with precise control and enough repetition I began to understand the layout of the field. I think if we built a more disciplined benchmark there would be even more to learn.

Both of the QuickDraw tests concatenate many different aspects into a single duration test so it's not easy to visualize that, for instance, the Horizon 24 is essentially as fast as the Thunder IV cards but simply doesn't accelerate the drawing of colored text. As a result, the Apple Personal Diagnostics test practically flops compared to the Speedometer test. Having sat in front of the screen watching the tests draw, I can contemplate their similarities, but anyone looking only at the data would be unlikely to draw the connection.

This said, I'm not sure what would be most informative from here: expand the test regiment, test this same set of cards in more Macs, or start compiling resolution/color depth and feature attributes?

I'll add more cards when I find them. Happy to hear if anyone has one of the following they'd like to sell/trade:

RasterOps PaintBoard Prism GT (7") (and PaintBoard Prism (7"))
MicroConversions 2124NB II (7")
SuperMac Thunder II GX (I have a 1360 but no GX module and I need to repair the card before testing)
 

Phipli

Well-known member
Amazing work! I'll use this for reference in the future!

If you are actually looking for extra things to test, picking one fast card with a broad system support and testing it in a number of System versions (6.0.8, 7.1, 7.5.3, 7.6.1, 8.1??) might be informative to see if the System version has a significant impact on graphics speed. I'm not saying you need to do this, as you have already done a huge amount!

I find myself wondering what nubus machine more of us have than any other? IIci? Q650? Q700? 9150? If we defined a baseline machine that as many people as possible had, and provided a disk image with a prepared OS and benchmark tools, we could submit results for cards we each had in a hopefully controlled manner?

I sadly don't have a IIfx!
 

jeremywork

Well-known member
"It'd only take a few minutes if I can find the cards and the systems"

-clueless, 2018

It's been a process, but absolutely worthwhile to me to know the numbers are clean and can at very least be compared against each other.

I'll certainly run an OS comparison with some of the cards, all if there's a story to tell. I've been curious myself.

I'll get an image cleaned up, packaged with the tests and drivers I've been using. I might take a couple days away from this though :)
I do have a number of systems lined up in anticipation of completing a large rainbow table, maybe even this decade:

IIx (with DayStar Universal adapter) (not powering on; needs attention)
IIcx (with DayStar Universal adapter) (not powering on; needs attention)
IIci (stable; 32k, 64k, 128k cache modules working but for some reason any DayStar software currently disables the Micron 128 module...)
IIvx (stable)
DayStar P33/33MHz, DayStar P33/50MHz, DiiMO 50MHz, DayStar Turbo 040/40MHz/128 on hand

Q700 (stock, and with cache-less Sonnet QuadDoubler)
Q950 (with NewerTech PowerPump for variable speeds up to around 45MHz)
Q650 (stock, 8M/Ethernet)
Q800 (stock)
DayStar FastCache Quadra (128k), Apple Power Macintosh Upgrade Card (601/2x/1M), DayStar PowerPro (601/80MHz/1M; 4x 72-pin SIMMs), Sonnet Presto PPC (601v/3x-4x/1M) on hand

Q840AV (out of four logic boards I will eventually make one work for more than a few days, surely...)
PM8100/100 (256k cache)
PM8100/110 (256k cache)
Sonnet Crescendo G3/Nubus 266/1M; NewerTech MaxPowr G3 266/1M; Sonnet Crescendo G3/Nubus 500/1M; Sonnet Crescendo G4/Nubus 360/1M (9x); Sonnet PDS extension bracket; NewerTech PDS extension bracket.
 

slomacuser

Well-known member
Did you turned off the GWorld in tests? When I was testing horizon without GWorld it test were nothing special mostly like other cards but when testing with GWorld on some tests were done 600% over others. Setting the GWorld to 2 MB made better results than 4 MB also.
 

jeremywork

Well-known member
Did you turned off the GWorld in tests? When I was testing horizon without GWorld it test were nothing special mostly like other cards but when testing with GWorld on some tests were done 600% over others. Setting the GWorld to 2 MB made better results than 4 MB also.
This is interesting. In the tests above, both the Horizon 24 and the Accelerator Board II had all 4MB of their DRAM set to GWorld use. I faintly remember running a single pass with all 4MB set to RAM disk instead but not recording the entry because none of the results were out of margin, but I'll make it a point to retest it in several configurations. I don't think I tried 2MB at all.

I'm also awaiting a 27c512 so I can run the Thunder/24 with a v3 ROM and see what control I get over its GWorld/DRAM.
 

David Cook

Well-known member
I'm also awaiting a 27c512

1. Are you using classic UV erasable 27C512 or EEPROMs like W27C512?
2. Do you know the speed requirement? Is 150 ns too slow?
3. A number of people on the forum have mentioned ROM v3.1 was available at some point, but it appears only the v3.0 bin has been uploaded. Does anyone know of a source for the 3.1 bin?
 

jeremywork

Well-known member
1. Are you using classic UV erasable 27C512 or EEPROMs like W27C512?
2. Do you know the speed requirement? Is 150 ns too slow?
3. A number of people on the forum have mentioned ROM v3.1 was available at some point, but it appears only the v3.0 bin has been uploaded. Does anyone know of a source for the 3.1 bin?
The ones I have en route are UV eraseable; the factory v1.7.0 ROM is on a TMS27C512-15, and I'd also be interested in the v3.1 ROM if anyone has it.

FWIW the ROM dumps and photos of all the cards I'm testing are organized in here: http://www.jeremywork.com/rd.html
 
Top