Depends... back in the days when I ran a IIci under NetBSD as my home server, I saw a 30% performance improvement.Overally though cloning a cache card is a bit of a "why bother" as the improvement is very incremental. A simple 030 clock doubler accelerator would give greater performance at a similar cost. I don't have much interest in the IIci though so haven't done more than some testing in IIsi.
A 68030 clock doubler achieves 100% improvement best case, 60% on typical performance trace, and a minimum of 10% on worst case. Cache achieves about 30% in best case scenarios (memory bound) and 0% or sometimes negative in worst case.Depends... back in the days when I ran a IIci under NetBSD as my home server, I saw a 30% performance improvement.
Yes, tag SRAM (which include comparators so they can do both storing and checking of tags) haven't been made in years, are expensive if you can find them, and won't help making bigger/better caches. Using a programmable device (i.e. FPGA) with embedded SRAM for tags (and external discrete SRAM for data) is theoretically possible, but would make for an expensive cache as you would need a fairly large FPGA just to get enough embedded SRAM for the tags.I don't think anyone's bothered. Micron has a 128K cache card that is just GALs and discrete logic, definitely clonable. However, it requires TAG SRAM which can be difficult to come by.
Yeah, the TAG SRAM is a big stumbling point for accelerators targeting machines of this vintage. Especially at high speed operation there's not much of a choice as unless you've an extremely fast SRAM and comparator it's going to be slower than a dedicated TAG which I usually get a match/no match decision faster than the rated speed. The 12ns TAGs I'm using on the Hyperdrive accelerator usually have a valid result by 8ns and definitely at 10ns. Impossible to beat in discrete logic. I was thinking to possibly adopt some of the later Pentium TAGs if I need to increase the cache size, or a FPGA. As you said it becomes fairly awful though as you'd need a largish FPGA and level shifters....Yes, tag SRAM (which include comparators so they can do both storing and checking of tags) haven't been made in years, are expensive if you can find them, and won't help making bigger/better caches. Using a programmable device (i.e. FPGA) with embedded SRAM for tags (and external discrete SRAM for data) is theoretically possible, but would make for an expensive cache as you would need a fairly large FPGA just to get enough embedded SRAM for the tags.
Also, the best an external cache can do is 2-1-1-1 burst (which may or may not offer benefits over non-burst 2+2+2+2 timings, which is equally as hard to achieve timing-wise). If your memory can already achieve that, then a cache is pointless (and in fact will make things worse by slowing things on a miss). And while it's closed-source (so it can't be easily reworked for a different machine) and somewhat expensive, the Synchr030/S proves it can be done for the 16 MHz '030 in a SE/30 using 'modern' components. The best option for a IIci (or any '030 machine!) would be to have something similar, negating the need for a cache. Even if only 3-1-1-1 burst could be achieved on faster '030 with SRAM, and external cache would likely still be useless.
Did you find a viable option? At 5V, 16 Mbit SRAM are expensive and with big MOQ, while 4 Mbit are more viable but you'd need 32 of them - that's a large PCB, and I'm not sure what the signal integrity would look like with all those extra chips/traces. I've thought about it as well, but couldn't find a satisfying design trade-off.I've been flirting with the idea of sticking 16MB of SRAM on a booster just to see how fast an 030 operating entirely out of SRAM could be, with ~15ns SRAM 0 waits should be doable at 50mhz.
Mmm, does the NuBus controller has the memory timings of the host memory controller baked in so that using different ones would be an issue ? Assuming it's IIsi or later and using /STERM couldn't it adapt to faster turnaround ? (... I'm not sure on which side of the fence the IIci is falling actually, if it's /DSACK only then 3-cycles and no burst at best).Of course, the local-fast-ram approach only works easily if you're willing to forsake DMA access and the fancy NuBus cards that do DMA.
No, that was part of it - not really any viable options unless we're overvolting 3.3v SRAMs or looking at limited availability/expensive chips. For my purposes I treated more than 8 SRAMs as being problematically complex. Doing 3.3v properly would be a double-whammy as you'd need a buffered high-speed 5v domain for at least the data bus as well as a 3.3v domain connected to that. Either way you're committed to 4 bus transceivers at minimum, and probably at least 8 SRAM chips for a 5v-only design. It gets out of control fast. I concur that a high speed DRAM based design would be the more viable choice. I'd be curious if any of the amiga folks have done local DRAM + 030 as DRAM+040 is common combo for amiga accelerators.Did you find a viable option? At 5V, 16 Mbit SRAM are expensive and with big MOQ, while 4 Mbit are more viable but you'd need 32 of them - that's a large PCB, and I'm not sure what the signal integrity would look like with all those extra chips/traces. I've thought about it as well, but couldn't find a satisfying design trade-off.
PSRAM are cheaper for the capacity, but much slower are 70ns for those available. Maybe for slower-clocked '020/'030, but not for fast ones. And they are 3.3V so the level shifters rear their expensive heads again.
Ultimately, fast (but not necessarily big) FPGA + SDRAM is likely the most appropriate solution for fast memory on a '020/'030 (still with some shifters). I might have a shot at that one someday, though probably not targeting speed at all cost (the Synchr030/S needs access to /ECS to get maximum performance, but it's not available on the PDS so it's a bit hackish).
Mmm, does the NuBus controller has the memory timings of the host memory controller baked in so that using different ones would be an issue ? Assuming it's IIsi or later and using /STERM couldn't it adapt to faster turnaround ? (... I'm not sure on which side of the fence the IIci is falling actually, if it's /DSACK only then 3-cycles and no burst at best).