• Updated 2023-07-12: Hello, Guest! Welcome back, and be sure to check out this follow-up post about our outage a week or so ago.
pcamen
Reaction score
0

Profile posts Latest activity Postings About

  • Hi, love your site, especially your scan collection. I noticed that there's a missing link in the Macintosh technical documentation series:

    fej8.jpg.56120f5f4cde6131d4233c08cdf33ffa.jpg


    Macintosh Family Hardware Reference sits between Technical Introduction to the Macintosh Family and where Guide to the Macintosh Family Hardware would be in a more recent version of the roadmap which you host on your site already an I couldn't a PDF anywhere online.

    I really want to keep mine bound, but I'd love to PayPal a donation of $8.50 toward the purchase of one. Are you in the USA? I'm poor as a church mouse, but positive I can get a couple up others to donate the remainder of the cost via PM if you'd like?

    https://www.amazon.com/Macintosh-family-hardware-reference $28.50

    It could prove most helpful in the Reverse Engineering the Macintosh SE PCB & Custom Chips for 1:1 reproduction thread and that gang will surely chip in the remainder.

    _________________

    On a side note, I've always wondered if a formatted text only version of the MacUser and Macworld could be done sans the artwork. One big PDF of each (both?) series in that format would be wonderful for keyword searching info on specific products. Then go to the artwork encumbered versions of specific issues for illustrations, historical context and such. I'd started a listing of the yearly indexes, but they are hard to locate, varying from year to year and seemingly missing from some years.

    Thanks so much for your efforts, especially hosting Gamba2, love your breakout of specific articles!

    jt

    pcamen
    pcamen
    Hey JT,

    No worries, I am happy to cover this for the team.  I just ordered a copy. 

    In terms of text only MacUser and MacWorld, I could probably hack together a python or perl script with some pdf libraries to extract the text.  But I'm not sure about formatting.  Hmm.  The PDF files are first and foremost images, which text added after the fact via OCR.  So if you remove the images, you would just have text left, unformatted.  . 

    What I typically do is just search using spotlight on my Mac at the top level where I have all my magazines and books, all of which have been OCR'd.  That brings up matches and shows what file they came from which is how I typically find things.

    Separate text files could be also used with grep, for a little more search control than spotlight. 

    What do you think?

    Peter

    Trash80toHP_Mini
    Trash80toHP_Mini
    Thanks much, Peter. I'm sure the SE builders will really appreciate your contribution. Each volume builds upon the last and hopefully there are a lot of cues in MFHR to better aid cryptological methods for deducing and recreating the PALs/ASICs as needed.

    About the search thing, using reader an issue at a time is as far as I've gotten, I'll have to research Searchlight.  [:I]   Unformatted text from each complete series would probably be workable enough for down and dirty research purposes in terms of issue/page spelunking.

    I've always wanted a good reason to learn to GREP something.  :lol:

    BTW, I just found/remembered that the link to Ports_Pinouts.PDF in your (all?) Gamba2 hosting pulls up a DOA Apple page. Directly hosting it along with the books on the development tree would be much appreciated, maybe redirecting that link in Gamba too? I imagine it's out there, but didn't find it in a few seconds of search today. I have it if you can't easily scare it up.

    again, really love your site,

    jt

    Trash80toHP_Mini
    Trash80toHP_Mini
    Hi again and happy Thanksgiving!

    Was musing and now rationally thinking about the Image/text thing regarding text extraction from magazine pages within a PDF.

    As a longtime AI user, I've been opening development book PDFs for grabbing artwork and text since they became available. Text characters, lines and paragraphs are objects with their own coordinates on the baseline page grid. My guess is that the OCR'd text is much the same, it's probably an overlay, invisible text objects located within page coordinates. Can't imagine it's not in its own layer across every graphical page layer and fully formatted within the PDF, page returns and all. Can't open a volume file using AI 9.02 on my QS, likely due to file size. It normally asks me which page I want to open, which is a guessing game as the intro/contents pages in Roman numerals count as pages.

    I haven't got any graphics programs set up since the move to X. Nothing appealed to me about upgrades to my apps and X was HORRIBLE when I tried it at first release, still haven't tried it on my 12" AlBook, heading right to and still loving my OS9 bubble.

    If you're not set up with graphic design menagerie (sounds like you're a programmer?) bounce this notion off your Photoshop guy to test the theory. If it's correct, I imagine it would be a lot more productive to tweak the GIMP as an automated paged text extraction engine than working within the bars of Adobe'sproprietary prison?

    jt

  • Loading…
  • Loading…
  • Loading…
Top