How Do I Find Broken Links? - Questions For Corbett #075

04/08/202120 Comments

Many viewers write in to ask about broken links in the archives. In the spirit of finding solutions, today James runs through a few basic methods you can use to replace broken links when you encounter the dreaded 404 error online.

Watch on Archive / BitChute / LBRY / Minds.com / YouTube or Download the mp4

SHOW NOTES:
How to Find Broken Links Online

The Library of Alexandria is on Fire

You Can’t Win. Don’t Even Try!

Trying to Cure Depression, but Inspiring Torture​

Committee Study of the Central Intelligence Agency's Detention and Interrogation Program

Wayback Machine

Filed in: Questions For Corbett
Tagged with:

Comments (20)

Trackback URL | Comments RSS Feed

  1. Victor says:

    Thanks James. Some great info here.

    I do have a question though. Do you (or anyone else) have any tips on how to organize your files/documents and etc. after a downloaded? Nothing worse than having 10000+ files in a single directory/folder and trying to trawl through them, this can get very messy very quickly.
    Just some ideas here. 1:Folder names for websites/article dates for example all your articles go in to ‘corbett-report’. 2: Linking files into other folders. 3: Setting up some type of database. 4 permuted indexes (great utility ‘ptx’). 5. changing the name of files to at least include a relevant date.

    • Duck says:

      Victor

      You could get “Obsidian” notes (free software for Mac, windows and Linux) and you could just use it as an electronic FILE CARD system….. copy paste the FILE NAME into a note and THEN under it list search terms prefixed with # and then just search the # in the app for a list of every file you marked with that #
      BEST thing about that is you can open those ‘cards’ in any text editor so its mostly future proof..

      https://obsidian.md/ you can get it free

      https://www.youtube.com/watch?v=bBk2kg8Gm_U tags are about 3 min in… you really SHOULD watch the whole video and the one below

      https://www.youtube.com/watch?v=QgbLb6QCK88
      BEGINNER video… watch the hipster and he will show you all kinds of tricks… but you really only need the search bar and note making if your making a searchable index

      The issue is that you need to take the time to # and list the ideas the article holds… on the bright side note taking improves your recall and comprehension

      Httrack can to grab a website and organize it for offline storage and browsing…but you CAN link your local files with Obsidian if you want to

      You can use also “The Brain” mindmaping software but its not as future proof or nice

      • Victor says:

        Thanks Duck,

        Didn’t know about obsidian. I’ll check all this out later tonight.

      • RighteouslyCurious says:

        I knew this was likely being discussed somewhere here recently as we desperately try to save the Library of Alexandria while it’s actually ablaze.

        If I understand the information you’ve offered here correctly, Obsidian is a system to organize information we download, right? I’m positive the videos you linked will help a lot to learn how to use it so thank you very much. Good thing, too, as I actually hadn’t given a thought to organization yet.

        I’m sure that ppl on this site understand the sheer amount of information I want to download and save. Just know that this site is only a portion of it.
        I would love any suggestions on how & where to store it. I don’t just want it on my laptop or PC. I don’t want it in a cloud – I don’t care for those clouds & had no idea I even had a personal cloud filled with stuff until I got a notification it was full & an offer to buy more cloud space.

        It was mentioned the Obsidian system for organizing the documents was somewhat ‘future proof’ so that’s another thing I’d want in the storage of it.
        This may sound like a stupid question but I’d rather ask it than end up w/ insufficient storage or something.
        Thank you & whoever may read this, for any suggestions how to go about getting set up to start getting information safe & secured.
        🙂

    • pill says:

      Common question: How to organize random files found online?

      Common reaction: I don’t know where to put it, so I won’t even download it.

      If you don’t have a folder to put it, make a generic one like /data/Documents/ALL and save it there until you find a better place. If possible, make sure the file name contains some meaningful keyword. Don’t waste time thinking of a great name but if it’s just called “123.pdf”, maybe name it “cia 123.pdf”.

      I’m also searching for a better way to organize such things – but in any case, save important stuff in a generic place rather than discarding it because it doesn’t seem to fit anywhere.

    • Mielia says:

      [SNIP – No urls without titles and explanations, please. -JC]

  2. Peripatêtikos says:

    “Amazon: The Best of All Worlds.”

    An incontinence-inducing quip.

  3. 8Gc58 says:

    On this topic, two very useful browser extensions/add-ons:

    SingleFile
    SingleFile is a Web Extension (and a CLI tool) compatible with Chrome, Firefox (Desktop and Mobile), Microsoft Edge, Vivaldi, Brave, Waterfox, Yandex browser, and Opera. It helps you to save a complete web page into a single HTML file.
    https://github.com/gildas-lormeau/SingleFile

    Web Archives
    Web Archives is a browser extension which helps you to find archived and cached versions of web pages. Searches can be initiated from the right-click context menu, the browser toolbar, or the address bar in case of a server error.
    https://github.com/dessant/web-archives

  4. Duck says:

    Thanks for that.. will take a look at single file which looks like it could tidy up my storage

  5. kbh says:

    If there is no proper search function on a website or if you know something you want is buried on a specific website you can use the “site:” function in any search engine. Lets say you want to find topic “broken links” on corbettreport.com, you go to your search engine of choice and type the following:
    site: corbettreport.com broken links

    • important says:

      Here is another simple first trick that works surprisingly often. If you are looking for ‘The Descent of Man’ like the person who contacted James, and it does not come up if you just enter the name of the book in the search field, try to add a space and pdf (just the three letters), and many times the book or document will come up as result. Not an earth-moving trick, and it doesn’t work every time, but useful sometimes.

  6. HyperSimian says:

    Why is Duck-Duck-Go bad?

    • Duck says:

      Duck-duck-go is “mostly” ok but they are getting more and more google-ized in how they slant results or at least thats what I heard

      https://www.youtube.com/watch?v=Rkv4rrdFBU0 video on Searx..
      THIS is probably the best way to search right now… always pick a random instance.
      sorry for the YT link but I’m on my other computer

      EVERYTHING in tech is getting to need watching for when to hop off… once its good enough that people start using it because its free’er the big money comes and starts screwing with it….Its like Linux, its still ok for now but the SJW incursion is in no way a natural or accidental thing… they are the tools of big money.

    • 8Gc58 says:

      We can do better than DuckDuckGo – by Drew DeVault

      DuckDuckGo is one of the long-time darlings of the technophile’s pro-privacy recommendations, and in fact the search engine that I use myself on the daily. They certainly present a more compelling option than many of the incumbents, like Google or Bing. Even so, DuckDuckGo is not good enough, and we ought to do better.

      I have three grievances with DuckDuckGo:

      1. It’s not open source. Almost all of DDG’s software is proprietary, and they’ve demonstrated gross incompetence in privacy in what little software they have made open source. Who knows what else is going on in the proprietary code?

      2. DuckDuckGo is not a search engine. It’s more aptly described as a search engine frontend. They do handle features like bangs and instant answers internally, but their actual search results come from third-parties like Bing. They don’t operate a crawler for their search results, and are not independent.

      3. The search results suck! The authoritative sources for anything I want to find are almost always buried beneath 2-5 results from content scrapers and blogspam. This is also true of other search engines like Google. Search engines are highly vulnerable to abuse and they aren’t doing enough to address it.

      There are some FOSS attempts to do better here, but they all fall flat. searX is also a false search engine — that is, they serve someone else’s results. YaCy has their own crawler, but the distributed design makes results untolerably slow, poor quality, and vulnerable to abuse, and it’s missing strong central project leadership.

      We need a real, working FOSS search engine, complete with its own crawler.

      https://drewdevault.com/2020/11/17/Better-than-DuckDuckGo.html

  7. GrowYourFood says:

    Thanks for this info, it is much appreciated 🙂

  8. catmad says:

    I think we need to save podcasts as well.
    I can no longer subscribe to The Corbett Report on Google Podcast player.

    Not only did it unsubscribe me but when I try to subscribe again it freezes the whole app and it crashes.

    I do also use RSS now and get emails but they can stop that at any time these days.

Leave a Reply

You must be logged in to post a comment.

Back to Top