• stardust@lemmy.ca
      link
      fedilink
      English
      arrow-up
      93
      arrow-down
      1
      ·
      1 day ago

      With internet archive head quarters being in the US what would happen if the administration went after them? Would people from other countries be able to keep the project going?

      • Petter1@lemm.ee
        link
        fedilink
        English
        arrow-up
        20
        ·
        edit-2
        19 hours ago

        We need to distribute all that data, for the sake of history in the future. Data hoarding and torrenting is a service to humanity.

        • Echo Dot@feddit.uk
          link
          fedilink
          English
          arrow-up
          8
          ·
          8 hours ago

          Not just data but in important scientific research too. The republicans want to drive the US back to the stone age, because that’s when they were last relevant.

          • Petter1@lemm.ee
            link
            fedilink
            English
            arrow-up
            2
            ·
            7 hours ago

            😇 in my mind, I included those datasets in data

            But sure! I agree! We need free and open science, worldwide

            People need to understand the art of scientific thinking

      • bobs_monkey@lemm.ee
        link
        fedilink
        English
        arrow-up
        48
        ·
        24 hours ago

        That’s unfortunately a very valid point. Iirc the big problem IA has is the sheer amount of disk space required to store everything.

        • ⛓️‍💥@sh.itjust.works
          link
          fedilink
          English
          arrow-up
          23
          ·
          23 hours ago

          I wish I had the necessary petabytes of storage to at least store an offline copy. I wonder how many disks that would be and how redundant disks you’d need.

          • bobs_monkey@lemm.ee
            link
            fedilink
            English
            arrow-up
            27
            ·
            22 hours ago

            Here’s this from 2021. They say they have about 200PB of raw storage across some 20k spinning drives at the time of writing (with more being added constantly, about 25%/yr), and capacities are mixed from 4TB to 16TB, across 750 servers housed on about 75 racks. I have 6x16TB WD red pros that ran me about $355/ea new with tax, and my bill was a smidge over $2100. Assuming you used all 16TB, you’d need about 12,500 16TB disks, which would run you about $4,437,500 without a bulk discount. How much of that is redundancy I’m not sure, but that’s just HDDs, not the hardware to actually run everything between storage enclosures, OS, disks, memory, clustering, etc. They say they say a single copy with 16TB drives would be about 15 racks., but how that breaks down I’m not sure.

            • Petter1@lemm.ee
              link
              fedilink
              English
              arrow-up
              10
              arrow-down
              1
              ·
              edit-2
              19 hours ago

              I once made this calculation for a database of 700Tb, even that blew my mind 🤣