this post was submitted on 07 Jun 2024
1006 points (98.9% liked)

Privacy

32130 readers
648 users here now

A place to discuss privacy and freedom in the digital world.

Privacy has become a very important issue in modern society, with companies and governments constantly abusing their power, more and more people are waking up to the importance of digital privacy.

In this community everyone is welcome to post links and discuss topics related to privacy.

Some Rules

Related communities

much thanks to @gary_host_laptop for the logo design :)

founded 5 years ago
MODERATORS
1006
Internet Archive is in danger (www.battleforlibraries.com)
submitted 5 months ago* (last edited 5 months ago) by [email protected] to c/[email protected]
 

It looks like the internet archive is needed assistance, I just heard about this today and figured lemmy could help spread this message around

you are viewing a single comment's thread
view the rest of the comments
[–] [email protected] 4 points 5 months ago (1 children)

You don't need blockchain to accomplish what the internet archive is, just a network of computers that share a part of their disk space to the other computers. This is just a torrent network at the end of the day

[–] [email protected] 1 points 5 months ago (2 children)

Except you can take down piratebay and send the founders to jail. You can't take down Ethereum, or anything hosted on it.

[–] [email protected] 5 points 5 months ago (1 children)

Blockchain is great for when you need global consensus on the ordering of events (e.g. Alice gave all her 5 ETH to Bob first, so a later transaction to give 5 ETH to Charlie is invalid). It is an unnecessarily expensive solution just for archival, since it necessitates storing the data on every node forever.

Ethereum charges 'gas' fees per transaction which helps ensure it doesn't collapse under the weight of excess usage. Blocks have transaction limits, and transactions have size limits. It is currently working out at about US$7,500 per MB of block data (which is stored forever, and replicated to every node in the network). The Internet Archive have apparently ~50 PB of data, which would cost US$371 trillion to put onto Ethereum (in practice, attempting this would push up the price of ETH further, and if they succeeded, most nodes would not be able to keep up with the network). Really, this is just telling us that blockchain is not appropriate for that use case, and the designers of real world blockchains have created mechanisms to make it financially unviable to attempt at that scale, because it would effectively destroy the ability to operate nodes.

The only real reason to use an existing blockchain anyway would be on the theory that you could argue it is too big to fail due to legitimate business use cases, and too hard to remove censorship resistant data. However, if it became used in the majority for censorship resistant data sharing, and transactions were the minority, I doubt that this would stop authorities going after node operators and so on.

The real problems that an archival project faces are:

  • The cost of storing and retrieving large amounts of data. That could be decentralised using a solution where not all data is stored on a chain - for example, IPFS.
  • The problem of curating data and deciding what is worth archiving, and what is a true-to-source archive vs fake copy. This probably requires either a centralised trusted party, or maybe a voting system.
  • The problem of censorship. Anonymity and opaqueness about what is on a particular node can help - but they might in some cases undermine the other goals of archival.
[–] [email protected] 1 points 5 months ago

You suggest IPFS, but isn't that what web3 is?

Web3 is blockchain + IPFS and/or torrents or whatever p2p protocol.

I am not suggesting storing the data itself on the blockchain, but the index, the equivalent of simple HTML pages on the blockchain so we never lose track of the data we share with torrents or whatever peer to peer protocol.

However, if it became used in the majority for censorship resistant data sharing, and transactions were the minority, I doubt that this would stop authorities going after node operators and so on.

I doubt it would exceed transactions, but if it did, authorities would need a global agreement with every single nation to take down all nodes, and that is never happening.

The problem of curating data and deciding what is worth archiving, and what is a true-to-source archive vs fake copy. This probably requires either a centralised trusted party, or maybe a voting system

I agree with you on this, but a voting system doesn't sound too difficult to implement. And alternatively the internet archive could be that centralized trusted party. Arresting them for reporting on what data is correct would surely be unconstitutional.

[–] [email protected] 5 points 5 months ago (2 children)

???
Taking down PirateBay didn't kill the torrents it hosted

[–] [email protected] 2 points 5 months ago

No, but it landed the founders in jail. Are you suggesting we just accept jail as an outcome if we want to save the internet archive?

[–] [email protected] 1 points 5 months ago (1 children)

Piratebay itself was taken down. You need piratebay to distribute the magnet links.

[–] [email protected] 3 points 5 months ago (1 children)

Well then, just use an anonymus service to distribute magnet links (i2p, tor, blockchain)

[–] [email protected] 2 points 5 months ago (1 children)

Fully agree. In fact, that's what I'm suggesting in my original comment.

Web3 is essentially just indexing links. But since indexing links to pirated data is illegal, that's why the blockchain is needed. Sure, tor is also viable, but riskier for the people hosting the websites.

[–] [email protected] 2 points 5 months ago

I agree but I find blockchain technology too costly hardwarewise, a simple anonimizing network may be enough