this post was submitted on 31 Mar 2024
242 points (100.0% liked)

Piracy: ꜱᴀɪʟ ᴛʜᴇ ʜɪɢʜ ꜱᴇᴀꜱ

1440 readers
45 users here now

⚓ Dedicated to the discussion of digital piracy, including ethical problems and legal advancements.

Rules • Full Version

1. Posts must be related to the discussion of digital piracy

2. Don't request invites, trade, sell, or self-promote

3. Don't request or link to specific pirated titles, including DMs

4. Don't submit low-quality posts, be entitled, or harass others



Loot, Pillage, & Plunder


💰 Please help cover server costs.

Ko-FiLiberapay


founded 1 year ago
MODERATORS
top 32 comments
sorted by: hot top controversial new old
[–] maxprime@lemmy.ml 55 points 6 months ago (2 children)

For anyone wanting to contribute but on a smaller and more feasible scale, you can help distribute their database using torrents.

https://annas-archive.org/torrents

[–] empireOfLove2@lemmy.dbzer0.com 41 points 6 months ago* (last edited 6 months ago) (1 children)

I know the last time this came up there was a lot of user resistance to the torrent scheme. I'd be willing to seed 200-500gb but having minimum torrent archive sizes of like 1.5TB and larger really limits the number of people willing to give up that storage, as well as defeats a lot of the resiliency of torrents with how bloody long it takes to get a complete copy. I know that 1.5TB takes a massive chunk out of my already pretty full NAS, and I passed on seeding the first time for that reason.

It feels like they didn't really subdivide the database as much as they should have...

[–] maxprime@lemmy.ml 23 points 6 months ago (1 children)

There are plenty of small torrents. Use the torrent generator and tell the script how much space you have and it will give you the “best” (least seeded) torrents whose sum is the size you give it. It doesn’t have to be big, even a few GB is suitable for some smaller torrents.

[–] empireOfLove2@lemmy.dbzer0.com 20 points 6 months ago* (last edited 6 months ago)

Almost all the small torrents that I see pop up are already seeded relatively good (~10 seeders) though, which reinforces the fact that A. the torrents most desperately needing seeders are the older, largest ones and B. large torrents don't attract seeders because of unreasonable space requirements.

Admittedly, newer torrents seem to be split into 300gb or less pieces, which is good, but there's still a lot of monster torrents in that list.

[–] GravitySpoiled@lemmy.ml 6 points 6 months ago (1 children)

Thx.

Do you know how useful it is to host such a torrent? Who is accessing the content via that torrent?

[–] maxprime@lemmy.ml 7 points 6 months ago (1 children)

Anyone who wants to. I think a lot of LLM trainers access them.

[–] GravitySpoiled@lemmy.ml 1 points 6 months ago

Doesn't sound like I should host some of it. I'd be more down to host it for endusers

[–] HeartyOfGlass@lemm.ee 25 points 6 months ago (2 children)

Could anyone broad-stroke the security requirements for something like this? Looks like they'll pay for hosting up to a certain amount, and between that and a pipeline to keep the mirror updated I'd think it wouldn't be tough to get one up and running.

Just looking for theory - what are the logistics behind keeping a mirror like this secure?

[–] thanksforallthefish@literature.cafe 20 points 6 months ago* (last edited 6 months ago) (2 children)

Could be worth asking on selfhosted (how do I link a sub on lemmy ?) They probably have more relevant experience at this sort of thing.

Edit

Does this work ?

https://lemmy.world/c/selfhosted

[–] rufus@discuss.tchncs.de 8 points 6 months ago* (last edited 6 months ago) (1 children)

!datahoarder@lemmy.ml

Is probably more suitable. I'd be interested in the total size, though.

[–] catloaf@lemm.ee 3 points 6 months ago (1 children)

900 TB, according to other comments here.

[–] Illecors@lemmy.cafe 1 points 6 months ago (1 children)

Is it all or nothing sort of deal?

[–] catloaf@lemm.ee 2 points 6 months ago

There are partial torrents, also according to the other comments.

[–] pfaca@lemm.ee 6 points 6 months ago

It does. 😉

[–] umbrella@lemmy.ml 23 points 6 months ago (2 children)

how big is the database?

books can't be that big, but i'm guessing the selection is simply huge?

[–] xrtxn@lemmy.sdf.org 43 points 6 months ago (16 children)

The selection is literally all books that can be found on the internet.

[–] spiderman@ani.social 2 points 6 months ago* (last edited 6 months ago)

bigger than zlib or project Gutenberg?

load more comments (15 replies)
[–] redcalcium@lemmy.institute 14 points 6 months ago (1 children)

It is huge! They claimed to have preserved about 5% of the world’s books.

[–] umbrella@lemmy.ml 2 points 6 months ago

oh i actually tought it was way more! there wasnt a single book i wanted (or even tought to look up) that i didnt actually find in there.

[–] matcha_addict@lemy.lol 8 points 6 months ago (1 children)

I had no idea about this project. Is it like a better search engine for libgen etc?

[–] Andromxda@lemmy.dbzer0.com 13 points 6 months ago

It has way more content than Libgen