It's A Digital Disease!

11 readers
1 users here now

This is a sub that aims at bringing data hoarders together to share their passion with like minded people.

founded 2 years ago
MODERATORS
101
 
 
The original post: /r/datahoarder by /u/I_LOVE_OIL_RIGS on 2024-12-30 20:34:47.

Sorry if this is the wrong place to ask.

I am trying to build a DAS, or something similar to a QNAP, with spare parts and things I have. I have 4 4TB hard drives. I am connecting them via SATA to a USB C 3.2 to SATA adapter (https://www.amazon.com/dp/B0DHRSSHJB?linkId=a7d614b4ea73cee11d91c4bb59c5890e&language=en%5C%5C_US&ref%5C%5C_=as%5C%5C_li%5C%5C_ss%5C%5C_tl)

Not sure if anyone has used these in their setups or have experience with the JMB575/JMS580. But whenever I connect more than two drives, it is as if it cannot sync the drives in RAID and write speeds go from about 300MB/s to about 40MB/s. The drive lights do not come on at the same time and blink rapidly telling me something about these adapters are not suitable for this. This happens with two different adapters (2 drives per adapter), all 4 drives on one, both adapters plugged into different USB C ports, etc.

Does anyone know of a good way to accomplish what I am wanting? I have the drives powered, I just need the data cables connected. Ideally all going down to one USB C cable to connect to anything.

102
 
 
The original post: /r/datahoarder by /u/Competitive_Fix3519 on 2024-12-30 19:53:13.

I recently learned you could download meta data but I don't really get how to use it, can someone explain it? Thanks in advance

103
 
 
The original post: /r/datahoarder by /u/joshm1301 on 2024-12-30 19:43:32.

So I have been trying to find a source for numerous maps from the middle ages. Geographical maps, maps of battlefields, siege maps, etc.

I found this website: https://www.themaparchive.com/product/personal-subscription/

It looks like a solid source for a little bit of everything that I am wanting, however it costs around $3 per map, or you can buy a personal annual subscription for just under $90 and it allows you to download all the maps they have.

My question is, does anybody know of this website? I haven't been able to find much information to go off of from other people experiences and would want to make sure before dropping almost $100. Any other sources that cover maps from this long ago? I know David Rumsey is a good source, however they mostly specialize in maps from the 16th through the 21st century.

Thanks in advance!

104
 
 
The original post: /r/datahoarder by /u/bitman2049 on 2024-12-30 19:11:05.
105
 
 
The original post: /r/datahoarder by /u/handawanda on 2024-12-30 19:05:46.

Hey guys. I am trying to use the Chrome extension for Wayback Machine to save web pages that I have a paid subscription to, so that I can archive and share articles (which are not already captured on the Internet Archive) with friends. Unfortunately, when I do this, it seems to only capture the paywalled version, even though I have access. Here's an example of a page I tried to capture:

https://web.archive.org/web/20241230185013/https://www.washingtonpost.com/sports/2023/11/13/nfl-sacks-interceptions-qb-rating/

Is there a way to use the Wayback Machine how I'm intending? If not, is there a better alternative? I suppose I could just print to PDF and email the PDF, but I'm really wanting to circulate a hyperlink. Thanks!

106
 
 
The original post: /r/datahoarder by /u/illuanonx1 on 2024-12-30 18:46:38.

I had a Seagate EXOS 16TB disk to RMA, after it found sector error. The disk is 4½ years old. This thing happen over time, nothing special.

I did pay for the shipment to UPS, and send the disk to Netherlands (I'm in EU) and Seagate confirm to receives it. They send a replacement after a week with UPS.

When it arrived to the package shop in my town, it was no where to to be found. It was lost. Mistakes happens, that's okay. Maybe an early Christmas for the driver, who knows :)

I reported it to Seagate. They told me I could not open the case, before they have confirmation from UPS that it is lost. That was my problem to obtain.

So first time I contacted UPS, they told me I needed Seagate to create the case, because they are the sender - I was not allowed to do that. Seagate disputed that when I called them back. Then I called UPS back again and now they suddenly could create the needed documentation - I got it and did send it to Seagate. Seagate created the Claim of the lost drive 8 days later. So now it should be easy from now on right? Ha, Seagate: hold my beer and let me show my evil side.

It has now been 3 weeks. UPS closed the case last week, but Seagate is still somehow investigating the case. I can not get any information from Seagate, the Support Supervisors doesn't know anything (not their department and can not communicate with the right department). UPS has a NDA with Seagate, so they are not allowed to share any information about the case conclusion from their side. So I'm out of luck.

Now I have spend 5 hours in Seagate Chat, talking to multiple Supervisors. Only way to describe it, it is support from Hell. They know nothing, can not do anything and only apologize in every sentence. After 5 hours back and forth they agreed to send a new drive. But they will not tell me if the other package was found or what happened to it. It's apparently none of my damn business. How dare I to ask about it. So bizarre.

I can see by the tracking number on UPS webside:

12/10/2024 Availble to pickup at package shop

12/18/2024 Claim in Progress

12/28/2024 Returning to Sender: Trackingnumber: xxx

So maybe they found it and send it back to Seagate and not me. Hard to tell. But there is a tracking number, but doesn't seem to be send yet. Its only registered.

I tried to escalate the case and to make Seagate aware of the problems. But the more I have interacted with Seagate, the more I think its deliberate done. There is not real interest in providing good support. They could just call UPS for the information. Plain and simple.

Does other have the same experience and maybe some stories to share?

107
 
 
The original post: /r/datahoarder by /u/shiftysnowman on 2024-12-30 18:38:19.

tl;dr: New Stacher7 Available at https://stacher.io/


Hi all!

First of all, I want to thank everyone here who is reading this right now. Your support, feedback, and encouragement have been super uplifting and motivating.

Stacher version 6 was released back in 2019. It was a learning project for me. I have continued pushing out updates and features over the last few years, but frankly, the project didn't have a great foundation for building upon and it's maintainability was poor.

Rather than continue updating version 6, I decided to take everything I learned and re-build Stacher from the ground up into a new version, Stacher 7.

Stacher7

Stacher 7 introduces the concept of having multiple yt-dlp configurations that you can quickly switch between. This should save you time from having to go into the settings every time you need to change something. Subscriptions are based on configurations so if you need to change a bunch of subscriptions at once, just change the single configuration rather than edit each subscription one by one.

Create A Configuration - [?] Button shows help

Use the cog wheel/settings icon in the upper right corner of Stacher 7 to access all the settings for your current configuration. You can change your current configuration from the upper left corner of the settings window.

Settings Window - Editing Default Configuration (see upper left)

Stacher 7 surfaces many more yt-dlp options which may be slow or tricky to find at times. You can search for a configuration and change it quickly with the CTRL + P hotkey (see full list of hotkeys in the Settings window) to open the "Configuration Spotlight"

Configuration Spotlight

Stacher 7 should do everything that the current Stacher 6.x can do, plus more. It can be as simple or as sophisticated as you need it to be. A new "Pro Mode" allows you to access the more advanced features in Stacher and yt-dlp.

Many of you have reported bugs and feature requests in the sub and have been patiently waiting for them to arrive, and I haven't forgotten about you. Hopefully a lot of those requests have been addressed in Stacher 7. A few things (like yt-dlp plugin support) aren't in just yet, but I still intend on getting those pushed out in a future update.

The subreddit sees regular posts related to ffmpeg not being installed or having trouble with getting it installed. Stacher 7 will detect if ffmpeg is not installed and will show a status indicator with options to install ffmpeg manually from a built zip or automatically by pulling from the official ffmpeg releases.

(Some) Feature Highlights

The primary goal with this release is to ensure there is no regression in features between 6 to 7. Because Stacher 7 was built to be more maintainable and follow best practices, adding additional features should come easier and updates more frequent.

Although the UI is very similar, Stacher 7 is a big change from Stacher 6. Because of this, Stacher 7 WILL NOT be pushed out as an automatic update for Stacher 6. Instead, you can have both of these installed on your system at the same time. Stacher 7 will install as "Stacher7".

Stacher 7 is available for:

  • Windows
  • MacOS (Intel)
  • MacOS (Silicon)
  • Ubuntu/Debian

For more information and download, check the official homepage at: https://stacher.io/

If you have any questions, comments, concerns, feedback, or whatever, don't hesitate to comment in this thread or post in the subreddit directly. You can also use the in-app feedback form in the lower left corner of Stacher7. The feedback form allows you to attach yt-dlp logs from failed downloads if you are having trouble with something specifically.

I'm sure there will be a few bugs here and there that might require quick updates. If you run into anything that doesn't seem right, please let me know!

-shiftysnowman

108
 
 
The original post: /r/datahoarder by /u/Pretty-Road-8538 on 2024-12-30 18:24:46.

I plan to set up a small home NAS with the sole purpose of running Calibre Ebook, so to be able to store ebooks and PDFs of books and to be able to quickly access them for consultation. I plan to store several thousands books, so I do not need a large amount of space. But I do want to be able to access them quickly from anywhere.

I am based in Europe and between NAS (eg, DS723+) and HHDs, it would cost me more than 700 Euros. Are there cheaper alternatives that offer similar benefits. Thank you.

109
 
 
The original post: /r/datahoarder by /u/sendlewdzpls on 2024-12-30 18:20:35.

Looking to up a multi-bay DAS so I have room to expand. I’d ideally like 4 bays and to keep it as inexpensive as possible. I was looking at the Mediasonic HFR7-SU31CH, Mediasonic HF2-SU3S3, and Orico 9758C3, but I figured I’d ask here before pulling the trigger.

What do you guys think? Any of these worth getting? Or is there something else I should be looking at?

110
 
 
The original post: /r/datahoarder by /u/BuritoBear on 2024-12-30 17:47:14.
111
 
 
The original post: /r/datahoarder by /u/Vast_Understanding_1 on 2024-12-30 16:24:59.

I have now 6 drives which are 12tb each, this nas will be used for media storage (non important, movies, shows, some bulk and heavy home videos and I want a minimum redundancy before affording a true external backup solution. I heard that Snapraid is good media redundancy since there are not a lot of writes per drive, realtime parity isn't necessary Also there is Raid5 / Raid6

Only 2 drives are already used, 4 went added recently and are still empty due to my mergerfs policy and due to severe testing before being added (smart / check disk)

112
 
 
The original post: /r/datahoarder by /u/BigMickDo on 2024-12-30 15:23:00.

using AI to proof read my post, so pardon the robotic sound.

Subject: Seeking Advice: Organizing Medical Records with Paperless-ngx (No EMR Available)

Introduction

  • I'm from Egypt, where Electronic Medical Records (EMRs) are not commonly used.
  • I'm experiencing chronic health issues and have accumulated a large volume of medical documents over the years.
  • These documents are currently disorganized and scattered.
  • I'm struggling to find doctors I trust, and each new doctor essentially restarts the diagnostic process, even when I provide history.
  • My goal is to take control of my medical records by digitizing and organizing them myself.

My Plan

  • I intend to use Paperless-ngx to manage my digital medical records.
  • I will use Microsoft Lens to scan paper records, lab results, and imaging reports (CT/MRI). Unfortunately, I don't have access to DICOM files from previous labs.
  • I plan to ask doctors for detailed notes after each visit and add them to my digital records.
  • Initially, I'll scan all related documents for each specialty into a single PDF (e.g., all eye scans in one file, all ENT documents in another).
  • New documents for each specialty will be scanned into separate PDFs.

Challenges and Questions

  1. Organization System:
    • I'm looking for a robust organizational system (methodology) within Paperless-ngx to keep my records easily accessible and understandable.
    • Any tips or best practices for tagging, categorizing, or structuring medical documents within Paperless-ngx would be greatly appreciated.
  2. Scanning Quality:
    • My phone camera quality isn't ideal, and I have shaky hands, which can affect scan clarity.
    • I'm currently using an overhead lamp positioned opposite me to minimize shadows, but I'm open to better techniques.
    • Any suggestions for improving scan quality with Microsoft Lens or alternative scanning methods are welcome.
    • Consider a phone stand or tripod for stability.
  3. Data Extraction and Tracking:
    • Paperless-ngx offers OCR, which is helpful, but I also need a way to track specific data points over time (e.g., eye pressure readings on different dates), would also need notes there for certain data points.
    • Are there any features within Paperless-ngx or complementary tools that can help with this?

Specific to Egypt

  • If anyone has experience managing medical records in a similar context (without EMRs), I'd love to hear your insights.

Thank You

113
 
 
The original post: /r/datahoarder by /u/impreza233 on 2024-12-30 15:15:57.

I'm saying this here because I don't know where I can get support. Here's my issue:

-I bought HDTunePro 6.01 some months ago. I have my serial and I have my email of Fastspring about my purchase.

-In the web says that upgrades to a minor version are free: https://www.hdtune.com/buy.html In theory I should be able to download and install 6.10 version with my serial for free. Problem: I can't. I have sent a email to [[email protected]](mailto:[email protected]) and they have not answered my question. Also I have written to [[email protected]](mailto:[email protected]) and nothing.

-Also I can't redownload HDTunePro 6.01.

114
 
 
The original post: /r/datahoarder by /u/V12daimler on 2024-12-30 13:36:13.

First I'm no IT person but as an industrial engineer I'm not completely stupid so please bear with me.

I currently have GSuite for my family domain (six users) which everyone uses and loves for years now. I have the low tier one which is 30GB per user. Total £360 per year.

Additionally, I myself have a 2TB Dropbox account for many years now, which is about half full. Total £360 per year.

Additionally still, five out of the six users have iPhones so I'm paying for a family account for that for password/Whatsapp backups and so on. Total £40 per year.

I really like Dropbox but I'm paying for too many different things. I first thought just to increase the Google storage to 2TB and get rid of my Dropbox account but doing that would double the GSuite cost.

Then I thought about migrating to Office 365 at 1TB per user for £80 per year in total BUT the main issue with that is that in Gmail I have thirty years of meticulously labelled emails and that would be ruined or duplicated.

My friend has suggested to get a VPS and run OwnCloud but frankly I have too much to do in my life without also learning to become a sysadmin.

So I am looking for any and all advice - what would you do?

115
 
 
The original post: /r/datahoarder by /u/ChillyPotatoFries on 2024-12-30 11:46:49.

https://www.youtube.com/watch?v=Gnfa7bajnWw

If someone can please send me a mp4 or mp3 file, Ive been trying to retrieve this video with many methods but I couldnt

116
 
 
The original post: /r/datahoarder by /u/Agreeable_Repeat_568 on 2024-12-30 11:09:09.

I need a low power way to add sata ports to my unraid VM on Proxmox. I am passing through the onboard sata now but I need more. I have been reluctant to use a HBA due to power use as I am trying to keep this server as low power as possible(at least with idle).

I was originally thinking of using a dual m.2 nvme pcie card with 2x 1166 ASM m.2 sata adaptors for a low power sata expansion. My problem when testing with a 6x sata pcie expansion card(1166 asm) the card shared the ahci driver with the onboad sata that is running the proxmox os making it really hard to blacklist the expansion card drivers so proxmox can never load the disk on the expansion. I am hoping a proper hba would get around this.

117
 
 
The original post: /r/datahoarder by /u/blackberriesandthink on 2024-12-30 09:08:43.

Or is that already done at the factory behind the scenes?

What about in a Synology SHR1 NAS environment?

I'm going to run 6x 8TB QVO drives in a DS620slim in SHR1 config.

If the QVO SSDs do need overprovisioning, what percentage is recommended?

Bought the NAS and SSDs now need to buy the backup drives to finish the setup.

I'm trying to figure out and buy my NAS's backup external USB hard drives while the holiday sales are still on and still in stock. Stupid good deal on WD 18TB USB external hard drive right now, two of them might just be enough if I do need to overprovision, might not be if I don't. Not such a great deal on the WD 20TB USB hard drives if I have to go that route.

118
 
 
The original post: /r/datahoarder by /u/ds3534534 on 2024-12-30 08:31:18.

Thought I'd share an interesting (probably well-known) TIL observation.

I'm backing up around 900,000 JPEGs and XMP - even locally is fine, it's just a 'get out of jail free' copy before I start messing about on the original.

These files are stored on a 12yo HP Microserver Gen8, running a Celeron CPU, with a 4x10TB Hardware RAID5 5200rpm array, running VMWare ESXi 5.5, and WIn10 on top of that. Horribly slow, of course.

I tried a few different options, but trying to copy those files was going to take at least a day, maybe 20-30.

The optimum method I've currently landed on, is this:

  • External (old 5200rpm SATA) drive in spare USB3 caddy
  • USB3 caddy mounted as new USB device in VMWare
  • Use 7Zip in Windows to archive an entire folder of ~100,000 JPGs, with 0 compression, from the source (Win VDisk on Hardware RAID5) to dest (NTFS-formatted USB Drive)

I had tested the USB drive at 150MB/s write speed using large movie files, which is acceptable enough. It was also twice as fast as an internal drive-to-drive copy within the RAID5 array, even though it's on a max-RAM hardware RAID card.

However, Windows-copying the small JPGs to backup to the external NTFS was running at only 100kB/s, no doubt due to NTFS overhead on an old spinning rust drive.

So - what I've found is fastest, is to use 7Zip at 0 compression to write the backups to the external drive. Even with my puny 2-core Celeron CPU, I'm getting 60-90MB/s sustained rates from the RAID5 array to external drive, against the previous best-case of 150MB/s for single large files.

Surprisingly, running 10 x 7Zip archive jobs at zero compression in parallel, it seems the parallel runs are faster than a single run (which ran at 20-30MB/s). I would have thought the high parallel copies would be slower than some optimal lower count of 2-3 copies, but it seems not.

At this rate, I'll back up 900,000 small files totalling 1TB in around 3-4hrs, which is way better than every other solution I had tried.

So my learning is that it seems 7Zip with 0 compression is the answer for copying small files far faster than other methods, running at near-(old) disk speeds even on a 12yo small Celeron CPU.

119
 
 
The original post: /r/datahoarder by /u/Striker3737 on 2024-12-30 07:23:30.

Hello,

I'm looking to get a backup HD for my Plex server. I already have a Seagate Ironwolf Pro 12 TB in an enclosure. Amazon has a 20 TB Seagate external drive for $280, and a WD 16 TB External for $269. I can't find internal/Ironwolf Pro/WD Red drives for anywhere near that price. What Should I do?

120
 
 
The original post: /r/datahoarder by /u/milkygirl21 on 2024-12-30 07:05:16.

https://i.imgur.com/cNqvXHw.png

Just got my 12TB Barracuda Pro, did the long generic test and passed. Started transferring large videos (E.g. 30GB each) from my SN980, and I noticed it has many troughs when transferring. I didn't notice this on normal consumer drives, so I just want to ask are those dips normal & expected? If I was transferring only small files, then I understand, but doesn't make sense for large files?

121
 
 
The original post: /r/datahoarder by /u/pororoca_surfer on 2024-12-30 06:42:12.
122
 
 
The original post: /r/datahoarder by /u/testmyfist on 2024-12-30 05:55:32.

Hey everyone, i'm planning to purchase a NAS sometime in 2025 and i wanted to get as much info as i can for then so i don't waste time when i have to buy these. Here are the things that i want to purchase, please let me know if this is a good and what else i can add to make this as easy as possible. I've added the links and the names itself if you don't want to open the links, also a few doubts and plans of this setup.

NAS Device: https://www.amazon.in/Synology-DS923-Desktop-Ryzen-R1600/dp/B0BKPXZDCY?nsdOptOutParam=true

Synology DS923+ 4-Bay Diskstation NAS (AMD Ryzen™ 4 Threads R1600 Dual-Core 4GB Ram 2xRJ-45 1GbE LAN-Port)

HDD for NAS : https://www.amazon.in/dp/B0CS49MZ1X

Synology HAT3310 8TB Plus Series SATA HDD 3.5" (HAT3310-8T)

Router it'll be connected to: https://www.amazon.in/TP-Link-M4-Seamless-Parental-Qualcomm/dp/B07L44RHC2

TP-Link Deco M4 Whole Home Mesh Wi-Fi System, Seamless Roaming and Speedy (Ac1200), Dual_Band, Work with Amazon Echo/Alexa, Router and Wi-Fi Booster, Parental Control, Pack of 1, Qualcomm CPU

I want to setup a 32 TB of total NAS and one of the 8TB to be a raid (i'm guessing that's the backup incase a drive fails?). I want setup up the NAS for 4 people including me.

I want all 4 users to have their own username and storage so each of us cannot see what's in the other's folder. i saw the phone app having a feature like this but i did not really understand how this works.

I'll be the one who'll be using this to it's full extent with not only my personal photos but also for archiving my work files and a large number of YT videos i've been collecting for a while now.

I want to be able to drop files into the NAS from my main PC, which has ethernet connection from the router. I'm guessing this would work more like an external HDD that i can just drop files to?

Also what are these 10 gig ethernet ports? i don't understand them

If i'm missing something or add something more to this, please let me know and i can plan my budget accordingly.

Edit: changed the link for the NAS device as i added one with the drives which i did not want to add

123
 
 
The original post: /r/datahoarder by /u/PM_ME_UR_BOOOBSS on 2024-12-30 05:38:21.

First, if you clicked this and haven't heard of stash and would like to keep your more... sensitive... collections organized, its pretty neat and can be found here https://github.com/stashapp/stash. (I'm not affiliated with Stash, just use it everyday).

Second, if you are using Stash but haven't configured StashDB you're missing out. Don't be like me and accumulate about 8TB of videos and just find out about it. Information can be located here:

https://guidelines.stashdb.org/docs/faq_getting-started/stashdb/

In short, StashDB along with ThePornDB (and subsequently fansDB) make properly tagging and organizing your collection a breeze and much better than the normal community scrapers. It'll add associated performers, scene codes, tags, links to the scene, good scene covers, etc.

Properly tagged, dated, and linked scenes just warm my heart.

That is all. I'm sure quite a few people in the sub knew about that little addition, but if not, there ya go.

Edit: Follow-up tip. StashDB is good for professional scenes, but may cause some issues with improper tagging of some of your more amateur or semi-pro content. It's ok, ThePornDB references fansDB which scrapes from some of the more popular amateur stuff and does a pretty good job of recognizing some scenes.

Make sure you generate phashes before attempting to use these, as that's what the DBs use.

124
 
 
The original post: /r/datahoarder by /u/Career-Acceptable on 2024-12-30 01:35:32.

Not having a ton of luck trying to scrape some scenes I ripped off of DVDs with handbrake. Does it even work like that? Literally just set it up today.

125
 
 
The original post: /r/datahoarder by /u/onesixzoo on 2024-12-30 04:57:20.

I have a Samsung t7 2tb portable hard drive filled with movies.

My question is, which is better for hard drive longevity - is it better to copy a movie from the hard drive each time to my desktop and watch it from there, or just always play it off the hard drive?

I watch a couple of movies a night. Is it better to plug n play and absorb the 3 hours a day usage and wear and tear, or is it better to read and copy 2 movies (approx 2 Gb) a day? Which is better for the SSD lifespan?

Any help much appreciated.

view more: ‹ prev next ›