Aussie living in the San Francisco Bay Area.
Coding since 1998.
.NET Foundation member. C# fan
https://d.sb/
Mastodon: @[email protected]

  • 5 Posts
  • 1.63K Comments
Joined 3 years ago
cake
Cake day: June 14th, 2023

help-circle

  • Syncthing is pretty good.

    I tried seafile and it kept going down and corrupted a lot of files after an unexpected server shutdown. It shared the corruption to all the local files on every app/pc I had it shared to.

    This sounds like an issue with your server rather than with Seafile specifically. Was the unexpected shutdown due to a power outage? You should have a UPS so that it can properly shut down during outages. You’ll hit similar issues with any other system otherwise.




  • If your scanner supports scanning to a network share, install Samba on your Pi and share the paperless-ngx incoming directory. My ScanSnap iX1600 supports this, but I’m not familiar with other models. I had to configure the scanner using the Windows app to add the SMB details, but once it’s configured, it works without a computer attached.

    Paperless-ngx also supports email. You can set up a separate email account for it, then forward it any documents you want to keep to it.

    For documents you need to keep a physical copy of, use ASNs (archive serial numbers) to correlate the physical and virtual copy. You can use QR code stickers to automatically set the ASN in paperless-ngx. I posted a nested comment with more details about this.

    Consider using paperless-ai to use an LLM to tag and title your scanned documents automatically. It needs a webhook to be configured. Consider a local model if possible, and if you want to use a hosted model, review the provider’s privacy policy to ensure they do NOT train the AI on user content.


  • And file away your scanned papers separately,

    I’d recommend using ASN (archive serial numbers) for documents you store a physical copy of, following the recommended flow

    I printed ASN QR code stickers, using the smallest Avery labels I could find (Avery 5267 in the USA, L4731REV-25 in Europe) along with their free online design app.

    For documents I want to keep, I stick a QR code sticker on them before scanning. Paperless-ngx automatically detects the QR code and sets the ASN. I then file it away in a folder that’s sorted by ASN. When I need to find the physical copy again, I first look in Paperless to find the ASN, then find the document in the folder (pretty quick since all documents are sorted).

    You’ll need to set the following settings:

    PAPERLESS_CONSUMER_ENABLE_BARCODES=true
    PAPERLESS_CONSUMER_ENABLE_ASN_BARCODE=true
    PAPERLESS_CONSUMER_BARCODE_SCANNER=zxing
    




  • Open source projects are particularly vulnerable here since anybody can just grab the source and throw an LLM at it to see if it can find exploits.

    On the other hand, this means that they should end up more secure. Open-source projects get far, far more vulnerability testing than closed-source projects. Security holes in closed-source systems can exist for years at a time, which is how things like the Pegasus malware work (undisclosed security holes).






  • Companies sometimes sell their own first-party data, but not nearly as often as people think. If a company has data that other companies don’t have, a lot of the time they’ll want to keep it for themselves, since it can give them a competitive advantage over other platforms.

    If Amazon knows what movies and TV shows you like, they’re going to use that data to improve ad performance on their own platforms - suggested content on Prime Video, product ads on Amazon, etc. They’re not going to give it to some other company to use.

    The one major exception to that are data brokers. These are companies that only exist to sell data. These are less well known companies. They often use public data and combine it with things like supermarket loyalty data and purchase history.


  • For a beginner, I’d probably stick to Github initially, just because there’s so many guides and tutorials on how to use it, and their free plan is still pretty generous.

    A lot of the knowledge is transferable though. If you do want to try something else, Codeberg is pretty good for open-source.

    To just learn about Git, you don’t even need a host like Github or Codeberg. You can have a Git repo just on your computer, and still get a bunch of the benefits of source control - a full history of everything, separate branches and worktrees so you can have multiple incomplete changes and switch between them, etc.




  • dan@upvote.autoSelfhosted@lemmy.worldPaperless
    link
    fedilink
    English
    arrow-up
    7
    ·
    15 days ago

    I felt like a grown up once I got my paperless-ngx setup up and running.

    I have a Scansnap ix1600 scanner. Everything is automated once I insert a document and click the button to scan it.

    1. Scanned documents are saved to an SMB share on my home server - it’s a built-in feature on the scanner.
    2. Paperless-ngx is watching that folder and grabs the files.
    3. Paperless-ai uses AI to add metadata to document (title, tags, correspondent).

    For documents I need to keep a physical copy of, I give each document a consecutive ASN (archive serial number) using QR code stickers. When importing the document, paperless-ngx sees the barcode and attached the correct archive number to the document.

    If I need to find the physical copy, I first find it in Paperless-ngx, look at the archive number, then look in a folder where the documents are arranged by archive number. Easy.