• ColeSloth@discuss.tchncs.de
    link
    fedilink
    English
    arrow-up
    9
    ·
    5 hours ago

    I know there’s a lot of music out there, but I would have guessed the number was still less than 86,000,000 on Spotify.

      • ColeSloth@discuss.tchncs.de
        link
        fedilink
        English
        arrow-up
        2
        ·
        2 hours ago

        I mean, if one song was released every 10 seconds, it’d take 28 years to hit 86 million songs. That sounds like a lot of Spotify.

    • thatsTheCatch@lemmy.nz
      link
      fedilink
      English
      arrow-up
      5
      ·
      edit-2
      3 hours ago

      I read a bit of Anna’s article and if I remember correctly there are something like 256 million tracks on Spotify. Mind you this includes things like white noise tracks.

      But 86 million tracks represent 99.6% of listens or something like that. Most tracks don’t get played much if at all

    • addie@feddit.uk
      link
      fedilink
      English
      arrow-up
      16
      ·
      9 hours ago

      Looks like you can get refurbished 26TB drives for about £340, so 12 of those. PCIe -> 6x SATA adaptors run you about £40 each. Molex to SATA power adaptors about £5. So £4200 will let you store all that with a bit left over for postage and some duct tape to make a storage bay out of the boxes it all came in.

      I’d probably want a few more drives for RAID6 and some hot spares, but if you go JBoD then at least you can just download the torrent again ;-)

      • rekabis@lemmy.ca
        link
        fedilink
        English
        arrow-up
        5
        arrow-down
        3
        ·
        6 hours ago

        Research RAID more effectively.

        RAID-10 is far more efficient not only as a transfer speed but also as redundancy across large arrays. It’s only nerf is storage inefficiency.

        RAID-6 requires serious computing oomph to create the parity bits, which dramatically slows down writes and rebuilds. It also needs only two drive losses across any one array before the whole array dies. Conversely RAID-10 has only duplication, no parity, so compute load is far lower and writes/rebuilds are a lot faster, and it can have up to half of all drives fail before the array is irretrievably broken.

        • addie@feddit.uk
          link
          fedilink
          English
          arrow-up
          4
          ·
          4 hours ago

          Yeah yeah, I know RAID.

          If OP can’t afford the storage for ‘just a bunch of disks’, then paying twice as much for 100% redundancy in RAID10 is doubly unaffordable.

          Also, consider what is being stored here. It’s music files that we obtained from a torrent. We need sufficient raw performance to read a few megabytes per minute so we can listen to them. As a bonus, we may wish to upload the torrent again, and can use any spare capacity for that. What benefit are you going to obtain from your very expensive storage solution?

          RAID6 can lose any two drives, but at most two. RAID-10 can lose only 1 drive with guaranteed no data loss. Losing two might lose the cluster, if you lose a drive and its mirror. Yes, if you’re really lucky, you can lose up to half, but ‘feeling lucky’ isn’t how we plan data storage. Doesn’t matter, we’ve got a backup - download the torrent again ;-)

  • Foni@lemmy.zip
    link
    fedilink
    English
    arrow-up
    170
    ·
    16 hours ago

    If they get caught, they can always say it was to train an AI model

    • red_bull_of_juarez@lemmy.dbzer0.com
      link
      fedilink
      English
      arrow-up
      2
      ·
      4 hours ago

      News like this make me sad. I have not been able to replicate the amazing experience I had with Audiogalaxy or Soulseek. By looking for things you liked you could then search what the people who had that music also had lying around and I discovered so much great stuff that way. My taste in music is extremely eclectic but since I’ve been limited to Spotify, I am rarely discovering weird music anymore. Fuck corporations.

      • Hate@lemmy.dbzer0.com
        link
        fedilink
        English
        arrow-up
        1
        ·
        3 hours ago

        I have not been able to replicate the amazing experience I had with Audiogalaxy or Soulseek. […] My taste in music is extremely eclectic but since I’ve been limited to Spotify, I am rarely discovering weird music anymore. Fuck corporations.

        Soulseek is still alive

    • LesserAbe@lemmy.world
      link
      fedilink
      English
      arrow-up
      7
      ·
      10 hours ago

      This is interesting, thanks!

      Don’t know that I would be active enough for what.cd, but I am drawn to archiving, lossless files and comprehensive metadata.

      I ripped my hundreds of CDs to FLAC and spent a good while organizing before Spotify came along with its temptations. Have started buying again on Bandcamp lately and been thinking about spinning up jellyfin.

      • xvertigox@lemmy.world
        link
        fedilink
        English
        arrow-up
        2
        ·
        4 hours ago

        Musicbrainz Picard + Emby/Jellyfin is a great combo. Emby servers double as navidrome servers so you can use myriad apps to listen.

  • shameless@lemmy.world
    link
    fedilink
    English
    arrow-up
    2
    ·
    10 hours ago

    Damn, I’d be open to if somone could host this, I’d actually pay a monthly subscription to an individual rather than spotify

  • CallMeAnAI@lemmy.world
    link
    fedilink
    English
    arrow-up
    4
    arrow-down
    20
    ·
    edit-2
    12 hours ago

    Who actually cares? This stuff was already available.

    It is this just bait to get y’all riled over “sticking it” to Spotify over what amounts to an utterly inconsequential action?

    • eli@lemmy.world
      link
      fedilink
      English
      arrow-up
      1
      ·
      4 hours ago

      Personally I think the metadata alone is pretty valuable. Being able to use it as an agent for something like my Plex library would be great from my understanding.

    • LesserAbe@lemmy.world
      link
      fedilink
      English
      arrow-up
      3
      arrow-down
      1
      ·
      10 hours ago

      “Most music is terrible” so I’m not particularly drawn to having this vast archive. I want to listen to the things I really like.

      It’s also not lossless, so from an archive standpoint that seems to diminish its value.

      That said, I do think the insights they post on their blog about statistics and distribution are interesting. And just because that music is currently available via paid services doesn’t mean it will always be as accessible in one place. There would be a lot of manual collection and labeling you’d have to do to get something like this otherwise. And you wouldn’t have nearly as much confidence about how comprehensive such a database was if you did it yourself.