Did I just brick my SAS drive?

I was trying to make a pool with the other 5 drives and this one kept giving errors. As a completer beginner I turned to gpt…

What can I do? Is that drive bricked for good?

Don’t clown on me, I understand my mistake in running shell scripts from Ai…

EMPTY DRIVES NO DATA

The initial error was:

Edit: sde and SDA are the same drive, name just changed for some reason And also I know it was 100% my fault and preventable 😞

**Edit: ** from LM22, output of sudo sg_format -vv /dev/sda

BIG EDIT:

For people that can help (btw, thx a lot), some more relevant info:

Exact drive model: SEAGATE ST4000NM0023 XMGG

HBA model and firmware: lspci | grep -i raid 00:17.0 RAID bus controller: Intel Corporation SATA Controller [RAID mode] Its an LSI card Bought it here

Kernel version / distro: I was using Truenas when I formatted it. Now trouble shooting on other PC got (6.8.0-38-generic), Linux Mint 22

Whether the controller supports DIF/DIX (T10 PI): output of lspci -vv

Whether other identical drives still work in the same slot/cable: yes all the other 5 drives worked when i set up a RAIDZ2 and a couple of them are exact same model of HDD

COMMANDS This is what I got for each command: verbatim output from

Thanks for all the help 😁

  • rook@lemmy.zipOP
    link
    fedilink
    English
    arrow-up
    2
    ·
    22 hours ago

    Thanks for the continued support! ❤

    I’ve attached an identical Segate SAS drive from the server.

    To confirm, it is the same LSI card that was in the TrueNAS server. I pulled it out of the server and put it into the trouble shooting machine, where I run the commands.

    It is this one: 01:00.0 Serial Attached SCSI controller [0107]: Broadcom / LSI SAS2308 PCI-Express Fusion-MPT SAS-2 [1000:0087] (rev 05)

    I did not see your other reply lol, I will also try this command that you recommended:

    sudo sg_format –format –size=512 –fmtpinfo=0 –pfu=0 /dev/sdb

    Also, the sg_format ran for less than 5 minutes, very quick. However, if I can recall, it did say it was completed.

    **Note: ** “Bricked Drive” turned to sdb

    Identical working drive installed as sda

    Here is the dmesg -T > dmesg-full.txt with the identical drive

    Here is the code from: (with the output for each drive, separately)

    sudo lspci -nnkvv

    sudo lsblk -o NAME,MODEL,SIZE,PHY-SeC,LOG-SeC,ROTA

    sudo fdisk -l /dev/sdX

    sudo sg_inq -vv /dev/sdX

    sudo sg_readcap -ll /dev/sdX

    sudo sg_modes -a /dev/sdX

    sudo sg_vpd -a /dev/sdX

    Thanks again for all the help, I await your reply. :)

    I will let you know the results of (sudo sg_format –format –size=512 –fmtpinfo=0 –pfu=0 /dev/sdb), as soon as it’s done.

    • y0din@lemmy.world
      link
      fedilink
      English
      arrow-up
      1
      ·
      edit-2
      16 hours ago

      Thanks for the update, that’s helpful.

      Confirming that the controller is a Broadcom / LSI SAS2308 and that it’s the same HBA that was used in the original TrueNAS system removes one major variable. It means the drive is now being tested under the same controller path it was previously attached to.

      The device mapping you described is clear:

      sda = known-good identical drive

      sdb = the problematic drive

      Running:

      sudo sg_format --format --size=512 --fmtpinfo=0 --pfu=0 /dev/sdb

      as you did is the correct next step to normalize the drive’s format and protection settings.

      A few general notes while this is in progress:

      • Some drives report completion before all internal states are fully settled, which will cause reduced performance as the operation continues until finished in the background
      • A power cycle after completion is recommended before testing the drive again

      At this point it makes sense to pause any further investigation until the current sg_format has fully completed and the system has been power-cycled.

      Once that’s done, the next step will be a direct comparison between sdb and the known-good sda using:

      sudo sg_readcap -lla

      • Reported logical and physical sector sizes

      • Protection / PI status

      As a general note going forward: on Linux / FreeBSD it’s safer to reference disks by persistent identifiers (e.g. /dev/disk/by-id/ or UUID (this is safer but not so direct human readable) on Linux or glabel on FreeBSD) rather than /dev/sdX, as device names can change across boots or hardware reordering as you have had some experience with now.

      Post the results when you’re ready and the sg_format complete and we can continue from there.

      • rook@lemmy.zipOP
        link
        fedilink
        English
        arrow-up
        2
        ·
        5 hours ago

        Great News!

        Format completed and now the drive is viewable in “Disks” (however it is still unknown compared to the other one, it might just need a normal format.

        The code for the comparison returns invalid option, I assumed you need just -l comparison:

        sudo sg_readcap -l /dev/sdb and sudo sg_readcap -l /dev/sda

        One question I have is: what do you mean by powercycle? Is that another command to run on the problematic drive? If you mean turn off the pc and turn it back on, I will do that right now, just after the drive has completed formatting.

        After PowerCycle (turned pc off and on)

        sudo sg_readcap -l /dev/sdb and sudo sg_readcap -l /dev/sda

        Would the next step be formatting of some kind?

        • y0din@lemmy.world
          link
          fedilink
          English
          arrow-up
          1
          ·
          edit-2
          5 hours ago

          That’s good news — what you’re seeing now is the expected state.

          A quick clarification first:

          Power cycle means exactly what you did: shut the machine down completely and turn it back on. There is no command involved. You did the right thing.

          Regarding the current status:

          The drive showing up in Disks but marked as unknown is normal

          At this point the disk has:

          • No partition table

          • No filesystem

          “Unknown” here does not indicate a problem, only that nothing has been created on it yet

          About sg_readcap:

          sg_readcap -l is correct

          There is no direct “comparison” mode; running it separately on sda and sdb is exactly what was intended

          The important thing is that both drives now report sane, consistent values (logical block size, capacity, no protection enabled)

          Next steps:

          Yes, the next step is normal disk setup, just like with any new drive:

          1. Create a partition table (GPT is typical)

          2. Create one or more partitions

          3. Create a filesystem (or add it back into ZFS if that’s your goal)

          At this stage the drive has transitioned from “unusable” to functionally recovered. From here on, you’re no longer fixing a problem — you’re just provisioning storage.

          If you plan to put it back into TrueNAS/ZFS, it’s usually best to let TrueNAS handle partitioning and formatting itself rather than doing it manually on Linux.

          Nice work sticking with the process and verifying things step by step.