Automating Your Pure Storage Infrastructure: Part 1 - Getting Started

After a decently long hiatus from writing anything in a series-fashion, I’m back to share a blog series based around automating your Pure Storage environment. We’re going to begin this series with a few posts about advancing both your understanding of your Pure Storage environment and understanding the options available to you for automating and monitoring the infrastructure.

This series will begin with some requests that came from Pure Storage customers, and other Pure employees that asked for some help in delivering the solution, or just some help with understanding how to accomplish the goal. These are normally the most enjoyable tasks to automate, as it gives a chance to understand what sorts of tasks a customer is trying to accomplish and what needs they have, and to help educate others along the way.

I have always found that the most effective way to learn automation is focusing on a practical application, where you have a goal in mind, and starting with breaking the task down into simple steps. We aim to have our automation handled in sequential steps which give us a repetitive process, where we can measure our results to both measure performance and work towards incremental improvement over time.

The first project that I want to cover was a request from a Pure customer that was preparing for their first non-disruptive software upgrade (NDU) across two storage arrays. While this upgrade was going to be handled by Pure support, as there were no personnel on-site at the customer datacenter, they wanted to understand how their IT staff and their NOC could remotely monitor the infrastructure before, during, and after the upgrade. The primary question that the customer had was how to observe the host path connections from each server to their FlashArray.

The biggest concern from the customer was with being new to the process and needing to ensure proper monitoring of their environment to meet their internal process and procedures for performing upgrades of critical infrastructure. Part of their requirements also include submitting an upgrade plan to the change approval board before the upgrade could be approved, especially with the upgrade being performed remotely and not by on-site personnel.

As they did not have operations experience with the platform and how a non-disruptive upgrade is performed (potentially also not trusting a vendor promise of “no downtime”), they wanted us to help make this an easy process for upgrades, beginning with the first. The customer was requesting assistance with a “storage pathing report”.

As part of their upgrade process, they needed to ensure they were able to determine that the correct number of host paths were seen active throughout the upgrade process, and they were just unsure of the capabilities of scripting such a solution. They were also smart enough to simply ask their account teams if one of our nerds could tell them which knobs to turn to accomplish this.

Although the software upgrade on a FlashArray does not taking much time for minor jumps, they needed to have this information automatically provided to their NOC every few minutes. They also needed to get the details displayed on the ‘Health > Connections’ tab for this detail, but avoid having to observe the array UI, as this is not a feasible way to monitor upgrades. (Seriously, that should never be an acceptable answer from a vendor… having a dedicated resource available to watch the UI through the upgrade)

After a discussion with the customer about what would be the ideal final result for them, they said they would be able to do two things:

  1. Observe the current number of connected initiators to get regular/timed updates of the count of active host paths
  2. Get the detailed output of all connected & not connected host initiators so that a comparison can be made before and after the upgrade

Now one final wrinkle in this, is that the customer also has some VMs which are running in-guest iSCSI for some testing, but these are not critical to report against, they would like to limit this effort to only fiber-channel hosts.

So, the breakdown of all of the required steps for processing both sets of output requested for these automated updates are:

  1. Allow for a script that can remotely access the array with externally stored credentials
  2. List out the number of initiators currently connected to the array (only fiber-channel)
  3. List out which array ports each host is currently connected to (only fiber-channel)
  4. Produce an object or text output that can be picked up and automatically sent to another alerting process (we only need to handle output, they would handle sending the notification/alert)

Now, by taking our request and breaking it down into specific steps, it becomes almost a simple checklist for us to follow. This also means that we can determine the basics of what is necessary and then add the code for each until we achieve our (MVP) minimum viable product.

And yes, it may be easy to think that step 1 is very basic and simple, and maybe doesn’t require being written down but the specific reason I wrote it down is honestly due to repetition. After many years of delivering example code to customers and users, I try to focus on the parts that are important and that are within my subject matter area. I don’t know what method the customer running this script will use to input a credential for the connection to the FlashArray and that is not my primary concern. With basic functionality built into the script to use a credential, the customer is free to add additional code to deal with the credentials in whatever manner is necessary for their internal systems.

We will be using the PureStoragePowerShellSDK (the original v1 SDK) module to connect to the array, just because I know the simple way to pull some of these details from the FlashArray via CLI, which means that my focus is just in taking these results which we can gather by CLI, and wrapping this in PowerShell. As a pre-requisite, this means that the PowerShell host running this code will need SSH access to the FlashArray, and we’ll highlight where we see this in the help for the cmdlet itself.

Let’s with step 1 and finding the cmdlet that we’ll use to connect to the FlashArray and run our CLI command. We’ll start looking through all of the commands within our module by running Get-Command -Module PureStoragePowerShellSDK

Get-Command -Module PureStoragePowerShellSDK

CommandType     Name                                               Version    Source
-----------     ----                                               -------    ------
Cmdlet          Add-PfaArrayToPod                                  1.19.49.0  PureStoragePowerShellSDK
Cmdlet          Add-PfaHostGroupsToProtectionGroup                 1.19.49.0  PureStoragePowerShellSDK
Cmdlet          Add-PfaHostIqns                                    1.19.49.0  PureStoragePowerShellSDK
Cmdlet          Add-PfaHostNqn                                     1.19.49.0  PureStoragePowerShellSDK
Cmdlet          Add-PfaHosts                                       1.19.49.0  PureStoragePowerShellSDK
Cmdlet          Add-PfaHostsToProtectionGroup                      1.19.49.0  PureStoragePowerShellSDK
Cmdlet          Add-PfaHostWwns                                    1.19.49.0  PureStoragePowerShellSDK
Cmdlet          Add-PfaPodReplicaLink                              1.19.49.0  PureStoragePowerShellSDK

skipping a few lines...

Cmdlet          Unlock-PfaUser                                     1.19.49.0  PureStoragePowerShellSDK
Cmdlet          Update-PfaArrayConnectionsThrottlingInformation    1.19.49.0  PureStoragePowerShellSDK
Cmdlet          Update-PfaItemFlag                                 1.19.49.0  PureStoragePowerShellSDK
Cmdlet          Update-PfaKmip                                     1.19.49.0  PureStoragePowerShellSDK
Cmdlet          Update-PfaSubnetSettings                           1.19.49.0  PureStoragePowerShellSDK
Cmdlet          Update-PfaSubnetStatus                             1.19.49.0  PureStoragePowerShellSDK

This returns a lot of cmdlets to us, and we can find this exact number by piping the previous code to ‘Measure-Object’

Get-Command -Module PureStoragePowerShellSDK | Measure-Object

Count    : 466

So, this module has 466 cmdlets in the current ‘1.19.49.0’ version. Now let’s go about finding the cmdlet that we need for getting our CLI command run and returning the output – we’ll do this by using our current “Get-Command” code and looking for a cmdlet with “CLI” as part of the cmdlet noun.

Get-Command -Module PureStoragePowerShellSDK -Noun *CLI*

CommandType     Name                                               Version    Source
-----------     ----                                               -------    ------
Cmdlet          New-PfaCLICommand                                  1.19.49.0  PureStoragePowerShellSDK

This returns a single cmdlet for us within the module, so now we only need to look at the cmdlet help to see what information we need to run it.

Get-Help New-PfaCLICommand -Detailed

NAME
    New-PfaCLICommand

SYNOPSIS
    Execute CLI Command on the FlashArray.


SYNTAX
    New-PfaCLICommand -EndPoint <String> -UserName <String> -Password <SecureString> [-DisableLoggingToArray     <SwitchParameter>] [-TimeOutInMilliSeconds <Nullable[Int32]>] -CommandText <String> [<CommonParameters>]

    New-PfaCLICommand -EndPoint <String> -Credentials <PSCredential> [-DisableLoggingToArray     <SwitchParameter>] [-TimeOutInMilliSeconds <Nullable[Int32]>] -CommandText <String> [<CommonParameters>]


DESCRIPTION
    Execute CLI Command on the FlashArray via the SSH protocol. SSH connectivity to the array is required.


PARAMETERS
    -EndPoint <String>
        The management address or FQDN of the FlashArray.

    -UserName <String>
        The SSH username for the FlashArray

    -Password <SecureString>
        The secure password. Must be a SecureString.

    -DisableLoggingToArray [<SwitchParameter>]
        Disable phone home logging for this connection. This reduces the amount of logging on the array for     any commands that are executed.

    -TimeOutInMilliSeconds <Nullable[Int32]>
        Timeout in milliseconds for the command to complete.

    -CommandText <String>
        The CLI command to run on the FlashArray

    -Credentials <PSCredential>
        The credentials for SSH access to the FlashArray. Used in place of -Username and -Password. Obtained     as a PSCredential object. Example $creds=Get-Credential.

    <CommonParameters>
        This cmdlet supports the common parameters: Verbose, Debug,
        ErrorAction, ErrorVariable, WarningAction, WarningVariable,
        OutBuffer, PipelineVariable, and OutVariable. For more information, see
        about_CommonParameters (https:/go.microsoft.com/fwlink/?LinkID=113216).

    ------ EXAMPLE ------

    New-PfaCLICommand -Endpoint array1 -Username pureuser -Password $securepassword -CommandText "purearray     list --controller"

    Executes the CLI command 'purearray list --controller' on Array array1 using the username and password     credentials supplied
    ------ EXAMPLE ------

    $creds=Get-Credential
    New-PfaCLICommand -Endpoint array1 -Credentials $creds -CommandText "purearray list --controller"     -DisableLoggingToArray

    Creates a PSCredential object with the username and password supplied when prompted.
    Executes the CLI command 'purearray list --controller' on Array 'array1' and disables the logging of this     connection for phone home logging.
REMARKS
    To see the examples, type: "get-help New-PfaCLICommand -examples".
    For more information, type: "get-help New-PfaCLICommand -detailed".
    For technical information, type: "get-help New-PfaCLICommand -full".

The help for our cmdlet gives us a parameter set which will take a PSCredential object, as shown here:

New-PfaCLICommand -EndPoint <String> -Credentials <PSCredential> [-DisableLoggingToArray <SwitchParameter>] [-TimeOutInMilliSeconds <Nullable[Int32]>] -CommandText <String>

So we’ve already got all of step 1 accomplished with just finding the right cmdlet. Score.

We’ll continue soon with the next blog post, where we parse this output that we have, and continue with our next steps.