DEVASC Data Encoding Formats Hands-On Lab Guide¶

Overview¶

What's the big fuss over IT automation? Well, more than anything, the excitement is about the sorts of things that you don't have to do when automation is on your side. Things like not having to either copy and paste configuration changes to dozens (maybe hundreds) of different systems or repeat the same click, click, click, click, click-through-the-UI marathon over, and over, and over...every single time there's a need to make a bulk change . To automate these sorts of workflows, we need to learn to write some form of automation-specific code. To make that possible, we must first learn to read and write the data structures that our code sends to and receives from the systems we automate.

This guide will walk you through some hands-on exercises that help teach and also give you a place to practice using various data encoding formats with Python. You'll get the most from these exercises if you have at least a little bit of exposure to Python fundamentals, although we put the step-by-step directions together in a way that welcomes all experience levels.

Unstructured vs Structured Data¶

Data Meant for Humans to Read¶

When we interact with the CLI of a network device, server, storage system, cloud compute platform, etc., we typically send text commands and receive text responses. Usually, the responses to our commands are in plain text, in an unstructured format. We often see the raw text responses formatted with various spaces, tabs, numbering, tables, or even text-based graphics, to make the text easier for humans to read.

Data Meant for Computers to Read¶

When a computer, such as an automation system, tries to read that same, unstructured data, we usually have to tell the computer precisely how it needs to find the information we need. That is, we have to write some form of a search pattern, or parser, to sift through all of the spaces, special characters, and text graphics, etc., to find the specific, raw data we want. Parsing through unstructured data is often difficult to configure, temperamental to test successfully, and a headache to maintain. Plus, anytime an IT system undergoes a software or hardware change, the unstructured command response a parser expects can change, even if by a single character, and cause the parser to fail or work improperly.

APIs and Data Formats¶

So, how do we implement IT automation workflows and avoid having our lives consumed by the "joy" of writing and maintaining text parsers? Fortunately, most IT systems have programmatic interfaces, referred to as APIs, which can send and receive configuration or state/status data in a structured format. Some of the more common structured data formats include JSON, XML, and YAML.

Each of these structured data encoding formats is much easier for computers to read because the structure of the data adheres to a standard format which IT systems and programming languages can read without text parsers.

A Practical Example¶

Here is a real example of some data that a network device sent in response to a request from its CLI and its API. Now, unless you happen to be a computer, you will probably find the CLI example (unstructured data) a bit easier to read than the API example (structured data).

A Side-by-Side Comparision¶

Example Network Device CLI and API Response Data

Even though the CLI and API data formats look drastically different, they both provide the exact same data.

CLI Response Output - Unstructured Data

text

API Response Output - Structured Data

xml

Comparing the Code¶

If you're thinking, "Why would I ever want to deal with structured data? It looks like a mess!" Well, to put it simply, structured data is way easier to work with programmatically. Take a look at how we might parse both the CLI and API responses with some Python code:

Example Code Exercise

This exercise aims to parse the interface ID from each of the CLI and API response data sets. For reference, the correct result of a search for the interface ID is the text 1/10. These examples assume the CLI and API data are already available to the Python interpreter in the variable with the name data.

Even though the CLI response data is only a few lines long, the Python code to parse the interface ID from that data, with a regular expression, is lengthy and somewhat complex to read.
By contrast, even though the API response data is lengthy, the Python code to parse the interface ID is short and far less complex to read.

Click to view the code examples

Parse the Unstructured (CLI) Response

python linenums="1" hl_lines="5-17 19 21-25"

Parsing the Structured (API) Response

python linenums="1" hl_lines="5-9"

All right, that's enough reading for now. It's time to get into the lab exercises and get some real practice. If everything to this point makes perfect sense to you, great! If not, don't worry because you're about to get plenty of hands-on repetitions with functional code. Click here to continue to the next section.