Automating Docs as Code

Posted by Jason Bolden on Jul 17, 2021

This may be an unpopular opinion, but software engineers are notoriously bad at maintaining documentation. I don't believe that's any fault of our own. Moving at break-neck speeds to keep up with changing technology while pausing to record the trials and tribulations of the journey takes that much more time away from your Product Owner's tight deadlines. Docs as Code as a methodology is arguably magnitudes more convenient than maintaining a Sharepoint site or Word document, but what if we could make things just a little simpler for the lazy coder?

Objective

In this article, we're going to explore a handful of methods to make capturing documentation a little more convenient. As a working example, we'll walk through a python module that generates templated blog posts. Github is used to maintain our documentation and actions are leveraged to orchestrate all of our tedious steps.

Figure 1 - Conceptual of our 3 phased approach to automation

Attention

There's a lot of moving pieces in this setup. It's important to note that this write up is focused on the art of the possible and snippets shared are enough for a presentable proof of concept.

The Breakdown

Assuming you have a fresh repo ready to go, let's start at the top.

Automating Inception

We're going to use GitHub's functionality as our starting point when creating a new blog, specifically Issues. Just as an issue would typically signal the beginning of a new feature for application development, we're going to treat this as the place where we capture the inspiration for our next blog entry.

Since we're trying to cut out as much manual work as possible, let's take advantage of the template functionality GitHub supports for issues.

---
name: Blog Post template
about: Automation entry point for creation of new blog posts.
title: "[BLOG]"
labels: documentation.blog
assignees: ''

---

title: <TITLE>
type: <tech, insight, idea>
summary: <brief summary of post>

Pushing this .md file to your repo under the .github/ISSUE_TEMPLATE directory will result in a pre-filled issue template that presents information we'll need to provide every time we'd like to make a new blog entry. The body is formatted using yaml syntax and the fields will be used as follows:

title - The title of the blog entry.
type - This will be more apparent later, but depending on the type of blog entry, a different template will be used.
summary - This will appear as the first paragraph in the new entry.

Figure 2 - Result of our new issue template

Important to note is the label used for this template. This will be used in the workflow that gets triggered next.

What happens when we create our new issue? Nothing, without a workflow defined. We want this workflow to trigger when a new Issue has been assigned.

name: Orchestrate Issue triggered workflows based on labels
on:
  issues:
    types: [assigned]

jobs:
  blog_issue_assigned:
    if: ${{github.event.issue.labels[0].name == 'documentation.blog'}}
    name: Blog Issue Assigned

    runs-on: ubuntu-latest
    strategy:
      matrix:
        python-version: [3.8]

Now we can use that templated information within a linux environment to take us into phase 2 of our automation workflow.

- name: Generate Blog Entry and Clean Up
  run: |
    echo "${{github.event.issue.body}}" > src/config.yml
    cat src/config.yml

Note

The job for building a new blog entry only runs if the issue that triggered the GitHub workflow has a label of documentation.blog.

Automating Boilerplate Code

With a brand new issue created, for a moment assume we had to manually create this new blog entry. For our blog, we're using mkdocs. We would need to spend valuable time adding the following to our directory structure:

New .md file under the blog directory
Type the file name with the correct naming convention
Add the new file to the mkdocs.yml file's nav definition

.
├── docs
.   .
.   .
.   .
│   ├── blog
│   │   ├── 2021-07-17-automating-docs-as-code.md
│   │   └── readme.md
│   └── index.md
└── mkdocs.yml

nav:
  - Home: index.md
  - About: about.md
  - Blogs:
    - Content: blog/readme.md
    - 2021:
      - Automating Docs as Code: blog/2021-07-17-automating-docs-as-code.md

These steps may appear small and low effort, but over time they accumulate and become the annoyances that cause one to avoid capturing documentation all together. We're going to address this by using some python code.

Attention

The code referenced in this post are snippets of the profile_builder module used to create it. The entire module is outside of the scope of this discussion, but feel free to explore how the profile_builder works under the hood as an exercise.

The logic of the profile_builder is straightforward:

Load our config file as a dictionary, builder()
Load the appropriate template file (based on the blog type), generate_blog(...)
Create the filenames and output directories based on date and config parameters, lines 5-8

# Perform the initial build
# Build the .md file and write to blog directory
# templates - directory location of ".j2" files
config = builder()        
config['publish_date'] = date.today()
template_file = f"{templates}/blog-post-{config['type']}.md.j2"
blog_filename = f"{config['publish_date']:%Y-%m-%d}-{slugify(config['title'])}.md"
dest_file = f"{output_dir}/{blog_filename}"
generate_blog(template_file, config, dest_file)

Note

Notice that we've parameterized the template filename using the type variable from the issue config.

1	`template_file = f"{templates}/blog-post-{config['type']}.md.j2"`

Jinja is a great template engine for programmatically creating documents. Because blogs are pretty much rinse and repeat of the same formatting, why should we spend the time typing it out every time (or even copy pasting for that matter)? Jinja allows us to use the dict elements in our config variable to populate the {{...}} expression fields of our template. Once generated, our new entry is written to the dest_file directory.

<div id="banner" class="page-image">
    <img src="../img/{{ publish_date.strftime('%Y-%m-%d') }}-blog-banner.drawio.svg" alt="">
    <div class="page-image-caption">
        <p>
            <a href="">AUTHOR</a>
        </p>
    </div>
</div>

<!-- Reposition the banner image on top -->
<script>
    var article = document.getElementsByTagName("article")[0];
    article.insertBefore(document.getElementById("banner"), article.childNodes[0]);
</script>

# {{ title }}

{{ publish_date.strftime('%Y %b %d') }} by [Jason Bolden](../about.md)


{{ summary }}

## Objective
## Steps
## Conclusion
## References

Lastly, we need to update mkdocs.yml file to include our new 2021-07-17-automating-docs-as-code.md file.

# get the relative path from within the docs directory
p = Path(dest_file)
entry = Path(*p.parts[2:]) 

# Update the mkdocs.yml file nav with the new structure
mkdocs_config = load_config(config_file=mkdocs_file)
update_blog(mkdocs_config, config['publish_date'], {config['title']: str(entry)})

Automating Dev Prep

We're almost done. The only thing remaining is to stitch our profile_builder into our issues_automation workflow and create our development branch. Again, we could create a new branch manually and run the profile_builder module from the command line on a our local, but where's the fun in that?

1
2
3

- name: Create new Blog Branch
  run: |
    git checkout -b ${{BRANCH_NAME}}

- name: Generate Blog Entry and Clean Up
  run: |
    echo "${{github.event.issue.body}}" > src/config.yml
    cat src/config.yml
    cd src
    python -m profile_builder blog -t ./templates -o ../docs/blog -c ./config.yml -v
    rm config.yml
    cd ..
    git config user.name github-actions
    git config user.email github-actions@github.com
    git add .
    git commit -m "generated ${{BRANCH_NAME}}"
    git push --set-upstream origin ${{BRANCH_NAME}}

This code snippet builds on our GitHub Workflow step from phase 1. All we're doing is scripting out the actions previously described in bash. To kick the whole thing off, simply create a new Blog issue, fill in the details, assign it to yourself, and switch to the newly created blog branch.

Figure 4 - Completed Action workflow following issue assignment

Conclusion

At first glance, this may seem like overkill for a simple blog. A Wordpress or Squarespace site would be much easier to put together. So instead, let's think about a repo that holds engineering artifacts for the application your team supports. Or how about a knowledge base of training material for your team of developers. Incident response playbooks for your Security Operations Center, IT Help Desk procedures, etc.

This workflow allows for someone to suggest an addition to the team's documentation, and automate a lot of the repetitive actions that demotivates the individual from creating the documentation in the first place. In addition to the bonus of treating the docs as managed source code, the GitHub repo facilitates a collaborative environment so documentation is less likely to be created in a vacuum without peer review.