Frazier Phillips — Senior Systems & Infrastructure Engineer

About

Two decades in, still learning out loud.

I started on the data-center floor — racking and cabling servers, planning power and cooling for 42U racks, running cage consolidations. Two decades later I'm architecting high-availability clusters, writing automation in Bash, Python, and Go, and building tooling that lets AI operate production systems without breaking them. I've done the full stack of infrastructure, top (OS) to bottom (hardware).

Today I run day-to-day operations for roughly 215 production servers as a remote systems administrator — web, database, Node, API, and Redis fleets — building new environments, HA web clusters, and the scripts that keep the lights on. Before that I led a NOC team of eight and oversaw 500+ servers through the kind of 2 a.m. incidents you only forget if you were lucky.

My approach is boring on purpose: build on the tools a team already trusts instead of forcing a migration, make the safeguards visible, and let the documentation write itself. The flashy part is what that approach lets you do safely — like the projects below.

Skills

The toolbox

Linux & Virtualization

Linux (RHEL/Oracle/CentOS/Debian)
Proxmox HA
VMware vCenter/ESXi
Nutanix
KVM / LXC

Web, Load Balancing & Databases

NGINX
HA web clusters
load balancing
Apache
Redis
MariaDB

Automation & Development

Bash
Python
Go
REST APIs
Git / Forgejo
CI/CD

Observability & Monitoring

Prometheus
Grafana
Loki
Nagios / Livestatus

Security & Networking

Wazuh SIEM
Suricata IDS
PKI / step-ca
VPN
Cisco / Brocade / Fujitsu
firewalls

Storage

ZFS
RAID
NFS
multi-TB arrays
backup / restore

AI-Ops

agentic AI tooling
AI-assisted operations
safe automated change pipelines

Datacenter & Hardware

bare-metal (Dell/HP/SuperMicro)
42U rack / power / cooling
RMA lifecycle
structured cabling

Featured Work

Projects & case studies

Personal Project

Maven — AI Operations Framework self-built

A system I designed that gives AI a persistent memory and safe, gated access to real infrastructure tools. Instead of a chat assistant that forgets everything between sessions, Maven loads the full state of the work each time — so any task can be started or resumed instantly. It connects to existing tools through visible safeguards: every change is validated, recorded in version control, and rolled back automatically if it fails. The documentation gets produced as a byproduct of the work, not as a chore.

BashGoPythonRESTGit

Reliability Engineering

Safe AI-Driven Config Deploys

An atomic deploy pipeline for an enterprise monitoring fleet: stage → validate against a copy → commit → deploy → reload, with automatic rollback if validation fails. A real config change rolled out across 200+ hosts and 3,000+ services in about one second with zero errors — and a deliberately broken change was rejected and rolled back before it could ever touch production. Safety you can watch happen.

GoGitREST APImonitoring internals

Tooling

Inventory Platform Fork + REST API

Forked and extended an open-source asset-management tool to add a REST API, an OS-lifecycle / package model, and human-in-the-loop confirmation dialogs that show a clear before/after diff on every change. Then built automation on top that can resolve, create, and update hosts safely. Built on what the team already used — no migration tax.

PHPMariaDBRESTApache

ML Infrastructure

Real-Time GPU Inference Pipeline

Stood up a real-time video processing pipeline on a Tesla V100 — GPU passthrough into a VM, NVIDIA DeepStream / Triton / TensorRT — and tuned it for stable single-stream throughput. A deep dive into GPU performance and the realities of ML infrastructure.

CUDADeepStreamTritonTensorRTProxmox

Platform

Self-Hosted Home Lab

A production-grade lab I run to keep my edge: a Proxmox HA hypervisor with ZFS storage pools, self-hosted Git (Forgejo), a reverse proxy with automated TLS, a full observability stack (Prometheus / Grafana / Loki), and a security stack (Wazuh SIEM + Suricata IDS). Where I prototype everything before it's real.

ProxmoxZFSForgejoNginxPrometheusWazuh

Experience

Where I've done it

Systems Administrator · Remote

HyperMedia Systems (via PCowens.com)

2022 — Present

Operate ~215 production servers — web, database, Node, API, and Redis fleets.
Build new environments and high-availability web clusters with load balancers.
Develop automation scripts for day-to-day operations (DevOps).
Full-stack troubleshooting from OS down to hardware; database backup/restore, deployments, user management.

NOC Technician → NOC Supervisor → Systems Admin

HyperMedia Systems · Los Angeles, CA

2008 — 2022

Led a NOC team of 8; oversaw 500+ servers off-hours and escalated major incidents (DoS attacks, circuit failures).
Built bare-metal and virtual servers; planned and built 42U racks with redundant power and cooling.
Ran data-center cage consolidation (~11 racks) and managed structured cabling with dual-redundant power.
Maintained server inventory and coordinated the full RMA lifecycle with vendors.

Server / Call-Center Technician

Aberdeen Inc · Santa Fe Springs, CA

2005 — 2008

Created and maintained large RAID arrays (4–30 TB).
System builds, remote troubleshooting (WebEx), VMware Workstation/ESX problem recreation, OS imaging.

Computer Technician

Schat.Net · Bishop, CA

2004 — 2005

Desktop troubleshooting, virus removal, data recovery, system upgrades, on-site networking and user training.

I keep infrastructure running —
from the rack to VM, and everything in between.

Two decades in, still learning out loud.

The toolbox

Linux & Virtualization

Web, Load Balancing & Databases

Automation & Development

Observability & Monitoring

Security & Networking

Storage

AI-Ops

Datacenter & Hardware

Projects & case studies

Maven — AI Operations Framework self-built

Safe AI-Driven Config Deploys

Inventory Platform Fork + REST API

Real-Time GPU Inference Pipeline

Self-Hosted Home Lab

Where I've done it

Let's talk infrastructure.

I keep infrastructure running —from the rack to VM, and everything in between.

Two decades in, still learning out loud.

The toolbox

Linux & Virtualization

Web, Load Balancing & Databases

Automation & Development

Observability & Monitoring

Security & Networking

Storage

AI-Ops

Datacenter & Hardware

Projects & case studies

Maven — AI Operations Framework self-built

Safe AI-Driven Config Deploys

Inventory Platform Fork + REST API

Real-Time GPU Inference Pipeline

Self-Hosted Home Lab

Where I've done it

Let's talk infrastructure.

I keep infrastructure running —
from the rack to VM, and everything in between.