Senior Systems & Infrastructure Engineer

I keep infrastructure running โ€”
from the rack to VM, and everything in between.

21+ years across Linux, virtualization, networking, and observability โ€” with a current obsession for making AI safely useful for real operations.

๐Ÿ“ Big Pine, CA ยท Remote ๐Ÿ› ๏ธ Linux ยท Proxmox ยท ZFS ยท Prometheus/Grafana ๐Ÿค– AI-Ops & automation
About

Two decades in, still learning out loud.

I started on the data-center floor โ€” racking and cabling servers, planning power and cooling for 42U racks, running cage consolidations. Two decades later I'm architecting high-availability clusters, writing automation in Bash, Python, and Go, and building tooling that lets AI operate production systems without breaking them. I've done the full stack of infrastructure, top (OS) to bottom (hardware).

Today I run day-to-day operations for roughly 215 production servers as a remote systems administrator โ€” web, database, Node, API, and Redis fleets โ€” building new environments, HA web clusters, and the scripts that keep the lights on. Before that I led a NOC team of eight and oversaw 500+ servers through the kind of 2 a.m. incidents you only forget if you were lucky.

My approach is boring on purpose: build on the tools a team already trusts instead of forcing a migration, make the safeguards visible, and let the documentation write itself. The flashy part is what that approach lets you do safely โ€” like the projects below.

Skills

The toolbox

Linux & Virtualization

  • Linux (RHEL/Oracle/CentOS/Debian)
  • Proxmox HA
  • VMware vCenter/ESXi
  • Nutanix
  • KVM / LXC

Web, Load Balancing & Databases

  • NGINX
  • HA web clusters
  • load balancing
  • Apache
  • Redis
  • MariaDB

Automation & Development

  • Bash
  • Python
  • Go
  • REST APIs
  • Git / Forgejo
  • CI/CD

Observability & Monitoring

  • Prometheus
  • Grafana
  • Loki
  • Nagios / Livestatus

Security & Networking

  • Wazuh SIEM
  • Suricata IDS
  • PKI / step-ca
  • VPN
  • Cisco / Brocade / Fujitsu
  • firewalls

Storage

  • ZFS
  • RAID
  • NFS
  • multi-TB arrays
  • backup / restore

AI-Ops

  • agentic AI tooling
  • AI-assisted operations
  • safe automated change pipelines

Datacenter & Hardware

  • bare-metal (Dell/HP/SuperMicro)
  • 42U rack / power / cooling
  • RMA lifecycle
  • structured cabling
Featured Work

Projects & case studies

Personal Project

Maven โ€” AI Operations Framework self-built

A system I designed that gives AI a persistent memory and safe, gated access to real infrastructure tools. Instead of a chat assistant that forgets everything between sessions, Maven loads the full state of the work each time โ€” so any task can be started or resumed instantly. It connects to existing tools through visible safeguards: every change is validated, recorded in version control, and rolled back automatically if it fails. The documentation gets produced as a byproduct of the work, not as a chore.

BashGoPythonRESTGit
Reliability Engineering

Safe AI-Driven Config Deploys

An atomic deploy pipeline for an enterprise monitoring fleet: stage โ†’ validate against a copy โ†’ commit โ†’ deploy โ†’ reload, with automatic rollback if validation fails. A real config change rolled out across 200+ hosts and 3,000+ services in about one second with zero errors โ€” and a deliberately broken change was rejected and rolled back before it could ever touch production. Safety you can watch happen.

GoGitREST APImonitoring internals
Tooling

Inventory Platform Fork + REST API

Forked and extended an open-source asset-management tool to add a REST API, an OS-lifecycle / package model, and human-in-the-loop confirmation dialogs that show a clear before/after diff on every change. Then built automation on top that can resolve, create, and update hosts safely. Built on what the team already used โ€” no migration tax.

PHPMariaDBRESTApache
ML Infrastructure

Real-Time GPU Inference Pipeline

Stood up a real-time video processing pipeline on a Tesla V100 โ€” GPU passthrough into a VM, NVIDIA DeepStream / Triton / TensorRT โ€” and tuned it for stable single-stream throughput. A deep dive into GPU performance and the realities of ML infrastructure.

CUDADeepStreamTritonTensorRTProxmox
Platform

Self-Hosted Home Lab

A production-grade lab I run to keep my edge: a Proxmox HA hypervisor with ZFS storage pools, self-hosted Git (Forgejo), a reverse proxy with automated TLS, a full observability stack (Prometheus / Grafana / Loki), and a security stack (Wazuh SIEM + Suricata IDS). Where I prototype everything before it's real.

ProxmoxZFSForgejoNginxPrometheusWazuh
Experience

Where I've done it

Systems Administrator ยท Remote
HyperMedia Systems (via PCowens.com)
2022 โ€” Present
  • Operate ~215 production servers โ€” web, database, Node, API, and Redis fleets.
  • Build new environments and high-availability web clusters with load balancers.
  • Develop automation scripts for day-to-day operations (DevOps).
  • Full-stack troubleshooting from OS down to hardware; database backup/restore, deployments, user management.
NOC Technician โ†’ NOC Supervisor โ†’ Systems Admin
HyperMedia Systems ยท Los Angeles, CA
2008 โ€” 2022
  • Led a NOC team of 8; oversaw 500+ servers off-hours and escalated major incidents (DoS attacks, circuit failures).
  • Built bare-metal and virtual servers; planned and built 42U racks with redundant power and cooling.
  • Ran data-center cage consolidation (~11 racks) and managed structured cabling with dual-redundant power.
  • Maintained server inventory and coordinated the full RMA lifecycle with vendors.
Server / Call-Center Technician
Aberdeen Inc ยท Santa Fe Springs, CA
2005 โ€” 2008
  • Created and maintained large RAID arrays (4โ€“30 TB).
  • System builds, remote troubleshooting (WebEx), VMware Workstation/ESX problem recreation, OS imaging.
Computer Technician
Schat.Net ยท Bishop, CA
2004 โ€” 2005
  • Desktop troubleshooting, virus removal, data recovery, system upgrades, on-site networking and user training.
Contact

Let's talk infrastructure.

โœ‰๏ธ FrazierPhillips@gmail.com ๐Ÿ“ž 760-920-9848 ๐Ÿ“ Big Pine, CA ยท Open to remote

References available upon request.